Memorias de investigación
Research Publications in journals:
A novel pruning algorithm for mining long and maximum length frequent itemsets
Year:2019

Research Areas
  • Statistics

Information
Abstract
Frequent itemset mining is today one of the most popular data mining techniques. Its application is, how- ever, hindered by the high computational cost in many real-world datasets, especially for smaller values of support thresholds. In many cases, moreover, the large number of frequent itemsets discovered is over- whelming. In some real-world applications, it is sufficient to find a smaller subset of frequent itemsets, such as identifying the frequent itemsets with a maximum length. In this paper, we present a prun- ing algorithm, called LengthSort, that reduces the search space effectively and improves the efficiency of mining frequent itemsets with a maximum length. LengthSort prunes both the items and the transactions before constructing a Frequent Pattern tree structure. Our experiments on several datasets show that the proposed pruning techniques reduce the time needed to discover the frequent itemsets with a maximum length. The proposed pruning algorithm can also be applied to efficiently discover frequent itemsets that are longer than a user-specified threshold.
International
Si
JCR
Si
Title
Expert Systems With Applications
ISBN
0957-4174
Impact factor JCR
Impact info
Volume
142
10.1016/j.eswa.2019.113004
Journal number
From page
113004
To page
113027
Month
SIN MES
Ranking
Participants

Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Estadística computacional y Modelado estocástico
  • Departamento: Ingeniería de Organización, Administración de Empresas y Estadística