Home
/
Topics
/
Apriori algorithm

Topic

Apriori algorithm

About: Apriori algorithm is a research topic. Over the lifetime, 4105 publications have been published within this topic receiving 85965 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1970

Papers

PDF

Open Access

More filters

Proceedings Article•

An efficient algorithm for the incremental updation of association rules in large databases

[...]

Shiby Thomas¹, Sreenath Bodagala¹, Khaled Alsabti¹, Sanjay Ranka¹•Institutions (1)

University of Florida¹

14 Aug 1997

TL;DR: This paper proposes an incremental updating technique based on negative borders, for the maintenance of association rules when new transaction data is added to or deleted from a transaction database.

...read moreread less

Abstract: Efficient discovery of association rules in large databases is a well studied problem and several approaches have been proposed. However, it is non trivial to maintain the association rules current when the database is updated since, such updates could invalidate existing rules or introduce new rules. In this paper, we propose an incremental updating technique based on negative borders, for the maintenance of association rules when new transaction data is added to or deleted from a transaction database. An important feature of our algorithm is that it requires a full scan (exactly one) of the whole database only if the database update causes the negative border of the set of large itemsets to expand.

...read moreread less

249 citations

Book Chapter•DOI•

A tree-based approach for frequent pattern mining from uncertain data

[...]

Carson K. Leung¹, Mark Anthony F. Mateo¹, Dale A. Brajczuk¹•Institutions (1)

University of Manitoba¹

20 May 2008

TL;DR: A tree-based mining algorithm is proposed to efficiently find frequent patterns from uncertain data, where each item in the transactions is associated with an existential probability.

...read moreread less

Abstract: Many frequent pattern mining algorithms find patterns from traditional transaction databases, in which the content of each transaction--namely, items--is definitely known and precise. However, there are many real-life situations in which the content of transactions is uncertain. To deal with these situations, we propose a tree-based mining algorithm to efficiently find frequent patterns from uncertain data, where each item in the transactions is associated with an existential probability. Experimental results show the efficiency of our proposed algorithm.

...read moreread less

228 citations

Proceedings Article•DOI•

Apriori-based frequent itemset mining algorithms on MapReduce

[...]

Ming-Yen Lin¹, Pei-Yu Lee¹, Sue-Chen Hsueh²•Institutions (2)

Feng Chia University¹, Chaoyang University of Technology²

20 Feb 2012

TL;DR: DPC features in dynamically combining candidates of various lengths and outperforms both the straight-forward algorithm SPC and the fixed passes combined counting algorithm FPC, and shows that all the three algorithms scale up linearly with respect to dataset sizes and cluster sizes.

...read moreread less

Abstract: Many parallelization techniques have been proposed to enhance the performance of the Apriori-like frequent itemset mining algorithms. Characterized by both map and reduce functions, MapReduce has emerged and excels in the mining of datasets of terabyte scale or larger in either homogeneous or heterogeneous clusters. Minimizing the scheduling overhead of each map-reduce phase and maximizing the utilization of nodes in each phase are keys to successful MapReduce implementations. In this paper, we propose three algorithms, named SPC, FPC, and DPC, to investigate effective implementations of the Apriori algorithm in the MapReduce framework. DPC features in dynamically combining candidates of various lengths and outperforms both the straight-forward algorithm SPC and the fixed passes combined counting algorithm FPC. Extensive experimental results also show that all the three algorithms scale up linearly with respect to dataset sizes and cluster sizes.

...read moreread less

225 citations

Proceedings Article•DOI•

Transversing itemset lattices with statistical metric pruning

[...]

Shinichi Morishita¹, Jun Sese¹•Institutions (1)

University of Tokyo¹

01 May 2000

TL;DR: A method of estimating a tight upper bound on the statistical metric associated with any superset of an itemset, as well as the novel use of the resulting information of upper bounds to prune unproductive supersets while traversing itemset lattices is presented.

...read moreread less

Abstract: We study how to efficiently compute significant association rules according to common statistical measures such as a chi-squared value or correlation coefficient. For this purpose, one might consider to use of the Apriori algorithm, but the algorithm needs major conversion, because none of these statistical metrics are anti-monotone, and the use of higher support for reducing the search space cannot guarantee solutions in its the search space. We here present a method of estimating a tight upper bound on the statistical metric associated with any superset of an itemset, as well as the novel use of the resulting information of upper bounds to prune unproductive supersets while traversing itemset lattices. Experimental tests demonstrate the efficiency of this method.

...read moreread less

216 citations

Proceedings Article•DOI•

Clustering event logs using iterative partitioning

[...]

Adetokunbo Makanju¹, A. Nur Zincir-Heywood¹, Evangelos E. Milios¹•Institutions (1)

Dalhousie University¹

28 Jun 2009

TL;DR: This paper presents IPLoM (Iterative Partitioning Log Mining), a novel algorithm for the mining of clusters from event logs that outperforms the other algorithms statistically significantly, and is also able to achieve an average F- Measure performance 78% when the closest other algorithm achieves an F-Measure performance of 10%.

...read moreread less

Abstract: The importance of event logs, as a source of information in systems and network management cannot be overemphasized. With the ever increasing size and complexity of today's event logs, the task of analyzing event logs has become cumbersome to carry out manually. For this reason recent research has focused on the automatic analysis of these log files. In this paper we present IPLoM (Iterative Partitioning Log Mining), a novel algorithm for the mining of clusters from event logs. Through a 3-Step hierarchical partitioning process IPLoM partitions log data into its respective clusters. In its 4th and final stage IPLoM produces cluster descriptions or line formats for each of the clusters produced. Unlike other similar algorithms IPLoM is not based on the Apriori algorithm and it is able to find clusters in data whether or not its instances appear frequently. Evaluations show that IPLoM outperforms the other algorithms statistically significantly, and it is also able to achieve an average F-Measure performance 78% when the closest other algorithm achieves an F-Measure performance of 10%.

...read moreread less

212 citations

1
2
…
3
4
5
6
7
8
9
…
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

4,481

Papers

92,099

Citations

No. of papers in the topic in previous years
Year	Papers
2023	92
2022	291
2021	180
2020	216
2019	209
2018	223

Apriori algorithm

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics