An Efficient Algorithm for Mining Association Rules in Large Databases

Open AccessProceedings Article

An Efficient Algorithm for Mining Association Rules in Large Databases

Ashoka Savasere, +2 more

- pp 432-444

Chats0

TLDR

This paper presents an efficient algorithm for mining association rules that is fundamentally different from known algorithms and not only reduces the I/O overhead significantly but also has lower CPU overhead for most cases.

Abstract:

Mining for a.ssociation rules between items in a large database of sales transactions has been described as an important database mining problem. In this paper we present an efficient algorithm for mining association rules that is fundamentally different from known algorithms. Compared to previous algorithms, our algorithm not only reduces the I/O overhead significantly but also has lower CPU overhead for most cases. We have performed extensive experiments and compared the performance of our algorithm with one of the best existing algorithms. It was found that for large databases, the CPU overhead was reduced by as much as a factor of four and I/O was reduced by almost an order of magnitude. Hence this algorithm is especially suitable for very large size databases.

Citations

PDF

Open Access

More filters

Book

Data Mining: Concepts and Techniques

Jiawei Han, +2 more

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.

...read moreread less

Journal ArticleDOI

Mining frequent patterns without candidate generation

Jiawei Han, +2 more

TL;DR: This study proposes a novel frequent pattern tree (FP-tree) structure, which is an extended prefix-tree structure for storing compressed, crucial information about frequent patterns, and develops an efficient FP-tree-based mining method, FP-growth, for mining the complete set of frequent patterns by pattern fragment growth.

...read moreread less

Journal ArticleDOI

Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach

Jiawei Han, +3 more

- 01 Jan 2004 -

Data Mining and Knowledge Discovery

TL;DR: A novel frequent-pattern tree (FP-tree) structure is proposed, which is an extended prefix-tree structure for storing compressed, crucial information about frequent patterns, and an efficient FP-tree-based mining method, FP-growth, is developed for mining the complete set of frequent patterns by pattern fragment growth.

...read moreread less

Journal ArticleDOI

Data mining: an overview from a database perspective

Ming-Syan Chen, +2 more

- 01 Dec 1996 -

IEEE Transactions on Knowledge and Data ...

TL;DR: In this paper, a survey of the available data mining techniques is provided and a comparative study of such techniques is presented, based on a database researcher's point-of-view.

...read moreread less

Journal ArticleDOI

SPADE: An Efficient Algorithm for Mining Frequent Sequences

Mohammed J. Zaki

- 01 Jan 2001 -

Machine Learning

TL;DR: SPADE is a new algorithm for fast discovery of Sequential Patterns that utilizes combinatorial properties to decompose the original problem into smaller sub-problems, that can be independently solved in main-memory using efficient lattice search techniques, and using simple join operations.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Mining association rules between sets of items in large databases

Rakesh Agrawal, +2 more

TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.

...read moreread less

Proceedings Article

Fast Algorithms for Mining Association Rules in Large Databases

Rakesh Agrawal, +1 more

Book

Knowledge Discovery in Databases

Gregory Piateski, +1 more

TL;DR: Knowledge Discovery in Databases brings together current research on the exciting problem of discovering useful and interesting knowledge in databases, which spans many different approaches to discovery, including inductive learning, bayesian statistics, semantic query optimization, knowledge acquisition for expert systems, information theory, and fuzzy 1 sets.

...read moreread less

Journal ArticleDOI

Parallel database systems: the future of high performance database systems

David J. DeWitt, +1 more

- 01 Jun 1992 -

Communications of The ACM

Proceedings Article

Knowledge Discovery in Databases: An Attribute-Oriented Approach

Jiawei Han, +2 more

TL;DR: An attribute-oriented induction method has been developed for knowledge discovery in databases that integrates a machine learning paradigm with set-oriented database operations and extracts generalized data from actual data in databases.

...read moreread less

An Efficient Algorithm for Mining Association Rules in Large Databases

Citations

Data Mining: Concepts and Techniques

Mining frequent patterns without candidate generation

Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach

Data mining: an overview from a database perspective

SPADE: An Efficient Algorithm for Mining Frequent Sequences

References

Mining association rules between sets of items in large databases

Fast Algorithms for Mining Association Rules in Large Databases

Knowledge Discovery in Databases

Parallel database systems: the future of high performance database systems

Knowledge Discovery in Databases: An Attribute-Oriented Approach

Related Papers (5)

Mining association rules between sets of items in large databases

Fast algorithms for mining association rules

Fast Algorithms for Mining Association Rules in Large Databases

Mining frequent patterns without candidate generation

Dynamic itemset counting and implication rules for market basket data