Data mining: an overview from a database perspective

doi:10.1109/69.553155

Journal ArticleDOI

Data mining: an overview from a database perspective

Ming-Syan Chen, +2 more

- 01 Dec 1996 -

IEEE Transactions on Knowledge and Data ...

- Vol. 8, Iss: 6, pp 866-883

TLDR

In this paper, a survey of the available data mining techniques is provided and a comparative study of such techniques is presented, based on a database researcher's point-of-view.

Abstract:

Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an important area with an opportunity of major revenues. Researchers in many different fields have shown great interest in data mining. Several emerging applications in information-providing services, such as data warehousing and online services over the Internet, also call for various data mining techniques to better understand user behavior, to improve the service provided and to increase business opportunities. In response to such a demand, this article provides a survey, from a database researcher's point of view, on the data mining techniques developed recently. A classification of the available data mining techniques is provided and a comparative study of such techniques is presented.

Citations

PDF

Open Access

More filters

Book

Data Mining: Concepts and Techniques

Jiawei Han, +2 more

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.

...read moreread less

Book

Data Mining: Practical Machine Learning Tools and Techniques

Ian H. Witten, +2 more

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

...read moreread less

Book

Introduction to Data Mining

Pang-Ning Tan, +2 more

TL;DR: This book discusses data mining through the lens of cluster analysis, which examines the relationships between data, clusters, and algorithms, and some of the techniques used to solve these problems.

...read moreread less

Journal ArticleDOI

A trust-based consumer decision-making model in electronic commerce: The role of trust, perceived risk, and their antecedents

Dan J. Kim, +2 more

TL;DR: A theoretical framework describing the trust-based decision-making process a consumer uses when making a purchase from a given site is developed and the proposed model is tested using a Structural Equation Modeling technique on Internet consumer purchasing behavior data collected via a Web survey.

...read moreread less

Data Mining: Concepts and Techniques (2nd edition)

Jiawei Han, +1 more

TL;DR: There have been many data mining books published in recent years, including Predictive Data Mining by Weiss and Indurkhya [WI98], Data Mining Solutions: Methods and Tools for Solving Real-World Problems by Westphal and Blaxton [WB98], Mastering Data Mining: The Art and Science of Customer Relationship Management by Berry and Linofi [BL99].

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

C4.5: Programs for Machine Learning

J. Ross Quinlan

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

Journal ArticleDOI

Induction of Decision Trees

J. R. Quinlan

- 25 Mar 1986 -

Machine Learning

TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.

...read moreread less

Proceedings ArticleDOI

Mining association rules between sets of items in large databases

Rakesh Agrawal, +2 more

TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.

...read moreread less

Book

Probability, random variables and stochastic processes

Athanasios Papoulis

TL;DR: This chapter discusses the concept of a Random Variable, the meaning of Probability, and the axioms of probability in terms of Markov Chains and Queueing Theory.

...read moreread less

Book

Probability, random variables, and stochastic processes

Athanasios Papoulis, +1 more

TL;DR: In this paper, the meaning of probability and random variables are discussed, as well as the axioms of probability, and the concept of a random variable and repeated trials are discussed.

...read moreread less

Collapse

Data mining: an overview from a database perspective

Citations

Data Mining: Concepts and Techniques

Data Mining: Practical Machine Learning Tools and Techniques

Introduction to Data Mining

A trust-based consumer decision-making model in electronic commerce: The role of trust, perceived risk, and their antecedents

Data Mining: Concepts and Techniques (2nd edition)

References

C4.5: Programs for Machine Learning

Induction of Decision Trees

Mining association rules between sets of items in large databases

Probability, random variables and stochastic processes

Probability, random variables, and stochastic processes

Related Papers (5)

Mining association rules between sets of items in large databases

Data Mining: Concepts and Techniques

Fast Algorithms for Mining Association Rules in Large Databases

Fast algorithms for mining association rules

Mining sequential patterns