scispace - formally typeset
Journal ArticleDOI

A Survey of Multiobjective Evolutionary Algorithms for Data Mining: Part I

TLDR
This two-part paper has surveyed different multiobjective evolutionary algorithms for clustering, association rule mining, and several other data mining tasks, and provided a general discussion on the scopes for future research in this domain.
Abstract
The aim of any data mining technique is to build an efficient predictive or descriptive model of a large amount of data. Applications of evolutionary algorithms have been found to be particularly useful for automatic processing of large quantities of raw noisy data for optimal parameter setting and to discover significant and meaningful information. Many real-life data mining problems involve multiple conflicting measures of performance, or objectives, which need to be optimized simultaneously. Under this context, multiobjective evolutionary algorithms are gradually finding more and more applications in the domain of data mining since the beginning of the last decade. In this two-part paper, we have made a comprehensive survey on the recent developments of multiobjective evolutionary algorithms for data mining problems. In this paper, Part I, some basic concepts related to multiobjective optimization and data mining are provided. Subsequently, various multiobjective evolutionary approaches for two major data mining tasks, namely feature selection and classification, are surveyed. In Part II of this paper, we have surveyed different multiobjective evolutionary algorithms for clustering, association rule mining, and several other data mining tasks, and provided a general discussion on the scopes for future research in this domain.

read more

Citations
More filters
Journal ArticleDOI

PlatEMO: A MATLAB Platform for Evolutionary Multi-Objective Optimization [Educational Forum]

TL;DR: PlatEMO as discussed by the authors is a MATLAB platform for evolutionary multi-objective optimization, which includes more than 50 multiobjective evolutionary algorithms and more than 100 multobjective test problems, along with several widely used performance indicators.
Posted Content

PlatEMO: A MATLAB Platform for Evolutionary Multi-Objective Optimization

TL;DR: The main features of PlatEMO are introduced and how to use it for performing comparative experiments, embedding new algorithms, creating new test problems, and developing performance indicators are illustrated.
Journal ArticleDOI

Big IoT Data Analytics: Architecture, Opportunities, and Open Research Challenges

TL;DR: The state-of-the-art research efforts directed toward big IoT data analytics are investigated, the relationship between big data analytics and IoT is explained, and several opportunities brought by data analytics in IoT paradigm are discussed.
Journal ArticleDOI

Many-Objective Evolutionary Algorithms: A Survey

TL;DR: A survey of MaOEAs is reported and seven classes of many-objective evolutionary algorithms proposed are categorized into seven classes: relaxed dominance based, diversity-based, aggregation- based, indicator-Based, reference set based, preference-based and dimensionality reduction approaches.
Journal ArticleDOI

Data mining for the Internet of Things: literature review and challenges

TL;DR: A systematic way to review data mining in knowledge view, technique view, and application view, including classification, clustering, association analysis, time series analysis and outlier analysis is given.
References
More filters
Book

Genetic algorithms in search, optimization, and machine learning

TL;DR: In this article, the authors present the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields, including computer programming and mathematics.
Journal ArticleDOI

A fast and elitist multiobjective genetic algorithm: NSGA-II

TL;DR: This paper suggests a non-dominated sorting-based MOEA, called NSGA-II (Non-dominated Sorting Genetic Algorithm II), which alleviates all of the above three difficulties, and modify the definition of dominance in order to solve constrained multi-objective problems efficiently.

Statistical learning theory

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
Proceedings ArticleDOI

Mining association rules between sets of items in large databases

TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.
Journal ArticleDOI

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

TL;DR: A new graphical display is proposed for partitioning techniques, where each cluster is represented by a so-called silhouette, which is based on the comparison of its tightness and separation, and provides an evaluation of clustering validity.
Related Papers (5)