scispace - formally typeset
Open AccessJournal ArticleDOI

CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM

TLDR
This paper proposes a new k-modes clustering algorithm that is combined with Cuckoo Search algorithm to obtain the global optimum solution for categorical clustering and shows the efficiency of the proposed algorithm.
Abstract
Cluster analysis is the unsupervised learning technique that finds the interesting patterns in the data objects without knowing class labels. Most of the real world dataset consists of categorical data. For example, social media analysis may have the categorical data like the gender as male or female. The k-modes clustering algorithm is the most widely used to group the categorical data, because it is easy to implement and efficient to handle the large amount of data. However, due to its random selection of initial centroids, it provides the local optimum solution. There are number of optimization algorithms are developed to obtain global optimum solution. Cuckoo Search algorithm is the population based metaheuristic optimization algorithms to provide the global optimum solution. Methods: In this paper, k-modes clustering algorithm is combined with Cuckoo Search algorithm to obtain the global optimum solution. Results: Experiments are conducted with benchmark datasets and the results are compared with k-modes and Particle Swarm Optimization with k-modes to prove the efficiency of the proposed algorithm.

read more

Content maybe subject to copyright    Report

Citations
More filters
Book

Information retrieval

TL;DR: The major change in the second edition of this book is the addition of a new chapter on probabilistic retrieval, which I think is one of the most interesting and active areas of research in information retrieval.
Journal ArticleDOI

Clustering Mixed Numeric and Categorical Data With Cuckoo Search

TL;DR: This research proposes CCS-K-Prototypes, a novel partitional Clustering algorithm based on Cuckoo Search and K-Prototype, for clustering mixed numeric and categorical data and suggests two formulas for the cuckoo to search for the potential solution around the existing solutions or in the entire attribute space.
Journal ArticleDOI

A comprehensive review of time use surveys in modelling occupant presence and behavior: Data, methods, and applications

TL;DR: A comprehensive review of the TUS’ available data, modelling methods, and implementations in building energy research and the main TUS-based models are categorized and explained in detail including their main features and limitations.
Proceedings ArticleDOI

Symptoms based Early Clinical Diagnosis of COVID-19 Cases using Hybrid and Ensemble Machine Learning Techniques

TL;DR: It is observed from the results that K-mode clustering followed by classification-based hybrid modelling resulted in improved classification accuracy in the clusters leading to an average accuracy of 87.17% and 87.24% with GB and RF respectively.
Book ChapterDOI

Hybrid Harris Hawks Optimization with Differential Evolution for Data Clustering

TL;DR: This chapter proposes a new hybridization strategy, namely, hybrid Harris Hawks optimization with differential evolution (DE) (H-HHO), to tackle the data clustering problem.
References
More filters
BookDOI

Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control and Artificial Intelligence

TL;DR: Initially applying his concepts to simply defined artificial systems with limited numbers of parameters, Holland goes on to explore their use in the study of a wide range of complex, naturally occuring processes, concentrating on systems having multiple factors that interact in nonlinear ways.
Journal ArticleDOI

Objective Criteria for the Evaluation of Clustering Methods

TL;DR: This article proposes several criteria which isolate specific aspects of the performance of a method, such as its retrieval of inherent structure, its sensitivity to resampling and the stability of its results in the light of new data.
Proceedings ArticleDOI

Cuckoo Search via Lévy flights

TL;DR: A new meta-heuristic algorithm, called Cuckoo Search (CS), is formulated, based on the obligate brood parasitic behaviour of some cuckoo species in combination with the Lévy flight behaviour ofSome birds and fruit flies, for solving optimization problems.
Journal ArticleDOI

Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values

TL;DR: Two algorithms which extend the k-means algorithm to categorical domains and domains with mixed numeric and categorical values are presented and are shown to be efficient when clustering large data sets, which is critical to data mining applications.
Dissertation

An analysis of particle swarm optimizers

TL;DR: This thesis presents a theoretical model that can be used to describe the long-term behaviour of the Particle Swarm Optimiser and results are presented to support the theoretical properties predicted by the various models, using synthetic benchmark functions to investigate specific properties.
Related Papers (5)