A New Rough-Fuzzy Clustering Algorithm and its Applications

doi:10.1007/978-81-322-1602-5_130

Home
/
Papers
/
A New Rough-Fuzzy Clustering Algorithm and its Applications

Book Chapter•DOI•

A New Rough-Fuzzy Clustering Algorithm and its Applications

Sushmita Paul¹, Pradipta Maji¹•Institutions (1)

Indian Statistical Institute¹

01 Jan 2014-pp 1245-1251

TL;DR: A robust rough-fuzzy clustering algorithm is applied here to identify clusters having similar objects and it can find overlapping and vaguely defined clusters with arbitrary shapes in noisy environment.

read less

Abstract: Cluster analysis is a technique that divides a given data set into a set of clusters in such a way that two objects from the same cluster are as similar as possible and the objects from different clusters are as dissimilar as possible. A robust rough-fuzzy \(c\)-means clustering algorithm is applied here to identify clusters having similar objects. Each cluster of the robust rough-fuzzy clustering algorithm is represented by a set of three parameters, namely, cluster prototype, a possibilistic fuzzy lower approximation, and a probabilistic fuzzy boundary. The possibilistic lower approximation helps in discovering clusters of various shapes. The cluster prototype depends on the weighting average of the possibilistic lower approximation and probabilistic boundary. The reported algorithm is robust in the sense that it can find overlapping and vaguely defined clusters with arbitrary shapes in noisy environment. The effectiveness of the clustering algorithm, along with a comparison with other clustering algorithms, is demonstrated on synthetic as well as coding and non-coding RNA expression data sets using some cluster validity indices.

...read moreread less

Citations

PDF

Open Access

More filters

Evolutionary approaches for feature selection in biological data

[...]

Vinh Q. Dang

01 Jan 2014

TL;DR: This dissertation aims to provide a history of web exceptionalism from 1989 to 2002, a period chosen in order to explore its roots as well as specific cases up to and including the year in which descriptions of “Web 2.0” began to circulate.

...read moreread less

Abstract: Data mining techniques have been used widely in many areas such as business, science, engineering and medicine The techniques allow a vast amount of data to be explored in order to extract useful information from the data One of the foci in the health area is finding interesting biomarkers from biomedical data Mass throughput data generated from microarrays and mass spectrometry from biological samples are high dimensional and is small in sample size Examples include DNA microarray datasets with up to 500,000 genes and mass spectrometry data with 300,000 m/z values While the availability of such datasets can aid in the development of techniques/drugs to improve diagnosis and treatment of diseases, a major challenge involves its analysis to extract useful and meaningful information The aims of this project are: 1) to investigate and develop feature selection algorithms that incorporate various evolutionary strategies, 2) using the developed algorithms to find the “most relevant” biomarkers contained in biological datasets and 3) and evaluate the goodness of extracted feature subsets for relevance (examined in terms of existing biomedical domain knowledge and from classification accuracy obtained using different classifiers) The project aims to generate good predictive models for classifying diseased samples from control

...read moreread less

4 citations

Journal Article•

Enforcement of Rough Fuzzy Clustering Based on Correlation Analysis

[...]

Revathy Subramanion, Parvathavarthini Balasubramanian, Shajunisha Noordeen

01 Jan 2017-The International Arab Journal of Information Technology

TL;DR: This paper proposes Fuzzy to Rough FuzzY Link Element (FRFLE) which is used as an important factor to conceptualize the rough fuzzy clustering from the fuzzy clusters result and shows that proposed RFCM algorithm using FRFLE deals with less computation time than the traditional RFCM algorithms.

...read moreread less

Abstract: Clustering is a standard approach in analysis of data and construction of separated similar groups. The most widely used robust soft clustering methods are fuzzy, rough and rough fuzzy clustering. The prominent feature of soft clustering leads to combine the rough and fuzzy sets. The Rough Fuzzy C-Means (RFCM) includes the lower and boundary estimation of rough sets, and fuzzy membership of fuzzy sets into c-means algorithm, the widespread RFCM needs more computation. To avoid this, this paper proposes Fuzzy to Rough Fuzzy Link Element (FRFLE) which is used as an important factor to conceptualize the rough fuzzy clustering from the fuzzy clustering result. Experiments with synthetic, standard and the different benchmark dataset shows the automation process of the FRFLE value, then the comparison between the results of general RFCM and RFCM using FRFLE is observed. Moreover, the performance analysis result shows that proposed RFCM algorithm using FRFLE deals with less computation time than the traditional RFCM algorithms.

...read moreread less

3 citations

Cites methods from "A New Rough-Fuzzy Clustering Algori..."

...RFCM algorithm has been widely used inmany applications [10,12,13,5]....
[...]

Book Chapter•DOI•

Rough K-means Algorithm Based on the Boundary Object Difference Metric

[...]

Pengda Zhong¹, Tengfei Zhang¹, Xiang Zhang¹, Xinyuan Hu¹, Wenfeng Zhang¹ - Show less +1 more•Institutions (1)

Nanjing University of Posts and Telecommunications¹

22 Oct 2021

TL;DR: Zhang et al. as discussed by the authors proposed a new rough k-means algorithm to measure the weight of boundary objects, which considers the distance from boundary objects to their neighbor points and the number of neighbor points together to dynamically calculate the weights of boundary object to clusters that may belong to.

...read moreread less

Abstract: Rough k-means algorithm can effectively deal with the problem of the fuzzy boundaries. But traditional rough k-means algorithm set unified weight for boundary object, ignoring the differences between individual objects. Membership degree method of rough fuzzy k-means algorithm is used to measure the membership degree of boundary object to the clusters that they may belong to, ignoring the distribution of neighbor points of the boundary object. So, according to the distribution of neighbor points of the boundary object, we put forward a new rough k-means algorithm to measure the weight of boundary objects. The proposed algorithm considers the distance from boundary objects to their neighbor points and the number of neighbor points of boundary objects together to dynamically calculate the weights of boundary object to clusters that may belong to. Simulation and experiment, through examples verify the effectiveness of the proposed method.

...read moreread less

References

PDF

Open Access

More filters

Journal Article•DOI•

A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters

[...]

J. C. Dunn

01 Jan 1973

TL;DR: Two fuzzy versions of the k-means optimal, least squared error partitioning problem are formulated for finite subsets X of a general inner product space; in both cases, the extremizing solutions are shown to be fixed points of a certain operator T on the class of fuzzy, k-partitions of X, and simple iteration of T provides an algorithm which has the descent property relative to the least squarederror criterion function.

...read moreread less

Abstract: Two fuzzy versions of the k-means optimal, least squared error partitioning problem are formulated for finite subsets X of a general inner product space. In both cases, the extremizing solutions are shown to be fixed points of a certain operator T on the class of fuzzy, k-partitions of X, and simple iteration of T provides an algorithm which has the descent property relative to the least squared error criterion function. In the first case, the range of T consists largely of ordinary (i.e. non-fuzzy) partitions of X and the associated iteration scheme is essentially the well known ISODATA process of Ball and Hall. However, in the second case, the range of T consists mainly of fuzzy partitions and the associated algorithm is new; when X consists of k compact well separated (CWS) clusters, Xi , this algorithm generates a limiting partition with membership functions which closely approximate the characteristic functions of the clusters Xi . However, when X is not the union of k CWS clusters, the limi...

...read moreread less

5,787 citations

A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters

[...]

J. C. Dunn

01 Jan 1973

TL;DR: In this paper, two fuzzy versions of the k-means optimal, least squared error partitioning problem are formulated for finite subsets X of a general inner product space, and the extremizing solutions are shown to be fixed points of a certain operator T on the class of fuzzy, k-partitions of X, and simple iteration of T provides an algorithm which has the descent property relative to the LSE criterion function.

...read moreread less

5,254 citations

Journal Article•DOI•

A possibilistic approach to clustering

[...]

Raghu Krishnapuram¹, James M. Keller¹•Institutions (1)

University of Missouri¹

01 May 1993-IEEE Transactions on Fuzzy Systems

TL;DR: An appropriate objective function whose minimum will characterize a good possibilistic partition of the data is constructed, and the membership and prototype update equations are derived from necessary conditions for minimization of the criterion function.

...read moreread less

Abstract: The clustering problem is cast in the framework of possibility theory. The approach differs from the existing clustering methods in that the resulting partition of the data can be interpreted as a possibilistic partition, and the membership values can be interpreted as degrees of possibility of the points belonging to the classes, i.e., the compatibilities of the points with the class prototypes. An appropriate objective function whose minimum will characterize a good possibilistic partition of the data is constructed, and the membership and prototype update equations are derived from necessary conditions for minimization of the criterion function. The advantages of the resulting family of possibilistic algorithms are illustrated by several examples. >

...read moreread less

2,388 citations

Journal Article•DOI•

Numerical methods for fuzzy clustering

[...]

Enrique H. Ruspini¹•Institutions (1)

University of California, Los Angeles¹

01 Jul 1970-Information Sciences

TL;DR: In this paper, the authors considered the problem of decomposition of the probability density function of the original set into the weighted sum of the component fuzzy set densities, which is done by optimization of some functional defined over all possible fuzzy classifications.

...read moreread less

561 citations

Book•

Abstraction and pattern classification

[...]

Richard Bellman, Robert E. Kalaba, Lotfi A. Zadeh

01 Aug 1996

TL;DR: A general framework for the treatment of pattern-recognition problems is discussed, including the notion of a 'fuzzy' set and how this may be employed in a sequential experimental procedure to ascertain whether a symbol is a member of a particular set or not.

...read moreread less

Abstract: : This is a preliminary paper in which the authors discuss a general framework for the treatment of pattern-recognition problems. They make precise the notion of a 'fuzzy' set. Then they show how this may be employed in a sequential experimental procedure to ascertain whether a symbol is a member of a particular set or not. The close relation between the problem of pattern recognition and interpolation is stressed. (Author)

...read moreread less

378 citations