Robust Rough-Fuzzy C-Means Algorithm: Design and Applications in Coding and Non-coding RNA Expression Data Clustering

doi:10.3233/FI-2013-829

Citations

PDF

Open Access

More filters

Journal Article•DOI•

An improved fuzzy c -means clustering algorithm based on shadowed sets and PSO

[...]

Jian Zhang¹, Ling Shen²•Institutions (2)

Tongji University¹, University of Shanghai for Science and Technology²

01 Jan 2014-Computational Intelligence and Neuroscience

TL;DR: A modified fuzzy c-means algorithm (SP-FCM) based on particle swarm optimization (PSO) and shadowed sets to perform feature clustering and significantly improves the clustering effect.

...read moreread less

Abstract: To organize the wide variety of data sets automatically and acquire accurate classification, this paper presents a modified fuzzy c-means algorithm (SP-FCM) based on particle swarm optimization (PSO) and shadowed sets to perform feature clustering SP-FCM introduces the global search property of PSO to deal with the problem of premature convergence of conventional fuzzy clustering, utilizes vagueness balance property of shadowed sets to handle overlapping among clusters, and models uncertainty in class boundaries This new method uses Xie-Beni index as cluster validity and automatically finds the optimal cluster number within a specific range with cluster partitions that provide compact and well-separated clusters Experiments show that the proposed approach significantly improves the clustering effect

...read moreread less

29 citations

Cites methods from "Robust Rough-Fuzzy C-Means Algorith..."

...Moreover, shadowed set- and rough set-based clustering methods, namely, SPFCM, SRCM, RCM, and SCM, perform better than FCM....
[...]
...Fuzzy sets and rough sets have been incorporated in the c-means framework to develop the fuzzy c-means (FCM) [7] and rough c-means (RCM) [8] algorithms....
[...]
...In this section, the performance of FCM, RCM, shadowed 𝑐- means (SCM) [21], shadowed rough 𝑐-means (SRCM) [19], and SP-FCM algorithms is presented on four UCI datasets, four yeast gene expression datasets, and real data....
[...]
...Fuzzy sets and rough sets have been incorporated in the 𝑐-means framework to develop the fuzzy 𝑐-means (FCM) [7] and rough 𝑐-means (RCM) [8] algorithms....
[...]
...The SP-FCM and SRCM obtain the same effect and perform better than other clustering algorithms....
[...]

Journal Article•DOI•

FaRoC: Fast and Robust Supervised Canonical Correlation Analysis for Multimodal Omics Data

[...]

Ankita Mandal¹, Pradipta Maji¹•Institutions (1)

Indian Statistical Institute¹

01 Apr 2018-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: The formulation enables the proposed method to extract required number of correlated features sequentially with lesser computational cost as compared to existing methods, and provides an efficient way to find optimum regularization parameters employed in CCA.

...read moreread less

Abstract: One of the main problems associated with high dimensional multimodal real life data sets is how to extract relevant and significant features. In this regard, a fast and robust feature extraction algorithm, termed as FaRoC, is proposed, integrating judiciously the merits of canonical correlation analysis (CCA) and rough sets. The proposed method extracts new features sequentially from two multidimensional data sets by maximizing their relevance with respect to class label and significance with respect to already-extracted features. To generate canonical variables sequentially, an analytical formulation is introduced to establish the relation between regularization parameters and CCA. The formulation enables the proposed method to extract required number of correlated features sequentially with lesser computational cost as compared to existing methods. To compute both significance and relevance measures of a feature, the concept of hypercuboid equivalence partition matrix of rough hypercuboid approach is used. It also provides an efficient way to find optimum regularization parameters employed in CCA. The efficacy of the proposed FaRoC algorithm, along with a comparison with other existing methods, is extensively established on several real life data sets.

...read moreread less

23 citations

Journal Article•DOI•

Rough-probabilistic clustering and hidden Markov random field model for segmentation of HEp-2 cell and brain MR images

[...]

Abhirup Banerjee¹, Pradipta Maji¹•Institutions (1)

Indian Statistical Institute¹

01 Sep 2016

TL;DR: A new clustering algorithm, termed as rough-probabilistic clustering, is presented, integrating judiciously the merits of rough sets and a new probability distribution, called stomped normal (SN) distribution, for accurate and robust segmentation of images.

...read moreread less

Abstract: Graphical abstractDisplay Omitted The segmentation of images into different meaningful classes is an important task for automatic image analysis technique. The finite Gaussian mixture model is one of the popular models for parametric model based image segmentation. However, the normality assumption of this model induces certain limitations as a single representative value is considered to represent each class. In this regard, the paper presents a new clustering algorithm, termed as rough-probabilistic clustering, integrating judiciously the merits of rough sets and a new probability distribution, called stomped normal (SN) distribution. The intensity distribution of a class is represented by SN distribution, where each class consists of a crisp lower approximation and a probabilistic boundary region. The intensity distribution of any image is modeled as a mixture of finite number of SN distributions. The expectation-maximization algorithm is used to estimate the parameters of each class. Incorporating hidden Markov random field framework into rough-probabilistic clustering, a new method is proposed for accurate and robust segmentation of images. The performance of the proposed segmentation approach, along with a comparison with related methods, is demonstrated on a set of HEp-2 cell images, and synthetic and real brain MR images for different bias fields and noise levels.

...read moreread less

22 citations

Journal Article•DOI•

μHEM for identification of differentially expressed miRNAs using hypercuboid equivalence partition matrix

[...]

Sushmita Paul¹, Pradipta Maji¹•Institutions (1)

Indian Statistical Institute¹

04 Sep 2013-BMC Bioinformatics

TL;DR: The results on several microarray data sets demonstrate that the proposed method can bring a remarkable improvement on miRNA selection problem and is a potentially useful tool for exploration of miRNA expression data and identification of differentially expressed miRNAs worth further investigation.

...read moreread less

Abstract: The miRNAs, a class of short approximately 22‐nucleotide non‐coding RNAs, often act post‐transcriptionally to inhibit mRNA expression. In effect, they control gene expression by targeting mRNA. They also help in carrying out normal functioning of a cell as they play an important role in various cellular processes. However, dysregulation of miRNAs is found to be a major cause of a disease. It has been demonstrated that miRNA expression is altered in many human cancers, suggesting that they may play an important role as disease biomarkers. Multiple reports have also noted the utility of miRNAs for the diagnosis of cancer. Among the large number of miRNAs present in a microarray data, a modest number might be sufficient to classify human cancers. Hence, the identification of differentially expressed miRNAs is an important problem particularly for the data sets with large number of miRNAs and small number of samples. In this regard, a new miRNA selection algorithm, called μHEM, is presented based on rough hypercuboid approach. It selects a set of miRNAs from a microarray data by maximizing both relevance and significance of the selected miRNAs. The degree of dependency of sample categories on miRNAs is defined, based on the concept of hypercuboid equivalence partition matrix, to measure both relevance and significance of miRNAs. The effectiveness of the new approach is demonstrated on six publicly available miRNA expression data sets using support vector machine. The.632+ bootstrap error estimate is used to minimize the variability and biasedness of the derived results. An important finding is that the μHEM algorithm achieves lowest B.632+ error rate of support vector machine with a reduced set of differentially expressed miRNAs on four expression data sets compare to some existing machine learning and statistical methods, while for other two data sets, the error rate of the μHEM algorithm is comparable with the existing techniques. The results on several microarray data sets demonstrate that the proposed method can bring a remarkable improvement on miRNA selection problem. The method is a potentially useful tool for exploration of miRNA expression data and identification of differentially expressed miRNAs worth further investigation.

...read moreread less

14 citations

Cites methods from "Robust Rough-Fuzzy C-Means Algorith..."

...The theory of rough sets has also been successfully applied to microarray data analysis in [9,24-35]....
[...]

Journal Article•DOI•

Segmentation of bias field induced brain MR images using rough sets and stomped-t distribution

[...]

Abhirup Banerjee¹, Pradipta Maji²•Institutions (2)

University of Oxford¹, Indian Statistical Institute²

01 Dec 2019-Information Sciences

TL;DR: A novel method for simultaneous segmentation and bias field correction in brain MR images is presented, which integrates the concept of rough sets and the merit of a recently introduced probability distribution, called stomped-t (St-t) distribution.

...read moreread less

10 citations

Robust Rough-Fuzzy C-Means Algorithm: Design and Applications in Coding and Non-coding RNA Expression Data Clustering

Citations

Cites methods from "Robust Rough-Fuzzy C-Means Algorith..."

Cites methods from "Robust Rough-Fuzzy C-Means Algorith..."

References

"Robust Rough-Fuzzy C-Means Algorith..." refers methods in this paper

"Robust Rough-Fuzzy C-Means Algorith..." refers methods in this paper

"Robust Rough-Fuzzy C-Means Algorith..." refers background in this paper

"Robust Rough-Fuzzy C-Means Algorith..." refers methods in this paper

Related Papers (5)