scispace - formally typeset
Open AccessJournal ArticleDOI

Multi-Objective Optimized Fuzzy Clustering for Detecting Cell Clusters from Single-Cell Expression Profiles

TLDR
A multi-objective optimization-based fuzzy clustering approach for detecting cell clusters from scRNA-seq data that obtained differentially expressed genes (DEGs) using Limma through the comparison of expression of the samples between each resultant cluster and the remaining clusters.
Abstract
Rapid advance in single-cell RNA sequencing (scRNA-seq) allows measurement of the expression of genes at single-cell resolution in complex disease or tissue. While many methods have been developed to detect cell clusters from the scRNA-seq data, this task currently remains a main challenge. We proposed a multi-objective optimization-based fuzzy clustering approach for detecting cell clusters from scRNA-seq data. First, we conducted initial filtering and SCnorm normalization. We considered various case studies by selecting different cluster numbers ( c l = 2 to a user-defined number), and applied fuzzy c-means clustering algorithm individually. From each case, we evaluated the scores of four cluster validity index measures, Partition Entropy ( P E ), Partition Coefficient ( P C ), Modified Partition Coefficient ( M P C ), and Fuzzy Silhouette Index ( F S I ). Next, we set the first measure as minimization objective (↓) and the remaining three as maximization objectives (↑), and then applied a multi-objective decision-making technique, TOPSIS, to identify the best optimal solution. The best optimal solution (case study) that had the highest TOPSIS score was selected as the final optimal clustering. Finally, we obtained differentially expressed genes (DEGs) using Limma through the comparison of expression of the samples between each resultant cluster and the remaining clusters. We applied our approach to a scRNA-seq dataset for the rare intestinal cell type in mice [GEO ID: GSE62270, 23,630 features (genes) and 288 cells]. The optimal cluster result (TOPSIS optimal score= 0.858) comprised two clusters, one with 115 cells and the other 91 cells. The evaluated scores of the four cluster validity indices, F S I , P E , P C , and M P C for the optimized fuzzy clustering were 0.482, 0.578, 0.607, and 0.215, respectively. The Limma analysis identified 1240 DEGs (cluster 1 vs. cluster 2). The top ten gene markers were Rps21, Slc5a1, Crip1, Rpl15, Rpl3, Rpl27a, Khk, Rps3a1, Aldob and Rps17. In this list, Khk (encoding ketohexokinase) is a novel marker for the rare intestinal cell type. In summary, this method is useful to detect cell clusters from scRNA-seq data.

read more

Citations
More filters

Revealing the vectors of cellular identity with single-cell genomics

TL;DR: Single-cell genomics has now made it possible to create a comprehensive atlas of human cells and has reopened definitions of a cell's identity and of the ways in which identity is regulated by the cell's molecular circuitry.
Journal ArticleDOI

Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study.

TL;DR: A comprehensive review and evaluation of four classical dimension reduction methods and five clustering models showed that the feature selection method contributed positively to high-dimensional and sparse scRNA-seq data and feature-extraction methods were able to promote clustering performance, although this was not eternally immutable.
Journal ArticleDOI

A Comparative Analysis of Single-Cell Transcriptome Identifies Reprogramming Driver Factors for Efficiency Improvement

TL;DR: These studies found that pathways for autophagy, endocytosis, and apoptosis were incompletely activated in nuclear transfer (NT) 2-cell arrest embryos, whereas extensively inhibited pathways for stem cell pluripotency maintenance, DNA repair, cell cycle, andautophagy may result in NT 4-cell embryos arrest.
Proceedings ArticleDOI

Combinatorial Auction-Based Fog Service Allocation Mechanism for IoT Applications

TL;DR: This article proposes two types of truthful mechanisms for resource allocation and pricing in fog computing, and investigates that the combinatorial auction based mechanism can essentially enhance the allocation of the resources with high proficiency and creating higher income for the fog providers.
References
More filters
Journal ArticleDOI

Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments

TL;DR: The hierarchical model of Lonnstedt and Speed (2002) is developed into a practical approach for general microarray experiments with arbitrary numbers of treatments and RNA samples and the moderated t-statistic is shown to follow a t-distribution with augmented degrees of freedom.
Book

Multiple Attribute Decision Making: Methods and Applications

TL;DR: In this paper, the authors present a classification of MADM methods by data type and propose a ranking method based on the degree of similarity of the MADM method to the original MADM algorithm.
Journal ArticleDOI

FCM: The fuzzy c-means clustering algorithm

TL;DR: A FORTRAN-IV coding of the fuzzy c -means (FCM) clustering program is transmitted, which generates fuzzy partitions and prototypes for any set of numerical data.
Related Papers (5)