Multi-Objective Optimized Fuzzy Clustering for Detecting Cell Clusters from Single-Cell Expression Profiles
Saurav Mallik,Zhongming Zhao +1 more
TLDR
A multi-objective optimization-based fuzzy clustering approach for detecting cell clusters from scRNA-seq data that obtained differentially expressed genes (DEGs) using Limma through the comparison of expression of the samples between each resultant cluster and the remaining clusters.Abstract:
Rapid advance in single-cell RNA sequencing (scRNA-seq) allows measurement of the expression of genes at single-cell resolution in complex disease or tissue. While many methods have been developed to detect cell clusters from the scRNA-seq data, this task currently remains a main challenge. We proposed a multi-objective optimization-based fuzzy clustering approach for detecting cell clusters from scRNA-seq data. First, we conducted initial filtering and SCnorm normalization. We considered various case studies by selecting different cluster numbers ( c l = 2 to a user-defined number), and applied fuzzy c-means clustering algorithm individually. From each case, we evaluated the scores of four cluster validity index measures, Partition Entropy ( P E ), Partition Coefficient ( P C ), Modified Partition Coefficient ( M P C ), and Fuzzy Silhouette Index ( F S I ). Next, we set the first measure as minimization objective (↓) and the remaining three as maximization objectives (↑), and then applied a multi-objective decision-making technique, TOPSIS, to identify the best optimal solution. The best optimal solution (case study) that had the highest TOPSIS score was selected as the final optimal clustering. Finally, we obtained differentially expressed genes (DEGs) using Limma through the comparison of expression of the samples between each resultant cluster and the remaining clusters. We applied our approach to a scRNA-seq dataset for the rare intestinal cell type in mice [GEO ID: GSE62270, 23,630 features (genes) and 288 cells]. The optimal cluster result (TOPSIS optimal score= 0.858) comprised two clusters, one with 115 cells and the other 91 cells. The evaluated scores of the four cluster validity indices, F S I , P E , P C , and M P C for the optimized fuzzy clustering were 0.482, 0.578, 0.607, and 0.215, respectively. The Limma analysis identified 1240 DEGs (cluster 1 vs. cluster 2). The top ten gene markers were Rps21, Slc5a1, Crip1, Rpl15, Rpl3, Rpl27a, Khk, Rps3a1, Aldob and Rps17. In this list, Khk (encoding ketohexokinase) is a novel marker for the rare intestinal cell type. In summary, this method is useful to detect cell clusters from scRNA-seq data.read more
Citations
More filters
Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq
Itay Tirosh,Benjamin Izar,Daniel J. Treacy,John J. Trombetta,Asaf Rotem,Christopher Rodman,Christine G. Lian,George F. Murphy,Mohammad Fallahi-Sichani,Ken Dutton-Regester,Jia-Ren Lin,Ofir Cohen,Parin Shah,Diana Lu,Alexandra-Chloé Villani,Aleksandr Andreev,E.M. Van Allen,Monica M. Bertagnolli,Peter K. Sorger,Ryan J. Sullivan,Keith T. Flaherty,Dennie T. Frederick,Judit Jané-Valbuena,Orit Rozenblatt-Rosen,Sanjay M. Prakadan,Marc H. Wadsworth,Alex S. Genshaft,Travis K. Hughes,Carly G. K. Ziegler,Samuel W. Kazer,Alethe Gaillard de Saint Germain,Kellie E. Kolb,Cory M. Johannessen,Clifford H. Yoon,Alex K. Shalek,Aviv Regev,Levi A. Garraway +36 more
TL;DR: Tirosh et al. as discussed by the authors applied single-cell RNA sequencing (RNA-seq) to 4645 single cells isolated from 19 patients, profiling malignant, immune, stromal, and endothelial cells.
Revealing the vectors of cellular identity with single-cell genomics
TL;DR: Single-cell genomics has now made it possible to create a comprehensive atlas of human cells and has reopened definitions of a cell's identity and of the ways in which identity is regulated by the cell's molecular circuitry.
Journal ArticleDOI
Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study.
TL;DR: A comprehensive review and evaluation of four classical dimension reduction methods and five clustering models showed that the feature selection method contributed positively to high-dimensional and sparse scRNA-seq data and feature-extraction methods were able to promote clustering performance, although this was not eternally immutable.
Journal ArticleDOI
A Comparative Analysis of Single-Cell Transcriptome Identifies Reprogramming Driver Factors for Efficiency Improvement
TL;DR: These studies found that pathways for autophagy, endocytosis, and apoptosis were incompletely activated in nuclear transfer (NT) 2-cell arrest embryos, whereas extensively inhibited pathways for stem cell pluripotency maintenance, DNA repair, cell cycle, andautophagy may result in NT 4-cell embryos arrest.
Proceedings ArticleDOI
Combinatorial Auction-Based Fog Service Allocation Mechanism for IoT Applications
TL;DR: This article proposes two types of truthful mechanisms for resource allocation and pricing in fog computing, and investigates that the combinatorial auction based mechanism can essentially enhance the allocation of the resources with high proficiency and creating higher income for the fog providers.
References
More filters
Journal ArticleDOI
Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
TL;DR: The hierarchical model of Lonnstedt and Speed (2002) is developed into a practical approach for general microarray experiments with arbitrary numbers of treatments and RNA samples and the moderated t-statistic is shown to follow a t-distribution with augmented degrees of freedom.
Book
Multiple Attribute Decision Making: Methods and Applications
TL;DR: In this paper, the authors present a classification of MADM methods by data type and propose a ranking method based on the degree of similarity of the MADM method to the original MADM algorithm.
Journal ArticleDOI
Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets
Evan Z. Macosko,Evan Z. Macosko,Anindita Basu,Anindita Basu,Rahul Satija,Rahul Satija,James Nemesh,James Nemesh,Karthik Shekhar,Melissa Goldman,Melissa Goldman,Itay Tirosh,Allison R. Bialas,Nolan Kamitaki,Nolan Kamitaki,Emily M. Martersteck,John J. Trombetta,David A. Weitz,Joshua R. Sanes,Alex K. Shalek,Alex K. Shalek,Alex K. Shalek,Aviv Regev,Aviv Regev,Aviv Regev,Steven A. McCarroll,Steven A. McCarroll +26 more
TL;DR: Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together.
Journal ArticleDOI
FCM: The fuzzy c-means clustering algorithm
TL;DR: A FORTRAN-IV coding of the fuzzy c -means (FCM) clustering program is transmitted, which generates fuzzy partitions and prototypes for any set of numerical data.
Journal ArticleDOI
Massively parallel digital transcriptional profiling of single cells
Grace X.Y. Zheng,Jessica M. Terry,Phillip Belgrader,Paul Ryvkin,Zachary Bent,Ryan Wilson,Solongo B. Ziraldo,Tobias Daniel Wheeler,Geoffrey P. McDermott,Junjie Zhu,Mark T. Gregory,Joe Shuga,Luz Montesclaros,Jason G. Underwood,Donald A. Masquelier,Stefanie Y. Nishimura,Michael Schnall-Levin,Paul Wyatt,Christopher Hindson,Rajiv Bharadwaj,Alexander Wong,Kevin D. Ness,Lan Beppu,H. Joachim Deeg,Christopher McFarland,Keith R. Loeb,Keith R. Loeb,William J. Valente,William J. Valente,Nolan G. Ericson,Emily A. Stevens,Jerald P. Radich,Tarjei S. Mikkelsen,Benjamin J. Hindson,Jason H. Bielas +34 more
TL;DR: A droplet-based system that enables 3′ mRNA counting of tens of thousands of single cells per sample is described and sequence variation in the transcriptome data is used to determine host and donor chimerism at single-cell resolution from bone marrow mononuclear cells isolated from transplant patients.