scispace - formally typeset
Open AccessJournal ArticleDOI

Model-based deep embedding for constrained clustering analysis of single cell RNA-seq data.

Reads0
Chats0
TLDR
Wang et al. as discussed by the authors proposed a principled clustering method named scDCC, which integrates domain knowledge into the clustering step to facilitate the biological interpretability of clusters, and subsequent cell type identification.
Abstract
Clustering is a critical step in single cell-based studies. Most existing methods support unsupervised clustering without the a priori exploitation of any domain knowledge. When confronted by the high dimensionality and pervasive dropout events of scRNA-Seq data, purely unsupervised clustering methods may not produce biologically interpretable clusters, which complicates cell type assignment. In such cases, the only recourse is for the user to manually and repeatedly tweak clustering parameters until acceptable clusters are found. Consequently, the path to obtaining biologically meaningful clusters can be ad hoc and laborious. Here we report a principled clustering method named scDCC, that integrates domain knowledge into the clustering step. Experiments on various scRNA-seq datasets from thousands to tens of thousands of cells show that scDCC can significantly improve clustering performance, facilitating the interpretability of clusters and downstream analyses, such as cell type assignment. Clustering cells based on similarities in gene expression is the first step towards identifying cell types in scRNASeq data. Here the authors incorporate biological knowledge into the clustering step to facilitate the biological interpretability of clusters, and subsequent cell type identification.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Self-supervised contrastive learning for integrative single cell RNA-seq data analysis

TL;DR: In this article , a self-supervised Contrastive LEArning framework for single-cell ribonucleic acid (RNA)-sequencing data representation and downstream analysis is presented.
Journal ArticleDOI

Clustering of single-cell multi-omics data with a multimodal deep learning method

TL;DR: Wang et al. as discussed by the authors developed a multimodal deep learning method, scMDC, for single-cell multi-omics data clustering analysis, which is an end-to-end deep model that explicitly characterizes different data sources and jointly learns latent features of deep embedding.
Journal ArticleDOI

scCNC: a method based on capsule network for clustering scRNA-seq data

TL;DR: Experiments show that scCNC can significantly improve clustering performance and facilitate downstream analyses, and also propose a Semi-supervised Greedy Iterative Training (SGIT) method used to train the whole network.
Journal ArticleDOI

UICPC: Centrality-based clustering for scRNA-seq data analysis without user input.

TL;DR: A centrality-clustering method named UICPC is proposed and its performance is compared with 9 state-of-the-art clustering methods on 11 real-world scRNA-seq datasets to demonstrate its effectiveness and usefulness in discovering cell groups.
Posted ContentDOI

Self-supervised contrastive learning for integrative single cell RNA-seq data analysis

TL;DR: Wang et al. as mentioned in this paper presented a self-supervised Contrastive LEArning framework for scRNA-seq (CLEAR) profile representation and downstream analysis, which overcomes the heterogeneity of the experimental data with a specifically designed representation learning task.
References
More filters
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Journal ArticleDOI

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Journal Article

Visualizing Data using t-SNE

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.
Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.
Related Papers (5)