scispace - formally typeset
Open AccessJournal ArticleDOI

f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq.

TLDR
In this article, a method based on factor analysis that uses pathway annotations to guide the inference of interpretable factors underpinning the heterogeneity in gene expression in large cell populations is proposed.
Abstract
Single-cell RNA-sequencing (scRNA-seq) allows studying heterogeneity in gene expression in large cell populations. Such heterogeneity can arise due to technical or biological factors, making decomposing sources of variation difficult. We here describe f-scLVM (factorial single-cell latent variable model), a method based on factor analysis that uses pathway annotations to guide the inference of interpretable factors underpinning the heterogeneity. Our model jointly estimates the relevance of individual factors, refines gene set annotations, and infers factors without annotation. In applications to multiple scRNA-seq datasets, we find that f-scLVM robustly decomposes scRNA-seq datasets into interpretable components, thereby facilitating the identification of novel subpopulations.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

SCANPY: large-scale single-cell gene expression data analysis

TL;DR: This work presents Scanpy, a scalable toolkit for analyzing single-cell gene expression data that includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks, and AnnData, a generic class for handling annotated data matrices.
Journal ArticleDOI

Current best practices in single-cell RNA-seq analysis: a tutorial.

TL;DR: The steps of a typical single‐cell RNA‐seq analysis, including pre‐processing (quality control, normalization, data correction, feature selection, and dimensionality reduction) and cell‐ and gene‐level downstream analysis, are detailed.
Journal ArticleDOI

Eleven grand challenges in single-cell data science

David Lähnemann, +71 more
- 07 Feb 2020 - 
TL;DR: This compendium is for established researchers, newcomers, and students alike, highlighting interesting and rewarding problems for the coming years in single-cell data science.
Journal ArticleDOI

Multi-Omics Factor Analysis—a framework for unsupervised integration of multi-omics data sets

TL;DR: Multi‐Omics Factor Analysis (MOFA) infers a set of (hidden) factors that capture biological and technical sources of variability that disentangles axes of heterogeneity that are shared across multiple modalities and those specific to individual data modalities.
References
More filters
Journal ArticleDOI

Adjusting batch effects in microarray expression data using empirical Bayes methods

TL;DR: This paper proposed parametric and non-parametric empirical Bayes frameworks for adjusting data for batch effects that is robust to outliers in small sample sizes and performs comparable to existing methods for large samples.
Journal ArticleDOI

An introduction to variational methods for graphical models

TL;DR: This paper presents a tutorial introduction to the use of variational methods for inference and learning in graphical models (Bayesian networks and Markov random fields), and describes a general framework for generating variational transformations based on convex duality.
Related Papers (5)