scispace - formally typeset
Open AccessJournal ArticleDOI

SCENIC: single-cell regulatory network inference and clustering.

TLDR
On a compendium of single-cell data from tumors and brain, it is demonstrated that cis-regulatory analysis can be exploited to guide the identification of transcription factors and cell states.
Abstract
We present SCENIC, a computational method for simultaneous gene regulatory network reconstruction and cell-state identification from single-cell RNA-seq data (http://scenicaertslaborg) On a compendium of single-cell data from tumors and brain, we demonstrate that cis-regulatory analysis can be exploited to guide the identification of transcription factors and cell states SCENIC provides critical biological insights into the mechanisms driving cellular heterogeneity

read more

Content maybe subject to copyright    Report

Figures
Citations
More filters
Journal ArticleDOI

Dimensionality reduction for visualizing single-cell data using UMAP.

TL;DR: Comparing the performance of UMAP with five other tools, it is found that UMAP provides the fastest run times, highest reproducibility and the most meaningful organization of cell clusters.
Journal ArticleDOI

Single-cell landscape of bronchoalveolar immune cells in patients with COVID-19.

TL;DR: Single-cell transcriptome and T cell receptor analysis of bronchoalveolar lavage fluid suggests enrichment of proinflammatory macrophages in patients with severe COVID-19 and the presence of clonally expanded CD8 + T cells in Patients with moderate CO VID-19.
Journal ArticleDOI

Current best practices in single-cell RNA-seq analysis: a tutorial.

TL;DR: The steps of a typical single‐cell RNA‐seq analysis, including pre‐processing (quality control, normalization, data correction, feature selection, and dimensionality reduction) and cell‐ and gene‐level downstream analysis, are detailed.
Journal ArticleDOI

Severe COVID-19 Is Marked by a Dysregulated Myeloid Cell Compartment.

Jonas Schulte-Schrepping, +137 more
- 17 Sep 2020 - 
TL;DR: This study provides detailed insights into the systemic immune response to SARS-CoV-2 infection and it reveals profound alterations in the myeloid cell compartment associated with severe COVID-19.
Journal ArticleDOI

Phenotype molding of stromal cells in the lung tumor microenvironment.

TL;DR: A comprehensive single-cell analysis of the lung cancer microenvironment reveals marked heterogeneity of transcriptional networks that defines novel clinically relevant stromal cell populations.
References
More filters
Journal ArticleDOI

Gene Ontology: tool for the unification of biology

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Journal ArticleDOI

Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

TL;DR: By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.
Journal ArticleDOI

limma powers differential expression analyses for RNA-sequencing and microarray studies

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
Journal ArticleDOI

MapReduce: simplified data processing on large clusters

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Journal ArticleDOI

Greedy function approximation: A gradient boosting machine.

TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.
Related Papers (5)
Frequently Asked Questions (12)
Q1. What have the authors contributed in "Scenic: single-cell regulatory network inference and clustering" ?

Here the authors describe a computational resource, called SCENIC ( Single Cell rEgulatory Network Inference and Clustering ), for the simultaneous reconstruction of gene regulatory networks ( GRNs ) and the identification of stable cell states, using single-cell RNA-seq data. Importantly, the authors show that cell state identification based on GRNs is robust towards batch-effects and technical-biases. The authors applied SCENIC to a compendium of single-cell data from the mouse and human brain and demonstrate that the proper combinations of transcription factors, target genes, enhancers, and cell types can be identified. The authors further validated these predictions by showing that two transcription factors are predominantly expressed in early metastatic sentinel lymph nodes. As scalable alternative to GENIE3, the authors also provide GRNboost, paving the way towards the network analysis across millions of single cells. Not peer-reviewed ) is the author/funder. 

The apparent differences between the tumors at single-cell level may be due to differences in copy number profiles, which are unique for each tumor and can have a strong impact on the gene expression profile 55,57,75. 

The databases used for the analyses presented in this paper are the "18k motif collection" from iRegulon (genebased motif rankings) for human and mouse. 

GiniClust 20 was run on the unlogged TPM matrix with the default parameters, which resulted in a matrix with 17843 genes and one single cluster. 

methods that exploit co-expression or networks for the analysis of single-cell RNA-seq data such as “network synthesis toolkit” 24, Pina’s approach 25, PAGODA 13, and SINCERA 26 have tentatively been developed. 

The authors removed these non-tumoral cells from the expression matrix using hierarchical clustering based on the markers cited in the article (mature oligodendrocytes and microglia, respectively). 

To test SCENIC performances the authors applied it to a scRNA-seq data set with well-known cell types from the adult mouse brain previously described in Zeisel et al. 

SCENIC identified an "interneuron-like" and a "excitatory neuron-like" subpopulation within the fetal quiescent cells in the human data set, expressing DLX1,2,5 and MAF, and NEUROD1. 

In conclusion, SCENIC competes with the best clustering methods to discovering cell types and correctly assigning cells to each cell type; but SCENIC goes beyond existing methods by reducing data dimensionality using TF regulons rather than principal components, thereby accounting for noise and removing technical biases, and uncovering master regulators and gene regulatory networks for each cell type. 

note that the first network-inference step is based on co-expression, and some authors recommend avoiding within sample normalizations (i.e. TPM) for this task because they may induce artificial co-variation 82. 

There are still limitations to using transcription factor motifs to filter and prune co-expression modules, the most obvious being that not for all transcription factors motifs are available, that some factors have motifs with higher information content than others, and that not all transcription factors are co-expressed with their target genes. 

This data set has been used extensively for benchmarking purposes 13,14,20,27–31 and contains the main cell types in hippocampus and somatosensory cortex, namely neurons (pyramidal excitatory neurons, and interneurons), glia (astrocytes, oligodendrocytes, microglia), and endothelial cells.