scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Single-cell and spatial transcriptomics enables probabilistic inference of cell type topography.

TL;DR: A probabilistic framework is presented that integrates single-cell and bulk spatial transcriptomics in order to spatially map cell types onto their respective tissues and applies to the developing human heart and mouse brain to demonstrate the power of the technique.
Abstract: The field of spatial transcriptomics is rapidly expanding, and with it the repertoire of available technologies. However, several of the transcriptome-wide spatial assays do not operate on a single cell level, but rather produce data comprised of contributions from a – potentially heterogeneous – mixture of cells. Still, these techniques are attractive to use when examining complex tissue specimens with diverse cell populations, where complete expression profiles are required to properly capture their richness. Motivated by an interest to put gene expression into context and delineate the spatial arrangement of cell types within a tissue, we here present a model-based probabilistic method that uses single cell data to deconvolve the cell mixtures in spatial data. To illustrate the capacity of our method, we use data from different experimental platforms and spatially map cell types from the mouse brain and developmental heart, which arrange as expected. Alma Andersson et al. present a probabilistic framework that integrates single-cell and bulk spatial transcriptomics in order to spatially map cell types onto their respective tissues. They apply their method to the developing human heart and mouse brain to demonstrate the power of the technique.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
11 Aug 2021-Nature
TL;DR: Spatial transcriptomics can also be used for hypothesis testing using experimental designs that compare time points or conditions, including genetic or environmental perturbations as mentioned in this paper, and is naturally amenable to integration with other data modalities, providing an expandable framework for insight into tissue organization.
Abstract: Deciphering the principles and mechanisms by which gene activity orchestrates complex cellular arrangements in multicellular organisms has far-reaching implications for research in the life sciences. Recent technological advances in next-generation sequencing- and imaging-based approaches have established the power of spatial transcriptomics to measure expression levels of all or most genes systematically throughout tissue space, and have been adopted to generate biological insights in neuroscience, development and plant biology as well as to investigate a range of disease contexts, including cancer. Similar to datasets made possible by genomic sequencing and population health surveys, the large-scale atlases generated by this technology lend themselves to exploratory data analysis for hypothesis generation. Here we review spatial transcriptomic technologies and describe the repertoire of operations available for paths of analysis of the resulting data. Spatial transcriptomics can also be deployed for hypothesis testing using experimental designs that compare time points or conditions—including genetic or environmental perturbations. Finally, spatial transcriptomic data are naturally amenable to integration with other data modalities, providing an expandable framework for insight into tissue organization. This review describes the state of spatial transcriptomics technologies and analysis tools that are being used to generate biological insights in diverse areas of biology.

358 citations

Journal ArticleDOI
TL;DR: In this paper, a single-cell and spatially resolved transcriptomics analysis of human breast cancers is presented, which reveals recurrent neoplastic cell heterogeneity and heterotypic interactions play central roles in disease progression.
Abstract: Breast cancers are complex cellular ecosystems where heterotypic interactions play central roles in disease progression and response to therapy. However, our knowledge of their cellular composition and organization is limited. Here we present a single-cell and spatially resolved transcriptomics analysis of human breast cancers. We developed a single-cell method of intrinsic subtype classification (SCSubtype) to reveal recurrent neoplastic cell heterogeneity. Immunophenotyping using cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq) provides high-resolution immune profiles, including new PD-L1/PD-L2+ macrophage populations associated with clinical outcome. Mesenchymal cells displayed diverse functions and cell-surface protein expression through differentiation within three major lineages. Stromal-immune niches were spatially organized in tumors, offering insights into antitumor immune regulation. Using single-cell signatures, we deconvoluted large breast cancer cohorts to stratify them into nine clusters, termed 'ecotypes', with unique cellular compositions and clinical outcomes. This study provides a comprehensive transcriptional atlas of the cellular architecture of breast cancer.

303 citations

Journal ArticleDOI
TL;DR: In this paper, a suite of recently developed techniques that localize RNA within tissue, including multiplexed in situ hybridization and in situ sequencing (here defined as high-plex RNA imaging) and spatial barcoding, can help address this issue.
Abstract: Single-cell RNA sequencing (scRNA-seq) identifies cell subpopulations within tissue but does not capture their spatial distribution nor reveal local networks of intercellular communication acting in situ. A suite of recently developed techniques that localize RNA within tissue, including multiplexed in situ hybridization and in situ sequencing (here defined as high-plex RNA imaging) and spatial barcoding, can help address this issue. However, no method currently provides as complete a scope of the transcriptome as does scRNA-seq, underscoring the need for approaches to integrate single-cell and spatial data. Here, we review efforts to integrate scRNA-seq with spatial transcriptomics, including emerging integrative computational methods, and propose ways to effectively combine current methodologies.

288 citations

Journal ArticleDOI
TL;DR: BayesSpace is introduced, a fully Bayesian statistical method that uses the information from spatial neighborhoods for resolution enhancement of spatial transcriptomic data and for clustering analysis and shows that it improves identification of distinct intra-tissue transcriptional profiles from samples of the brain, melanoma, invasive ductal carcinoma and ovarian adenocarcinoma.
Abstract: Recent spatial gene expression technologies enable comprehensive measurement of transcriptomic profiles while retaining spatial context. However, existing analysis methods do not address the limited resolution of the technology or use the spatial information efficiently. Here, we introduce BayesSpace, a fully Bayesian statistical method that uses the information from spatial neighborhoods for resolution enhancement of spatial transcriptomic data and for clustering analysis. We benchmark BayesSpace against current methods for spatial and non-spatial clustering and show that it improves identification of distinct intra-tissue transcriptional profiles from samples of the brain, melanoma, invasive ductal carcinoma and ovarian adenocarcinoma. Using immunohistochemistry and an in silico dataset constructed from scRNA-seq data, we show that BayesSpace resolves tissue structure that is not detectable at the original resolution and identifies transcriptional heterogeneity inaccessible to histological analysis. Our results illustrate BayesSpace's utility in facilitating the discovery of biological insights from spatial transcriptomic datasets.

226 citations

References
More filters
Journal ArticleDOI
TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Abstract: In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html .

47,038 citations

Journal ArticleDOI
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Abstract: Summary: It is expected that emerging digital gene expression (DGE) technologies will overtake microarray technologies in the near future for many functional genomics applications. One of the fundamental data analysis tasks, especially for gene expression studies, involves determining whether there is evidence that counts for a transcript or exon are significantly different across experimental conditions. edgeR is a Bioconductor software package for examining differential expression of replicated count data. An overdispersed Poisson model is used to account for both biological and technical variability. Empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference. The methodology can be used even with the most minimal levels of replication, provided at least one phenotype or experimental condition is replicated. The software may have other applications beyond sequencing data, such as proteome peptide count data. Availability: The package is freely available under the LGPL licence from the Bioconductor web site (http://bioconductor.org).

29,413 citations

Proceedings Article
01 Jan 2019
TL;DR: This paper details the principles that drove the implementation of PyTorch and how they are reflected in its architecture, and explains how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance.
Abstract: Deep learning frameworks have often focused on either usability or speed, but not both. PyTorch is a machine learning library that shows that these two goals are in fact compatible: it was designed from first principles to support an imperative and Pythonic programming style that supports code as a model, makes debugging easy and is consistent with other popular scientific computing libraries, while remaining efficient and supporting hardware accelerators such as GPUs. In this paper, we detail the principles that drove the implementation of PyTorch and how they are reflected in its architecture. We emphasize that every aspect of PyTorch is a regular Python program under the full control of its user. We also explain how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance. We demonstrate the efficiency of individual subsystems, as well as the overall speed of PyTorch on several commonly used benchmarks.

10,045 citations

Journal ArticleDOI
Ed S. Lein1, Michael Hawrylycz1, Nancy Ao2, Mikael Ayres1, Amy Bensinger1, Amy Bernard1, Andrew F. Boe1, Mark S. Boguski1, Mark S. Boguski3, Kevin S. Brockway1, Emi J. Byrnes1, Lin Chen1, Li Chen2, Tsuey-Ming Chen2, Mei Chi Chin1, Jimmy Chong1, Brian E. Crook1, Aneta Czaplinska2, Chinh Dang1, Suvro Datta1, Nick Dee1, Aimee L. Desaki1, Tsega Desta1, Ellen Diep1, Tim A. Dolbeare1, Matthew J. Donelan1, Hong-Wei Dong1, Jennifer G. Dougherty1, Ben J. Duncan1, Amanda Ebbert1, Gregor Eichele4, Lili K. Estin1, Casey Faber1, Benjamin A.C. Facer1, Rick Fields2, Shanna R. Fischer1, Tim P. Fliss1, Cliff Frensley1, Sabrina N. Gates1, Katie J. Glattfelder1, Kevin R. Halverson1, Matthew R. Hart1, John G. Hohmann1, Maureen P. Howell1, Darren P. Jeung1, Rebecca A. Johnson1, Patrick T. Karr1, Reena Kawal1, Jolene Kidney1, Rachel H. Knapik1, Chihchau L. Kuan1, James H. Lake1, Annabel R. Laramee1, Kirk D. Larsen1, Christopher Lau1, Tracy Lemon1, Agnes J. Liang2, Ying Liu2, Lon T. Luong1, Jesse Michaels1, Judith J. Morgan1, Rebecca J. Morgan1, Marty Mortrud1, Nerick Mosqueda1, Lydia Ng1, Randy Ng1, Geralyn J. Orta1, Caroline C. Overly1, Tu H. Pak1, Sheana Parry1, Sayan Dev Pathak1, Owen C. Pearson1, Ralph B. Puchalski1, Zackery L. Riley1, Hannah R. Rockett1, Stephen A. Rowland1, Joshua J. Royall1, Marcos J. Ruiz2, Nadia R. Sarno1, Katherine Schaffnit1, Nadiya V. Shapovalova1, Taz Sivisay1, Clifford R. Slaughterbeck1, Simon Smith1, Kimberly A. Smith1, Bryan I. Smith1, Andy J. Sodt1, Nick N. Stewart1, Kenda-Ruth Stumpf1, Susan M. Sunkin1, Madhavi Sutram1, Angelene Tam2, Carey D. Teemer1, Christina Thaller2, Carol L. Thompson1, Lee R. Varnam1, Axel Visel4, Axel Visel5, Ray M. Whitlock1, Paul Wohnoutka1, Crissa K. Wolkey1, Victoria Y. Wong1, Matthew J.A. Wood2, Murat B. Yaylaoglu2, Rob Young1, Brian L. Youngstrom1, Xu Feng Yuan1, Bin Zhang2, Theresa A. Zwingman1, Allan R. Jones1 
11 Jan 2007-Nature
TL;DR: An anatomically comprehensive digital atlas containing the expression patterns of ∼20,000 genes in the adult mouse brain is described, providing an open, primary data resource for a wide variety of further studies concerning brain organization and function.
Abstract: Molecular approaches to understanding the functional circuitry of the nervous system promise new insights into the relationship between genes, brain and behaviour. The cellular diversity of the brain necessitates a cellular resolution approach towards understanding the functional genomics of the nervous system. We describe here an anatomically comprehensive digital atlas containing the expression patterns of approximately 20,000 genes in the adult mouse brain. Data were generated using automated high-throughput procedures for in situ hybridization and data acquisition, and are publicly accessible online. Newly developed image-based informatics tools allow global genome-scale structural analysis and cross-correlation, as well as identification of regionally enriched genes. Unbiased fine-resolution analysis has identified highly specific cellular markers as well as extensive evidence of cellular heterogeneity not evident in classical neuroanatomical atlases. This highly standardized atlas provides an open, primary data resource for a wide variety of further studies concerning brain organization and function.

4,944 citations

Journal ArticleDOI
TL;DR: This work presents xCell, a novel gene signature-based method, and uses it to infer 64 immune and stromal cell types and shows that xCell outperforms other methods.
Abstract: Tissues are complex milieus consisting of numerous cell types. Several recent methods have attempted to enumerate cell subsets from transcriptomes. However, the available methods have used limited sources for training and give only a partial portrayal of the full cellular landscape. Here we present xCell, a novel gene signature-based method, and use it to infer 64 immune and stromal cell types. We harmonized 1822 pure human cell type transcriptomes from various sources and employed a curve fitting approach for linear comparison of cell types and introduced a novel spillover compensation technique for separating them. Using extensive in silico analyses and comparison to cytometry immunophenotyping, we show that xCell outperforms other methods. xCell is available at http://xCell.ucsf.edu/ .

2,040 citations

Related Papers (5)