scispace - formally typeset
Search or ask a question
Posted Content•DOI•

Direct Comparative Analysis of 10X Genomics Chromium and Smart-seq2

Xiliang Wang1, Yao He1, Qiming Zhang1, Xianwen Ren1, Zemin Zhang1 •
22 Apr 2019-bioRxiv (Cold Spring Harbor Laboratory)-pp 615013
TL;DR: A comprehensive benchmark analysis offers the basis for selecting the optimal scRNA-seq strategy based on the objectives of each study, and detects different sets of differentially expressed genes between cell clusters, indicating the complementary nature of these technologies.
Abstract: Single cell RNA sequencing (scRNA-seq) is widely used for profiling transcriptomes of individual cells. The droplet-based 10X Genomics Chromium (10X) approach and the plate-based Smart-seq2 full-length method are two frequently-used scRNA-seq platforms, yet there are only a few thorough and systematic comparisons of their advantages and limitations. Here, by directly comparing the scRNA-seq data by the two platforms from the same samples of CD45-cells, we systematically evaluated their features using a wide spectrum of analysis. Smart-seq2 detected more genes in a cell, especially low abundance transcripts as well as alternatively spliced transcripts, but captured higher proportion of mitochondrial genes. The composite of Smart-seq2 data also resembled bulk RNA-seq data better. For 10X-based data, we observed higher noise for mRNA in the low expression level. Despite the poly(A) enrichment, approximately 10-30% of all detected transcripts by both platforms were from non-coding genes, with lncRNA accounting for a higher proportion in 10X. 10X-based data displayed more severe dropout problem, especially for genes with lower expression levels. However, 10X-data can better detect rare cell types given its ability to cover a large number of cells. In addition, each platform detected different sets of differentially expressed genes between cell clusters, indicating the complementary nature of these technologies. Our comprehensive benchmark analysis offers the basis for selecting the optimal scRNA-seq strategy based on the objectives of each study.
Citations
More filters
01 Nov 2016
TL;DR: Single-cell genomics has now made it possible to create a comprehensive atlas of human cells and has reopened definitions of a cell's identity and of the ways in which identity is regulated by the cell's molecular circuitry.
Abstract: Single-cell genomics has now made it possible to create a comprehensive atlas of human cells. At the same time, it has reopened definitions of a cell's identity and of the ways in which identity is regulated by the cell's molecular circuitry. Emerging computational analysis methods, especially in single-cell RNA sequencing (scRNA-seq), have already begun to reveal, in a data-driven way, the diverse simultaneous facets of a cell's identity, from discrete cell types to continuous dynamic transitions and spatial locations. These developments will eventually allow a cell to be represented as a superposition of 'basis vectors', each determining a different (but possibly dependent) aspect of cellular organization and function. However, computational methods must also overcome considerable challenges-from handling technical noise and data scale to forming new abstractions of biology. As the scale of single-cell experiments continues to increase, new computational approaches will be essential for constructing and characterizing a reference map of cell identities.

372 citations

Journal Article•DOI•
TL;DR: The rapid production (and analysis) of droplets allows for exceptionally high-throughput experimentation and data acquisition, and configurable channel designs, coupled with on-demand control architectures, engender a range of robust manipulations.
Abstract: Microfluidic platforms have changed the paradigm of biochemical experimentation over the past three decades. Prominent within this technology set are droplet-based microfluidic systems, in which passive microfluidic structures are used to rapidly generate and manipulate sub-nanoliter volumes droplets within microchannel environments. Droplets are formed in a continuous and robust fashion through the extrusion and shearing of two mutually immiscible phases in a microchannel, with droplet volumes being precisely controlled through the variation of flow rate ratios and channel dimensions. The rapid production (and analysis) of droplets allows for exceptionally high-throughput experimentation and data acquisition, and configurable channel designs, coupled with on-demand control architectures, engender a range of robust manipulations, such as reagent dosing, droplet fusion, droplet splitting, washing, payload heating, incubation, content dilution and droplet sorting. Accordingly, and unsurprisingly, droplet-based microfluidic systems have become an indispensable and embedded tool within contemporary chemical and biological science.

161 citations

Journal Article•DOI•
Fenglin Liu1, Yuanyuan Zhang1, Lei Zhang1, Ziyi Li1, Qiao Fang1, Ranran Gao1, Zemin Zhang1 •
TL;DR: This study provides the first benchmarking to evaluate the performances of different SNV detection tools for scRNA-seq data and recommends SAMtools, Strelka2, FreeBayes, or CTAT, depending on the specific conditions of usage.
Abstract: Systematic interrogation of single-nucleotide variants (SNVs) is one of the most promising approaches to delineate the cellular heterogeneity and phylogenetic relationships at the single-cell level. While SNV detection from abundant single-cell RNA sequencing (scRNA-seq) data is applicable and cost-effective in identifying expressed variants, inferring sub-clones, and deciphering genotype-phenotype linkages, there is a lack of computational methods specifically developed for SNV calling in scRNA-seq. Although variant callers for bulk RNA-seq have been sporadically used in scRNA-seq, the performances of different tools have not been assessed. Here, we perform a systematic comparison of seven tools including SAMtools, the GATK pipeline, CTAT, FreeBayes, MuTect2, Strelka2, and VarScan2, using both simulation and scRNA-seq datasets, and identify multiple elements influencing their performance. While the specificities are generally high, with sensitivities exceeding 90% for most tools when calling homozygous SNVs in high-confident coding regions with sufficient read depths, such sensitivities dramatically decrease when calling SNVs with low read depths, low variant allele frequencies, or in specific genomic contexts. SAMtools shows the highest sensitivity in most cases especially with low supporting reads, despite the relatively low specificity in introns or high-identity regions. Strelka2 shows consistently good performance when sufficient supporting reads are provided, while FreeBayes shows good performance in the cases of high variant allele frequencies. We recommend SAMtools, Strelka2, FreeBayes, or CTAT, depending on the specific conditions of usage. Our study provides the first benchmarking to evaluate the performances of different SNV detection tools for scRNA-seq data.

76 citations

Posted Content•DOI•
01 Sep 2020-bioRxiv
TL;DR: Comparative high resolution analysis of prostate cancer bone metastasis shows a targeted approach for relieving local immunosuppression for therapeutic effect.
Abstract: Bone metastases are devastating complications of cancer. They are particularly common in prostate cancer, represent incurable disease and are refractory to immunotherapy. We sought to define distinct features of the bone marrow microenvironment by analyzing single cells from prostate cancer patients’ involved bone, uninvolved bone and distant bone sites as well as bone from cancer-free, orthopedic patients and healthy individuals. Metastatic prostate cancer was associated with multifaceted immune distortion, specifically exhaustion of distinct T cell subsets, appearance of macrophages with states specific to prostate cancer bone metastases. The chemokine CCL20 was notably overexpressed by myeloid cells, as was its cognate CCR6 receptor on T cells. Disruption of the CCL20-CCR6 axis in mice with syngeneic prostate bone metastases restored T cell reactivity and significantly prolonged animal survival. Comparative high resolution analysis of prostate cancer bone metastasis shows a targeted approach for relieving local immunosuppression for therapeutic effect.

56 citations


Cites methods from "Direct Comparative Analysis of 10X ..."

  • ...The 10x Chromium V2 protocol utilized captures a relatively modest percentage of mRNA molecules in each cell (Ding et al., 2020; Mereu, 2020; Wang, 2020)....

    [...]

Journal Article•DOI•
TL;DR: S spatial molecular imaging is described, a system that measures RNAs and proteins in intact biological samples at subcellular resolution by performing multiple cycles of nucleic acid hybridization of fluorescent molecular barcodes, and has high sensitivity and very low error rate.

52 citations

References
More filters
Journal Article•DOI•
TL;DR: An R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters and can be easily extended to other species and ontologies is presented.
Abstract: Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters The analysis module and visualization module were combined into a reusable workflow Currently, clusterProfiler supports three species, including humans, mice, and yeast Methods provided in this package can be easily extended to other species and ontologies The clusterProfiler package is released under Artistic-20 License within Bioconductor project The source code and vignette are freely available at http://bioconductororg/packages/release/bioc/html/clusterProfilerhtml

16,644 citations

Journal Article•DOI•
21 May 2015-Cell
TL;DR: Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together.

5,506 citations

Journal Article•DOI•
TL;DR: Seurat is a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns, and correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups.
Abstract: Spatial localization is a key determinant of cellular fate and behavior, but methods for spatially resolved, transcriptome-wide gene expression profiling across complex tissues are lacking. RNA staining methods assay only a small number of transcripts, whereas single-cell RNA-seq, which measures global gene expression, separates cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. We confirmed Seurat's accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.

3,465 citations

01 May 2015
TL;DR: Drop-seq as discussed by the authors analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts' cell of origin, and identifies 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes.
Abstract: Cells, the basic units of biological structure and function, vary broadly in type and state. Single-cell genomics can characterize cell identity and function, but limitations of ease and scale have prevented its broad application. Here we describe Drop-seq, a strategy for quickly profiling thousands of individual cells by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together. Drop-seq analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts' cell of origin. We analyzed transcriptomes from 44,808 mouse retinal cells and identified 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes. Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution. VIDEO ABSTRACT.

3,365 citations

Journal Article•DOI•
TL;DR: It is found that lincRNA expression is strikingly tissue-specific compared with coding genes, and that l incRNAs are typically coexpressed with their neighboring genes, albeit to an extent similar to that of pairs of neighboring protein-coding genes.
Abstract: Large intergenic noncoding RNAs (lincRNAs) are emerging as key regulators of diverse cellular processes. Determining the function of individual lincRNAs remains a challenge. Recent advances in RNA sequencing (RNA-seq) and computational methods allow for an unprecedented analysis of such transcripts. Here, we present an integrative approach to define a reference catalog of >8000 human lincRNAs. Our catalog unifies previously existing annotation sources with transcripts we assembled from RNA-seq data collected from ~4 billion RNA-seq reads across 24 tissues and cell types. We characterize each lincRNA by a panorama of >30 properties, including sequence, structural, transcriptional, and orthology features. We found that lincRNA expression is strikingly tissue-specific compared with coding genes, and that lincRNAs are typically coexpressed with their neighboring genes, albeit to an extent similar to that of pairs of neighboring protein-coding genes. We distinguish an additional subset of transcripts that have high evolutionary conservation but may include short ORFs and may serve as either lincRNAs or small peptides. Our integrated, comprehensive, yet conservative reference catalog of human lincRNAs reveals the global properties of lincRNAs and will facilitate experimental studies and further functional classification of these genes.

3,114 citations