viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia

doi:10.1038/NBT.2594

Home
/
Papers
/
viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia

Journal Article•DOI•

viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia

El-ad David Amir¹, Kara L. Davis², Michelle D. Tadmor¹, Erin F. Simonds², Jacob H. Levine¹, Sean C. Bendall², Daniel K. Shenfeld¹, Smita Krishnaswamy¹, Garry P. Nolan², Dana Pe'er¹ - Show less +6 more•Institutions (2)

Columbia University¹, Stanford University²

19 May 2013-Nature Biotechnology (NIH Public Access)-Vol. 31, Iss: 6, pp 545-552

TL;DR: In this article, the authors present viSNE, a tool that allows one to map high-dimensional cytometry data onto two dimensions, yet conserve the highdimensional structure of the data by using all pairwise distances in high dimension to determine each cell's location in the plot.

read less

Abstract: New high-dimensional, single-cell technologies offer unprecedented resolution in the analysis of heterogeneous tissues. However, because these technologies can measure dozens of parameters simultaneously in individual cells, data interpretation can be challenging. Here we present viSNE, a tool that allows one to map high-dimensional cytometry data onto two dimensions, yet conserve the high-dimensional structure of the data. viSNE plots individual cells in a visual similar to a scatter plot, while using all pairwise distances in high dimension to determine each cell's location in the plot. We integrated mass cytometry with viSNE to map healthy and cancerous bone marrow samples. Healthy bone marrow automatically maps into a consistent shape, whereas leukemia samples map into malformed shapes that are distinct from healthy bone marrow and from each other. We also use viSNE and mass cytometry to compare leukemia diagnosis and relapse samples, and to identify a rare leukemia population reminiscent of minimal residual disease. viSNE can be applied to any multi-dimensional single-cell technology.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets

[...]

Evan Z. Macosko¹, Evan Z. Macosko², Anindita Basu¹, Anindita Basu², Rahul Satija³, Rahul Satija², James Nemesh², James Nemesh¹, Karthik Shekhar², Melissa Goldman², Melissa Goldman¹, Itay Tirosh², Allison R. Bialas⁴, Nolan Kamitaki¹, Nolan Kamitaki², Emily M. Martersteck¹, John J. Trombetta², David A. Weitz¹, Joshua R. Sanes¹, Alex K. Shalek⁵, Alex K. Shalek², Alex K. Shalek⁶, Aviv Regev⁶, Aviv Regev², Aviv Regev⁷, Steven A. McCarroll¹, Steven A. McCarroll² - Show less +23 more•Institutions (7)

Harvard University¹, Broad Institute², New York University³, Boston Children's Hospital⁴, Ragon Institute of MGH, MIT and Harvard⁵, Massachusetts Institute of Technology⁶, Howard Hughes Medical Institute⁷

21 May 2015-Cell

TL;DR: Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together.

...read moreread less

5,506 citations

Journal Article•DOI•

The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells

[...]

Cole Trapnell¹, Davide Cacchiarelli², Davide Cacchiarelli¹, Jonna Grimsby², Prapti Pokharel², Shuqiang Li³, Michael A. Morse¹, Michael A. Morse², Niall J. Lennon², Kenneth J. Livak³, Tarjei S. Mikkelsen¹, Tarjei S. Mikkelsen², John L. Rinn², John L. Rinn¹, John L. Rinn⁴ - Show less +11 more•Institutions (4)

Harvard University¹, Broad Institute², Fluidigm Corporation³, Beth Israel Deaconess Medical Center⁴

23 Mar 2014-Nature Biotechnology

TL;DR: Monocle is described, an unsupervised algorithm that increases the temporal resolution of transcriptome dynamics using single-cell RNA-Seq data collected at multiple time points that revealed switch-like changes in expression of key regulatory factors, sequential waves of gene regulation, and expression of regulators that were not known to act in differentiation.

...read moreread less

Abstract: Defining the transcriptional dynamics of a temporal process such as cell differentiation is challenging owing to the high variability in gene expression between individual cells. Time-series gene expression analyses of bulk cells have difficulty distinguishing early and late phases of a transcriptional cascade or identifying rare subpopulations of cells, and single-cell proteomic methods rely on a priori knowledge of key distinguishing markers. Here we describe Monocle, an unsupervised algorithm that increases the temporal resolution of transcriptome dynamics using single-cell RNA-Seq data collected at multiple time points. Applied to the differentiation of primary human myoblasts, Monocle revealed switch-like changes in expression of key regulatory factors, sequential waves of gene regulation, and expression of regulators that were not known to act in differentiation. We validated some of these predicted regulators in a loss-of function screen. Monocle can in principle be used to recover single-cell gene expression kinetics from a wide array of cellular processes, including differentiation, proliferation and oncogenic transformation.

...read moreread less

4,119 citations

Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets

[...]

Evan Z. Macosko¹, Evan Z. Macosko², Anindita Basu², Anindita Basu¹, Rahul Satija¹, Rahul Satija³, James Nemesh², James Nemesh¹, Karthik Shekhar¹, Melissa Goldman¹, Melissa Goldman², Itay Tirosh¹, Allison R. Bialas⁴, Nolan Kamitaki², Nolan Kamitaki¹, Emily M. Martersteck², John J. Trombetta¹, David A. Weitz², Joshua R. Sanes², Alex K. Shalek¹, Alex K. Shalek⁵, Alex K. Shalek⁶, Aviv Regev¹, Aviv Regev⁷, Aviv Regev⁶, Steven A. McCarroll¹, Steven A. McCarroll² - Show less +23 more•Institutions (7)

Broad Institute¹, Harvard University², New York University³, Boston Children's Hospital⁴, Ragon Institute of MGH, MIT and Harvard⁵, Massachusetts Institute of Technology⁶, Howard Hughes Medical Institute⁷

01 May 2015

TL;DR: Drop-seq as discussed by the authors analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts' cell of origin, and identifies 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes.

...read moreread less

Abstract: Cells, the basic units of biological structure and function, vary broadly in type and state. Single-cell genomics can characterize cell identity and function, but limitations of ease and scale have prevented its broad application. Here we describe Drop-seq, a strategy for quickly profiling thousands of individual cells by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together. Drop-seq analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts' cell of origin. We analyzed transcriptomes from 44,808 mouse retinal cells and identified 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes. Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution. VIDEO ABSTRACT.

...read moreread less

3,365 citations

Journal Article•DOI•

SCANPY: large-scale single-cell gene expression data analysis

[...]

F. Alexander Wolf, Philipp Angerer, Fabian J. Theis¹•Institutions (1)

Technische Universität München¹

06 Feb 2018-Genome Biology

TL;DR: This work presents Scanpy, a scalable toolkit for analyzing single-cell gene expression data that includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks, and AnnData, a generic class for handling annotated data matrices.

...read moreread less

Abstract: Scanpy is a scalable toolkit for analyzing single-cell gene expression data. It includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks. Its Python-based implementation efficiently deals with data sets of more than one million cells ( https://github.com/theislab/Scanpy ). Along with Scanpy, we present AnnData, a generic class for handling annotated data matrices ( https://github.com/theislab/anndata ).

...read moreread less

3,343 citations

Cites methods from "viSNE enables visualization of high..."

...Specifically, SCANPY provides preprocessing comparable to SEURAT [10] and CELL RANGER [6], visualization through TSNE [11, 12], graph-drawing [13– 15] and diffusion maps [11, 16, 17], clustering similar *Correspondence: alex.wolf@helmholtz-muenchen.de; fabian.theis@helmholtz-muenchen.de 1Helmholtz Zentrum München – German Research Center for Environmental Health, Institute of Computational Biology, Neuherberg, Munich, Germany 2Department of Mathematics, Technische Universität München, Munich, Germany to PHENOGRAPH [18–20], identification of marker genes for clusters via differential expression tests and pseudotemporal ordering via diffusion pseudotime [21], which compares favorably [22] with MONOCLE 2 [22], and WISHBONE [23] (Fig....
[...]
...Parts of the toolkit rely on SCIKIT-LEARN [27], STATSMODELS [38], SEABORN [39], NETWORKX [28], IGRAPH [14], the TSNE package of [40], and the Louvain clustering package of [41]....
[...]
...Specifically, SCANPY provides preprocessing comparable to SEURAT [10] and CELL RANGER [6], visualization through TSNE [11, 12], graph-drawing [13– 15] and diffusion maps [11, 16, 17], clustering similar...
[...]
...We use pseudotemporal ordering from a root cell in the CD34+ cluster and detect a branching trajectory, visualized with TSNE and diffusion maps. b Speedup over CELL RANGER R kit....
[...]
...TSNE and graph-drawing (Fruchterman–Reingold) visualizations show cell-type annotations obtained by comparisons with bulk expression....
[...]

Journal Article•DOI•

Dimensionality reduction for visualizing single-cell data using UMAP.

[...]

Etienne Becht¹, Leland McInnes, John Healy, Charles-Antoine Dutertre¹, Immanuel Kwok¹, Lai Guan Ng¹, Florent Ginhoux¹, Evan W. Newell¹, Evan W. Newell² - Show less +5 more•Institutions (2)

Agency for Science, Technology and Research¹, Fred Hutchinson Cancer Research Center²

01 Jan 2019-Nature Biotechnology

TL;DR: Comparing the performance of UMAP with five other tools, it is found that UMAP provides the fastest run times, highest reproducibility and the most meaningful organization of cell clusters.

...read moreread less

Abstract: Advances in single-cell technologies have enabled high-resolution dissection of tissue composition. Several tools for dimensionality reduction are available to analyze the large number of parameters generated in single-cell studies. Recently, a nonlinear dimensionality-reduction technique, uniform manifold approximation and projection (UMAP), was developed for the analysis of any type of high-dimensional data. Here we apply it to biological data, using three well-characterized mass cytometry and single-cell RNA sequencing datasets. Comparing the performance of UMAP with five other tools, we find that UMAP provides the fastest run times, highest reproducibility and the most meaningful organization of cell clusters. The work highlights the use of UMAP for improved visualization and interpretation of single-cell data.

...read moreread less

3,016 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•

Visualizing Data using t-SNE

[...]

Laurens van der Maaten, Geoffrey E. Hinton

01 Jan 2008-Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

Abstract: We present a new technique called “t-SNE” that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map. The technique is a variation of Stochastic Neighbor Embedding (Hinton and Roweis, 2002) that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map. t-SNE is better than existing techniques at creating a single map that reveals structure at many different scales. This is particularly important for high-dimensional data that lie on several different, but related, low-dimensional manifolds, such as images of objects from multiple classes seen from multiple viewpoints. For visualizing the structure of very large datasets, we show how t-SNE can use random walks on neighborhood graphs to allow the implicit structure of all of the data to influence the way in which a subset of the data is displayed. We illustrate the performance of t-SNE on a wide variety of datasets and compare it with many other non-parametric visualization techniques, including Sammon mapping, Isomap, and Locally Linear Embedding. The visualizations produced by t-SNE are significantly better than those produced by the other techniques on almost all of the datasets.

...read moreread less

30,124 citations

Journal Article•DOI•

Fast unfolding of communities in large networks

[...]

Vincent D. Blondel¹, Jean-Loup Guillaume¹, Jean-Loup Guillaume², Renaud Lambiotte³, Renaud Lambiotte¹, Etienne Lefebvre¹ - Show less +2 more•Institutions (3)

Université catholique de Louvain¹, Pierre-and-Marie-Curie University², Imperial College London³

04 Mar 2008-arXiv: Physics and Society

TL;DR: This work proposes a heuristic method that is shown to outperform all other known community detection methods in terms of computation time and the quality of the communities detected is very good, as measured by the so-called modularity.

...read moreread less

Abstract: We propose a simple method to extract the community structure of large networks. Our method is a heuristic method that is based on modularity optimization. It is shown to outperform all other known community detection method in terms of computation time. Moreover, the quality of the communities detected is very good, as measured by the so-called modularity. This is shown first by identifying language communities in a Belgian mobile phone network of 2.6 million customers and by analyzing a web graph of 118 million nodes and more than one billion links. The accuracy of our algorithm is also verified on ad-hoc modular networks. .

...read moreread less

13,519 citations

Journal Article•DOI•

RNA-Seq: a revolutionary tool for transcriptomics

[...]

Zhong Wang¹, Mark Gerstein¹, Michael Snyder¹•Institutions (1)

Yale University¹

01 Jan 2009-Nature Reviews Genetics

TL;DR: The RNA-Seq approach to transcriptome profiling that uses deep-sequencing technologies provides a far more precise measurement of levels of transcripts and their isoforms than other methods.

...read moreread less

Abstract: RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. Studies using this method have already altered our view of the extent and complexity of eukaryotic transcriptomes. RNA-Seq also provides a far more precise measurement of levels of transcripts and their isoforms than other methods. This article describes the RNA-Seq approach, the challenges associated with its application, and the advances made so far in characterizing several eukaryote transcriptomes.

...read moreread less

11,528 citations

Journal Article•DOI•

Fast unfolding of communities in large networks

[...]

Vincent D. Blondel¹, Jean-Loup Guillaume¹, Jean-Loup Guillaume², Renaud Lambiotte¹, Renaud Lambiotte³, Etienne Lefebvre¹ - Show less +2 more•Institutions (3)

Université catholique de Louvain¹, Pierre-and-Marie-Curie University², Imperial College London³

01 Oct 2008-Journal of Statistical Mechanics: Theory and Experiment

TL;DR: In this paper, the authors proposed a simple method to extract the community structure of large networks based on modularity optimization, which is shown to outperform all other known community detection methods in terms of computation time.

...read moreread less

Abstract: We propose a simple method to extract the community structure of large networks. Our method is a heuristic method that is based on modularity optimization. It is shown to outperform all other known community detection methods in terms of computation time. Moreover, the quality of the communities detected is very good, as measured by the so-called modularity. This is shown first by identifying language communities in a Belgian mobile phone network of 2 million customers and by analysing a web graph of 118 million nodes and more than one billion links. The accuracy of our algorithm is also verified on ad hoc modular networks.

...read moreread less

11,078 citations

Journal Article•DOI•

Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum

[...]

Sean C. Bendall¹, Erin F. Simonds¹, Peng Qiu¹, El-ad David Amir², Peter O. Krutzik¹, Rachel Finck¹, Robert V. Bruggner¹, Rachel D. Melamed², Angelica Trejo¹, Olga Ornatsky³, Robert S. Balderas, Sylvia K. Plevritis¹, Karen Sachs¹, Dana Pe'er², Scott D. Tanner³, Garry P. Nolan¹ - Show less +12 more•Institutions (3)

Stanford University¹, Columbia University², University of Toronto³

06 May 2011-Science

TL;DR: Single-cell “mass cytometry” analyses provide system-wide views of immune signaling in healthy human hematopoiesis, against which drug action and disease can be compared for mechanistic studies and pharmacologic intervention.

...read moreread less

Abstract: Flow cytometry is an essential tool for dissecting the functional complexity of hematopoiesis. We used single-cell "mass cytometry" to examine healthy human bone marrow, measuring 34 parameters simultaneously in single cells (binding of 31 antibodies, viability, DNA content, and relative cell size). The signaling behavior of cell subsets spanning a defined hematopoietic hierarchy was monitored with 18 simultaneous markers of functional signaling states perturbed by a set of ex vivo stimuli and inhibitors. The data set allowed for an algorithmically driven assembly of related cell types defined by surface antigen expression, providing a superimposable map of cell signaling responses in combination with drug inhibition. Visualized in this manner, the analysis revealed previously unappreciated instances of both precise signaling responses that were bounded within conventionally defined cell subsets and more continuous phosphorylation responses that crossed cell population boundaries in unexpected manners yet tracked closely with cellular phenotype. Collectively, such single-cell analyses provide system-wide views of immune signaling in healthy human hematopoiesis, against which drug action and disease can be compared for mechanistic studies and pharmacologic intervention.

...read moreread less

2,147 citations