Home
/
Authors
/
Swati Ranade

Author

Swati Ranade

Other affiliations: Massachusetts Eye and Ear Infirmary, Life Technologies, Applied Biosystems

Bio: Swati Ranade is an academic researcher from Pacific Biosciences. The author has contributed to research in topics: DNA sequencing & Human leukocyte antigen. The author has an hindex of 16, co-authored 30 publications receiving 3695 citations. Previous affiliations of Swati Ranade include Massachusetts Eye and Ear Infirmary & Life Technologies.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Stem cell transcriptome profiling via massive-scale mRNA sequencing.

[...]

Nicole Cloonan¹, Alistair R. R. Forrest¹, Alistair R. R. Forrest², Gabriel Kolle¹, Brooke Gardiner¹, Geoffrey J. Faulkner¹, Mellissa K Brown¹, Darrin Taylor¹, Anita L Steptoe¹, Shivangi Wani¹, Graeme Bethel¹, Alan J. Robertson¹, Andrew C. Perkins¹, Stephen J. Bruce¹, Clarence Lee³, Swati Ranade³, Heather E. Peckham³, Jonathan M. Manning³, Kevin McKernan³, Sean M. Grimmond¹ - Show less +16 more•Institutions (3)

University of Queensland¹, Griffith University², Applied Biosystems³

30 May 2008-Nature Methods

TL;DR: A massive-scale RNA sequencing protocol, short quantitative random RNA libraries or SQRL, is developed, highlighting how SQRL can be used to characterize transcriptome content and dynamics in a quantitative and reproducible manner, and suggesting that the understanding of transcriptional complexity is far from complete.

...read moreread less

Abstract: We developed a massive-scale RNA sequencing protocol, short quantitative random RNA libraries or SQRL, to survey the complexity, dynamics and sequence content of transcriptomes in a near-complete fashion. This method generates directional, random-primed, linear cDNA libraries that are optimized for next-generation short-tag sequencing. We surveyed the poly(A)+ transcriptomes of undifferentiated mouse embryonic stem cells (ESCs) and embryoid bodies (EBs) at an unprecedented depth (10 Gb), using the Applied Biosystems SOLiD technology. These libraries capture the genomic landscape of expression, state-specific expression, single-nucleotide polymorphisms (SNPs), the transcriptional activity of repeat elements, and both known and new alternative splicing events. We investigated the impact of transcriptional complexity on current models of key signaling pathways controlling ESC pluripotency and differentiation, highlighting how SQRL can be used to characterize transcriptome content and dynamics in a quantitative and reproducible manner, and suggesting that our understanding of transcriptional complexity is far from complete.

...read moreread less

1,119 citations

Journal Article•DOI•

A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning

[...]

Anton Valouev¹, Jeffrey Ichikawa, Thaisan Tonthat, Jeremy R. Stuart, Swati Ranade, Heather E. Peckham, Kathy Zeng, Joel A. Malek, Gina Costa, Kevin McKernan, Arend Sidow, Andrew Fire, Steven M. Johnson - Show less +9 more•Institutions (1)

Stanford University¹

01 Jul 2008-Genome Research

TL;DR: These analyses provide a global view of the chromatin architecture of a multicellular animal at extremely high density and resolution and release this data set, via the UCSC Genome Browser, as a resource for the high-resolution analysis of chromatin conformation and DNA accessibility at individual loci within the C. elegans genome.

...read moreread less

Abstract: Using the massively parallel technique of sequencing by oligonucleotide ligation and detection (SOLiD; Applied Biosystems), we have assessed the in vivo positions of more than 44 million putative nucleosome cores in the multicellular genetic model organism Caenorhabditis elegans. These analyses provide a global view of the chromatin architecture of a multicellular animal at extremely high density and resolution. While we observe some degree of reproducible positioning throughout the genome in our mixed stage population of animals, we note that the major chromatin feature in the worm is a diversity of allowed nucleosome positions at the vast majority of individual loci. While absolute positioning of nucleosomes can vary substantially, relative positioning of nucleosomes (in a repeated array structure likely to be maintained at least in part by steric constraints) appears to be a significant property of chromatin structure. The high density of nucleosomal reads enabled a substantial extension of previous analysis describing the usage of individual oligonucleotide sequences along the span of the nucleosome core and linker. We release this data set, via the UCSC Genome Browser, as a resource for the high-resolution analysis of chromatin conformation and DNA accessibility at individual loci within the C. elegans genome.

...read moreread less

630 citations

Journal Article•DOI•

Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding

[...]

Kevin McKernan¹, Heather E. Peckham¹, Gina Costa¹, Stephen F. McLaughlin¹, Yutao Fu¹, Eric F. Tsung¹, Christopher Clouser¹, Cisyla Duncan¹, Jeffrey K. Ichikawa¹, Clarence Lee¹, Zheng Zhang¹, Swati Ranade¹, Eileen T. Dimalanta¹, Fiona Hyland¹, Tanya Sokolsky¹, Lei Zhang¹, Andrew Sheridan¹, Haoning Fu¹, Cynthia L. Hendrickson², Bin Li¹, Lev Kotler¹, Jeremy R. Stuart¹, Joel A. Malek³, Jonathan M. Manning¹, Alena A. Antipova¹, Damon S. Perez¹, Michael P. Moore¹, Kathleen C. Hayashibara¹, Michael R. Lyons¹, Robert E. Beaudoin¹, Brittany E. Coleman¹, Michael W. Laptewicz¹, Adam Sannicandro¹, Michael D. Rhodes¹, Rajesh Gottimukkala¹, Shan Yang¹, Vineet Bafna⁴, Ali Bashir⁴, Andrew MacBride, Can Alkan⁵, Jeffrey M. Kidd, Evan E. Eichler⁵, Martin G. Reese, Francisco M. De La Vega¹, Alan Blanchard¹ - Show less +41 more•Institutions (5)

Life Technologies¹, New England Biolabs², Cornell University³, University of California, San Diego⁴, University of Washington⁵

01 Sep 2009-Genome Research

TL;DR: Dozens of mutations previously described in OMIM and hundreds of nonsynonymous single-nucleotide and structural variants in genes previously implicated in disease are identified in this individual.

...read moreread less

Abstract: We describe the genome sequencing of an anonymous individual of African origin using a novel ligation-based sequencing assay that enables a unique form of error correction that improves the raw accuracy of the aligned reads to >99.9%, allowing us to accurately call SNPs with as few as two reads per allele. We collected several billion mate-paired reads yielding approximately 18x haploid coverage of aligned sequence and close to 300x clone coverage. Over 98% of the reference genome is covered with at least one uniquely placed read, and 99.65% is spanned by at least one uniquely placed mate-paired clone. We identify over 3.8 million SNPs, 19% of which are novel. Mate-paired data are used to physically resolve haplotype phases of nearly two-thirds of the genotypes obtained and produce phased segments of up to 215 kb. We detect 226,529 intra-read indels, 5590 indels between mate-paired reads, 91 inversions, and four gene fusions. We use a novel approach for detecting indels between mate-paired reads that are smaller than the standard deviation of the insert size of the library and discover deletions in common with those detected with our intra-read approach. Dozens of mutations previously described in OMIM and hundreds of nonsynonymous single-nucleotide and structural variants in genes previously implicated in disease are identified in this individual. There is more genetic variation in the human genome still to be uncovered, and we provide guidance for future surveys in populations and cancer biopsies.

...read moreread less

595 citations

Journal Article•DOI•

Rapid whole-genome mutational profiling using next-generation sequencing technologies

[...]

Douglas Smith, Aaron R. Quinlan¹, Heather E. Peckham², Kathryn Makowsky, Wei Tao, Betty Woolf, Lei Shen, William F. Donahue, Nadeem Tusneem, Michael Strömberg¹, Donald A. Stewart¹, Lu Zhang¹, Swati Ranade², Jason Warner², Clarence Lee², Brittney E. Coleman², Zheng Zhang², Stephen F. McLaughlin², Joel A. Malek², Jon M. Sorenson², Alan Blanchard², Jarrod Chapman³, David Hillman³, Feng Chen³, Daniel S. Rokhsar³, Kevin McKernan², Thomas W. Jeffries⁴, Gabor T. Marth¹, Paul G. Richardson³ - Show less +25 more•Institutions (4)

Boston College¹, Applied Biosystems², United States Department of Energy³, United States Department of Agriculture⁴

01 Oct 2008-Genome Research

TL;DR: It is shown that new high-throughput, massively parallel sequencing technologies can completely and accurately characterize a mutant genome relative to a previously sequenced parental (reference) strain and that detecting mutations in evolved and engineered organisms is rapid and cost-effective at the whole-genome level using new sequencing technologies.

...read moreread less

Abstract: Forward genetic mutational studies, adaptive evolution, and phenotypic screening are powerful tools for creating new variant organisms with desirable traits. However, mutations generated in the process cannot be easily identified with traditional genetic tools. We show that new high-throughput, massively parallel sequencing technologies can completely and accurately characterize a mutant genome relative to a previously sequenced parental (reference) strain. We studied a mutant strain of Pichia stipitis, a yeast capable of converting xylose to ethanol. This unusually efficient mutant strain was developed through repeated rounds of chemical mutagenesis, strain selection, transformation, and genetic manipulation over a period of seven years. We resequenced this strain on three different sequencing platforms. Surprisingly, we found fewer than a dozen mutations in open reading frames. All three sequencing technologies were able to identify each single nucleotide mutation given at least 10–15-fold nominal sequence coverage. Our results show that detecting mutations in evolved and engineered organisms is rapid and cost-effective at the whole-genome level using new sequencing technologies. Identification of specific mutations in strains with altered phenotypes will add insight into specific gene functions and guide further metabolic engineering efforts.

...read moreread less

289 citations

Journal Article•DOI•

Reconstructing complex regions of genomes using long-read sequencing technology

[...]

John Huddleston¹, Swati Ranade², Maika Malig¹, Francesca Antonacci³, Mark Chaisson¹, Lawrence Hon², Peter H. Sudmant¹, Tina Graves⁴, Can Alkan⁵, Megan Y. Dennis¹, Richard K. Wilson⁴, Stephen Turner², Jonas Korlach², Evan E. Eichler¹ - Show less +10 more•Institutions (5)

University of Washington¹, Pacific Biosciences², University of Bari³, Washington University in St. Louis⁴, Bilkent University⁵

13 Jan 2014-Genome Research

TL;DR: It is shown that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences.

...read moreread less

Abstract: Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state.

...read moreread less

245 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

Journal Article•DOI•

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation

[...]

Cole Trapnell¹, Cole Trapnell², Brian A. Williams³, Geo Pertea¹, Ali Mortazavi³, Gordon Kwan³, Marijke J. van Baren⁴, Steven L. Salzberg¹, Barbara J. Wold³, Lior Pachter² - Show less +6 more•Institutions (4)

University of Maryland, College Park¹, University of California, Berkeley², California Institute of Technology³, Washington University in St. Louis⁴

01 May 2010-Nature Biotechnology

TL;DR: The results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

...read moreread less

Abstract: High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

...read moreread less

13,337 citations

Journal Article•DOI•

RNA-Seq: a revolutionary tool for transcriptomics

[...]

Zhong Wang¹, Mark Gerstein¹, Michael Snyder¹•Institutions (1)

Yale University¹

01 Jan 2009-Nature Reviews Genetics

TL;DR: The RNA-Seq approach to transcriptome profiling that uses deep-sequencing technologies provides a far more precise measurement of levels of transcripts and their isoforms than other methods.

...read moreread less

Abstract: RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. Studies using this method have already altered our view of the extent and complexity of eukaryotic transcriptomes. RNA-Seq also provides a far more precise measurement of levels of transcripts and their isoforms than other methods. This article describes the RNA-Seq approach, the challenges associated with its application, and the advances made so far in characterizing several eukaryote transcriptomes.

...read moreread less

11,528 citations

Journal Article•DOI•

TopHat: discovering splice junctions with RNA-Seq

[...]

Cole Trapnell¹, Lior Pachter¹, Steven L. Salzberg¹•Institutions (1)

University of Maryland, College Park¹

01 May 2009-Bioinformatics

TL;DR: The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer.

...read moreread less

Abstract: Motivation: A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or ‘reads’, can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites. Results: We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72% of the splice junctions reported by the annotation-based software from that study, along with nearly 20 000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development. Availability: TopHat is free, open-source software available from http://tophat.cbcb.umd.edu Contact: ude.dmu.sc@eloc Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

11,473 citations

Journal Article•DOI•

Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks

[...]

Cole Trapnell¹, Adam Roberts², Loyal A. Goff³, Loyal A. Goff¹, Loyal A. Goff⁴, Geo Pertea⁵, Daehwan Kim⁶, Daehwan Kim⁷, David R. Kelley¹, David R. Kelley⁴, Harold Pimentel², Steven L. Salzberg⁵, John L. Rinn⁴, John L. Rinn¹, Lior Pachter² - Show less +11 more•Institutions (7)

Broad Institute¹, University of California, Berkeley², Massachusetts Institute of Technology³, Harvard University⁴, Johns Hopkins University⁵, University of Maryland, College Park⁶, Johns Hopkins University School of Medicine⁷

01 Mar 2012-Nature Protocols

TL;DR: This protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results, which takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.

...read moreread less

Abstract: Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.

...read moreread less

10,913 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse