RNA-Seq: a revolutionary tool for transcriptomics

doi:10.1038/NRG2484

Home
/
Papers
/
RNA-Seq: a revolutionary tool for transcriptomics

Journal Article•DOI•

RNA-Seq: a revolutionary tool for transcriptomics

Zhong Wang¹, Mark Gerstein¹, Michael Snyder¹•Institutions (1)

Yale University¹

01 Jan 2009-Nature Reviews Genetics (Nature Publishing Group)-Vol. 10, Iss: 1, pp 57-63

TL;DR: The RNA-Seq approach to transcriptome profiling that uses deep-sequencing technologies provides a far more precise measurement of levels of transcripts and their isoforms than other methods.

read less

Abstract: RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. Studies using this method have already altered our view of the extent and complexity of eukaryotic transcriptomes. RNA-Seq also provides a far more precise measurement of levels of transcripts and their isoforms than other methods. This article describes the RNA-Seq approach, the challenges associated with its application, and the advances made so far in characterizing several eukaryote transcriptomes.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Impact of genomic polymorphisms on the repertoire of human MHC class I-associated peptides

[...]

Diana Paola Granados¹, Dev Sriranganadane¹, Tariq Daouda¹, Antoine Zieger¹, Céline M. Laumont¹, Olivier Caron-Lizotte¹, Geneviève Boucher¹, Marie-Pierre Hardy¹, Patrick Gendron¹, Caroline Côté¹, Sébastien Lemieux¹, Pierre Thibault¹, Claude Perreault¹ - Show less +9 more•Institutions (1)

Université de Montréal¹

09 Apr 2014-Nature Communications

TL;DR: The method provides fundamental insights into the relationship between the genomic self and the immune self and accelerates the discovery of polymorphic MIPs (also known as minor histocompatibility antigens), which play a major role in allorecognition.

...read moreread less

Abstract: For decades, the global impact of genomic polymorphisms on the repertoire of peptides presented by major histocompatibility complex (MHC) has remained a matter of speculation. Here we present a novel approach that enables high-throughput discovery of polymorphic MHC class I-associated peptides (MIPs), which play a major role in allorecognition. On the basis of comprehensive analyses of the genomic landscape of MIPs eluted from B lymphoblasts of two MHC-identical siblings, we show that 0.5% of non-synonymous single nucleotide variations are represented in the MIP repertoire. The 34 polymorphic MIPs found in our subjects are encoded by bi-allelic loci with dominant and recessive alleles. Our analyses show that, at the population level, 12% of the MIP-coding exome is polymorphic. Our method provides fundamental insights into the relationship between the genomic self and the immune self and accelerates the discovery of polymorphic MIPs (also known as minor histocompatibility antigens).

...read moreread less

95 citations

Journal Article•DOI•

Methodologies to decipher the cell secretome.

[...]

Paromita Mukherjee¹, Sridhar Mani¹•Institutions (1)

Albert Einstein College of Medicine¹

01 Nov 2013-Biochimica et Biophysica Acta

TL;DR: This review aims to discuss the methodologies available along with their potential advantages and disadvantages to identify secretory proteins in cell secretomes.

...read moreread less

95 citations

Journal Article•DOI•

Systematic integration of RNA-Seq statistical algorithms for accurate detection of differential gene expression patterns

[...]

Panagiotis Moulos, Pantelis Hatzis

27 Feb 2015-Nucleic Acids Research

TL;DR: A new method (PANDORA) is presented that combines multiple algorithms toward a summarized result, more efficiently reflecting true experimental outcomes, by optimizing the tradeoff between standard performance measurements, such as precision and sensitivity.

...read moreread less

Abstract: RNA-Seq is gradually becoming the standard tool for transcriptomic expression studies in biological research. Although considerable progress has been recorded in the development of statistical algorithms for the detection of differentially expressed genes using RNA-Seq data, the list of detected genes can differ significantly between algorithms. We present a new method (PANDORA) that combines multiple algorithms toward a summarized result, more efficiently reflecting true experimental outcomes. This is achieved through the systematic combination of several analysis algorithms, by weighting their outcomes according to their performance with realistically simulated data sets generated from real data. Results supported by the analysis of both simulated and real data from different organisms as well as correlation with PolII occupancy demonstrate that PANDORA improves the detection of differential expression. It accomplishes this by optimizing the tradeoff between standard performance measurements, such as precision and sensitivity.

...read moreread less

95 citations

Cites background from "RNA-Seq: a revolutionary tool for t..."

...One of the common applications of RNA-Seq (1) is genome-wide transcript expression profiling and detection of differentially expressed genes (DEGs) across distinct biological conditions....
[...]

Journal Article•DOI•

Integrative "omic" analysis of experimental bacteremia identifies a metabolic signature that distinguishes human sepsis from systemic inflammatory response syndromes

[...]

Raymond J. Langley, Jennifer L. Tipper¹, Shannon E. Bruse¹, Rebecca M. Baron², Ephraim L. Tsalik³, Ephraim L. Tsalik⁴, James Huntley⁵, Angela J. Rogers², Richard J. Jaramillo¹, Denise O'Donnell¹, William Mega¹, Mignon Keaton, Elizabeth Kensicki, Lee Gazourian², Laura E. Fredenburgh², Anthony F. Massaro², Ronny M. Otero⁶, Vance G. Fowler⁴, Emanuel P. Rivers⁶, Christopher W. Woods³, Christopher W. Woods⁴, Stephen F. Kingsmore⁷, Stephen F. Kingsmore⁸, Mohan L. Sopori¹, Mark A. Perrella², Augustine M.K. Choi⁹, Kevin S. Harrod¹ - Show less +23 more•Institutions (9)

Lovelace Respiratory Research Institute¹, Brigham and Women's Hospital², Veterans Health Administration³, Duke University⁴, University of Colorado Boulder⁵, Henry Ford Health System⁶, Children's Mercy Hospital⁷, University of Missouri–Kansas City⁸, Houston Methodist Hospital⁹

15 Aug 2014-American Journal of Respiratory and Critical Care Medicine

TL;DR: A model of sepsis based on reciprocal metabolomic and transcriptomic data was developed in primates and validated in two human patient cohorts and it is anticipated that the identified parameters will facilitate early diagnosis and management ofsepsis.

...read moreread less

Abstract: Rationale: Sepsis is a leading cause of morbidity and mortality. Currently, early diagnosis and the progression of the disease are difficult to make. The integration of metabolomic and transcriptomic data in a primate model of sepsis may provide a novel molecular signature of clinical sepsis. Objectives: To develop a biomarker panel to characterize sepsis in primates and ascertain its relevance to early diagnosis and progression of human sepsis. Methods: Intravenous inoculation of Macaca fascicularis with Escherichia coli produced mild to severe sepsis, lung injury, and death. Plasma samples were obtained before and after 1, 3, and 5 days of E. coli challenge and at the time of killing. At necropsy, blood, lung, kidney, and spleen samples were collected. An integrative analysis of the metabolomic and transcriptomic datasets was performed to identify a panel of sepsis biomarkers. Measurements and Main Results: The extent of E. coli invasion, respiratory distress, lethargy, and mortality was dependent on the bacterial dose. Metabolomic and transcriptomic changes characterized severe infections and death, and indicated impaired mitochondrial, peroxisomal, and liver functions. Analysis of the pulmonary transcriptome and plasma metabolome suggested impaired fatty acid catabolism regulated by peroxisome-proliferator activated receptor signaling. A representative four-metabolite model effectively diagnosed sepsis in primates (area under the curve, 0.966) and in two human sepsis cohorts (area under the curve, 0.78 and 0.82). Conclusions: A model of sepsis based on reciprocal metabolomic and transcriptomic data was developed in primates and validated in two human patient cohorts. It is anticipated that the identified parameters will facilitate early diagnosis and management of sepsis.

...read moreread less

95 citations

Cites methods from "RNA-Seq: a revolutionary tool for t..."

...We used RNAseq because of the dynamic range and high correlations to quantitative polymerase chain reaction (42)....
[...]

host defenses in the chytridiomycosis-susceptible frog Atelopus zeteki

[...]

Amy Ellison, Anna E. Savage, Grace V. DiRenzo, Penny F. Langhammer, Karen R. Lips, Kelly R. Zamudio - Show less +2 more

01 Jan 2014

TL;DR: Evidence of acquired immune responses generated against chytridiomycosis is shown, including increased expression of immunoglobulins and major histocompatibility complex genes, and suppression of key immune responses by Bd is likely an important factor in the lethality of this fungus.

...read moreread less

Abstract: The emergence of the disease chytridiomycosis caused by the chytrid fungus Batrachochytrium dendrobatidis (Bd) has been implicated in dramatic global amphibian declines. Although many species have undergone catastrophic declines and/or extinctions, others appear to be unaffected or persist at reduced frequencies after Bd outbreaks. The reasons behind this variance in disease outcomes are poorly understood: differences in host immune responses have been proposed, yet previous studies suggest a lack of robust immune responses to Bd in susceptible species. Here, we sequenced transcriptomes from clutch-mates of a highly susceptible amphibian, Atelopus zeteki, with different infection histories. We found significant changes in expression of numerous genes involved in innate and inflammatory responses in infected frogs despite high susceptibility to chytridiomycosis. We show evidence of acquired immune responses generated against Bd, including increased expression of immunoglobulins and major histocompatibility complex genes. In addition, fungal-killing genes had significantly greater expression in frogs previously exposed to Bd compared with Bd-naïve frogs, including chitinase and serine-type proteases. However, our results appear to confirm recent in vitro evidence of immune suppression by Bd, demonstrated by decreased expression of lymphocyte genes in the spleen of infected compared with control frogs. We propose susceptibility to chytridiomycosis is not due to lack of Bd-specific immune responses but instead is caused by failure of those responses to be effective. Ineffective immune pathway activation and timing of antibody production are discussed as potential mechanisms. However, in light of our findings, suppression of key immune responses by Bd is likely an important factor in the lethality of this fungus.

...read moreread less

95 citations

Cites result from "RNA-Seq: a revolutionary tool for t..."

...…it is possible that other susceptible species have weak immune responses to Bd, it is also possible that these contrasting findings are due to the greater sensitivity of RNASeq gene expression analyses over microarray-based experiments used in prior studies (Marioni et al. 2008; Wang et al. 2009)....
[...]

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
…
174
175
176
177
178
179
180
…
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Mapping and quantifying mammalian transcriptomes by RNA-Seq.

[...]

Ali Mortazavi¹, Brian A. Williams¹, Kenneth McCue¹, Lorian Schaeffer¹, Barbara J. Wold¹ - Show less +1 more•Institutions (1)

California Institute of Technology¹

29 Jun 2008-Nature Methods

TL;DR: Although >90% of uniquely mapped reads fell within known exons, the remaining data suggest new and revised gene models, including changed or additional promoters, exons and 3′ untranscribed regions, as well as new candidate microRNA precursors.

...read moreread less

Abstract: We have mapped and quantified mouse transcriptomes by deeply sequencing them and recording how frequently each gene is represented in the sequence sample (RNA-Seq). This provides a digital measure of the presence and prevalence of transcripts from known and previously unknown genes. We report reference measurements composed of 41–52 million mapped 25-base-pair reads for poly(A)-selected RNA from adult mouse brain, liver and skeletal muscle tissues. We used RNA standards to quantify transcript prevalence and to test the linear range of transcript detection, which spanned five orders of magnitude. Although >90% of uniquely mapped reads fell within known exons, the remaining data suggest new and revised gene models, including changed or additional promoters, exons and 3′ untranscribed regions, as well as new candidate microRNA precursors. RNA splice events, which are not readily measured by standard gene expression microarray or serial analysis of gene expression methods, were detected directly by mapping splice-crossing sequence reads. We observed 1.45 × 10 5 distinct splices, and alternative splices were prominent, with 3,500 different genes expressing one or more alternate internal splices. The mRNA population specifies a cell’s identity and helps to govern its present and future activities. This has made transcriptome analysis a general phenotyping method, with expression microarrays of many kinds in routine use. Here we explore the possibility that transcriptome analysis, transcript discovery and transcript refinement can be done effectively in large and complex mammalian genomes by ultra-high-throughput sequencing. Expression microarrays are currently the most widely used methodology for transcriptome analysis, although some limitations persist. These include hybridization and cross-hybridization artifacts 1–3 , dye-based detection issues and design constraints that preclude or seriously limit the detection of RNA splice patterns and previously unmapped genes. These issues have made it difficult for standard array designs to provide full sequence comprehensiveness (coverage of all possible genes, including unknown ones, in large genomes) or transcriptome comprehensiveness (reliable detection of all RNAs of all prevalence classes, including the least abundant ones that are physiologically relevant). Other

...read moreread less

12,293 citations

Patent•DOI•

Serial analysis of gene expression

[...]

Kenneth W. Kinzler¹, Victor Velculescu², Bert Vogelstein², Lin Zhang², ヴェルヴレスク，ヴィクター，イー．, ヴォゲルステイン，バート, キンズラー，ケネス，ダブリュ．, ツァン，リン - Show less +4 more•Institutions (2)

Johns Hopkins University¹, Howard Hughes Medical Institute²

04 Oct 2000-Science

TL;DR: Serial analysis of gene expression (SAGE) should provide a broadly applicable means for the quantitative cataloging and comparison of expressed genes in a variety of normal, developmental, and disease states.

...read moreread less

Abstract: PROBLEM TO BE SOLVED: To provide a method for preparing a short nucleotide sequence (tag) which is useful to identify a cDNA oligonucleotide and is derived from a restricted position in a mRNA or a cDNA. SOLUTION: This is the method of preparing a tag for identifying the cDNA oligonucleotide. The above method comprises preparing the cDNA oligonucleotide bearing 5' and 3' terminals, collecting cDNA fragments by cutting the cDNA oligonucleotide with a restriction enzyme at the first restriction endonuclease site, separating a cDNA oligonucleotide bearing 5' or 3' terminal and connecting an oligonucleotide linker to the isolated cDNA fragment bearing the cDNA oligonucleotide 5' or 3' terminal. Here, the oligonucleotide linker contains the recognition site of the second restriction endonuclease enzyme and the isolated cDNA fragment is cut with the second restriction endonuclease enzyme which cuts the cDNA fragment in a section separated from the recognition site to obtain the tag for identifying the cDNA oligonucleotide.

...read moreread less

4,437 citations

Journal Article•DOI•

Mapping short DNA sequencing reads and calling variants using mapping quality scores

[...]

Heng Li¹, Jue Ruan, Richard Durbin•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Nov 2008-Genome Research

TL;DR: This work describes the software MAQ, software that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample.

...read moreread less

Abstract: New sequencing technologies promise a new era in the use of DNA sequence. However, some of these technologies produce very short reads, typically of a few tens of base pairs, and to use these reads effectively requires new algorithms and software. In particular, there is a major issue in efficiently aligning short reads to a reference genome and handling ambiguity or lack of accuracy in this alignment. Here we introduce the concept of mapping quality, a measure of the confidence that a read actually comes from the position it is aligned to by the mapping algorithm. We describe the software MAQ that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample. MAQ makes full use of mate-pair information and estimates the error probability of each read alignment. Error probabilities are also derived for the final genotype calls, using a Bayesian statistical model that incorporates the mapping qualities, error probabilities from the raw sequence quality scores, sampling of the two haplotypes, and an empirical model for correlated errors at a site. Both read mapping and genotype calling are evaluated on simulated data and real data. MAQ is accurate, efficient, versatile, and user-friendly. It is freely available at http://maq.sourceforge.net.

...read moreread less

2,927 citations

Journal Article•DOI•

RNA-seq: An assessment of technical reproducibility and comparison with gene expression arrays

[...]

John C. Marioni¹, Christopher E. Mason, Shrikant Mane, Matthew Stephens, Yoav Gilad - Show less +1 more•Institutions (1)

University of Chicago¹

01 Sep 2008-Genome Research

TL;DR: It is found that the Illumina sequencing data are highly replicable, with relatively little technical variation, and thus, for many purposes, it may suffice to sequence each mRNA sample only once (i.e., using one lane).

...read moreread less

Abstract: Ultra-high-throughput sequencing is emerging as an attractive alternative to microarrays for genotyping, analysis of methylation patterns, and identification of transcription factor binding sites. Here, we describe an application of the Illumina sequencing (formerly Solexa sequencing) platform to study mRNA expression levels. Our goals were to estimate technical variance associated with Illumina sequencing in this context and to compare its ability to identify differentially expressed genes with existing array technologies. To do so, we estimated gene expression differences between liver and kidney RNA samples using multiple sequencing replicates, and compared the sequencing data to results obtained from Affymetrix arrays using the same RNA samples. We find that the Illumina sequencing data are highly replicable, with relatively little technical variation, and thus, for many purposes, it may suffice to sequence each mRNA sample only once (i.e., using one lane). The information in a single lane of Illumina sequencing data appears comparable to that in a single array in enabling identification of differentially expressed genes, while allowing for additional analyses such as detection of low-expressed genes, alternative splice variants, and novel transcripts. Based on our observations, we propose an empirical protocol and a statistical framework for the analysis of gene expression using ultra-high-throughput sequencing technology.

...read moreread less

2,834 citations

Journal Article•DOI•

SOAP: short oligonucleotide alignment program

[...]

Ruiqiang Li¹, Yingrui Li², Karsten Kristiansen², Jun Wang²•Institutions (2)

Beijing Genomics Institute¹, University of Southern Denmark²

01 Mar 2008-Bioinformatics

TL;DR: The program SOAP is designed to handle the huge amounts of short reads generated by parallel sequencing using the new generation Illumina-Solexa sequencing technology, which supports multi-threaded parallel computing and has a batch module for multiple query sets.

...read moreread less

Abstract: Summary: We have developed a program SOAP for efficient gapped and ungapped alignment of short oligonucleotides onto reference sequences. The program is designed to handle the huge amounts of short reads generated by parallel sequencing using the new generation Illumina-Solexa sequencing technology. SOAP is compatible with numerous applications, including single-read or pair-end resequencing, small RNA discovery and mRNA tag sequence mapping. SOAP is a command-driven program, which supports multi-threaded parallel computing, and has a batch module for multiple query sets. Availability: http://soap.genomics.org.cn Contact: soap@genomics.org.cn

...read moreread less

2,729 citations