scispace - formally typeset
Search or ask a question
Journal ArticleDOI

RNA-Seq: a revolutionary tool for transcriptomics

01 Jan 2009-Nature Reviews Genetics (Nature Publishing Group)-Vol. 10, Iss: 1, pp 57-63
TL;DR: The RNA-Seq approach to transcriptome profiling that uses deep-sequencing technologies provides a far more precise measurement of levels of transcripts and their isoforms than other methods.
Abstract: RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. Studies using this method have already altered our view of the extent and complexity of eukaryotic transcriptomes. RNA-Seq also provides a far more precise measurement of levels of transcripts and their isoforms than other methods. This article describes the RNA-Seq approach, the challenges associated with its application, and the advances made so far in characterizing several eukaryote transcriptomes.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
01 May 2014-Gut
TL;DR: In vitro and in vivo functional assays prove that ADAR1 functions as an oncogene while ADAR2 has tumour suppressive ability in HCC, highlighting the fact that the differentially expressed ADARs in tumours has great prognostic value and diagnostic potential for HCC.
Abstract: Objective Hepatocellular carcinoma (HCC) is a heterogeneous tumour displaying a complex variety of genetic and epigenetic changes. In human cancers, aberrant post-transcriptional modifications, such as alternative splicing and RNA editing, may lead to tumour specific transcriptome diversity. Design By utilising large scale transcriptome sequencing of three paired HCC clinical specimens and their adjacent non-tumour (NT) tissue counterparts at depth, we discovered an average of 20 007 inferred A to I (adenosine to inosine) RNA editing events in transcripts. The roles of the double stranded RNA specific ADAR (Adenosine DeAminase that act on RNA) family members (ADARs) and the altered gene specific editing patterns were investigated in clinical specimens, cell models and mice. Results HCC displays a severely disrupted A to I RNA editing balance. ADAR1 and ADAR2 manipulate the A to I imbalance of HCC via their differential expression in HCC compared with NT liver tissues. Patients with ADAR1 overexpression and ADAR2 downregulation in tumours demonstrated an increased risk of liver cirrhosis and postoperative recurrence and had poor prognoses. Due to the differentially expressed ADAR1 and ADAR2 in tumours, the altered gene specific editing activities, which was reflected by the hyper-editing of FLNB (filamin B, β) and the hypo-editing of COPA (coatomer protein complex, subunit α), are closely associated with HCC pathogenesis. In vitro and in vivo functional assays prove that ADAR1 functions as an oncogene while ADAR2 has tumour suppressive ability in HCC. Conclusions These findings highlight the fact that the differentially expressed ADARs in tumours, which are responsible for an A to I editing imbalance, has great prognostic value and diagnostic potential for HCC.

182 citations

Journal ArticleDOI
TL;DR: It is argued here, however, that new multidisciplinary approaches—especially the integration of epidemiology with systems biology in what is called “systems epidemiology”—will be required to eliminate TB.
Abstract: Renewed efforts in tuberculosis (TB) research have led to important new insights into the biology and epidemiology of this devastating disease. Yet, in the face of the modern epidemics of HIV/AIDS, diabetes, and multidrug resistance—all of which contribute to susceptibility to TB—global control of the disease will remain a formidable challenge for years to come. New high-throughput genomics technologies are already contributing to studies of TB's epidemiology, comparative genomics, evolution, and host–pathogen interaction. We argue here, however, that new multidisciplinary approaches—especially the integration of epidemiology with systems biology in what we call “systems epidemiology”—will be required to eliminate TB.

182 citations


Cites background or methods from "RNA-Seq: a revolutionary tool for t..."

  • ...Recent studies, for example, have reported a role for small regulatory RNAs in M. tuberculosis [58], and there is little doubt more regulatory RNAs will soon be identified by RNA-seq [57]....

    [...]

  • ...used to study both the host and the pathogen is sequence-based transcriptomics, in which gene expression is measured by whole genome sequencing of RNA transcripts; a method referred to as RNA-seq [57]....

    [...]

  • ...Another approach increasingly being PLoS Pathogens | www.plospathogens.org 5 October 2009 | Volume 5 | Issue 10 | e1000600 used to study both the host and the pathogen is sequence-based transcriptomics, in which gene expression is measured by whole genome sequencing of RNA transcripts; a method referred to as RNA-seq [57]....

    [...]

  • ...tuberculosis [58], and there is little doubt more regulatory RNAs will soon be identified by RNA-seq [57]....

    [...]

  • ...RNA-seq is particularly useful for genome-wide studies of small regulatory RNAs, as such studies are more difficult to perform using standard DNA microarrays....

    [...]

Journal ArticleDOI
TL;DR: These findings, focusing on results from profiling mammalian transcriptomes, include the detection of substantial new transcript complexity, the elucidation of binding maps and regulatory properties of RNA-binding proteins, and new insights into the links between different steps in pre-mRNA processing.
Abstract: Recent papers have described the first application of high-throughput sequencing (HTS) technologies to the characterization of transcriptomes. These studies emphasize the tremendous power of this new technology, in terms of both profiling coverage and quantitative accuracy. Initial discoveries include the detection of substantial new transcript complexity, the elucidation of binding maps and regulatory properties of RNA-binding proteins, and new insights into the links between different steps in pre-mRNA processing. We review these findings, focusing on results from profiling mammalian transcriptomes. The strengths and limitations of HTS relative to microarray profiling are discussed. We also consider how future advances in HTS technology are likely to transform our understanding of integrated cellular networks operating at the RNA level.

181 citations


Cites result from "RNA-Seq: a revolutionary tool for t..."

  • ...For a more detailed comparison of current HTS technologies and how these compare with microarray-based methods, we refer the reader to a recent review by Wang et al. (2009)....

    [...]

Journal ArticleDOI
Li-Xin Xiang1, Ding-Xin He1, Wei-ren Dong1, Yi-wen Zhang1, Jian-Zhong Shao1 
TL;DR: An overall conservation of the components and transcriptome alterations underlying innate and adaptive immunity in fish and other vertebrate models is indicated, suggesting the acquisition of numerous fish-specific immune system components during early vertebrate evolution.
Abstract: Systematic research on fish immunogenetics is indispensable in understanding the origin and evolution of immune systems. This has long been a challenging task because of the limited number of deep sequencing technologies and genome backgrounds of non-model fish available. The newly developed Solexa/Illumina RNA-seq and Digital gene expression (DGE) are high-throughput sequencing approaches and are powerful tools for genomic studies at the transcriptome level. This study reports the transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus using RNA-seq and DGE in an attempt to gain insights into the immunogenetics of marine fish. RNA-seq analysis generated 169,950 non-redundant consensus sequences, among which 48,987 functional transcripts with complete or various length encoding regions were identified. More than 52% of these transcripts are possibly involved in approximately 219 known metabolic or signalling pathways, while 2,673 transcripts were associated with immune-relevant genes. In addition, approximately 8% of the transcripts appeared to be fish-specific genes that have never been described before. DGE analysis revealed that the host transcriptome profile of Vibrio harveyi- challenged L. japonicus is considerably altered, as indicated by the significant up- or down-regulation of 1,224 strong infection-responsive transcripts. Results indicated an overall conservation of the components and transcriptome alterations underlying innate and adaptive immunity in fish and other vertebrate models. Analysis suggested the acquisition of numerous fish-specific immune system components during early vertebrate evolution. This study provided a global survey of host defence gene activities against bacterial challenge in a non-model marine fish. Results can contribute to the in-depth study of candidate genes in marine fish immunity, and help improve current understanding of host-pathogen interactions and evolutionary history of immunogenetics from fish to mammals.

181 citations


Cites background from "RNA-Seq: a revolutionary tool for t..."

  • ...Recently developed RNA deep sequencing technologies, such as Solexa/Illumina RNA-seq and Digital gene expression (DGE), have dramatically changed the way immune-related genes in fish are identified because these technologies facilitate the investigation of the functional complexity of transcriptomes [11,12]....

    [...]

Journal ArticleDOI
TL;DR: The Allen Human Brain Atlas is a freely available multimodal atlas of gene expression and anatomy comprising a comprehensive 'all genes-all structures' array-based dataset of geneexpression and complementary in situ hybridization gene expression studies targeting selected genes in specific brain regions.

181 citations

References
More filters
Journal ArticleDOI
TL;DR: Although >90% of uniquely mapped reads fell within known exons, the remaining data suggest new and revised gene models, including changed or additional promoters, exons and 3′ untranscribed regions, as well as new candidate microRNA precursors.
Abstract: We have mapped and quantified mouse transcriptomes by deeply sequencing them and recording how frequently each gene is represented in the sequence sample (RNA-Seq). This provides a digital measure of the presence and prevalence of transcripts from known and previously unknown genes. We report reference measurements composed of 41–52 million mapped 25-base-pair reads for poly(A)-selected RNA from adult mouse brain, liver and skeletal muscle tissues. We used RNA standards to quantify transcript prevalence and to test the linear range of transcript detection, which spanned five orders of magnitude. Although >90% of uniquely mapped reads fell within known exons, the remaining data suggest new and revised gene models, including changed or additional promoters, exons and 3′ untranscribed regions, as well as new candidate microRNA precursors. RNA splice events, which are not readily measured by standard gene expression microarray or serial analysis of gene expression methods, were detected directly by mapping splice-crossing sequence reads. We observed 1.45 × 10 5 distinct splices, and alternative splices were prominent, with 3,500 different genes expressing one or more alternate internal splices. The mRNA population specifies a cell’s identity and helps to govern its present and future activities. This has made transcriptome analysis a general phenotyping method, with expression microarrays of many kinds in routine use. Here we explore the possibility that transcriptome analysis, transcript discovery and transcript refinement can be done effectively in large and complex mammalian genomes by ultra-high-throughput sequencing. Expression microarrays are currently the most widely used methodology for transcriptome analysis, although some limitations persist. These include hybridization and cross-hybridization artifacts 1–3 , dye-based detection issues and design constraints that preclude or seriously limit the detection of RNA splice patterns and previously unmapped genes. These issues have made it difficult for standard array designs to provide full sequence comprehensiveness (coverage of all possible genes, including unknown ones, in large genomes) or transcriptome comprehensiveness (reliable detection of all RNAs of all prevalence classes, including the least abundant ones that are physiologically relevant). Other

12,293 citations

PatentDOI
04 Oct 2000-Science
TL;DR: Serial analysis of gene expression (SAGE) should provide a broadly applicable means for the quantitative cataloging and comparison of expressed genes in a variety of normal, developmental, and disease states.
Abstract: PROBLEM TO BE SOLVED: To provide a method for preparing a short nucleotide sequence (tag) which is useful to identify a cDNA oligonucleotide and is derived from a restricted position in a mRNA or a cDNA. SOLUTION: This is the method of preparing a tag for identifying the cDNA oligonucleotide. The above method comprises preparing the cDNA oligonucleotide bearing 5' and 3' terminals, collecting cDNA fragments by cutting the cDNA oligonucleotide with a restriction enzyme at the first restriction endonuclease site, separating a cDNA oligonucleotide bearing 5' or 3' terminal and connecting an oligonucleotide linker to the isolated cDNA fragment bearing the cDNA oligonucleotide 5' or 3' terminal. Here, the oligonucleotide linker contains the recognition site of the second restriction endonuclease enzyme and the isolated cDNA fragment is cut with the second restriction endonuclease enzyme which cuts the cDNA fragment in a section separated from the recognition site to obtain the tag for identifying the cDNA oligonucleotide.

4,437 citations

Journal ArticleDOI
TL;DR: This work describes the software MAQ, software that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample.
Abstract: New sequencing technologies promise a new era in the use of DNA sequence. However, some of these technologies produce very short reads, typically of a few tens of base pairs, and to use these reads effectively requires new algorithms and software. In particular, there is a major issue in efficiently aligning short reads to a reference genome and handling ambiguity or lack of accuracy in this alignment. Here we introduce the concept of mapping quality, a measure of the confidence that a read actually comes from the position it is aligned to by the mapping algorithm. We describe the software MAQ that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample. MAQ makes full use of mate-pair information and estimates the error probability of each read alignment. Error probabilities are also derived for the final genotype calls, using a Bayesian statistical model that incorporates the mapping qualities, error probabilities from the raw sequence quality scores, sampling of the two haplotypes, and an empirical model for correlated errors at a site. Both read mapping and genotype calling are evaluated on simulated data and real data. MAQ is accurate, efficient, versatile, and user-friendly. It is freely available at http://maq.sourceforge.net.

2,927 citations

Journal ArticleDOI
TL;DR: It is found that the Illumina sequencing data are highly replicable, with relatively little technical variation, and thus, for many purposes, it may suffice to sequence each mRNA sample only once (i.e., using one lane).
Abstract: Ultra-high-throughput sequencing is emerging as an attractive alternative to microarrays for genotyping, analysis of methylation patterns, and identification of transcription factor binding sites. Here, we describe an application of the Illumina sequencing (formerly Solexa sequencing) platform to study mRNA expression levels. Our goals were to estimate technical variance associated with Illumina sequencing in this context and to compare its ability to identify differentially expressed genes with existing array technologies. To do so, we estimated gene expression differences between liver and kidney RNA samples using multiple sequencing replicates, and compared the sequencing data to results obtained from Affymetrix arrays using the same RNA samples. We find that the Illumina sequencing data are highly replicable, with relatively little technical variation, and thus, for many purposes, it may suffice to sequence each mRNA sample only once (i.e., using one lane). The information in a single lane of Illumina sequencing data appears comparable to that in a single array in enabling identification of differentially expressed genes, while allowing for additional analyses such as detection of low-expressed genes, alternative splice variants, and novel transcripts. Based on our observations, we propose an empirical protocol and a statistical framework for the analysis of gene expression using ultra-high-throughput sequencing technology.

2,834 citations

Journal ArticleDOI
TL;DR: The program SOAP is designed to handle the huge amounts of short reads generated by parallel sequencing using the new generation Illumina-Solexa sequencing technology, which supports multi-threaded parallel computing and has a batch module for multiple query sets.
Abstract: Summary: We have developed a program SOAP for efficient gapped and ungapped alignment of short oligonucleotides onto reference sequences. The program is designed to handle the huge amounts of short reads generated by parallel sequencing using the new generation Illumina-Solexa sequencing technology. SOAP is compatible with numerous applications, including single-read or pair-end resequencing, small RNA discovery and mRNA tag sequence mapping. SOAP is a command-driven program, which supports multi-threaded parallel computing, and has a batch module for multiple query sets. Availability: http://soap.genomics.org.cn Contact: soap@genomics.org.cn

2,729 citations


"RNA-Seq: a revolutionary tool for t..." refers methods in this paper

  • ...There are several programs for mapping reads to the genome, including ELAND, SOA...

    [...]