scispace - formally typeset
Open AccessJournal ArticleDOI

Assessment of transcript reconstruction methods for RNA-seq

TLDR
The results show that most algorithms are able to identify discrete transcript components with high success rates but that assembly of complete isoform structures poses a major challenge even when all constituent elements are identified.
Abstract
We evaluated 25 protocol variants of 14 independent computational methods for exon identification, transcript reconstruction and expression-level quantification from RNA-seq data. Our results show that most algorithms are able to identify discrete transcript components with high success rates but that assembly of complete isoform structures poses a major challenge even when all constituent elements are identified. Expression-level estimates also varied widely across methods, even when based on similar transcript models. Consequently, the complexity of higher eukaryotic genomes imposes severe limitations on transcript recall and splice product discrimination that are likely to remain limiting factors for the analysis of current-generation RNA-seq data.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

StringTie enables improved reconstruction of a transcriptome from RNA-seq reads

TL;DR: StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts produces more complete and accurate reconstructions of genes and better estimates of expression levels.
Journal ArticleDOI

GENCODE reference annotation for the human and mouse genomes.

TL;DR: This work generates primary data, creates bioinformatics tools and provides analysis to support the work of expert manual gene annotators and automated gene annotation pipelines to identify and characterise gene loci to the highest standard.
Journal ArticleDOI

A survey of best practices for RNA-seq data analysis

TL;DR: All of the major steps in RNA-seq data analysis are reviewed, including experimental design, quality control, read alignment, quantification of gene and transcript levels, visualization, differential gene expression, alternative splicing, functional analysis, gene fusion detection and eQTL mapping.
Journal ArticleDOI

PacBio Sequencing and Its Applications.

TL;DR: Single-molecule, real-time sequencing developed by Pacific BioSciences offers longer read lengths than the second-generation sequencing technologies, making it well-suited for unsolved problems in genome, transcriptome, and epigenetics research.
References
More filters
Journal ArticleDOI

Fast and accurate short read alignment with Burrows–Wheeler transform

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Journal ArticleDOI

STAR: ultrafast universal RNA-seq aligner

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.
Journal ArticleDOI

RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome

TL;DR: It is shown that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads, and estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired- end reads, depending on the number of possible splice forms for each gene.
Related Papers (5)
Trending Questions (1)
What are the potential limitations and challenges in analyzing transcripts for the purpose of assessing foreign language acquisition?

The provided paper is about the assessment of transcript reconstruction methods for RNA-seq data. It does not provide information about the potential limitations and challenges in analyzing transcripts for the purpose of assessing foreign language acquisition.