scispace - formally typeset
Open AccessJournal ArticleDOI

TransRate: reference-free quality assessment of de novo transcriptome assemblies

Reads0
Chats0
TLDR
TransRate is a tool for reference-free quality assessment of de novo transcriptome assemblies using only the sequenced reads and the assembly as input and it is revealed that variance in the quality of the input data explains 43% of the variance inThe quality of published de noVO transcriptome assembly assemblies.
Abstract
TransRate is a tool for reference-free quality assessment of de novo transcriptome assemblies Using only the sequenced reads and the assembly as input, we show that multiple common artifacts of de novo transcriptome assembly can be readily detected These include chimeras, structural errors, incomplete assembly, and base errors TransRate evaluates these errors to produce a diagnostic quality score for each contig, and these contig scores are integrated to evaluate whole assemblies Thus, TransRate can be used for de novo assembly filtering and optimization as well as comparison of assemblies generated using different methods from the same input reads Applying the method to a data set of 155 published de novo transcriptome assemblies, we deconstruct the contribution that assembly method, read length, read quantity, and read quality make to the accuracy of de novo transcriptome assemblies and reveal that variance in the quality of the input data explains 43% of the variance in the quality of published de novo transcriptome assemblies Because TransRate is reference-free, it is suitable for assessment of assemblies of all types of RNA, including assemblies of long noncoding RNA, rRNA, mRNA, and mixed RNA samples

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Using SPAdes De Novo Assembler

TL;DR: Protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole‐genome sequencing and metagenomic datasets are presented.
Journal ArticleDOI

Transcriptomics technologies

TL;DR: The first attempts to study the whole transcriptome began in the early 1990s, and technological advances since the late 1990s have made transcriptomics a widespread discipline as mentioned in this paper, which has enabled the study of how gene expression changes in different organisms and has been instrumental in the understanding of human disease.
Journal ArticleDOI

rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.

TL;DR: The novel transcriptome assembler rnaSPAdes, which has been developed on top of the SPAdes genome assembler, typically outperforms other assemblers by such important property as the number of assembled genes and isoforms and at the same time has higher accuracy statistics on average comparing to the closest competitors.
Journal ArticleDOI

The how and why of lncRNA function: An innate immune perspective.

TL;DR: The challenges, as well as the emergence of new technologies that will continue to move this field forward and provide greater insight into the biological importance of this class of genes are discussed.
Journal ArticleDOI

Next-generation biology: Sequencing and data analysis approaches for non-model organisms.

TL;DR: This review presents an overview of the current sequencing technologies and the methods used in typical high-throughput data analysis pipelines, and contextualize high- throughput DNA sequencing technologies within their applications in non-model organism biology.
References
More filters
Journal Article

R: A language and environment for statistical computing.

R Core Team
- 01 Jan 2014 - 
TL;DR: Copyright (©) 1999–2012 R Foundation for Statistical Computing; permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and permission notice are preserved on all copies.
Journal ArticleDOI

BLAST+: architecture and applications.

TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Journal ArticleDOI

QUAST: quality assessment tool for genome assemblies

TL;DR: This tool improves on leading assembly comparison software with new ideas and quality metrics, and can evaluate assemblies both with a reference genome, as well as without a reference.
Journal ArticleDOI

Oases: Robust de novo RNA-seq assembly across the dynamic range of expression levels

TL;DR: A software package named Oases designed to heuristically assemble RNA-seq reads in the absence of a reference genome, across a broad spectrum of expression values and in presence of alternative isoforms is presented.
Related Papers (5)