Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences

doi:10.12688/F1000RESEARCH.7563.1

Open AccessJournal ArticleDOI

Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences

Charlotte Soneson, +4 more

- 30 Dec 2015 -

F1000Research

- Vol. 4, pp 1521-1521

Chats0

TLDR

It is illustrated that while the presence of differential isoform usage can lead to inflated false discovery rates in differential expression analyses on simple count matrices and transcript-level abundance estimates improve the performance in simulated data, the difference is relatively minor in several real data sets.

Abstract:

High-throughput sequencing of cDNA (RNA-seq) is used extensively to characterize the transcriptome of cells. Many transcriptomic studies aim at comparing either abundance levels or the transcriptome composition between given conditions, and as a first step, the sequencing reads must be used as the basis for abundance quantification of transcriptomic features of interest, such as genes or transcripts. Various quantification approaches have been proposed, ranging from simple counting of reads that overlap given genomic regions to more complex estimation of underlying transcript abundances. In this paper, we show that gene-level abundance estimates and statistical inference offer advantages over transcript-level analyses, in terms of performance and interpretability. We also illustrate that the presence of differential isoform usage can lead to inflated false discovery rates in differential gene expression analyses on simple count matrices but that this can be addressed by incorporating offsets derived from transcript-level abundance estimates. We also show that the problem is relatively minor in several real data sets. Finally, we provide an R package ( tximport) to help users integrate transcript-level abundance estimates from common quantification pipelines into count-based statistical inference engines.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

The commensal microbiome is associated with anti-PD-1 efficacy in metastatic melanoma patients

Vyara Matson, +7 more

- 05 Jan 2018 -

Science

TL;DR: The results suggest that the commensal microbiome may have a mechanistic impact on antitumor immunity in human cancer patients and could lead to improved tumor control, augmented T cell responses, and greater efficacy of anti–PD-L1 therapy.

...read moreread less

Journal ArticleDOI

RNA sequencing: the teenage years

Rory Stark, +2 more

- 24 Jul 2019 -

Nature Reviews Genetics

TL;DR: Advances in RNA-sequencing technologies and methods over the past decade are discussed and adaptations that are enabling a fuller understanding of RNA biology are outlined, from when and where an RNA is expressed to the structures it adopts.

...read moreread less

Journal ArticleDOI

Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences.

Anqi Zhu, +2 more

- 01 Jun 2019 -

Bioinformatics

TL;DR: The proposed method, Approximate Posterior Estimation for generalized linear model, apeglm, has lower bias than previously proposed shrinkage estimators, while still reducing variance for those genes with little information for statistical inference.

...read moreread less

Journal ArticleDOI

The complete sequence of a human genome

Sergey Koren, +6 more

- 01 Apr 2022 -

Science

TL;DR: The T2T-CHM13-T2T Consortium presented a complete 3.055 billion-base pair sequence of a human genome, including gapless assemblies for all chromosomes except Y, corrected errors in the prior references, and introduced nearly 200 million base pairs of sequence containing gene predictions, 99 of which are predicted to be protein coding as discussed by the authors .

...read moreread less

Journal ArticleDOI

Functional aspects of meningeal lymphatics in ageing and Alzheimer’s disease

Sandro Da Mesquita, +25 more

- 25 Jul 2018 -

Nature

TL;DR: It is shown that meningeal lymphatic vessels drain macromolecules from the CNS (cerebrospinal and interstitial fluids) into the cervical lymph nodes in mice and improves brain perfusion and learning and memory performance.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

Michael I. Love, +3 more

- 05 Dec 2014 -

Genome Biology

TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.

...read moreread less

Journal ArticleDOI

STAR: ultrafast universal RNA-seq aligner

Alexander Dobin, +8 more

- 01 Jan 2013 -

Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Journal ArticleDOI

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Mark D. Robinson, +2 more

- 01 Jan 2010 -

Bioinformatics

TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.

...read moreread less

Journal ArticleDOI

limma powers differential expression analyses for RNA-sequencing and microarray studies

Matthew E. Ritchie, +7 more

- 20 Apr 2015 -

Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Journal ArticleDOI

HTSeq—a Python framework to work with high-throughput sequencing data

Simon Anders, +2 more

- 15 Jan 2015 -

Bioinformatics

TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.

...read moreread less