Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences
Reads0
Chats0
TLDR
It is illustrated that while the presence of differential isoform usage can lead to inflated false discovery rates in differential expression analyses on simple count matrices and transcript-level abundance estimates improve the performance in simulated data, the difference is relatively minor in several real data sets.Abstract:
High-throughput sequencing of cDNA (RNA-seq) is used extensively to characterize the transcriptome of cells. Many transcriptomic studies aim at comparing either abundance levels or the transcriptome composition between given conditions, and as a first step, the sequencing reads must be used as the basis for abundance quantification of transcriptomic features of interest, such as genes or transcripts. Various quantification approaches have been proposed, ranging from simple counting of reads that overlap given genomic regions to more complex estimation of underlying transcript abundances. In this paper, we show that gene-level abundance estimates and statistical inference offer advantages over transcript-level analyses, in terms of performance and interpretability. We also illustrate that the presence of differential isoform usage can lead to inflated false discovery rates in differential gene expression analyses on simple count matrices but that this can be addressed by incorporating offsets derived from transcript-level abundance estimates. We also show that the problem is relatively minor in several real data sets. Finally, we provide an R package ( tximport) to help users integrate transcript-level abundance estimates from common quantification pipelines into count-based statistical inference engines.read more
Citations
More filters
Journal ArticleDOI
The commensal microbiome is associated with anti-PD-1 efficacy in metastatic melanoma patients
Vyara Matson,Jessica Fessler,Riyue Bao,Tara Chongsuwat,Yuanyuan Zha,Maria-Luisa Alegre,Jason J. Luke,Thomas F. Gajewski +7 more
TL;DR: The results suggest that the commensal microbiome may have a mechanistic impact on antitumor immunity in human cancer patients and could lead to improved tumor control, augmented T cell responses, and greater efficacy of anti–PD-L1 therapy.
Journal ArticleDOI
RNA sequencing: the teenage years
TL;DR: Advances in RNA-sequencing technologies and methods over the past decade are discussed and adaptations that are enabling a fuller understanding of RNA biology are outlined, from when and where an RNA is expressed to the structures it adopts.
Journal ArticleDOI
Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences.
TL;DR: The proposed method, Approximate Posterior Estimation for generalized linear model, apeglm, has lower bias than previously proposed shrinkage estimators, while still reducing variance for those genes with little information for statistical inference.
Journal ArticleDOI
The complete sequence of a human genome
Sergey Koren,Sergey Nurk,Mikko Rautiainen,B Ren,Weijun Zhu,Richard Lawless,Саидмуродов Мамур Таирович +6 more
TL;DR: The T2T-CHM13-T2T Consortium presented a complete 3.055 billion-base pair sequence of a human genome, including gapless assemblies for all chromosomes except Y, corrected errors in the prior references, and introduced nearly 200 million base pairs of sequence containing gene predictions, 99 of which are predicted to be protein coding as discussed by the authors .
Journal ArticleDOI
Functional aspects of meningeal lymphatics in ageing and Alzheimer’s disease
Sandro Da Mesquita,Antoine Louveau,Andrea Vaccari,Igor Smirnov,R. Chase Cornelison,Kathryn M. Kingsmore,Christian Contarino,Christian Contarino,Suna Onengut-Gumuscu,Emily Farber,Daniel M.S. Raper,Daniel M.S. Raper,Kenneth E. Viar,Romie D. Powell,Wendy Baker,Nisha Dabhi,Robin Bai,Rui Cao,Song Hu,Stephen S. Rich,Jennifer M. Munson,Jennifer M. Munson,M. Beatriz S. Lopes,Christopher C. Overall,Scott T. Acton,Jonathan Kipnis +25 more
TL;DR: It is shown that meningeal lymphatic vessels drain macromolecules from the CNS (cerebrospinal and interstitial fluids) into the cervical lymph nodes in mice and improves brain perfusion and learning and memory performance.
References
More filters
Journal ArticleDOI
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Journal ArticleDOI
STAR: ultrafast universal RNA-seq aligner
Alexander Dobin,Carrie A. Davis,Felix Schlesinger,Jorg Drenkow,Chris Zaleski,Sonali Jha,Philippe Batut,Mark Chaisson,Thomas R. Gingeras +8 more
TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
limma powers differential expression analyses for RNA-sequencing and microarray studies
Matthew E. Ritchie,Belinda Phipson,Di Wu,Yifang Hu,Charity W. Law,Wei Shi,Gordon K. Smyth,Gordon K. Smyth +7 more
TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
Journal ArticleDOI
HTSeq—a Python framework to work with high-throughput sequencing data
TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.