Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
Reads0
Chats0
TLDR
This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.Abstract:
In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html
.read more
Citations
More filters
Journal ArticleDOI
HTSeq—a Python framework to work with high-throughput sequencing data
TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.
Journal ArticleDOI
Comprehensive Integration of Single-Cell Data.
Tim Stuart,Andrew Butler,Paul J. Hoffman,Christoph Hafemeister,Efthymia Papalexi,William M. Mauck,Yuhan Hao,Marlon Stoeckius,Peter Smibert,Rahul Satija +9 more
TL;DR: A strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities.
Journal ArticleDOI
Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown
TL;DR: This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts.
Journal ArticleDOI
Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19.
Daniel Blanco-Melo,Benjamin E. Nilsson-Payant,Wen-Chun Liu,Skyler Uhl,Daisy A. Hoagland,Rasmus Møller,Tristan X. Jordan,Kohei Oishi,Maryline Panis,David H. Sachs,Taia T. Wang,Robert E. Schwartz,Jean K. Lim,Randy A. Albrecht,Benjamin R. tenOever +14 more
TL;DR: It is proposed that reduced innate antiviral defenses coupled with exuberant inflammatory cytokine production are the defining and driving features of COVID-19.
Journal ArticleDOI
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update
Enis Afgan,Dannon Baker,Bérénice Batut,Marius van den Beek,Dave Bouvier,Martin Čech,John Chilton,Dave Clements,Nate Coraor,Björn Grüning,Aysam Guerler,Jennifer Hillman-Jackson,Saskia Hiltemann,Vahid Jalili,Helena Rasche,Nicola Soranzo,Jeremy Goecks,James Taylor,Anton Nekrutenko,Daniel Blankenberg +19 more
TL;DR: Improvements to Galaxy's core framework, user interface, tools, and training materials enable Galaxy to be used for analyzing tens of thousands of datasets, and >5500 tools are now available from the Galaxy ToolShed.
References
More filters
Journal ArticleDOI
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Journal ArticleDOI
Handbook of Mathematical Functions
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Book
Generalized Linear Models
Peter McCullagh,John A. Nelder +1 more
TL;DR: In this paper, a generalization of the analysis of variance is given for these models using log- likelihoods, illustrated by examples relating to four distributions; the Normal, Binomial (probit analysis, etc.), Poisson (contingency tables), and gamma (variance components).
Book
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.