Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
TLDR
This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.Abstract:
In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html
.read more
Citations
More filters
Journal ArticleDOI
HTSeq—a Python framework to work with high-throughput sequencing data
TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.
Journal ArticleDOI
Comprehensive Integration of Single-Cell Data.
Tim Stuart,Andrew Butler,Paul J. Hoffman,Christoph Hafemeister,Efthymia Papalexi,William M. Mauck,Yuhan Hao,Marlon Stoeckius,Peter Smibert,Rahul Satija +9 more
TL;DR: A strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities.
Journal ArticleDOI
Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown
TL;DR: This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts.
Journal ArticleDOI
Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19.
Daniel Blanco-Melo,Benjamin E. Nilsson-Payant,Wen-Chun Liu,Skyler Uhl,Daisy A. Hoagland,Rasmus Møller,Tristan X. Jordan,Kohei Oishi,Maryline Panis,David H. Sachs,Taia T. Wang,Robert E. Schwartz,Jean K. Lim,Randy A. Albrecht,Benjamin R. tenOever +14 more
TL;DR: It is proposed that reduced innate antiviral defenses coupled with exuberant inflammatory cytokine production are the defining and driving features of COVID-19.
Journal ArticleDOI
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update
Enis Afgan,Dannon Baker,Bérénice Batut,Marius van den Beek,Dave Bouvier,Martin Čech,John Chilton,Dave Clements,Nate Coraor,Björn Grüning,Aysam Guerler,Jennifer Hillman-Jackson,Saskia Hiltemann,Vahid Jalili,Helena Rasche,Nicola Soranzo,Jeremy Goecks,James Taylor,Anton Nekrutenko,Daniel Blankenberg +19 more
TL;DR: Improvements to Galaxy's core framework, user interface, tools, and training materials enable Galaxy to be used for analyzing tens of thousands of datasets, and >5500 tools are now available from the Galaxy ToolShed.
References
More filters
Journal ArticleDOI
Differential expression analysis for sequence count data.
Simon Anders,Wolfgang Huber +1 more
TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.
Journal ArticleDOI
Bioconductor: open software development for computational biology and bioinformatics
Robert Gentleman,Vincent J. Carey,Douglas M. Bates,Benjamin M. Bolstad,Marcel Dettling,Sandrine Dudoit,Byron Ellis,Laurent Gautier,Yongchao Ge,Jeff Gentry,Kurt Hornik,Torsten Hothorn,Wolfgang Huber,Stefano Maria Iacus,Rafael A. Irizarry,Friedrich Leisch,Cheng Li,Martin Maechler,A. J. Rossini,Günther Sawitzki,Colin A. Smith,Gordon K. Smyth,Luke Tierney,Jean Yang,Jianhua Zhang +24 more
TL;DR: Details of the aims and methods of Bioconductor, the collaborative creation of extensible software for computational biology and bioinformatics, and current challenges are described.
Journal ArticleDOI
Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
TL;DR: The hierarchical model of Lonnstedt and Speed (2002) is developed into a practical approach for general microarray experiments with arbitrary numbers of treatments and RNA samples and the moderated t-statistic is shown to follow a t-distribution with augmented degrees of freedom.
Journal ArticleDOI
TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions
Daehwan Kim,Daehwan Kim,Geo Pertea,Cole Trapnell,Cole Trapnell,Harold Pimentel,Kelley Ryan Matthew,Steven L. Salzberg,Steven L. Salzberg +8 more
TL;DR: TopHat2 is described, which incorporates many significant enhancements to TopHat, and combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes.
Journal ArticleDOI
Generalized Linear Models
TL;DR: This is the rst book on generalized linear models written by authors not mostly associated with the biological sciences, and it is thoroughly enjoyable to read.