Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
Reads0
Chats0
TLDR
This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.Abstract:
In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html
.read more
Citations
More filters
Journal ArticleDOI
HTSeq—a Python framework to work with high-throughput sequencing data
TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.
Journal ArticleDOI
Comprehensive Integration of Single-Cell Data.
Tim Stuart,Andrew Butler,Paul J. Hoffman,Christoph Hafemeister,Efthymia Papalexi,William M. Mauck,Yuhan Hao,Marlon Stoeckius,Peter Smibert,Rahul Satija +9 more
TL;DR: A strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities.
Journal ArticleDOI
Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown
TL;DR: This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts.
Journal ArticleDOI
Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19.
Daniel Blanco-Melo,Benjamin E. Nilsson-Payant,Wen-Chun Liu,Skyler Uhl,Daisy A. Hoagland,Rasmus Møller,Tristan X. Jordan,Kohei Oishi,Maryline Panis,David H. Sachs,Taia T. Wang,Robert E. Schwartz,Jean K. Lim,Randy A. Albrecht,Benjamin R. tenOever +14 more
TL;DR: It is proposed that reduced innate antiviral defenses coupled with exuberant inflammatory cytokine production are the defining and driving features of COVID-19.
Journal ArticleDOI
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update
Enis Afgan,Dannon Baker,Bérénice Batut,Marius van den Beek,Dave Bouvier,Martin Čech,John Chilton,Dave Clements,Nate Coraor,Björn Grüning,Aysam Guerler,Jennifer Hillman-Jackson,Saskia Hiltemann,Vahid Jalili,Helena Rasche,Nicola Soranzo,Jeremy Goecks,James Taylor,Anton Nekrutenko,Daniel Blankenberg +19 more
TL;DR: Improvements to Galaxy's core framework, user interface, tools, and training materials enable Galaxy to be used for analyzing tens of thousands of datasets, and >5500 tools are now available from the Galaxy ToolShed.
References
More filters
Journal ArticleDOI
Classification and clustering of sequencing data using a Poisson model
TL;DR: In this paper, the authors propose new approaches for performing classification and clustering of observations on the basis of sequencing data using a Poisson log linear model, which is an analog of diagonal linear discriminant analysis.
Journal ArticleDOI
Errors in RNA-Seq quantification affect genes of relevance to human disease
Christelle Robert,Mick Watson +1 more
TL;DR: It is shown that it is possible to use data that may otherwise have been discarded to measure group-level expression, and that such data contains biologically relevant information.
Journal ArticleDOI
A powerful and flexible approach to the analysis of RNA sequence count data
TL;DR: BBSeq is described, which incorporates a simple beta-binomial generalized linear model, combined with simple outlier detection and testing approaches, which appears to have favorable characteristics in power and flexibility.
Posted ContentDOI
Salmon: Accurate, Versatile and Ultrafast Quantification from RNA-seq Data using Lightweight-Alignment
TL;DR: Salmon is introduced, a novel method and software tool for transcript quantication that exhibits state-of-the-art accuracy while being signicantly faster than most other tools.
Journal ArticleDOI
Classification and clustering of sequencing data using a Poisson model
TL;DR: Using a Poisson log linear model, an analog of diagonal linear discriminant analysis that is appropriate for sequencing data is developed and an approach for clustering sequencing data using a new dissimilarity measure that is based upon the Poisson model is proposed.