scispace - formally typeset
Open AccessJournal ArticleDOI

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

TLDR
This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Abstract
In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html .

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

HTSeq—a Python framework to work with high-throughput sequencing data

TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.
Journal ArticleDOI

Comprehensive Integration of Single-Cell Data.

TL;DR: A strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities.
Journal ArticleDOI

Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown

TL;DR: This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts.
References
More filters
Journal ArticleDOI

Testing significance relative to a fold-change threshold is a TREAT

TL;DR: This work presents a method, t-tests relative to a threshold (TREAT), that allows researchers to test formally the hypothesis (with associated p-values) that the differential expression in a microarray experiment is greater than a given threshold.
Journal ArticleDOI

A variance-stabilizing transformation for gene-expression microarray data.

TL;DR: A transformation is introduced that stabilizes the variance of microarray data across the full range of expression, and simulation studies suggest that this transformation approximately symmetrizes micro array data.
Journal ArticleDOI

Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data

TL;DR: A simple, non-parametric method with resampling to account for the different sequencing depths is introduced, and it is found that the method discovers more consistent patterns than competing methods.

Di↵erential analysis of count data - the DESeq2 package

TL;DR: The package DESeq2 provides methods to test for di↵erential expression by use of negative binomial generalized linear models; the estimates of dispersion and logarithmic fold changes incorporate data-driven prior distributions.
Journal ArticleDOI

GFOLD: a generalized fold change for ranking differentially expressed genes from RNA-seq data

TL;DR: The GFOLD (generalized fold change) algorithm is presented, which overcomes the shortcomings of P-value and fold change calculated by existing RNA-seq analysis methods and gives more stable and biological meaningful gene rankings when only a single biological replicate is available.
Related Papers (5)