scispace - formally typeset
Open AccessJournal ArticleDOI

voom: precision weights unlock linear model analysis tools for RNA-seq read counts

TLDR
New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments, and the voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline.
Abstract
New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments. The voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline. This opens access for RNA-seq analysts to a large body of methodology developed for microarrays. Simulation studies show that voom performs as well or better than count-based RNA-seq methods even when the data are generated according to the assumptions of the earlier methods. Two case studies illustrate the use of linear modeling and gene set testing methods.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Next generation sequencing technology and genomewide data analysis: Perspectives for retinal research

TL;DR: The basic design of commonly used NGS-based methods, specifically whole exome sequencing, transcriptome, and epigenome profiling are illustrated, and recommendations for data analyses are provided.
Posted ContentDOI

Understanding sequencing data as compositions: an outlook and review

TL;DR: The principles of compositional data analysis (CoDA) are summarized, evidence for why sequencing data are compositional, methods available for analyzing sequencing data, and future directions are highlighted to highlight future directions with regard to this field of study.
Journal ArticleDOI

seq-ImmuCC: Cell-Centric View of Tissue Transcriptome Measuring Cellular Compositions of Immune Microenvironment From Mouse RNA-Seq Data.

TL;DR: A computational model named seq-ImmuCC was developed to infer the relative proportions of 10 major immune cells in mouse tissues from RNA-Seq data and generated the comprehensive landscape of immune cell compositions in 27 normal mouse tissues, which provided a comprehensive and informative measurement for the immune microenvironment inside tumor tissues.
Journal ArticleDOI

Co-expression analysis of high-throughput transcriptome sequencing data with Poisson mixture models.

TL;DR: A Poisson mixture model is proposed using a rigorous framework for parameter estimation as well as the choice of the appropriate number of clusters for clustering DGE profiles as a means to discover groups of co-expressed genes.
References
More filters
Journal ArticleDOI

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Book

Generalized Linear Models

TL;DR: In this paper, a generalization of the analysis of variance is given for these models using log- likelihoods, illustrated by examples relating to four distributions; the Normal, Binomial (probit analysis, etc.), Poisson (contingency tables), and gamma (variance components).
Journal ArticleDOI

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.
Journal ArticleDOI

Differential expression analysis for sequence count data.

Simon Anders, +1 more
- 27 Oct 2010 - 
TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.
Related Papers (5)