voom: precision weights unlock linear model analysis tools for RNA-seq read counts
Charity W. Law,Charity W. Law,Yunshun Chen,Yunshun Chen,Wei Shi,Wei Shi,Gordon K. Smyth,Gordon K. Smyth +7 more
TLDR
New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments, and the voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline.Abstract:
New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments. The voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline. This opens access for RNA-seq analysts to a large body of methodology developed for microarrays. Simulation studies show that voom performs as well or better than count-based RNA-seq methods even when the data are generated according to the assumptions of the earlier methods. Two case studies illustrate the use of linear modeling and gene set testing methods.read more
Citations
More filters
Journal ArticleDOI
Nuclear Transcriptomes of the Seven Neuronal Cell Types That Constitute the Drosophila Mushroom Bodies
TL;DR: This high-quality, cell type-level transcriptome catalog for the Drosophila MB provides a valuable resource for the fly neuroscience community and provides a full accounting of the neurotransmitter receptors, transporters, neurotransmitter biosynthetic enzymes, neuropeptides, and Neuropeptide receptors expressed within each of these cell types.
Posted ContentDOI
DIABLO - an integrative, multi-omics, multivariate method for multi-group classification
Amrit Singh,Benoit Gautier,Casey P. Shannon,Michael Vacher,Florian Rohart,Scott J Tebutt,Kim-Anh Lê Cao +6 more
TL;DR: DIABLO - Data Integration Analysis for Biomarker discovery using a Latent component method for Omics studies, models the correlation structure between omics datasets, resulting in an improved ability to associate biomarkers across multiple functional levels to phenotypes of interest.
Journal ArticleDOI
Integration of targeted metabolomics and transcriptomics identifies deregulation of phosphatidylcholine metabolism in Huntington's disease peripheral blood samples.
Anastasios Mastrokolias,René Pool,René Pool,Eleni Mina,Kristina Hettne,Erik van Duijn,Roos C. van der Mast,Gert-Jan B. van Ommen,Peter A C 't Hoen,Cornelia Prehn,Jerzy Adamski,Willeke M. C. van Roon-Mom +11 more
TL;DR: The notion that phosphatidylcholine metabolism is deregulated in HD blood and that these metabolite alterations are associated with specific gene expression changes is supported.
Journal ArticleDOI
A novel statistical method for quantitative comparison of multiple ChIP-seq datasets
TL;DR: This work develops a statistical method to perform quantitative comparison of multiple ChIP-seq datasets and detect genomic regions showing differential protein binding or histone modification and demonstrates that the proposed method provides more accurate and robust results compared with existing ones.
Posted ContentDOI
St. Jude Cloud—a Pediatric Cancer Genomic Data Sharing Ecosystem
Clay McLeod,Alexander M. Gout,Xin Zhou,Delaram Rahbarinia,Andrew Thrasher,Scott Newman,Kirby Birch,Michael Macias,David Finkelstein,Jobin Sunny,Rahul Mudunuri,Brent A. Orr,Madison Treadway,Bob Davidson,Tracy Ard,Andrew Swistak,Stephanie Wiggins,Scott G. Foy,Samuel W. Brady,Jian Wang,Edgar Sioson,Shuoguo Wang,J. Robert Michael,Yu Liu,Xiaotu Ma,Aman Patel,Michael N. Edmonson,Mark R. Wilkinson,Andrew Frantz,Ti-Cheng Chang,Liqing Tian,Shaohua Lei,Christopher P. Meyer,Naina Thangaraj,Pamella Tater,Vijay Kandali,Singer Ma,Tuan Nguyen,Omar Serang,Irina McGuire,Nedra Robison,Darrell Gentry,Xing Tang,Lance E. Palmer,Gang Wu,Ed Suh,Leigh Tanner,James McMurry,Matthew Lear,Zhaoming Wang,Carmen L. Wilson,Yong Cheng,Mitch Weiss,Gregory T. Armstrong,Leslie L. Robison,Yutaka Yasui,Kim E. Nichols,David W. Ellison,Chitanya Bangur,Charles G. Mullighan,Suzanne J. Baker,Michael A. Dyer,Geralyn Miller,Michael Rusch,Richard Daly,Keith Perry,James R. Downing,Jinghui Zhang +67 more
TL;DR: The value of the St. Jude Cloud ecosystem is demonstrated through use cases that classify 48 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 subtypes of pediatric cancer.
References
More filters
Journal ArticleDOI
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
Aravind Subramanian,Pablo Tamayo,Vamsi K. Mootha,Sayan Mukherjee,Benjamin L. Ebert,Michael A. Gillette,Amanda G. Paulovich,Scott L. Pomeroy,Todd R. Golub,Eric S. Lander,Jill P. Mesirov +10 more
TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Book
Generalized Linear Models
Peter McCullagh,John A. Nelder +1 more
TL;DR: In this paper, a generalization of the analysis of variance is given for these models using log- likelihoods, illustrated by examples relating to four distributions; the Normal, Binomial (probit analysis, etc.), Poisson (contingency tables), and gamma (variance components).
Journal ArticleDOI
featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features
TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.
Journal ArticleDOI
Differential expression analysis for sequence count data.
Simon Anders,Wolfgang Huber +1 more
TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.