Differential abundance analysis for microbial marker-gene surveys
TLDR
It is shown that metagenomeSeq outperforms the tools currently used in this field and relies on a novel normalization technique and a statistical model that accounts for undersampling in large-scale marker-gene studies.Abstract:
We introduce a methodology to assess differential abundance in sparse high-throughput microbial marker-gene survey data. Our approach, implemented in the metagenomeSeq Bioconductor package, relies on a novel normalization technique and a statistical model that accounts for undersampling-a common feature of large-scale marker-gene studies. Using simulated data and several published microbiota data sets, we show that metagenomeSeq outperforms the tools currently used in this field.read more
Citations
More filters
Journal ArticleDOI
Establishing microbial composition measurement standards with reference frames
James T. Morton,Clarisse Marotz,Alex D. Washburne,Justin D. Silverman,Livia S. Zaramela,Anna Edlund,Karsten Zengler,Rob Knight +7 more
TL;DR: The notion of “reference frames” is defined, which provide deep intuition about the compositional nature of microbiome data, which allow reassessment of published relative abundance data to reveal reproducible microbial changes from standard sequencing output without the need for new assays.
Journal ArticleDOI
The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
J. Paul Brooks,David J. Edwards,Michael D. Harwich,Maria C. Rivera,Jennifer M. Fettweis,Myrna G. Serrano,Robert Reris,Nihar U. Sheth,Bernice Huang,Philippe H. Girerd,Jerome F. Strauss,Kimberly K. Jefferson,Gregory A. Buck +12 more
TL;DR: An experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline and developed models to predict the “true” composition of environmental samples based on the observed proportions, which were applied to clinical vaginal samples from a single subject during four visits.
Journal ArticleDOI
Simplified and representative bacterial community of maize roots.
TL;DR: A greatly simplified, yet representative, synthetic bacterial model community that allowed us to study the community assembly dynamics and function on axenic maize seedlings and suggested that E. cloacae plays the role of keystone species in this model ecosystem.
Journal ArticleDOI
Reproducible RNA-seq analysis using recount2.
Leonardo Collado-Torres,Abhinav Nellore,Kai Kammers,Kai Kammers,Shannon E. Ellis,Margaret A. Taub,Kasper D. Hansen,Andrew E. Jaffe,Ben Langmead,Jeffrey T. Leek +9 more
TL;DR: This work presents a new generation of probabilistic models that combine RNAseq, a very simple and scalable approach to genome annotation, with real-time information about the “wear and tear” of the genome.
Journal ArticleDOI
Multivariable association discovery in population-scale meta-omics studies.
Himel Mallick,Himel Mallick,Ali Rahnavard,Lauren J. McIver,Lauren J. McIver,Siyuan Ma,Siyuan Ma,Yancong Zhang,Yancong Zhang,Long H. Nguyen,Timothy L. Tickle,George Weingart,George Weingart,Boyu Ren,Boyu Ren,Emma Schwager,Emma Schwager,Suvo Chatterjee,Kelsey N. Thompson,Jeremy E. Wilkinson,Ayshwarya Subramanian,Ayshwarya Subramanian,Yiren Lu,Levi Waldron,Joseph N. Paulson,Eric A. Franzosa,Eric A. Franzosa,Héctor Corrada Bravo,Curtis Huttenhower,Curtis Huttenhower +29 more
TL;DR: MaAsLin 2 (Microbiome Multivariable Associations with Linear Models) as mentioned in this paper uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types with or without covariates and repeated measurements.
References
More filters
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more
TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.
Journal ArticleDOI
Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy
TL;DR: The RDP Classifier can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes, and the majority of the classification errors appear to be due to anomalies in the current taxonomies.
Journal ArticleDOI
Differential expression analysis for sequence count data.
Simon Anders,Wolfgang Huber +1 more
TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.
Journal ArticleDOI
Metagenomic biomarker discovery and explanation
Nicola Segata,Jacques Izard,Jacques Izard,Levi Waldron,Dirk Gevers,Larisa Miropolsky,Wendy S. Garrett,Curtis Huttenhower +7 more
TL;DR: A new method for metagenomic biomarker discovery is described and validates by way of class comparison, tests of biological consistency and effect size estimation to address the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities.
Related Papers (5)
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more