Differential abundance analysis for microbial marker-gene surveys
TLDR
It is shown that metagenomeSeq outperforms the tools currently used in this field and relies on a novel normalization technique and a statistical model that accounts for undersampling in large-scale marker-gene studies.Abstract:
We introduce a methodology to assess differential abundance in sparse high-throughput microbial marker-gene survey data. Our approach, implemented in the metagenomeSeq Bioconductor package, relies on a novel normalization technique and a statistical model that accounts for undersampling-a common feature of large-scale marker-gene studies. Using simulated data and several published microbiota data sets, we show that metagenomeSeq outperforms the tools currently used in this field.read more
Citations
More filters
Journal ArticleDOI
NetCoMi: network construction and comparison for microbiome data in R
TL;DR: NetCoMi (Network Construction and comparison for Microbiome data), an R package that integrates existing methods for each analysis step in a single reproducible computational workflow, is introduced, enabling insights into whether single taxa, groups of taxa or the overall network structure change between groups.
Journal ArticleDOI
Airway Microbiota Dynamics Uncover a Critical Window for Interplay of Pathogenic Bacteria and Allergy in Childhood Respiratory Disease.
Shu Mei Teo,Shu Mei Teo,Shu Mei Teo,Howard H.F. Tang,Howard H.F. Tang,Howard H.F. Tang,Danny Mok,Louise M. Judd,Stephen C. Watts,Kym Pham,Barbara J. Holt,Merci M.H. Kusel,Michael Serralha,Niamh M. Troy,Yury A. Bochkov,Kristine Grindle,Robert F. Lemanske,Sebastian L. Johnston,James E. Gern,Peter D. Sly,Patrick G. Holt,Patrick G. Holt,Kathryn E. Holt,Kathryn E. Holt,Kathryn E. Holt,Michael Inouye +25 more
TL;DR: It is demonstrated that repeated cycles of infection-associated lower airway inflammation drive the pathogenesis of persistent wheezing disease in children, and monitoring NPM composition may enable early detection and intervention in high-risk children.
Journal ArticleDOI
Negative binomial mixed models for analyzing microbiome count data
Xinyan Zhang,Himel Mallick,Himel Mallick,Zaixiang Tang,Lei Zhang,Xiangqin Cui,Andrew K. Benson,Nengjun Yi +7 more
TL;DR: A flexible and efficient IWLS (Iterative Weighted Least Squares) algorithm is developed to fit the proposed negative binomial mixed models (NBMMs) for detecting the association between the microbiome and host environmental/clinical factors for correlated microbiome count data.
Journal ArticleDOI
MixMC: A Multivariate Statistical Framework to Gain Insight into Microbial Communities.
Kim-Anh Lê Cao,Mary-Ellen Costello,Vanessa Anne Lakis,Francois Bartolo,Xin-Yi Chua,Rémi Brazeilles,Pascale Rondeau +6 more
TL;DR: MixMC as discussed by the authors is a multivariate data analysis framework for metagenomic biomarker discovery, which accounts for the compositional nature of 16S data and enables detection of subtle differences when high inter-subject variability is present due to microbial sampling performed repeatedly on the same subjects, but in multiple habitats.
Journal ArticleDOI
Microbial Composition Predicts Genital Tract Inflammation and Persistent Bacterial Vaginosis in South African Adolescent Females.
Katie Lennard,Smritee Dabee,Shaun L. Barnabas,Enock Havyarimana,Anna K. Blakney,Shameem Z. Jaumdally,Gerrit Botha,Nonhlanhla N. Mkhize,Linda-Gail Bekker,David A. Lewis,Glenda Gray,Glenda Gray,Nicola Mulder,Jo-Ann S. Passmore,Jo-Ann S. Passmore,Heather B. Jaspan +15 more
TL;DR: It is proposed that women with this BVAB1-dominated subtype may have chronic genital inflammation due to persistent BV, which may place them at a particularly high risk for HIV infection.
References
More filters
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more
TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.
Journal ArticleDOI
Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy
TL;DR: The RDP Classifier can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes, and the majority of the classification errors appear to be due to anomalies in the current taxonomies.
Journal ArticleDOI
Differential expression analysis for sequence count data.
Simon Anders,Wolfgang Huber +1 more
TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.
Journal ArticleDOI
Metagenomic biomarker discovery and explanation
Nicola Segata,Jacques Izard,Jacques Izard,Levi Waldron,Dirk Gevers,Larisa Miropolsky,Wendy S. Garrett,Curtis Huttenhower +7 more
TL;DR: A new method for metagenomic biomarker discovery is described and validates by way of class comparison, tests of biological consistency and effect size estimation to address the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities.
Related Papers (5)
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more