Differential abundance analysis for microbial marker-gene surveys
TLDR
It is shown that metagenomeSeq outperforms the tools currently used in this field and relies on a novel normalization technique and a statistical model that accounts for undersampling in large-scale marker-gene studies.Abstract:
We introduce a methodology to assess differential abundance in sparse high-throughput microbial marker-gene survey data. Our approach, implemented in the metagenomeSeq Bioconductor package, relies on a novel normalization technique and a statistical model that accounts for undersampling-a common feature of large-scale marker-gene studies. Using simulated data and several published microbiota data sets, we show that metagenomeSeq outperforms the tools currently used in this field.read more
Citations
More filters
Posted Content
Testing for differential abundance in compositional counts data, with application to microbiome studies
TL;DR: A novel approach for differential abundance testing of compositional data, with a non-neglible amount of "zeros", which uses a set of reference taxa, which are non-differentially abundant, to identify which taxa differ in the microbiome community across groups.
Journal ArticleDOI
Biomarker discovery in inflammatory bowel diseases using network-based feature selection.
Mostafa M. Abbas,John Matta,Thanh Le,Halima Bensmail,Tayo Obafemi-Ajayi,Vasant Honavar,Yasser EL-Manzalawy,Yasser EL-Manzalawy +7 more
TL;DR: An integrative approach to Network-Based Biomarker Discovery (NBBD) which integrates network analyses methods for prioritizing potential biomarkers and machine learning techniques for assessing the discriminative power of the prioritized biomarkers is presented.
Journal ArticleDOI
Environmental and host factors shaping the gut microbiota diversity of brown frog Rana dybowskii.
TL;DR: It is found that alpha and beta diversity varied significantly during different developmental stages of brown frog R. dybowskii, and the AIC-based model results suggested that development was the only variable that needed inclusion in the redundancy analysis (RDA) to explain the variance in taxa.
Posted ContentDOI
Normalization methods for microbial abundance data strongly affect correlation estimates
TL;DR: It is shown that log-ratio and variance-stabilization transformations provide the most consistent estimates across experiments of different sample sizes, and that data analysis methods that rely on correlation, such as data clustering and network inference, differ depending on the normalization schemes.
Journal ArticleDOI
Chicken Intestinal Mycobiome: Initial Characterization and Its Response to Bacitracin Methylene Disalicylate.
Kelsy Robinson,Yingping Xiao,Timothy J. Johnson,Binlong Chen,Qing Yang,Wentao Lyu,Wang Jing,Nicole Fansler,Sage Becker,Jing Liu,Hua Yang,Guolong Zhang +11 more
TL;DR: The biogeography of the mycobiota along the GI tract of day 28 broiler chicks is characterized and its possible shift in response to bacitracin methylene disalicylate (BMD) is examined through Illumina sequencing of the internal transcribed spacer 2 (ITS2) region of fungal rRNA genes.
References
More filters
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more
TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.
Journal ArticleDOI
Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy
TL;DR: The RDP Classifier can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes, and the majority of the classification errors appear to be due to anomalies in the current taxonomies.
Journal ArticleDOI
Differential expression analysis for sequence count data.
Simon Anders,Wolfgang Huber +1 more
TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.
Journal ArticleDOI
Metagenomic biomarker discovery and explanation
Nicola Segata,Jacques Izard,Jacques Izard,Levi Waldron,Dirk Gevers,Larisa Miropolsky,Wendy S. Garrett,Curtis Huttenhower +7 more
TL;DR: A new method for metagenomic biomarker discovery is described and validates by way of class comparison, tests of biological consistency and effect size estimation to address the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities.
Related Papers (5)
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more