scispace - formally typeset
Open AccessJournal ArticleDOI

phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

Paul J. McMurdie, +1 more
- 22 Apr 2013 - 
- Vol. 8, Iss: 4
Reads0
Chats0
TLDR
The phyloseq project for R is a new open-source software package dedicated to the object-oriented representation and analysis of microbiome census data in R, which supports importing data from a variety of common formats, as well as many analysis techniques.
Abstract
Background The analysis of microbial communities through DNA sequencing brings many challenges: the integration of different types of data with methods from ecology, genetics, phylogenetics, multivariate statistics, visualization and testing. With the increased breadth of experimental designs now being pursued, project-specific statistical analyses are often needed, and these analyses are often difficult (or impossible) for peer researchers to independently reproduce. The vast majority of the requisite tools for performing these analyses reproducibly are already implemented in R and its extensions (packages), but with limited support for high throughput microbiome census data. Results Here we describe a software project, phyloseq, dedicated to the object-oriented representation and analysis of microbiome census data in R. It supports importing data from a variety of common formats, as well as many analysis techniques. These include calibration, filtering, subsetting, agglomeration, multi-table comparisons, diversity analysis, parallelized Fast UniFrac, ordination methods, and production of publication-quality graphics; all in a manner that is easy to document, share, and modify. We show how to apply functions from other R packages to phyloseq-represented data, illustrating the availability of a large number of open source analysis techniques. We discuss the use of phyloseq with tools for reproducible research, a practice common in other fields but still rare in the analysis of highly parallel microbiome census data. We have made available all of the materials necessary to completely reproduce the analysis and figures included in this article, an example of best practices for reproducible research. Conclusions The phyloseq project for R is a new open-source software package, freely available on the web from both GitHub and Bioconductor.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Unraveling the Composition of the Root-Associated Bacterial Microbiota of Phragmites australis and Typha latifolia.

TL;DR: It is demonstrated that, despite a different composition of the initial basin inoculum, the microbiota associated with the rhizosphere and rhizoplane of P. australis and T. latifolia tends to converge toward a common taxonomic composition dominated by members of the phyla Actinob bacteria, Firmicutes, Proteobacteria, and Planctomycetes, which indicates the existence of a selecting process acting at the root–soil interface of these aquatic plants
Journal ArticleDOI

Chloroplast sequence variation and the efficacy of peptide nucleic acids for blocking host amplification in plant microbiome studies

TL;DR: A validated framework to modify universal PNA clamps to accommodate host variation in organellar sequences is provided, and it is found that pPNA type had no effect on the detection of individual bacterial taxa, or estimates of within and between sample bacterial diversity, suggesting that the modification did not introduce bias against particular bacterial lineages.
Journal ArticleDOI

A critical evaluation of ecological indices for the comparative analysis of microbial communities based on molecular datasets

TL;DR: Generally applicable ecological indices for the statistical analysis of microbial community composition and dynamics based on fingerprinting and NGS datasets are presented warranting interstudy comparability and intuitive interpretability.
Journal ArticleDOI

Comparative assessment of autochthonous bacterial and fungal communities and microbial biomarkers of polluted agricultural soils of the Terra dei Fuochi

TL;DR: Investigation of the indigenous bacterial and fungal community structure as well as the impact of pollutants on their diversity and richness in contaminated and noncontaminated soils of a National Interest Priority Site of Campania Region (Italy) showed that the indigenous microbial communities were most strongly affected by contamination rather than by site of origin.
References
More filters
Journal Article

R: A language and environment for statistical computing.

R Core Team
- 01 Jan 2014 - 
TL;DR: Copyright (©) 1999–2012 R Foundation for Statistical Computing; permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and permission notice are preserved on all copies.
Book

ggplot2: Elegant Graphics for Data Analysis

TL;DR: This book describes ggplot2, a new data visualization package for R that uses the insights from Leland Wilkisons Grammar of Graphics to create a powerful and flexible system for creating data graphics.
Related Papers (5)