phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.
Paul J. McMurdie,Susan Holmes +1 more
TLDR
The phyloseq project for R is a new open-source software package dedicated to the object-oriented representation and analysis of microbiome census data in R, which supports importing data from a variety of common formats, as well as many analysis techniques.Abstract:
Background The analysis of microbial communities through DNA sequencing brings many challenges: the integration of different types of data with methods from ecology, genetics, phylogenetics, multivariate statistics, visualization and testing. With the increased breadth of experimental designs now being pursued, project-specific statistical analyses are often needed, and these analyses are often difficult (or impossible) for peer researchers to independently reproduce. The vast majority of the requisite tools for performing these analyses reproducibly are already implemented in R and its extensions (packages), but with limited support for high throughput microbiome census data. Results Here we describe a software project, phyloseq, dedicated to the object-oriented representation and analysis of microbiome census data in R. It supports importing data from a variety of common formats, as well as many analysis techniques. These include calibration, filtering, subsetting, agglomeration, multi-table comparisons, diversity analysis, parallelized Fast UniFrac, ordination methods, and production of publication-quality graphics; all in a manner that is easy to document, share, and modify. We show how to apply functions from other R packages to phyloseq-represented data, illustrating the availability of a large number of open source analysis techniques. We discuss the use of phyloseq with tools for reproducible research, a practice common in other fields but still rare in the analysis of highly parallel microbiome census data. We have made available all of the materials necessary to completely reproduce the analysis and figures included in this article, an example of best practices for reproducible research. Conclusions The phyloseq project for R is a new open-source software package, freely available on the web from both GitHub and Bioconductor.read more
Citations
More filters
Journal ArticleDOI
Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2
Evan Bolyen,Jai Ram Rideout,Matthew R. Dillon,Nicholas A. Bokulich,Christian C. Abnet,Gabriel A. Al-Ghalith,Harriet Alexander,Harriet Alexander,Eric J. Alm,Manimozhiyan Arumugam,Francesco Asnicar,Yang Bai,Jordan E. Bisanz,Kyle Bittinger,Asker Daniel Brejnrod,Colin J. Brislawn,C. Titus Brown,Benjamin J. Callahan,Andrés Mauricio Caraballo-Rodríguez,John Chase,Emily K. Cope,Ricardo Silva,Christian Diener,Pieter C. Dorrestein,Gavin M. Douglas,Daniel M. Durall,Claire Duvallet,Christian F. Edwardson,Madeleine Ernst,Madeleine Ernst,Mehrbod Estaki,Jennifer Fouquier,Julia M. Gauglitz,Sean M. Gibbons,Sean M. Gibbons,Deanna L. Gibson,Antonio Gonzalez,Kestrel Gorlick,Jiarong Guo,Benjamin Hillmann,Susan Holmes,Hannes Holste,Curtis Huttenhower,Curtis Huttenhower,Gavin A. Huttley,Stefan Janssen,Alan K. Jarmusch,Lingjing Jiang,Benjamin D. Kaehler,Benjamin D. Kaehler,Kyo Bin Kang,Kyo Bin Kang,Christopher R. Keefe,Paul Keim,Scott T. Kelley,Dan Knights,Irina Koester,Tomasz Kosciolek,Jorden Kreps,Morgan G. I. Langille,Joslynn S. Lee,Ruth E. Ley,Ruth E. Ley,Yong-Xin Liu,Erikka Loftfield,Catherine A. Lozupone,Massoud Maher,Clarisse Marotz,Bryan D Martin,Daniel McDonald,Lauren J. McIver,Lauren J. McIver,Alexey V. Melnik,Jessica L. Metcalf,Sydney C. Morgan,Jamie Morton,Ahmad Turan Naimey,Jose A. Navas-Molina,Jose A. Navas-Molina,Louis-Félix Nothias,Stephanie B. Orchanian,Talima Pearson,Samuel L. Peoples,Samuel L. Peoples,Daniel Petras,Mary L. Preuss,Elmar Pruesse,Lasse Buur Rasmussen,Adam R. Rivers,Michael S. Robeson,Patrick Rosenthal,Nicola Segata,Michael Shaffer,Arron Shiffer,Rashmi Sinha,Se Jin Song,John R. Spear,Austin D. Swafford,Luke R. Thompson,Luke R. Thompson,Pedro J. Torres,Pauline Trinh,Anupriya Tripathi,Peter J. Turnbaugh,Sabah Ul-Hasan,Justin J. J. van der Hooft,Fernando Vargas,Yoshiki Vázquez-Baeza,Emily Vogtmann,Max von Hippel,William A. Walters,Yunhu Wan,Mingxun Wang,Jonathan Warren,Kyle C. Weber,Kyle C. Weber,Charles H. D. Williamson,Amy D. Willis,Zhenjiang Zech Xu,Jesse R. Zaneveld,Yilong Zhang,Qiyun Zhu,Rob Knight,J. Gregory Caporaso +123 more
TL;DR: QIIME 2 development was primarily funded by NSF Awards 1565100 to J.G.C. and R.K.P. and partial support was also provided by the following: grants NIH U54CA143925 and U54MD012388.
Journal ArticleDOI
ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data
TL;DR: An r package, ggtree, which provides programmable visualization and annotation of phylogenetic trees, which can read more tree file formats than other softwares, and support visualization of phylo, multiphylo, phylo4, phyla4d, obkdata and phyloseq tree objects defined in other r packages.
Journal ArticleDOI
Waste not, want not: why rarefying microbiome data is inadmissible.
Paul J. McMurdie,Susan Holmes +1 more
TL;DR: It is advocated that investigators avoid rarefying altogether and supported statistical theory is provided that simultaneously accounts for library size differences and biological variability using an appropriate mixture model.
Journal ArticleDOI
Microbiome Datasets Are Compositional: And This Is Not Optional.
TL;DR: The purpose of this review is to alert investigators to the dangers inherent in ignoring the compositional nature of the data, and point out that HTS datasets derived from microbiome studies can and should be treated as compositions at all stages of analysis.
Journal ArticleDOI
Human gut microbes impact host serum metabolome and insulin sensitivity
Helle Krogh Pedersen,Valborg Gudmundsdottir,Henrik Nielsen,Tuulia Hyötyläinen,Tuulia Hyötyläinen,Trine G. Nielsen,Benjamin A. H. Jensen,Kristoffer Forslund,Falk Hildebrand,Falk Hildebrand,Edi Prifti,Edi Prifti,Gwen Falony,Florence Levenez,Joël Doré,Ismo Mattila,Ismo Mattila,Damian R. Plichta,Päivi Pöhö,Päivi Pöhö,Lars Hellgren,Manimozhiyan Arumugam,Shinichi Sunagawa,Sara Vieira-Silva,Torben Jørgensen,Torben Jørgensen,Jacob Bak Holm,Kajetan Trošt,Karsten Kristiansen,Susanne Brix,Jeroen Raes,Jeroen Raes,Jun Wang,Torben Hansen,Torben Hansen,Peer Bork,Søren Brunak,Søren Brunak,Matej Orešič,Matej Orešič,Matej Orešič,S. Dusko Ehrlich,S. Dusko Ehrlich,Oluf Pedersen +43 more
TL;DR: It is shown how the human gut microbiome impacts the serum metabolome and associates with insulin resistance in 277 non-diabetic Danish individuals and suggested that microbial targets may have the potential to diminish insulin resistance and reduce the incidence of common metabolic and cardiovascular disorders.
References
More filters
Journal ArticleDOI
Structure, function and diversity of the healthy human microbiome
Curtis Huttenhower,Curtis Huttenhower,Dirk Gevers,Rob Knight,Rob Knight,Sahar Abubucker,Jonathan H. Badger,Asif T. Chinwalla,Heather Huot Creasy,Ashlee M. Earl,Michael Fitzgerald,Robert S. Fulton,Michelle G. Giglio,Kymberlie Hallsworth-Pepin,Elizabeth A. Lobos,Ramana Madupu,Vincent Magrini,John Martin,Makedonka Mitreva,Donna M. Muzny,Erica Sodergren,James Versalovic,Aye Wollam,Kim C. Worley,Jennifer R. Wortman,Sarah Young,Qiandong Zeng,Kjersti Aagaard,Olukemi O. Abolude,Emma Allen-Vercoe,Eric J. Alm,Eric J. Alm,Lucia Alvarado,Gary L. Andersen,Scott Anderson,Elizabeth L. Appelbaum,Harindra Arachchi,Gary C. Armitage,Cesar Arze,Tulin Ayvaz,Carl C. Baker,Lisa Begg,Tsegahiwot Belachew,Veena Bhonagiri,Monika Bihan,Martin J. Blaser,Toby Bloom,Vivien Bonazzi,J. Paul Brooks,Gregory A. Buck,Christian J. Buhay,Dana A. Busam,Joseph L. Campbell,Shane Canon,Brandi L. Cantarel,Patrick S. G. Chain,Patrick S. G. Chain,I. Min A. Chen,Lei Chen,Shaila Chhibba,Ken Chu,Dawn Ciulla,Jose C. Clemente,Sandra W. Clifton,Sean Conlan,Jonathan Crabtree,Mary A. Cutting,Noam J. Davidovics,Catherine C. Davis,Todd Z. DeSantis,Carolyn Deal,Kimberley D. Delehaunty,Floyd E. Dewhirst,Elena Deych,Yan Ding,David J. Dooling,Shannon Dugan,Wm. Michael Dunne,Wm. Michael Dunne,A. Scott Durkin,Robert C. Edgar,Rachel L. Erlich,Candace N. Farmer,Ruth M. Farrell,Karoline Faust,Michael Feldgarden,Victor Felix,Sheila Fisher,Anthony A. Fodor,Larry J. Forney,Leslie Foster,Valentina Di Francesco,Jonathan Friedman,Dennis C. Friedrich,Catrina Fronick,Lucinda Fulton,Hongyu Gao,Nathalia Garcia,Georgia Giannoukos,Christina Giblin,Maria Y. Giovanni,Jonathan M. Goldberg,Johannes B. Goll,Antonio Gonzalez,Allison D. Griggs,Sharvari Gujja,Susan Kinder Haake,Brian J. Haas,Holli A. Hamilton,Emily L. Harris,Theresa A. Hepburn,Brandi Herter,Diane E. Hoffmann,Michael Holder,Clinton Howarth,Katherine H. Huang,Susan M. Huse,Jacques Izard,Janet K. Jansson,Huaiyang Jiang,Catherine Jordan,Vandita Joshi,James A. Katancik,Wendy A. Keitel,Scott T. Kelley,Cristyn Kells,Nicholas B. King,Dan Knights,Heidi H. Kong,Omry Koren,Sergey Koren,Karthik Kota,Christie Kovar,Nikos C. Kyrpides,Patricio S. La Rosa,Sandra L. Lee,Katherine P. Lemon,Niall J. Lennon,Cecil M. Lewis,Lora Lewis,Ruth E. Ley,Kelvin Li,Konstantinos Liolios,Bo Liu,Yue Liu,Chien Chi Lo,Catherine A. Lozupone,R. Dwayne Lunsford,Tessa Madden,Anup Mahurkar,Peter J. Mannon,Elaine R. Mardis,Victor M. Markowitz,Victor M. Markowitz,Konstantinos Mavromatis,Jamison McCorrison,Daniel McDonald,Jean E. McEwen,Amy L. McGuire,Pamela McInnes,Teena Mehta,Kathie A. Mihindukulasuriya,Jason R. Miller,Patrick Minx,Irene Newsham,Chad Nusbaum,Michelle Oglaughlin,Joshua Orvis,Ioanna Pagani,Krishna Palaniappan,Shital M. Patel,Matthew D. Pearson,Jane Peterson,Mircea Podar,Craig Pohl,Katherine S. Pollard,Mihai Pop,Margaret Priest,Lita M. Proctor,Xiang Qin,Jeroen Raes,Jacques Ravel,Jeffrey G. Reid,Mina Rho,Rosamond Rhodes,Kevin Riehle,Maria C. Rivera,Beltran Rodriguez-Mueller,Yu-Hui Rogers,Matthew C. Ross,Carsten Russ,Ravi Sanka,Pamela Sankar,J. Fah Sathirapongsasuti,Jeffery A. Schloss,Patrick D. Schloss,Thomas M. Schmidt,Matthew B. Scholz,Lynn M. Schriml,Alyxandria M. Schubert,Nicola Segata,Julia A. Segre,William D. Shannon,Richard R. Sharp,Thomas J. Sharpton,Narmada Shenoy,Nihar U. Sheth,Gina A. Simone,Indresh Singh,Christopher Smillie,Jack D. Sobel,Daniel D. Sommer,Paul Spicer,Granger G. Sutton,Sean M. Sykes,Diana Tabbaa,Mathangi Thiagarajan,Chad Tomlinson,Manolito Torralba,Todd J. Treangen,Rebecca Truty,Tatiana A. Vishnivetskaya,Jason Walker,Lu Wang,Zhengyuan Wang,Doyle V. Ward,Wesley C. Warren,Mark A. Watson,Christopher Wellington,Kris A. Wetterstrand,James R. White,Katarzyna Wilczek-Boney,Yuanqing Wu,Kristine M. Wylie,Todd Wylie,Chandri Yandava,Liang Ye,Yuzhen Ye,Shibu Yooseph,Bonnie P. Youmans,Lan Zhang,Yanjiao Zhou,Yiming Zhu,Laurie Zoloth,Jeremy Zucker,Bruce W. Birren,Richard A. Gibbs,Sarah K. Highlander,Barbara A. Methé,Karen E. Nelson,Joseph F. Petrosino,George M. Weinstock,Richard K. Wilson,Owen White +253 more
TL;DR: The Human Microbiome Project Consortium reported the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome as discussed by the authors.
Journal ArticleDOI
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences
Weizhong Li,Adam Godzik +1 more
TL;DR: Cd-hit-2d compares two protein datasets and reports similar matches between them; cd- Hit-est clusters a DNA/RNA sequence database and cd- hit-est-2D compares two nucleotide datasets.
Journal ArticleDOI
Sequencing technologies-the next generation
TL;DR: A technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments is presented.
Book
The C++ Programming Language
TL;DR: Bjarne Stroustrup makes C even more accessible to those new to the language, while adding advanced information and techniques that even expert C programmers will find invaluable.
Journal ArticleDOI
Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample
J. Gregory Caporaso,Christian L. Lauber,William A. Walters,Donna Berg-Lyons,Catherine A. Lozupone,Peter J. Turnbaugh,Noah Fierer,Rob Knight +7 more
TL;DR: This work sequences a diverse array of 25 environmental samples and three known “mock communities” at a depth averaging 3.1 million reads per sample to demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of metaanalysis of many studies from the literature.
Related Papers (5)
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more