Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy
TLDR
The RDP Classifier can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes, and the majority of the classification errors appear to be due to anomalies in the current taxonomies.Abstract:
The Ribosomal Database Project (RDP) Classifier, a naive Bayesian classifier, can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes (2nd ed., release 5.0, Springer-Verlag, New York, NY, 2004). It provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. The majority of classifications (98%) were of high estimated confidence (≥95%) and high accuracy (98%). In addition to being tested with the corpus of 5,014 type strain sequences from Bergey's outline, the RDP Classifier was tested with a corpus of 23,095 rRNA sequences as assigned by the NCBI into their alternative higher-order taxonomy. The results from leave-one-out testing on both corpora show that the overall accuracies at all levels of confidence for near-full-length and 400-base segments were 89% or above down to the genus level, and the majority of the classification errors appear to be due to anomalies in the current taxonomies. For shorter rRNA segments, such as those that might be generated by pyrosequencing, the error rate varied greatly over the length of the 16S rRNA gene, with segments around the V2 and V4 variable regions giving the lowest error rates. The RDP Classifier is suitable both for the analysis of single rRNA sequences and for the analysis of libraries of thousands of sequences. Another related tool, RDP Library Compare, was developed to facilitate microbial-community comparison based on 16S rRNA gene sequence libraries. It combines the RDP Classifier with a statistical test to flag taxa differentially represented between samples. The RDP Classifier and RDP Library Compare are available online at http://rdp.cme.msu.edu/.read more
Citations
More filters
Journal ArticleDOI
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more
TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.
Journal ArticleDOI
Metagenomic biomarker discovery and explanation
Nicola Segata,Jacques Izard,Jacques Izard,Levi Waldron,Dirk Gevers,Larisa Miropolsky,Wendy S. Garrett,Curtis Huttenhower +7 more
TL;DR: A new method for metagenomic biomarker discovery is described and validates by way of class comparison, tests of biological consistency and effect size estimation to address the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities.
Journal ArticleDOI
Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample
J. Gregory Caporaso,Christian L. Lauber,William A. Walters,Donna Berg-Lyons,Catherine A. Lozupone,Peter J. Turnbaugh,Noah Fierer,Rob Knight +7 more
TL;DR: This work sequences a diverse array of 25 environmental samples and three known “mock communities” at a depth averaging 3.1 million reads per sample to demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of metaanalysis of many studies from the literature.
Journal Article
Structure, function and diversity of the healthy human microbiome
Curtis Huttenhower,Dirk Gevers,Rob Knight,Sahar Abubucker,Jonathan H. Badger,Asif T. Chinwalla,Heather Huot Creasy,Ashlee M. Earl,Michael Fitzgerald,Robert S. Fulton,Michelle G. Giglio,Kymberlie Hallsworth-Pepin,Elizabeth A. Lobos,Ramana Madupu,Vincent Magrini,John Martin,Makedonka Mitreva,Donna M. Muzny,Erica Sodergren,James Versalovic,Aye Wollam,Kim C. Worley,Jennifer R. Wortman,Sarah Young,Qiandong Zeng,Kjersti Aagaard,Olukemi O. Abolude,Emma Allen-Vercoe,Eric J. Alm,Lucia Alvarado,Gary L. Andersen,Scott Anderson,Elizabeth L. Appelbaum,Harindra Arachchi,Gary C. Armitage,Cesar Arze,Tulin Ayvaz,Carl C. Baker,Lisa Begg,Tsegahiwot Belachew,Veena Bhonagiri,Monika Bihan,Martin J. Blaser,Toby Bloom,Vivien Bonazzi,J. Paul Brooks,Gregory A. Buck,Christian J. Buhay,Dana A. Busam,Joseph L. Campbell,Shane Canon,Brandi L. Cantarel,Patrick S. G. Chain,I-Min A. Chen,Lei Chen,Shaila Chhibba,Ken Chu,Dawn Ciulla,Jose C. Clemente,Sandra W. Clifton,Sean Conlan,Jonathan Crabtree,Mary A. Cutting,Noam J. Davidovics,Catherine C. Davis,Todd Z. DeSantis,Carolyn Deal,Kimberley D. Delehaunty,Floyd E. Dewhirst,Elena Deych,Yan Ding,David J. Dooling,Shannon Dugan,Wm. Michael Dunne,A. Scott Durkin,Robert C. Edgar,Rachel L. Erlich,Candace N. Farmer,Ruth M. Farrell,Karoline Faust,Michael Feldgarden,Victor Felix,Sheila A. Fisher,Anthony A. Fodor,Larry J. Forney,Leslie Foster,Valentina Di Francesco,Jonathan Friedman,Dennis C. Friedrich,Catrina Fronick,Lucinda Fulton,Hongyu Gao,Nathalia Garcia,Georgia Giannoukos,Christina Giblin,Maria Y. Giovanni,Jonathan M. Goldberg,Johannes B. Goll,Antonio Gonzalez,Allison D. Griggs,Sharvari Gujja,Susan Kinder Haake,Brian J. Haas,Holli A. Hamilton,Emily L. Harris,Theresa A. Hepburn,Brandi Herter,Diane E. Hoffmann,Michael Holder,Clinton Howarth,Katherine H. Huang,Susan M. Huse,Jacques Izard,Janet K. Jansson,Huaiyang Jiang,Craig T. Jordan,Vandita Joshi,James A. Katancik,Wendy A. Keitel,Scott T. Kelley,Cristyn Kells,Nicholas B. King,Dan Knights,Heidi H. Kong,Omry Koren,Sergey Koren,Karthik Kota,Christie Kovar,Nikos C. Kyrpides,Patricio S. La Rosa,Sandra L. Lee,Katherine P. Lemon,Niall J. Lennon,Cecil M. Lewis,Lora Lewis,Ruth E. Ley,Kelvin Li,Konstantinos Liolios,Bo Liu,Yue Liu,Chien-Chi Lo,Catherine A. Lozupone,R. Dwayne Lunsford,Tessa Madden,Anup Mahurkar,Peter J. Mannon,Elaine R. Mardis,Victor M. Markowitz,Konstantinos Mavromatis,Jamison McCorrison,Daniel McDonald,Jean E. McEwen,Amy L. McGuire,Pamela McInnes,Teena Mehta,Kathie A. Mihindukulasuriya,Jason R. Miller,Patrick Minx,Irene Newsham,Chad Nusbaum,Michelle O'Laughlin,Joshua Orvis,Ioanna Pagani,Krishna Palaniappan,Shital M. Patel,Matthew D. Pearson,Jane Peterson,Mircea Podar,Craig Pohl,Katherine S. Pollard,Mihai Pop,Margaret Priest,Lita M. Proctor,Xiang Qin,Jeroen Raes,Jacques Ravel,Jeffrey G. Reid,Mina Rho,Rosamond Rhodes,Kevin Riehle,Maria C. Rivera,Beltran Rodriguez-Mueller,Yu-Hui Rogers,Matthew C. Ross,Carsten Russ,Ravi Sanka,Pamela Sankar,J. Fah Sathirapongsasuti,Jeffery A. Schloss,Patrick D. Schloss,Thomas M. Schmidt,Matthew B. Scholz,Lynn M. Schriml,Alyxandria M. Schubert,Nicola Segata,Julia A. Segre,William D. Shannon,Richard R. Sharp,Thomas J. Sharpton,Narmada Shenoy,Nihar U. Sheth,Gina A. Simone,Indresh Singh,Christopher Smillie,Jack D. Sobel,Daniel D. Sommer,Paul Spicer,Granger G. Sutton,Sean M. Sykes,Diana Tabbaa,Mathangi Thiagarajan,Chad Tomlinson,Manolito Torralba,Todd J. Treangen,Rebecca Truty,Tatiana A. Vishnivetskaya,Jason Walker,Lu Wang,Zhengyuan Wang,Doyle V. Ward,Wesley C. Warren,Mark A. Watson,Christopher Wellington,Kris A. Wetterstrand,James R. White,Katarzyna Wilczek-Boney,Yuanqing Wu,Kristine M. Wylie,Todd Wylie,Chandri N. Yandava,Liang Ye,Yuzhen Ye,Shibu Yooseph,Bonnie P. Youmans,Lan Zhang,Yanjiao Zhou,Yiming Zhu,Laurie Zoloth,Jeremy Zucker,Bruce W. Birren,Richard A. Gibbs,Sarah K. Highlander,Barbara A. Methé,Karen E. Nelson,Joseph F. Petrosino,George M. Weinstock,Richard K. Wilson,Owen White +247 more
TL;DR: The Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far, finding the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals.
Journal ArticleDOI
Enterotypes of the human gut microbiome
Manimozhiyan Arumugam,Jeroen Raes,Eric Pelletier,Denis Le Paslier,Takuji Yamada,Daniel R. Mende,Gabriel Fernandes,Julien Tap,Thomas Brüls,Jean-Michel Batto,Marcelo Bertalan,Natalia Borruel,Francesc Casellas,Leyden Fernández,Laurent Gautier,Torben Hansen,Masahira Hattori,Tetsuya Hayashi,Michiel Kleerebezem,Ken Kurokawa,Marion Leclerc,Florence Levenez,Chaysavanh Manichanh,H. Bjørn Nielsen,Trine Nielsen,Nicolas Pons,Julie Poulain,Junjie Qin,Thomas Sicheritz-Pontén,Sebastian Tims,David Torrents,Edgardo Ugarte,Erwin G. Zoetendal,Jun Wang,Francisco Guarner,Oluf Pedersen,Willem M. de Vos,Søren Brunak,Joël Doré,Jean Weissenbach,S. Dusko Ehrlich,Peer Bork +41 more
TL;DR: Three robust clusters (referred to as enterotypes hereafter) are identified that are not nation or continent specific and confirmed in two published, larger cohorts, indicating that intestinal microbiota variation is generally stratified, not continuous.
References
More filters
Book
Bergey's Manual of Systematic Bacteriology
TL;DR: BCL3 and Sheehy cite Bergey's manual of determinative bacteriology of which systematic bacteriology, first edition, is an expansion.
Journal ArticleDOI
Database resources of the National Center for Biotechnology Information
David L. Wheeler,Deanna M. Church,Ron Edgar,Scott Federhen,Wolfgang Helmberg,Thomas L. Madden,Joan Pontius,Gregory D. Schuler,Lynn M. Schriml,Edwin Sequeira,Tugba O. Suzek,Tatiana Tatusova,Lukas Wagner +12 more
TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.
Journal ArticleDOI
UniFrac: a New Phylogenetic Method for Comparing Microbial Communities
Catherine A. Lozupone,Rob Knight +1 more
TL;DR: The results illustrate that UniFrac provides a new way of characterizing microbial communities, using the wealth of environmental rRNA sequences, and allows quantitative insight into the factors that underlie the distribution of lineages among environments.
Journal ArticleDOI
Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya.
TL;DR: It is proposed that a formal system of organisms be established in which above the level of kingdom there exists a new taxon called a "domain." Life on this planet would be seen as comprising three domains, the Bacteria, the Archaea, and the Eucarya, each containing two or more kingdoms.
Journal ArticleDOI
Microbial diversity in the deep sea and the underexplored “rare biosphere”
Mitchell L. Sogin,Hilary G. Morrison,Julie A. Huber,David B. Mark Welch,Susan M. Huse,Phillip R. Neal,Jesús M. Arrieta,Gerhard J. Herndl +7 more
TL;DR: It is shown that bacterial communities of deep water masses of the North Atlantic and diffuse flow hydrothermal vents are one to two orders of magnitude more complex than previously reported for any microbial environment.
Related Papers (5)
QIIME allows analysis of high-throughput community sequencing data.
J. Gregory Caporaso,Justin Kuczynski,Jesse Stombaugh,Kyle Bittinger,Frederic D. Bushman,Elizabeth K. Costello,Noah Fierer,Antonio Gonzalez Peña,Julia K. Goodrich,Jeffrey I. Gordon,Gavin A. Huttley,Scott T. Kelley,Dan Knights,Jeremy E. Koenig,Ruth E. Ley,Catherine A. Lozupone,Daniel McDonald,Brian D. Muegge,Meg Pirrung,Jens Reeder,Joel Sevinsky,Peter J. Turnbaugh,William A. Walters,Jeremy Widmann,Tanya Yatsunenko,Jesse R. Zaneveld,Rob Knight,Rob Knight +27 more
Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities
Patrick D. Schloss,Patrick D. Schloss,Sarah L. Westcott,Sarah L. Westcott,Thomas Ryabin,Justine R. Hall,Martin Hartmann,Emily B. Hollister,Ryan A. Lesniewski,Brian B. Oakley,Donovan H. Parks,Courtney J. Robinson,Jason W. Sahl,Blaz Stres,Gerhard G. Thallinger,David J. Van Horn,Carolyn F. Weber +16 more