Structure, function and diversity of the healthy human microbiome

doi:10.1038/NATURE11234

Home
/
Papers
/
Structure, function and diversity of the healthy human microbiome

Journal Article•DOI•

Structure, function and diversity of the healthy human microbiome

Curtis Huttenhower¹, Curtis Huttenhower², Dirk Gevers², Rob Knight³ +250 more•Institutions (42)

14 Jun 2012-Nature (Nature Publishing Group)-Vol. 486, Iss: 7402, pp 207-214

TL;DR: The Human Microbiome Project Consortium reported the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome as discussed by the authors.

read less

Abstract: The Human Microbiome Project Consortium reports the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

DADA2: High-resolution sample inference from Illumina amplicon data

[...]

Benjamin J. Callahan¹, Paul J. McMurdie, Michael J. Rosen¹, Andrew W. Han, Amy Jo A. Johnson, Susan Holmes¹ - Show less +2 more•Institutions (1)

Stanford University¹

01 Jul 2016-Nature Methods

TL;DR: The open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors is presented, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

Abstract: We present the open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors (https://github.com/benjjneb/dada2). DADA2 infers sample sequences exactly and resolves differences of as little as 1 nucleotide. In several mock communities, DADA2 identified more real variants and output fewer spurious sequences than other methods. We applied DADA2 to vaginal samples from a cohort of pregnant women, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

14,505 citations

Journal Article•DOI•

UPARSE: highly accurate OTU sequences from microbial amplicon reads

[...]

Robert C. Edgar

01 Oct 2013-Nature Methods

TL;DR: The UPARSE pipeline reports operational taxonomic unit (OTU) sequences with ≤1% incorrect bases in artificial microbial community tests, compared with >3% correct bases commonly reported by other methods.

...read moreread less

Abstract: Amplified marker-gene sequences can be used to understand microbial community structure, but they suffer from a high level of sequencing and amplification artifacts. The UPARSE pipeline reports operational taxonomic unit (OTU) sequences with ≤1% incorrect bases in artificial microbial community tests, compared with >3% incorrect bases commonly reported by other methods. The improved accuracy results in far fewer OTUs, consistently closer to the expected number of species in a community.

...read moreread less

11,329 citations

Journal Article•DOI•

phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

[...]

Paul J. McMurdie¹, Susan Holmes¹•Institutions (1)

Stanford University¹

22 Apr 2013-PLOS ONE

TL;DR: The phyloseq project for R is a new open-source software package dedicated to the object-oriented representation and analysis of microbiome census data in R, which supports importing data from a variety of common formats, as well as many analysis techniques.

...read moreread less

Abstract: Background The analysis of microbial communities through DNA sequencing brings many challenges: the integration of different types of data with methods from ecology, genetics, phylogenetics, multivariate statistics, visualization and testing. With the increased breadth of experimental designs now being pursued, project-specific statistical analyses are often needed, and these analyses are often difficult (or impossible) for peer researchers to independently reproduce. The vast majority of the requisite tools for performing these analyses reproducibly are already implemented in R and its extensions (packages), but with limited support for high throughput microbiome census data. Results Here we describe a software project, phyloseq, dedicated to the object-oriented representation and analysis of microbiome census data in R. It supports importing data from a variety of common formats, as well as many analysis techniques. These include calibration, filtering, subsetting, agglomeration, multi-table comparisons, diversity analysis, parallelized Fast UniFrac, ordination methods, and production of publication-quality graphics; all in a manner that is easy to document, share, and modify. We show how to apply functions from other R packages to phyloseq-represented data, illustrating the availability of a large number of open source analysis techniques. We discuss the use of phyloseq with tools for reproducible research, a practice common in other fields but still rare in the analysis of highly parallel microbiome census data. We have made available all of the materials necessary to completely reproduce the analysis and figures included in this article, an example of best practices for reproducible research. Conclusions The phyloseq project for R is a new open-source software package, freely available on the web from both GitHub and Bioconductor.

...read moreread less

11,272 citations

Journal Article•DOI•

Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences

[...]

Morgan G. I. Langille¹, Jesse R. Zaneveld², J. Gregory Caporaso³, J. Gregory Caporaso⁴, Daniel McDonald⁵, Dan Knights⁶, Joshua A Reyes⁷, Jose C. Clemente⁵, Deron E. Burkepile⁸, Rebecca Vega Thurber², Rob Knight⁵, Rob Knight⁹, Robert G. Beiko¹, Curtis Huttenhower⁷, Curtis Huttenhower¹⁰ - Show less +11 more•Institutions (10)

Dalhousie University¹, Oregon State University², Argonne National Laboratory³, Northern Arizona University⁴, University of Colorado Boulder⁵, University of Minnesota⁶, Harvard University⁷, Florida International University⁸, Howard Hughes Medical Institute⁹, Broad Institute¹⁰

01 Sep 2013-Nature Biotechnology

TL;DR: The results demonstrate that phylogeny and function are sufficiently linked that this 'predictive metagenomic' approach should provide useful insights into the thousands of uncultivated microbial communities for which only marker gene surveys are currently available.

...read moreread less

Abstract: Profiling phylogenetic marker genes, such as the 16S rRNA gene, is a key tool for studies of microbial communities but does not provide direct evidence of a community's functional capabilities. Here we describe PICRUSt (phylogenetic investigation of communities by reconstruction of unobserved states), a computational approach to predict the functional composition of a metagenome using marker gene data and a database of reference genomes. PICRUSt uses an extended ancestral-state reconstruction algorithm to predict which gene families are present and then combines gene families to estimate the composite metagenome. Using 16S information, PICRUSt recaptures key findings from the Human Microbiome Project and accurately predicts the abundance of gene families in host-associated and environmental communities, with quantifiable uncertainty. Our results demonstrate that phylogeny and function are sufficiently linked that this 'predictive metagenomic' approach should provide useful insights into the thousands of uncultivated microbial communities for which only marker gene surveys are currently available.

...read moreread less

6,860 citations

Journal Article•DOI•

VSEARCH: a versatile open source tool for metagenomics

[...]

Torbjørn Rognes¹, Torbjørn Rognes², Tomas Flouri³, Tomas Flouri⁴, Ben Nichols⁵, Christopher Quince⁶, Christopher Quince⁵, Frédéric Mahé⁷ - Show less +4 more•Institutions (7)

Oslo University Hospital¹, University of Oslo², Karlsruhe Institute of Technology³, Heidelberg Institute for Theoretical Studies⁴, University of Glasgow⁵, University of Warwick⁶, Kaiserslautern University of Technology⁷

18 Oct 2016-PeerJ

TL;DR: VSEARCH is here shown to be more accurate than USEARCH when performing searching, clustering, chimera detection and subsampling, while on a par with US EARCH for paired-ends read merging and dereplication.

...read moreread less

Abstract: Background: VSEARCH is an open source and free of charge multithreaded 64-bit tool for processing and preparing metagenomics, genomics and population genomics nucleotide sequence data. It is designed as an alternative to the widely used USEARCH tool (Edgar, 2010) for which the source code is not publicly available, algorithm details are only rudimentarily described, and only a memory-confined 32-bit version is freely available for academic use. Methods: When searching nucleotide sequences, VSEARCH uses a fast heuristic based on words shared by the query and target sequences in order to quickly identify similar sequences, a similar strategy is probably used in USEARCH. VSEARCH then performs optimal global sequence alignment of the query against potential target sequences, using full dynamic programming instead of the seed-and-extend heuristic used by USEARCH. Pairwise alignments are computed in parallel using vectorisation and multiple threads. Results: VSEARCH includes most commands for analysing nucleotide sequences available in USEARCH version 7 and several of those available in USEARCH version 8, including searching (exact or based on global alignment), clustering by similarity (using length pre-sorting, abundance pre-sorting or a user-defined order), chimera detection (reference-based or de novo), dereplication (full length or prefix), pairwise alignment, reverse complementation, sorting, and subsampling. VSEARCH also includes commands for FASTQ file processing, i.e., format detection, filtering, read quality statistics, and merging of paired reads. Furthermore, VSEARCH extends functionality with several new commands and improvements, including shuffling, rereplication, masking of low-complexity sequences with the well-known DUST algorithm, a choice among different similarity definitions, and FASTQ file format conversion. VSEARCH is here shown to be more accurate than USEARCH when performing searching, clustering, chimera detection and subsampling, while on a par with USEARCH for paired-ends read merging. VSEARCH is slower than USEARCH when performing clustering and chimera detection, but significantly faster when performing paired-end reads merging and dereplication. VSEARCH is available at https://github.com/torognes/vsearch under either the BSD 2-clause license or the GNU General Public License version 3.0. Discussion: VSEARCH has been shown to be a fast, accurate and full-fledged alternative to USEARCH. A free and open-source versatile tool for sequence analysis is now available to the metagenomics community.

...read moreread less

5,850 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

QIIME allows analysis of high-throughput community sequencing data.

[...]

J. Gregory Caporaso¹, Justin Kuczynski¹, Jesse Stombaugh¹, Kyle Bittinger², Frederic D. Bushman², Elizabeth K. Costello¹, Noah Fierer³, Antonio Gonzalez Peña¹, Julia K. Goodrich¹, Jeffrey I. Gordon⁴, Gavin A. Huttley⁵, Scott T. Kelley⁶, Dan Knights¹, Jeremy E. Koenig⁷, Ruth E. Ley⁷, Catherine A. Lozupone¹, Daniel McDonald¹, Brian D. Muegge⁴, Meg Pirrung¹, Jens Reeder¹, Joel Sevinsky, Peter J. Turnbaugh⁴, William A. Walters¹, Jeremy Widmann¹, Tanya Yatsunenko⁴, Jesse R. Zaneveld¹, Rob Knight¹, Rob Knight⁸ - Show less +24 more•Institutions (8)

University of Colorado Boulder¹, University of Pennsylvania², Cooperative Institute for Research in Environmental Sciences³, Washington University in St. Louis⁴, Australian National University⁵, San Diego State University⁶, Cornell University⁷, Howard Hughes Medical Institute⁸

11 Apr 2010-Nature Methods

TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.

...read moreread less

Abstract: Supplementary Figure 1 Overview of the analysis pipeline. Supplementary Table 1 Details of conventionally raised and conventionalized mouse samples. Supplementary Discussion Expanded discussion of QIIME analyses presented in the main text; Sequencing of 16S rRNA gene amplicons; QIIME analysis notes; Expanded Figure 1 legend; Links to raw data and processed output from the runs with and without denoising.

...read moreread less

28,911 citations

Journal Article•DOI•

A human gut microbial gene catalogue established by metagenomic sequencing

[...]

Junjie Qin¹, Ruiqiang Li¹, Jeroen Raes², Manimozhiyan Arumugam, Kristoffer Sølvsten Burgdorf, Chaysavanh Manichanh, Trine Nielsen, Nicolas Pons³, Florence Levenez³, Takuji Yamada, Daniel R. Mende, Junhua Li¹, Junming Xu¹, Shaochuan Li¹, Dongfang Li¹, Jianjun Cao¹, Bo Wang¹, Huiqing Liang¹, Huisong Zheng¹, Yinlong Xie¹, Julien Tap³, Patricia Lepage³, Marcelo Bertalan, Jean-Michel Batto³, Torben Hansen, Denis Le Paslier, Allan Linneberg, H. Bjørn Nielsen, Eric Pelletier, Pierre Renault³, Thomas Sicheritz-Pontén, Keith Turner⁴, Hongmei Zhu¹, Chang Yu¹, Shengting Li¹, Min Jian¹, Yan Zhou¹, Yingrui Li¹, Xiuqing Zhang¹, Songgang Li¹, Nan Qin¹, Huanming Yang¹, Jian Wang¹, Søren Brunak, Joël Doré³, Francisco Guarner⁵, Karsten Kristiansen, Oluf Pedersen, Julian Parkhill, Jean Weissenbach, Peer Bork, S. Dusko Ehrlich³, Jun Wang¹ - Show less +49 more•Institutions (5)

Beijing Genomics Institute¹, Vrije Universiteit Brussel², Institut national de la recherche agronomique³, Wellcome Trust Sanger Institute⁴, Hebron University⁵

04 Mar 2010-Nature

TL;DR: The Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals are described, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species.

...read moreread less

Abstract: To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals. The gene set, ~150 times larger than the human gene complement, contains an overwhelming majority of the prevalent (more frequent) microbial genes of the cohort and probably includes a large proportion of the prevalent human intestinal microbial genes. The genes are largely shared among individuals of the cohort. Over 99% of the genes are bacterial, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species, which are also largely shared. We define and describe the minimal gut metagenome and the minimal gut bacterial genome in terms of functions present in all individuals and most bacteria, respectively

...read moreread less

9,268 citations

Journal Article•DOI•

A core gut microbiome in obese and lean twins

[...]

Peter J. Turnbaugh¹, Micah Hamady², Tanya Yatsunenko¹, Brandi L. Cantarel³, Alexis E. Duncan¹, Ruth E. Ley¹, Mitchell L. Sogin⁴, William J. Jones⁵, Bruce A. Roe⁶, Jason P. Affourtit, Michael Egholm, Bernard Henrissat³, Andrew C. Heath¹, Rob Knight², Jeffrey I. Gordon¹ - Show less +11 more•Institutions (6)

Washington University in St. Louis¹, University of Colorado Boulder², Centre national de la recherche scientifique³, Marine Biological Laboratory⁴, University of South Carolina⁵, University of Oklahoma⁶

22 Jan 2009-Nature

TL;DR: The faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity, and their mothers are characterized to address how host genotype, environmental exposure and host adiposity influence the gut microbiome.

...read moreread less

Abstract: The human distal gut harbours a vast ensemble of microbes (the microbiota) that provide important metabolic capabilities, including the ability to extract energy from otherwise indigestible dietary polysaccharides. Studies of a few unrelated, healthy adults have revealed substantial diversity in their gut communities, as measured by sequencing 16S rRNA genes, yet how this diversity relates to function and to the rest of the genes in the collective genomes of the microbiota (the gut microbiome) remains obscure. Studies of lean and obese mice suggest that the gut microbiota affects energy balance by influencing the efficiency of calorie harvest from the diet, and how this harvested energy is used and stored. Here we characterize the faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity, and their mothers, to address how host genotype, environmental exposure and host adiposity influence the gut microbiome. Analysis of 154 individuals yielded 9,920 near full-length and 1,937,461 partial bacterial 16S rRNA sequences, plus 2.14 gigabases from their microbiomes. The results reveal that the human gut microbiome is shared among family members, but that each person's gut microbial community varies in the specific bacterial lineages present, with a comparable degree of co-variation between adult monozygotic and dizygotic twin pairs. However, there was a wide array of shared microbial genes among sampled individuals, comprising an extensive, identifiable 'core microbiome' at the gene, rather than at the organismal lineage, level. Obesity is associated with phylum-level changes in the microbiota, reduced bacterial diversity and altered representation of bacterial genes and metabolic pathways. These results demonstrate that a diversity of organismal assemblages can nonetheless yield a core microbiome at a functional level, and that deviations from this core are associated with different physiological states (obese compared with lean).

...read moreread less

6,970 citations

Journal Article•DOI•

Faecalibacterium prausnitzii is an anti-inflammatory commensal bacterium identified by gut microbiota analysis of Crohn disease patients

[...]

Harry Sokol¹, Bénédicte Pigneur, Laurie Watterlot, Omar Lakhdari, Luis G. Bermúdez-Humarán, Jean-Jacques Gratadoux, Sébastien Blugeon, Chantal Bridonneau, Jean-Pierre Furet², Gérard Corthier, Corinne Grangette, Nadia Vasquez, Philippe Pochart, Germain Trugnan³, Ginette Thomas³, Hervé M. Blottière, Joël Doré¹, Philippe Marteau, Philippe Seksik³, Philippe Langella - Show less +16 more•Institutions (3)

Institut national de la recherche agronomique¹, Micalis Institute², Pierre-and-Marie-Curie University³

28 Oct 2008-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The results suggest that counterbalancing dysbiosis using F. prausnitzii as a probiotic is a promising strategy in CD treatment and exhibits anti-inflammatory effects on cellular and TNBS colitis models, partly due to secreted metabolites able to block NF-κB activation and IL-8 production.

...read moreread less

Abstract: A decrease in the abundance and biodiversity of intestinal bacteria within the dominant phylum Firmicutes has been observed repeatedly in Crohn disease (CD) patients. In this study, we determined the composition of the mucosa-associated microbiota of CD patients at the time of surgical resection and 6 months later using FISH analysis. We found that a reduction of a major member of Firmicutes, Faecalibacterium prausnitzii, is associated with a higher risk of postoperative recurrence of ileal CD. A lower proportion of F. prausnitzii on resected ileal Crohn mucosa also was associated with endoscopic recurrence at 6 months. To evaluate the immunomodulatory properties of F. prausnitzii we analyzed the anti-inflammatory effects of F. prausnitzii in both in vitro (cellular models) and in vivo [2,4,6-trinitrobenzenesulphonic acid (TNBS)-induced] colitis in mice. In Caco-2 cells transfected with a reporter gene for NF-kappaB activity, F. prausnitzii had no effect on IL-1beta-induced NF-kappaB activity, whereas the supernatant abolished it. In vitro peripheral blood mononuclear cell stimulation by F. prausnitzii led to significantly lower IL-12 and IFN-gamma production levels and higher secretion of IL-10. Oral administration of either live F. prausnitzii or its supernatant markedly reduced the severity of TNBS colitis and tended to correct the dysbiosis associated with TNBS colitis, as demonstrated by real-time quantitative PCR (qPCR) analysis. F. prausnitzii exhibits anti-inflammatory effects on cellular and TNBS colitis models, partly due to secreted metabolites able to block NF-kappaB activation and IL-8 production. These results suggest that counterbalancing dysbiosis using F. prausnitzii as a probiotic is a promising strategy in CD treatment.

...read moreread less

3,653 citations

Journal Article•DOI•

Vaginal microbiome of reproductive-age women

[...]

Jacques Ravel¹, Pawel Gajer¹, Zaid Abdo², G. Maria Schneider², Sara S. K. Koenig¹, Stacey L. McCulle¹, Shara Karlebach³, Reshma Gorle¹, Jennifer Russell¹, Carol O. Tacket¹, Rebecca M. Brotman¹, Catherine C. Davis⁴, Kevin A. Ault³, Ligia Peralta¹, Larry J. Forney² - Show less +11 more•Institutions (4)

University of Maryland, Baltimore¹, University of Idaho², Emory University³, Procter & Gamble⁴

15 Mar 2011-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The inherent differences within and between women in different ethnic groups strongly argues for a more refined definition of the kinds of bacterial communities normally found in healthy women and the need to appreciate differences between individuals so they can be taken into account in risk assessment and disease diagnosis.

...read moreread less

Abstract: The means by which vaginal microbiomes help prevent urogenital diseases in women and maintain health are poorly understood. To gain insight into this, the vaginal bacterial communities of 396 asymptomatic North American women who represented four ethnic groups (white, black, Hispanic, and Asian) were sampled and the species composition characterized by pyrosequencing of barcoded 16S rRNA genes. The communities clustered into five groups: four were dominated by Lactobacillus iners, L. crispatus, L. gasseri, or L. jensenii, whereas the fifth had lower proportions of lactic acid bacteria and higher proportions of strictly anaerobic organisms, indicating that a potential key ecological function, the production of lactic acid, seems to be conserved in all communities. The proportions of each community group varied among the four ethnic groups, and these differences were statistically significant [χ(2)(10) = 36.8, P < 0.0001]. Moreover, the vaginal pH of women in different ethnic groups also differed and was higher in Hispanic (pH 5.0 ± 0.59) and black (pH 4.7 ± 1.04) women as compared with Asian (pH 4.4 ± 0.59) and white (pH 4.2 ± 0.3) women. Phylotypes with correlated relative abundances were found in all communities, and these patterns were associated with either high or low Nugent scores, which are used as a factor for the diagnosis of bacterial vaginosis. The inherent differences within and between women in different ethnic groups strongly argues for a more refined definition of the kinds of bacterial communities normally found in healthy women and the need to appreciate differences between individuals so they can be taken into account in risk assessment and disease diagnosis.

...read moreread less

2,848 citations