A core gut microbiome in obese and lean twins
TL;DR: The faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity, and their mothers are characterized to address how host genotype, environmental exposure and host adiposity influence the gut microbiome.
Abstract: The human distal gut harbours a vast ensemble of microbes (the microbiota) that provide important metabolic capabilities, including the ability to extract energy from otherwise indigestible dietary polysaccharides. Studies of a few unrelated, healthy adults have revealed substantial diversity in their gut communities, as measured by sequencing 16S rRNA genes, yet how this diversity relates to function and to the rest of the genes in the collective genomes of the microbiota (the gut microbiome) remains obscure. Studies of lean and obese mice suggest that the gut microbiota affects energy balance by influencing the efficiency of calorie harvest from the diet, and how this harvested energy is used and stored. Here we characterize the faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity, and their mothers, to address how host genotype, environmental exposure and host adiposity influence the gut microbiome. Analysis of 154 individuals yielded 9,920 near full-length and 1,937,461 partial bacterial 16S rRNA sequences, plus 2.14 gigabases from their microbiomes. The results reveal that the human gut microbiome is shared among family members, but that each person's gut microbial community varies in the specific bacterial lineages present, with a comparable degree of co-variation between adult monozygotic and dizygotic twin pairs. However, there was a wide array of shared microbial genes among sampled individuals, comprising an extensive, identifiable 'core microbiome' at the gene, rather than at the organismal lineage, level. Obesity is associated with phylum-level changes in the microbiota, reduced bacterial diversity and altered representation of bacterial genes and metabolic pathways. These results demonstrate that a diversity of organismal assemblages can nonetheless yield a core microbiome at a functional level, and that deviations from this core are associated with different physiological states (obese compared with lean).
Citations
More filters
••
TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.
Abstract: Supplementary Figure 1 Overview of the analysis pipeline. Supplementary Table 1 Details of conventionally raised and conventionalized mouse samples. Supplementary Discussion Expanded discussion of QIIME analyses presented in the main text; Sequencing of 16S rRNA gene amplicons; QIIME analysis notes; Expanded Figure 1 legend; Links to raw data and processed output from the runs with and without denoising.
28,911 citations
Cites methods from "A core gut microbiome in obese and ..."
...8 million bacterial 16S rRNA sequences from previously published studies: it also includes reads from different regions of the 16S rRNA gene (variable region 2 (V2) versus variable region 6 (V6)).(1) A step-by-step guide to the QIIME analysis can be found in the ‘QIIME commands’ section of this document....
[...]
••
University of Michigan1, University of Massachusetts Amherst2, University of New Mexico3, University of British Columbia4, Texas A&M University5, University of Minnesota6, University of Warwick7, Dalhousie University8, Colorado School of Mines9, University of Ljubljana10, Graz University of Technology11, Louisiana State University12
TL;DR: M mothur is used as a case study to trim, screen, and align sequences; calculate distances; assign sequences to operational taxonomic units; and describe the α and β diversity of eight marine samples previously characterized by pyrosequencing of 16S rRNA gene fragments.
Abstract: mothur aims to be a comprehensive software package that allows users to use a single piece of software to analyze community sequence data. It builds upon previous tools to provide a flexible and powerful software package for analyzing sequencing data. As a case study, we used mothur to trim, screen, and align sequences; calculate distances; assign sequences to operational taxonomic units; and describe the alpha and beta diversity of eight marine samples previously characterized by pyrosequencing of 16S rRNA gene fragments. This analysis of more than 222,000 sequences was completed in less than 2 h with a laptop computer.
17,350 citations
••
TL;DR: The Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals are described, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species.
Abstract: To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals. The gene set, ~150 times larger than the human gene complement, contains an overwhelming majority of the prevalent (more frequent) microbial genes of the cohort and probably includes a large proportion of the prevalent human intestinal microbial genes. The genes are largely shared among individuals of the cohort. Over 99% of the genes are bacterial, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species, which are also largely shared. We define and describe the minimal gut metagenome and the minimal gut bacterial genome in terms of functions present in all individuals and most bacteria, respectively
9,268 citations
••
University of Washington1, Sapienza University of Rome2, Mekelle University3, University of Texas at San Antonio4, King Saud bin Abdulaziz University for Health Sciences5, Debre markos University6, Emory University7, University of Oxford8, University of Cartagena9, United Nations Population Fund10, University of Birmingham11, Stanford University12, Aga Khan University13, University of Melbourne14, National Taiwan University15, University of Cambridge16, University of California, San Diego17, Public Health Foundation of India18, Public Health England19, University of Peradeniya20, Harvard University21, National Institutes of Health22, Tehran University of Medical Sciences23, Auckland University of Technology24, University of Sheffield25, University of Western Australia26, Karolinska Institutet27, Birzeit University28, Brandeis University29, American Cancer Society30, Ochsner Medical Center31, Yonsei University32, University of Bristol33, Heidelberg University34, Vanderbilt University35, South African Medical Research Council36, Jordan University of Science and Technology37, New Generation University College38, Northeastern University39, Simmons College40, Norwegian Institute of Public Health41, Boston University42, Chinese Center for Disease Control and Prevention43, University of Bari44, University of São Paulo45, University of Otago46, University of Crete47, International Centre for Diarrhoeal Disease Research, Bangladesh48, Fred Hutchinson Cancer Research Center49, Teikyo University50, Bhabha Atomic Research Centre51, University of Tokyo52, Finnish Institute of Occupational Health53, Heriot-Watt University54, University of Alabama at Birmingham55, Griffith University56, National Center for Disease Control and Public Health57, University of California, Irvine58, Johns Hopkins University59, New York University60, University of Queensland61, Universidade Federal de Minas Gerais62, National Research University – Higher School of Economics63, University of Bergen64, Columbia University65, Shandong University66, University of North Carolina at Chapel Hill67, Fujita Health University68, Korea University69, Chongqing Medical University70, Zhejiang University71
TL;DR: The global, regional, and national prevalence of overweight and obesity in children and adults during 1980-2013 is estimated using a spatiotemporal Gaussian process regression model to estimate prevalence with 95% uncertainty intervals (UIs).
9,180 citations
••
TL;DR: A new method for metagenomic biomarker discovery is described and validates by way of class comparison, tests of biological consistency and effect size estimation to address the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities.
Abstract: This study describes and validates a new method for metagenomic biomarker discovery by way of class comparison, tests of biological consistency and effect size estimation. This addresses the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities, which is a central problem to the study of metagenomics. We extensively validate our method on several microbiomes and a convenient online interface for the method is provided at http://huttenhower.sph.harvard.edu/lefse/.
9,057 citations
Cites background from "A core gut microbiome in obese and ..."
...differences are detected even among the gut microbiomes of twins [29], and both human microbiomes and environmental communities are thought to be characterized by the presence of a long tail of rare organisms [30-32]....
[...]
References
More filters
••
TL;DR: Although primarily developed for protein interaction analysis, the resource has also been successfully applied to comparative genomics, phylogenetics and network studies, which are all facilitated by programmatic access to the database backend and the availability of compact download files.
Abstract: Information on protein–protein interactions is still mostly limited to a small number of model organisms, and originates from a wide variety of experimental and computational techniques The database and online resource STRING generalizes access to protein interaction data, by integrating known and predicted interactions from a variety of sources The underlying infrastructure includes a consistent body of completely sequenced genomes and exhaustive orthology classifications, based on which interaction evidence is transferred between organisms Although primarily developed for protein interaction analysis, the resource has also been successfully applied to comparative genomics, phylogenetics and network studies, which are all facilitated by programmatic access to the database backend and the availability of compact download files As of release 7, STRING has almost doubled to 373 distinct organisms, and contains more than 15 million proteins for which associations have been pre-computed Novel features include AJAX-based web-navigation, inclusion of additional resources such as BioGRID, and detailed protein domain annotation STRING is available at http:// stringemblde/
669 citations
••
TL;DR: FASTX and FASTY are used to scan the Mycoplasma genitalium, Haemophilus influenzae, and Methanococcus jannaschii genomes for unidentified or misidentified protein-coding genes and are found to be quite accurate, except when an out-of-frame translation produces a low-complexity protein sequence.
618 citations
••
TL;DR: The data indicate that factors related to the host genotype have an important effect on determining the bacterial composition in the GI tract.
Abstract: The gastrointestinal (GI) tract is one of the most complex ecosystems consisting of microbial and host cells. It is suggested that the host genotype, the physiology of the host and environmental fa...
594 citations
••
TL;DR: It is suggested that CNV analysis in phenotypically discordant monozygotic twins may provide a powerful tool for identifying disease-predisposition loci and that caution should be exercised when interpreting disease causality of de novo CNVs found in patients based on analysis of a single tissue in routine disease-related DNA diagnostics.
Abstract: The exploration of copy-number variation (CNV), notably of somatic cells, is an understudied aspect of genome biology. Any differences in the genetic makeup between twins derived from the same zygote represent an irrefutable example of somatic mosaicism. We studied 19 pairs of monozygotic twins with either concordant or discordant phenotype by using two platforms for genome-wide CNV analyses and showed that CNVs exist within pairs in both groups. These findings have an impact on our views of genotypic and phenotypic diversity in monozygotic twins and suggest that CNV analysis in phenotypically discordant monozygotic twins may provide a powerful tool for identifying disease-predisposition loci. Our results also imply that caution should be exercised when interpreting disease causality of de novo CNVs found in patients based on analysis of a single tissue in routine disease-related DNA diagnostics.
567 citations