scispace - formally typeset
Search or ask a question

Showing papers by "Rob Knight published in 2020"


Journal ArticleDOI
11 Mar 2020-Nature
TL;DR: Microbial nucleic acids are detected in samples of tissues and blood from more than 10,000 patients with cancer, and machine learning is used to show that these can be used to discriminate between and among different types of cancer, suggesting a new microbiome-based diagnostic approach.
Abstract: Systematic characterization of the cancer microbiome provides the opportunity to develop techniques that exploit non-human, microorganism-derived molecules in the diagnosis of a major human disease. Following recent demonstrations that some types of cancer show substantial microbial contributions1–10, we re-examined whole-genome and whole-transcriptome sequencing studies in The Cancer Genome Atlas11 (TCGA) of 33 types of cancer from treatment-naive patients (a total of 18,116 samples) for microbial reads, and found unique microbial signatures in tissue and blood within and between most major types of cancer. These TCGA blood signatures remained predictive when applied to patients with stage Ia–IIc cancer and cancers lacking any genomic alterations currently measured on two commercial-grade cell-free tumour DNA platforms, despite the use of very stringent decontamination analyses that discarded up to 92.3% of total sequence data. In addition, we could discriminate among samples from healthy, cancer-free individuals (n = 69) and those from patients with multiple types of cancer (prostate, lung, and melanoma; 100 samples in total) solely using plasma-derived, cell-free microbial nucleic acids. This potential microbiome-based oncology diagnostic tool warrants further exploration. Microbial nucleic acids are detected in samples of tissues and blood from more than 10,000 patients with cancer, and machine learning is used to show that these can be used to discriminate between and among different types of cancer, suggesting a new microbiome-based diagnostic approach.

524 citations


Journal ArticleDOI
04 Nov 2020-Nature
TL;DR: It is demonstrated that for numerous prevalent, high-burden human diseases, matching cases and controls for confounding variables reduces observed differences in the microbiota and the incidence of spurious associations.
Abstract: Low concordance between studies that examine the role of microbiota in human diseases is a pervasive challenge that limits the capacity to identify causal relationships between host-associated microorganisms and pathology. The risk of obtaining false positives is exacerbated by wide interindividual heterogeneity in microbiota composition1, probably due to population-wide differences in human lifestyle and physiological variables2 that exert differential effects on the microbiota. Here we infer the greatest, generalized sources of heterogeneity in human gut microbiota profiles and also identify human lifestyle and physiological characteristics that, if not evenly matched between cases and controls, confound microbiota analyses to produce spurious microbial associations with human diseases. We identify alcohol consumption frequency and bowel movement quality as unexpectedly strong sources of gut microbiota variance that differ in distribution between healthy participants and participants with a disease and that can confound study designs. We demonstrate that for numerous prevalent, high-burden human diseases, matching cases and controls for confounding variables reduces observed differences in the microbiota and the incidence of spurious associations. On this basis, we present a list of host variables that we recommend should be captured in human microbiota studies for the purpose of matching comparison groups, which we anticipate will increase robustness and reproducibility in resolving the members of the gut microbiota that are truly associated with human disease.

277 citations


Journal ArticleDOI
TL;DR: PhyloPhlAn 3.0 can assign genomes from isolate sequencing or MAGs to species-level genome bins built from >230,000 publically available sequences, and reconstructs strain-level phylogenies from among the closest species using clade-specific maximally informative markers.
Abstract: Microbial genomes are available at an ever-increasing pace, as cultivation and sequencing become cheaper and obtaining metagenome-assembled genomes (MAGs) becomes more effective. Phylogenetic placement methods to contextualize hundreds of thousands of genomes must thus be efficiently scalable and sensitive from closely related strains to divergent phyla. We present PhyloPhlAn 3.0, an accurate, rapid, and easy-to-use method for large-scale microbial genome characterization and phylogenetic analysis at multiple levels of resolution. PhyloPhlAn 3.0 can assign genomes from isolate sequencing or MAGs to species-level genome bins built from >230,000 publically available sequences. For individual clades of interest, it reconstructs strain-level phylogenies from among the closest species using clade-specific maximally informative markers. At the other extreme of resolution, it scales to large phylogenies comprising >17,000 microbial species. Examples including Staphylococcus aureus isolates, gut metagenomes, and meta-analyses demonstrate the ability of PhyloPhlAn 3.0 to support genomic and metagenomic analyses.

277 citations


Journal ArticleDOI
26 Feb 2020-Nature
TL;DR: It is found that the microbiota affects the chemistry of all organs, including amino acid conjugations of host bile acids that were used to produce phenylalanocholic acid, tyrosocholic acid and leucocholic Acid, which have not previously been characterized despite extensive research on bile-acid chemistry.
Abstract: A mosaic of cross-phylum chemical interactions occurs between all metazoans and their microbiomes. A number of molecular families that are known to be produced by the microbiome have a marked effect on the balance between health and disease1–9. Considering the diversity of the human microbiome (which numbers over 40,000 operational taxonomic units10), the effect of the microbiome on the chemistry of an entire animal remains underexplored. Here we use mass spectrometry informatics and data visualization approaches11–13 to provide an assessment of the effects of the microbiome on the chemistry of an entire mammal by comparing metabolomics data from germ-free and specific-pathogen-free mice. We found that the microbiota affects the chemistry of all organs. This included the amino acid conjugations of host bile acids that were used to produce phenylalanocholic acid, tyrosocholic acid and leucocholic acid, which have not previously been characterized despite extensive research on bile-acid chemistry14. These bile-acid conjugates were also found in humans, and were enriched in patients with inflammatory bowel disease or cystic fibrosis. These compounds agonized the farnesoid X receptor in vitro, and mice gavaged with the compounds showed reduced expression of bile-acid synthesis genes in vivo. Further studies are required to confirm whether these compounds have a physiological role in the host, and whether they contribute to gut diseases that are associated with microbiome dysbiosis. Metabolomics data from germ-free and specific-pathogen-free mice reveal effects of the microbiome on host chemistry, identifying conjugations of bile acids that are also enriched in patients with inflammatory bowel disease or cystic fibrosis.

251 citations


Journal ArticleDOI
11 Feb 2020-eLife
TL;DR: It is proposed that exposure to microbial amyloids in the gastrointestinal tract can accelerate αSyn aggregation and disease in the gut and the brain.
Abstract: Amyloids are a class of protein with unique self-aggregation properties, and their aberrant accumulation can lead to cellular dysfunctions associated with neurodegenerative diseases. While genetic and environmental factors can influence amyloid formation, molecular triggers and/or facilitators are not well defined. Growing evidence suggests that non-identical amyloid proteins may accelerate reciprocal amyloid aggregation in a prion-like fashion. While humans encode ~30 amyloidogenic proteins, the gut microbiome also produces functional amyloids. For example, curli are cell surface amyloid proteins abundantly expressed by certain gut bacteria. In mice overexpressing the human amyloid α-synuclein (αSyn), we reveal that colonization with curli-producing Escherichia coli promotes αSyn pathology in the gut and the brain. Curli expression is required for E. coli to exacerbate αSyn-induced behavioral deficits, including intestinal and motor impairments. Purified curli subunits accelerate αSyn aggregation in biochemical assays, while oral treatment of mice with a gut-restricted amyloid inhibitor prevents curli-mediated acceleration of pathology and behavioral abnormalities. We propose that exposure to microbial amyloids in the gastrointestinal tract can accelerate αSyn aggregation and disease in the gut and the brain.

212 citations


Posted ContentDOI
Alexander Kurilshikov1, Carolina Medina-Gomez2, Rodrigo Bacigalupe3, Djawad Radjabzadeh2, Jun Wang4, Ayse Demirkan5, Ayse Demirkan1, Caroline I. Le Roy6, Juan Antonio Raygoza Garay7, Juan Antonio Raygoza Garay8, Casey T. Finnicum9, Xingrong Liu10, Daria V. Zhernakova11, Daria V. Zhernakova1, Marc Jan Bonder1, Tue H. Hansen12, Fabian Frost13, Malte C. Rühlemann14, Williams Turpin8, Williams Turpin7, Jee-Young Moon15, Han-Na Kim16, Kreete Lüll17, Elad Barkan18, Shiraz A. Shah19, Myriam Fornage20, Joanna Szopinska-Tokov, Zachary D. Wallen21, Dmitrii Borisevich12, Lars Agréus10, Anna Andreasson22, Corinna Bang14, Larbi Bedrani8, Jordana T. Bell6, Hans Bisgaard19, Michael Boehnke23, Dorret I. Boomsma24, Robert D. Burk15, Annique Claringbould1, Kenneth Croitoru7, Kenneth Croitoru8, Gareth E. Davies24, Cornelia M. van Duijn25, Cornelia M. van Duijn2, Liesbeth Duijts2, Gwen Falony3, Jingyuan Fu1, Adriaan van der Graaf1, Torben Hansen12, Georg Homuth13, David A. Hughes26, Richard G. IJzerman27, Matthew A. Jackson25, Matthew A. Jackson6, Vincent W. V. Jaddoe2, Marie Joossens3, Torben Jørgensen12, Daniel Keszthelyi28, Rob Knight29, Markku Laakso30, Matthias Laudes, Lenore J. Launer31, Wolfgang Lieb14, Aldons J. Lusis32, Ad A.M. Masclee28, Henriette A. Moll2, Zlatan Mujagic28, Qi Qibin15, Daphna Rothschild18, Hocheol Shin16, Søren J. Sørensen12, Claire J. Steves6, Jonathan Thorsen19, Nicholas J. Timpson26, Raul Y. Tito3, Sara Vieira-Silva3, Uwe Völker13, Henry Völzke13, Urmo Võsa1, Kaitlin H Wade26, Susanna Walter33, Kyoko Watanabe24, Stefan Weiss13, Frank Ulrich Weiss13, Omer Weissbrod34, Harm-Jan Westra1, Gonneke Willemsen24, Haydeh Payami21, Daisy Jonkers28, Alejandro Arias Vasquez35, Eco J. C. de Geus24, Katie A. Meyer36, Jakob Stokholm19, Eran Segal18, Elin Org17, Cisca Wijmenga1, Hyung Lae Kim37, Robert C. Kaplan38, Tim D. Spector6, André G. Uitterlinden2, Fernando Rivadeneira2, Andre Franke14, Markus M. Lerch13, Lude Franke1, Serena Sanna1, Serena Sanna39, Mauro D'Amato, Oluf Pedersen12, Andrew D. Paterson8, Robert Kraaij2, Jeroen Raes3, Alexandra Zhernakova1 
16 Dec 2020-bioRxiv
TL;DR: A phenome-wide association study and Mendelian randomization identified enrichment of microbiome trait loci in the metabolic, nutrition and environment domains and suggested the microbiome has causal effects in ulcerative colitis and rheumatoid arthritis.
Abstract: To study the effect of host genetics on gut microbiome composition, the MiBioGen consortium curated and analyzed genome-wide genotypes and 16S fecal microbiome data from 18,340 individuals (24 cohorts). Microbial composition showed high variability across cohorts: only 9 out of 410 genera were detected in more than 95% samples. A genome-wide association study (GWAS) of host genetic variation in relation to microbial taxa identified 31 loci affecting microbiome at a genome-wide significant (P

210 citations


Journal ArticleDOI
TL;DR: This investigation highlights the interconnection patterns across microbiomes in various environments and emphasizes the importance of understanding co-occurrence feature of microbiomes from a network perspective.
Abstract: Microbial interactions shape the structure and function of microbial communities; microbial co-occurrence networks in specific environments have been widely developed to explore these complex systems, but their interconnection pattern across microbiomes in various environments at the global scale remains unexplored. Here, we have inferred an Earth microbial co-occurrence network from a communal catalog with 23,595 samples and 12,646 exact sequence variants from 14 environments in the Earth Microbiome Project dataset. This non-random scale-free Earth microbial co-occurrence network consisted of 8 taxonomy distinct modules linked with different environments, which featured environment specific microbial co-occurrence relationships. Different topological features of subnetworks inferred from datasets trimmed into uniform size indicate distinct co-occurrence patterns in the microbiomes of various environments. The high number of specialist edges highlights that environmental specific co-occurrence relationships are essential features across microbiomes. The microbiomes of various environments were clustered into two groups, which were mainly bridged by the microbiomes of plant and animal surface. Acidobacteria Gp2 and Nisaea were identified as hubs in most of subnetworks. Negative edges proportions ranged from 1.9% in the soil subnetwork to 48.9% the non-saline surface subnetwork, suggesting various environments experience distinct intensities of competition or niche differentiation. This investigation highlights the interconnection patterns across microbiomes in various environments and emphasizes the importance of understanding co-occurrence feature of microbiomes from a network perspective.

185 citations


Journal ArticleDOI
07 Jan 2020-Mbio
TL;DR: Assessment of gut microbiomes from ∼900 vertebrate species, including 315 mammals and 491 birds, concludes that host-gut microbiome phylosymbiosis depends on factors convergently absent in birds and bats, potentially associated with physiological adaptations to flight.
Abstract: Diet and host phylogeny drive the taxonomic and functional contents of the gut microbiome in mammals, yet it is unknown whether these patterns hold across all vertebrate lineages. Here, we assessed gut microbiomes from ∼900 vertebrate species, including 315 mammals and 491 birds, assessing contributions of diet, phylogeny, and physiology to structuring gut microbiomes. In most nonflying mammals, strong correlations exist between microbial community similarity, host diet, and host phylogenetic distance up to the host order level. In birds, by contrast, gut microbiomes are only very weakly correlated to diet or host phylogeny. Furthermore, while most microbes resident in mammalian guts are present in only a restricted taxonomic range of hosts, most microbes recovered from birds show little evidence of host specificity. Notably, among the mammals, bats host especially bird-like gut microbiomes, with little evidence for correlation to host diet or phylogeny. This suggests that host-gut microbiome phylosymbiosis depends on factors convergently absent in birds and bats, potentially associated with physiological adaptations to flight. Our findings expose major variations in the behavior of these important symbioses in endothermic vertebrates and may signal fundamental evolutionary shifts in the cost/benefit framework of the gut microbiome.IMPORTANCE In this comprehensive survey of microbiomes of >900 species, including 315 mammals and 491 birds, we find a striking convergence of the microbiomes of birds and animals that fly. In nonflying mammals, diet and short-term evolutionary relatedness drive the microbiome, and many microbial species are specific to a particular kind of mammal, but flying mammals and birds break this pattern with many microbes shared across different species, with little correlation either with diet or with relatedness of the hosts. This finding suggests that adaptation to flight breaks long-held relationships between hosts and their microbes.

176 citations


Journal ArticleDOI
29 Oct 2020-Cell
TL;DR: A subset of mucosal-associated gut bacteria that consistently translocated and remained viable in CrF in CD ileal surgical resections is discovered, and Clostridium innocuum is identified as a signature of this consortium with strain variation between mucosal and adipose isolates, suggesting preference for lipid-rich environments.

162 citations


Journal ArticleDOI
TL;DR: This tutorial analyzes a subset of the Early Childhood Antibiotics and the Microbiome study, which tracked the microbiome composition and development of 43 infants in the United States from birth to 2 years of age, identifying microbiome associations with antibiotic exposure, delivery mode, and diet.
Abstract: QIIME 2 is a completely re-engineered microbiome bioinformatics platform based on the popular QIIME platform, which it has replaced. QIIME 2 facilitates comprehensive and fully reproducible microbiome data science, improving accessibility to diverse users by adding multiple user interfaces. QIIME 2 can be combined with Qiita, an open-source web-based platform, to re-use available data for meta-analysis. The following basic protocol describes how to install QIIME 2 on a single computer and analyze microbiome sequence data, from processing of raw DNA sequence reads through generating publishable interactive figures. These interactive figures allow readers of a study to interact with data with the same ease as its authors, advancing microbiome science transparency and reproducibility. We also show how plug-ins developed by the community to add analysis capabilities can be installed and used with QIIME 2, enhancing various aspects of microbiome analyses-e.g., improving taxonomic classification accuracy. Finally, we illustrate how users can perform meta-analyses combining different datasets using readily available public data through Qiita. In this tutorial, we analyze a subset of the Early Childhood Antibiotics and the Microbiome (ECAM) study, which tracked the microbiome composition and development of 43 infants in the United States from birth to 2 years of age, identifying microbiome associations with antibiotic exposure, delivery mode, and diet. For more information about QIIME 2, see https://qiime2.org. To troubleshoot or ask questions about QIIME 2 and microbiome analysis, join the active community at https://forum.qiime2.org. © 2020 The Authors. Basic Protocol: Using QIIME 2 with microbiome data Support Protocol: Further microbiome analyses.

146 citations


Journal ArticleDOI
TL;DR: Differences in taxonomic composition and functional potential varied across studies, but Akkermansia was most consistently reported to be relatively more abundant with aging, whereas Faecalibacterium, Bacteroidaceae, and Lachnospiraceae were relatively reduced.
Abstract: Aging is determined by complex interactions among genetic and environmental factors. Increasing evidence suggests that the gut microbiome lies at the core of many age-associated changes, including immune system dysregulation and susceptibility to diseases. The gut microbiota undergoes extensive changes across the lifespan, and age-related processes may influence the gut microbiota and its related metabolic alterations. The aim of this systematic review was to summarize the current literature on aging-associated alterations in diversity, composition, and functional features of the gut microbiota. We identified 27 empirical human studies of normal and successful aging suitable for inclusion. Alpha diversity of microbial taxa, functional pathways, and metabolites was higher in older adults, particularly among the oldest-old adults, compared to younger individuals. Beta diversity distances significantly differed across various developmental stages and were different even between oldest-old and younger-old adults. Differences in taxonomic composition and functional potential varied across studies, but Akkermansia was most consistently reported to be relatively more abundant with aging, whereas Faecalibacterium, Bacteroidaceae, and Lachnospiraceae were relatively reduced. Older adults have reduced pathways related to carbohydrate metabolism and amino acid synthesis; however, oldest-old adults exhibited functional differences that distinguished their microbiota from that of young-old adults, such as greater potential for short-chain fatty acid production and increased butyrate derivatives. Although a definitive interpretation is limited by the cross-sectional design of published reports, we integrated findings of microbial composition and downstream functional pathways and metabolites, offering possible explanations regarding age-related processes.


Journal ArticleDOI
01 Oct 2020
TL;DR: It is shown that fructose promotes gut-barrier deterioration and subsequent endotoxaemia that in turn induces hepatic lipogenesis by activation TLR signalling in liver macrophages, which is linked to the development of hepatic steatosis.
Abstract: Benign hepatosteatosis, affected by lipid uptake, de novo lipogenesis and fatty acid (FA) oxidation, progresses to non-alcoholic steatohepatitis (NASH) on stress and inflammation. A key macronutrient proposed to increase hepatosteatosis and NASH risk is fructose. Excessive intake of fructose causes intestinal-barrier deterioration and endotoxaemia. However, how fructose triggers these alterations and their roles in hepatosteatosis and NASH pathogenesis remain unknown. Here we show, using mice, that microbiota-derived Toll-like receptor (TLR) agonists promote hepatosteatosis without affecting fructose-1-phosphate (F1P) and cytosolic acetyl-CoA. Activation of mucosal-regenerative gp130 signalling, administration of the YAP-induced matricellular protein CCN1 or expression of the antimicrobial peptide Reg3b (beta) peptide counteract fructose-induced barrier deterioration, which depends on endoplasmic-reticulum stress and subsequent endotoxaemia. Endotoxin engages TLR4 to trigger TNF production by liver macrophages, thereby inducing lipogenic enzymes that convert F1P and acetyl-CoA to FA in both mouse and human hepatocytes.

Journal ArticleDOI
TL;DR: The metabolic benefits of gastric bypass surgery and diet were similar and were apparently related to weight loss itself, with no evident clinically important effects independent of weight loss.
Abstract: Background Some studies have suggested that in people with type 2 diabetes, Roux-en-Y gastric bypass has therapeutic effects on metabolic function that are independent of weight loss. Meth...

Journal ArticleDOI
TL;DR: A webenabled mass spectrometry (MS) search engine, named Mass Spectrometry Search Tool (MASST), is introduced by enabling searches of all small-molecule tandem MS (MS/MS) data in public metabolomics repositories, it is proposed that MASST will unlock these resources for clinical, environmental and natural product applications.
Abstract: To the Editor — We introduce a webenabled mass spectrometry (MS) search engine, named Mass Spectrometry Search Tool (MASST; https://masst.ucsd.edu). By enabling searches of all small-molecule tandem MS (MS/MS) data in public metabolomics repositories, we posit that MASST will unlock these resources for clinical, environmental and natural product applications. Introduced in 1990, a tool for discovering related protein or gene sequences named Basic Local Alignment Search Tool (BLAST) enabled researchers to query entire public sequence data repositories through a web interface (WebBLAST; https://blast.ncbi.nlm.nih.gov/Blast.cgi)1. WebBLAST is one of the most widely cited and used bioinformatics tools because it permits any researcher to answer simple questions, such as ‘is a protein or DNA sequence common or rare?’. In the early days of public gene and protein databases, metadata, which include descriptions of sample, population or technical details, were limited. No deposition standards existed, except for the Short Read Archive and European Nucleotide Archive, which include experimental details for sequencing, instrumental details and sample description, such as the source of a sample. The current status of much MS data in the public domain is reminiscent of the DNA databanks of the 1990s. To increase usage and unlock the potential of openly available MS resources, we set out to build an infrastructure to enable WebBLAST for MS. Algorithms developed for MS data, including molecular networking2 and fragmentation trees3, enable similarity searches against reference libraries of known molecules, whereas powerful metabolomics analysis software infrastructures, such as MS-DIAL4, MetaboAnalyst5, XCMS Online6 and HMDB7, focus on annotation of MS/MS spectra, or finding statistical relationships between molecular features. However, none of the existing tools enable searching a single MS/MS spectrum for identical or analogous MS/MS spectra against public data in repositories, including unknown molecules. Finding specific MS/MS spectra of interest, including unannotated spectra or structural analogs, in public repositories of metabolomics MS data and natural product MS data, is not possible. Deposition of untargeted MS data in the public domain is experiencing rapid growth. In March 2017, 910 metabolomics datasets were available8; by January 2019, there were >2,000 downloadable metabolomics datasets (about half of these datasets contain MS/MS data)9. Despite the availability of metabolomics and natural product data, including environmental and clinical MS datasets, public small-molecule MS data are hardly reused10. Now that there is a huge amount of small-molecule untargeted MS datasets publicly available (~1,100 untargeted datasets and ~110,000,000 spectra in ~150,000 files as of December 11, 2018), we felt that the time was right to develop MASST, to enable reuse of these MS data. MASST comprises a web-based system to search the public data repository part of the GNPS/MassIVE knowledge base11 and an analysis infrastructure for a single MS/ MS spectrum. The developments required for MASST searches included converting deposited public data to a uniform open format12 (irrespective of instrument type and original data format), the ability to trace the file from which each MS/MS spectrum originated, and a reporting system that shows all identical or similar MS/MS spectra found in public data along with their associated metadata. MASST development has been possible for two main reasons: first, adoption of universal, non-vendor-specific MS data formats has increased, which means that multiple publicly available datasets have been converted to the same data format13, and second, the recently developed ability to connect all public data in GNPS/MassIVE and connect each MS/MS spectrum to its metadata entries had not been developed yet. A MASST report also includes matches to any reference spectra in public MS/ MS spectral libraries, if the matches are within the user-specified search parameters. Libraries include GNPS usercontributed spectra11, GNPS libraries11, all three MassBanks14 (https://massbank.eu/ MassBank/, https://mona.fiehnlab.ucdavis. edu/), ReSpect15, MIADB/Beniddir16, Sumner/Bruker, CASMI17, PNNL lipids18, Sirenas/Gates, EMBL MCF and several other libraries, listed at https://gnps.ucsd.edu/ ProteoSAFe/libraries.jsp. Visualization of the MASST matches uses a mirror view (Fig. 1). MASST can search against various repositories, including GNPS/MassIVE11, Metabolomics Workbench19, MetaboLights20 or the non-redundant (nr) MS/MS library of all unique MS/MS spectra from all three repositories combined. MASST searching using multiple repositories was enabled by converting data uploaded to the Metabolomics Workbench and MetaboLights repositories to the same open MS format in the GNPS/MassIVE data storage environment. Instructions on how to upload to GNPS/MassIVE can be found at https://ccms-ucsd.github.io/ GNPSDocumentation/datasets/. All public data in GNPS/MassIVE becomes MASST-searchable. MASST searches output results according to userdefined search parameters. The report returns the origin of the matched MS/ MS spectrum with respect to the dataset and file information and any metadata associated with the file (Fig. 1). Datasets and files can be tagged with sample or spectral information by the community of MASST users, and this information then becomes part of the metadata reported back in future MASST searches. We also curated ~34,000 additional MS files with ~340,000 tags, mostly from human-associated samples, but also from microbes, food and indoor and outdoor environments, to provide a good foundation for MASST searches. Metadata can be associated with MS/ MS spectra in the GNPS/MassIVE upload portal at the dataset level, file level or single annotated spectrum level. Examples of metadata include instrument type, phylogeny (according to the National Center for Biotechnology Information (NCBI) taxonomy) and keywords at the dataset level; phylogeny, sample type, age, sex, body site (defined using the Uberon anatomy ontology21) and disease22 at the file level; and source, biological activity and structural class information at the single annotated spectrum level. In addition, GNPS/MassIVE is compatible with metadata formats from other software tools (e.g., QIIME2 and Qiita), which are used to analyze microbiome data and have a controlled vocabulary that can be imported23,24. Any sample information uploaded to GNPS/MassIVE from another repository (e.g., from MetaboLights and Metabolomics workbench) is also included in the MASST report. At present, there is only limited metadata at the dataset and file level, but the metadata in the public domain can provide insights into the types of MS/MS signals being analyzed (Box 1 contains examples of usage). Although the amount and quality of metadata is increasing25, datasets do not always have detailed metadata. To allay this problem, re-annotation of metadata as knowledge increases, while retaining provenance of all changes, is possible in


Posted ContentDOI
13 Sep 2020-medRxiv
TL;DR: A combination of genetics and dietary habits was shown to strongly shape the abundances of certain key bacterial members of the gut microbiota, and explain their genetic association, and this work identifies putative causal relationships between gut microbes and complex diseases using MR.
Abstract: Co-evolution between humans and the microbial communities colonizing them has resulted in an intimate assembly of thousands of microbial species mutualistically living on and in their body and impacting multiple aspects of host physiology and health. Several studies examining whether human genetic variation can affect gut microbiota suggest a complex combination of environmental and host factors. Here, we leverage a single large-scale population-based cohort of 5,959 genotyped individuals with matched gut microbial shotgun metagenomes, dietary information and health records up to 16 years post-sampling, to characterize human genetic variations associated with microbial abundances, and predict possible causal links with various diseases using Mendelian randomization (MR). Genome-wide association study (GWAS) identified 583 independent SNP-taxon associations at genome-wide significance (p

Journal ArticleDOI
TL;DR: The results suggest that the signal quality, ease of set-up and portability of the dry electrode EEG headset used in this study comply with the needs of clinical applications.
Abstract: Dry electrode electroencephalogram (EEG) recording combined with wireless data transmission offers an alternative tool to conventional wet electrode EEG systems. However, the question remains whether the signal quality of dry electrode recordings is comparable to wet electrode recordings in the clinical context. We recorded the resting state EEG (rsEEG), the visual evoked potentials (VEP) and the visual P300 (P3) from 16 healthy subjects (age range: 26–79 years) and 16 neurological patients who reported subjective memory impairment (age range: 50–83 years). Each subject took part in two recordings on different days, one with 19 dry electrodes and another with 19 wet electrodes. They reported their preferred EEG system. Comparisons of the rsEEG recordings were conducted qualitatively by independent visual evaluation by two neurologists blinded to the EEG system used and quantitatively by spectral analysis of the rsEEG. The P100 visual evoked potential (VEP) and P3 event-related potential (ERP) were compared in terms of latency, amplitude and pre-stimulus noise. The majority of subjects preferred the dry electrode headset. Both neurologists reported that all rsEEG traces were comparable between the wet and dry electrode headsets. Absolute Alpha and Beta power during rest did not statistically differ between the two EEG systems (p > 0.05 in all cases). However, Theta and Delta power was slightly higher with the dry electrodes (p = 0.0004 for Theta and p 0.10 in all cases) with a similar spatial distribution for both wet and dry electrode systems. These results suggest that the signal quality, ease of set-up and portability of the dry electrode EEG headset used in our study comply with the needs of clinical applications.

Journal ArticleDOI
01 Jun 2020
TL;DR: Qurro as mentioned in this paper is a tool that allows users to select features from along the rank plot to compute a log-ratio; this action updates both the rank plots and the sample plots through displaying the current logratios of samples.
Abstract: Many tools for dealing with compositional ' 'omics' data produce feature-wise values that can be ranked in order to describe features' associations with some sort of variation. These values include differentials (which describe features' associations with specified covariates) and feature loadings (which describe features' associations with variation along a given axis in a biplot). Although prior work has discussed the use of these 'rankings' as a starting point for exploring the log-ratios of particularly high- or low-ranked features, such exploratory analyses have previously been done using custom code to visualize feature rankings and the log-ratios of interest. This approach is laborious, prone to errors and raises questions about reproducibility. To address these problems we introduce Qurro, a tool that interactively visualizes a plot of feature rankings (a 'rank plot') alongside a plot of selected features' log-ratios within samples (a 'sample plot'). Qurro's interface includes various controls that allow users to select features from along the rank plot to compute a log-ratio; this action updates both the rank plot (through highlighting selected features) and the sample plot (through displaying the current log-ratios of samples). Here, we demonstrate how this unique interface helps users explore feature rankings and log-ratios simply and effectively.

Journal ArticleDOI
17 Mar 2020
TL;DR: A combination of omics-based analyses and multi-omic approaches found that fermented food consumers have subtle differences in their gut microbiota structure, which is enriched in conjugated linoleic acid, thought to be beneficial.
Abstract: Lifestyle factors, such as diet, strongly influence the structure, diversity, and composition of the microbiome. While we have witnessed over the last several years a resurgence of interest in fermented foods, no study has specifically explored the effects of their consumption on gut microbiota in large cohorts. To assess whether the consumption of fermented foods is associated with a systematic signal in the gut microbiome and metabolome, we used a multi-omic approach (16S rRNA amplicon sequencing, metagenomic sequencing, and untargeted mass spectrometry) to analyze stool samples from 6,811 individuals from the American Gut Project, including 115 individuals specifically recruited for their frequency of fermented food consumption for a targeted 4-week longitudinal study. We observed subtle but statistically significant differences between consumers and nonconsumers in beta diversity as well as differential taxa between the two groups. We found that the metabolome of fermented food consumers was enriched with conjugated linoleic acid (CLA), a putatively health-promoting molecule. Cross-omic analyses between metagenomic sequencing and mass spectrometry suggest that CLA may be driven by taxa associated with fermented food consumers. Collectively, we found modest yet persistent signatures associated with fermented food consumption that appear present in multiple -omic types which motivate further investigation of how different types of fermented food impact the gut microbiome and overall health.IMPORTANCE Public interest in the effects of fermented food on the human gut microbiome is high, but limited studies have explored the association between fermented food consumption and the gut microbiome in large cohorts. Here, we used a combination of omics-based analyses to study the relationship between the microbiome and fermented food consumption in thousands of people using both cross-sectional and longitudinal data. We found that fermented food consumers have subtle differences in their gut microbiota structure, which is enriched in conjugated linoleic acid, thought to be beneficial. The results suggest that further studies of specific kinds of fermented food and their impacts on the microbiome and health will be useful.

Journal ArticleDOI
TL;DR: This study provides the first evidence of significant associations between exposure to air pollutants and the compositional and functional profile of the human gut microbiome using whole-genome sequencing.

Journal ArticleDOI
TL;DR: Citizen-scientists used a crowdsourcing model to obtain oral bacterial composition data from guests at the Denver Museum of Nature & Science to determine if previously suspected oral microbiome associations with an individual’s demographics, lifestyle, and/or genetics are robust and generalizable enough to be detected within a general population.
Abstract: Oral microbiome dysbiosis has been associated with various local and systemic human diseases such as dental caries, periodontal disease, obesity, and cardiovascular disease. Bacterial composition may be affected by age, oral health, diet, and geography, although information about the natural variation found in the general public is still lacking. In this study, citizen-scientists used a crowdsourcing model to obtain oral bacterial composition data from guests at the Denver Museum of Nature & Science to determine if previously suspected oral microbiome associations with an individual's demographics, lifestyle, and/or genetics are robust and generalizable enough to be detected within a general population. Consistent with past research, we found bacterial composition to be more diverse in youth microbiomes when compared to adults. Adult oral microbiomes were predominantly impacted by oral health habits, while youth microbiomes were impacted by biological sex and weight status. The oral pathogen Treponema was detected more commonly in adults without recent dentist visits and in obese youth. Additionally, oral microbiomes from participants of the same family were more similar to each other than to oral microbiomes from non-related individuals. These results suggest that previously reported oral microbiome associations are observable in a human population containing the natural variation commonly found in the general public. Furthermore, these results support the use of crowdsourced data as a valid methodology to obtain community-based microbiome data.

Journal ArticleDOI
11 Feb 2020
TL;DR: The skin microbiome provides the best prediction of age, suggesting a model in which physiological aging occurs concomitantly with the loss of key taxa over a lifetime, enabling potential microbiome-targeted therapeutic strategies to prevent aging.
Abstract: Human gut microbiomes are known to change with age, yet the relative value of human microbiomes across the body as predictors of age, and prediction robustness across populations is unknown. In this study, we tested the ability of the oral, gut, and skin (hand and forehead) microbiomes to predict age in adults using random forest regression on data combined from multiple publicly available studies, evaluating the models in each cohort individually. Intriguingly, the skin microbiome provides the best prediction of age (mean ± standard deviation, 3.8 ± 0.45 years, versus 4.5 ± 0.14 years for the oral microbiome and 11.5 ± 0.12 years for the gut microbiome). This also agrees with forensic studies showing that the skin microbiome predicts postmortem interval better than microbiomes from other body sites. Age prediction models constructed from the hand microbiome generalized to the forehead and vice versa, across cohorts, and results from the gut microbiome generalized across multiple cohorts (United States, United Kingdom, and China). Interestingly, taxa enriched in young individuals (18 to 30 years) tend to be more abundant and more prevalent than taxa enriched in elderly individuals (>60 yrs), suggesting a model in which physiological aging occurs concomitantly with the loss of key taxa over a lifetime, enabling potential microbiome-targeted therapeutic strategies to prevent aging.IMPORTANCE Considerable evidence suggests that the gut microbiome changes with age or even accelerates aging in adults. Whether the age-related changes in the gut microbiome are more or less prominent than those for other body sites and whether predictions can be made about a person's age from a microbiome sample remain unknown. We therefore combined several large studies from different countries to determine which body site's microbiome could most accurately predict age. We found that the skin was the best, on average yielding predictions within 4 years of chronological age. This study sets the stage for future research on the role of the microbiome in accelerating or decelerating the aging process and in the susceptibility for age-related diseases.

Journal ArticleDOI
TL;DR: Men with higher levels of 1,25( OH)2D and higher activation ratios, but not 25(OH)D itself, are more likely to possess butyrate producing bacteria that are associated with better gut microbial health.
Abstract: The vitamin D receptor is highly expressed in the gastrointestinal tract where it transacts gene expression. With current limited understanding of the interactions between the gut microbiome and vitamin D, we conduct a cross-sectional analysis of 567 older men quantifying serum vitamin D metabolites using LC-MSMS and defining stool sub-Operational Taxonomic Units from16S ribosomal RNA gene sequencing data. Faith’s Phylogenetic Diversity and non-redundant covariate analyses reveal that the serum 1,25(OH)2D level explains 5% of variance in α-diversity. In β-diversity analyses using unweighted UniFrac, 1,25(OH)2D is the strongest factor assessed, explaining 2% of variance. Random forest analyses identify 12 taxa, 11 in the phylum Firmicutes, eight of which are positively associated with either 1,25(OH)2D and/or the hormone-to-prohormone [1,25(OH)2D/25(OH)D] “activation ratio.” Men with higher levels of 1,25(OH)2D and higher activation ratios, but not 25(OH)D itself, are more likely to possess butyrate producing bacteria that are associated with better gut microbial health. Here, the authors investigate associations of vitamin D metabolites with gut microbiome in a cross-sectional analysis of 567 elderly men enrolled in the Osteoporotic Fractures in Men (MrOS) Study and find larger alpha-diversity correlates with high 1,25(OH)2D and high 24,25(OH)2D and higher ratios of activation and catabolism.

Journal ArticleDOI
TL;DR: The authors use metabolomics and sequencing to assess changes in chemicals and microbial communities, including fungi and microeukaryotes, across an urbanization gradient in South America, and indicate that urbanization has large-scale effects on chemical and microbial exposures and on the human microbiota.
Abstract: Urbanization represents a profound shift in human behaviour, and has considerable cultural and health-associated consequences1,2. Here, we investigate chemical and microbial characteristics of houses and their human occupants across an urbanization gradient in the Amazon rainforest, from a remote Peruvian Amerindian village to the Brazilian city of Manaus. Urbanization was found to be associated with reduced microbial outdoor exposure, increased contact with housing materials, antimicrobials and cleaning products, and increased exposure to chemical diversity. The degree of urbanization correlated with changes in the composition of house bacterial and microeukaryotic communities, increased house and skin fungal diversity, and an increase in the relative abundance of human skin-associated fungi and bacteria in houses. Overall, our results indicate that urbanization has large-scale effects on chemical and microbial exposures and on the human microbiota.

Journal ArticleDOI
TL;DR: Those who typically experience frequent gastrointestinal symptoms reported significantly less bowel discomfort or diarrhea, significantly less gas or bloating, more regular bowel movements, and better stool consistency when regularly consuming C. reinhardtii.

Journal ArticleDOI
Alan K. Jarmusch1, Alan K. Jarmusch2, Mingxun Wang2, Mingxun Wang1, Christine M. Aceves2, Christine M. Aceves1, Rohit S. Advani1, Rohit S. Advani2, Shaden Aguirre1, Shaden Aguirre2, Alexander A. Aksenov2, Alexander A. Aksenov1, Gajender Aleti2, Allegra T. Aron2, Allegra T. Aron1, Anelize Bauermeister2, Anelize Bauermeister3, Sanjana Bolleddu1, Sanjana Bolleddu2, Amina Bouslimani1, Amina Bouslimani2, Andrés M. C. Rodríguez1, Andrés M. C. Rodríguez2, Rama Chaar2, Rama Chaar1, Roxana Coras2, Emmanuel O. Elijah2, Emmanuel O. Elijah1, Madeleine Ernst1, Madeleine Ernst2, Madeleine Ernst4, Julia M. Gauglitz1, Julia M. Gauglitz2, Emily C. Gentry1, Emily C. Gentry2, Makhai Husband1, Makhai Husband2, Scott A. Jarmusch5, Kenneth L. Jones1, Kenneth L. Jones2, Zdenek Kamenik6, Audrey Le Gouellec7, Aileen Lu2, Aileen Lu1, Laura-Isobel McCall8, Kerry L. McPhail9, Michael J. Meehan1, Michael J. Meehan2, Alexey V. Melnik1, Alexey V. Melnik2, Riya Christina Menezes10, Yessica Alejandra Montoya Giraldo11, Ngoc Hung Nguyen1, Ngoc Hung Nguyen2, Louis-Félix Nothias1, Louis-Félix Nothias2, Mélissa Nothias-Esposito2, Mélissa Nothias-Esposito1, Morgan Panitchpakdi1, Morgan Panitchpakdi2, Daniel Petras1, Daniel Petras2, Robert A. Quinn12, Nicole Sikora2, Nicole Sikora1, Justin J. J. van der Hooft13, Justin J. J. van der Hooft2, Fernando Vargas2, Fernando Vargas1, Alison Vrbanac2, Kelly C. Weldon2, Kelly C. Weldon1, Rob Knight, Nuno Bandeira1, Nuno Bandeira2, Pieter C. Dorrestein 
TL;DR: Repository-scale reanalysis of public mass spectrometry-based metabolomics data is facilitated by the Reanalysis of Data User (ReDU) interface, a system that uses consistent formatting and controlled vocabularies for metadata capture.
Abstract: We present ReDU ( https://redu.ucsd.edu/ ), a system for metadata capture of public mass spectrometry-based metabolomics data, with validated controlled vocabularies. Systematic capture of knowledge enables the reanalysis of public data and/or co-analysis of one’s own data. ReDU enables multiple types of analyses, including finding chemicals and associated metadata, comparing the shared and different chemicals between groups of samples, and metadata-filtered, repository-scale molecular networking. Repository-scale reanalysis of public mass spectrometry-based metabolomics data is facilitated by the Reanalysis of Data User (ReDU) interface, a system that uses consistent formatting and controlled vocabularies for metadata capture.

Journal ArticleDOI
TL;DR: The associations between overall gut taxonomic composition and BP are weak, individuals with hypertension demonstrate changes in several genera, highlighting the need for experimental studies.
Abstract: Background Several small-scale animal studies have suggested that gut microbiota and blood pressure (BP) are linked. However, results from human studies remain scarce and conflicting. We wanted to elucidate the multivariable-adjusted association between gut metagenome and BP in a large, representative, well-phenotyped population sample. We performed a focused analysis to examine the previously reported inverse associations between sodium intake and Lactobacillus abundance and between Lactobacillus abundance and BP. Methods and Results We studied a population sample of 6953 Finns aged 25 to 74 years (mean age, 49.2±12.9 years; 54.9% women). The participants underwent a health examination, which included BP measurement, stool collection, and 24-hour urine sampling (N=829). Gut microbiota was analyzed using shallow shotgun metagenome sequencing. In age- and sex-adjusted models, the α (within-sample) and β (between-sample) diversities of taxonomic composition were strongly related to BP indexes (P<0.001 for most). In multivariable-adjusted models, β diversity was only associated with diastolic BP (P=0.032). However, we observed significant, mainly positive, associations between BP indexes and 45 microbial genera (P<0.05), of which 27 belong to the phylum Firmicutes. Interestingly, we found mostly negative associations between 19 distinct Lactobacillus species and BP indexes (P<0.05). Of these, greater abundance of the known probiotic Lactobacillus paracasei was associated with lower mean arterial pressure and lower dietary sodium intake (P<0.001 for both). Conclusions Although the associations between overall gut taxonomic composition and BP are weak, individuals with hypertension demonstrate changes in several genera. We demonstrate strong negative associations of certain Lactobacillus species with sodium intake and BP, highlighting the need for experimental studies.


Journal ArticleDOI
TL;DR: A single-tube Transposase Enzyme Linked Long-read Sequencing (TELL-seqTM) technology is developed, which enables a low-cost, high-accuracy and high-throughput short-read second-generation sequencer to generate over 100 kb long-range sequencing information with as little as 0.1 ng input material.
Abstract: Long-range sequencing information is required for haplotype phasing, de novo assembly, and structural variation detection. Current long-read sequencing technologies can provide valuable long-range information but at a high cost with low accuracy and high DNA input requirements. We have developed a single-tube Transposase Enzyme Linked Long-read Sequencing (TELL-seq) technology, which enables a low-cost, high-accuracy, and high-throughput short-read second-generation sequencer to generate over 100 kb of long-range sequencing information with as little as 0.1 ng input material. In a PCR tube, millions of clonally barcoded beads are used to uniquely barcode long DNA molecules in an open bulk reaction without dilution and compartmentation. The barcoded linked-reads are used to successfully assemble genomes ranging from microbes to human. These linked-reads also generate megabase-long phased blocks and provide a cost-effective tool for detecting structural variants in a genome, which are important to identify compound heterozygosity in recessive Mendelian diseases and discover genetic drivers and diagnostic biomarkers in cancers.