scispace - formally typeset
Search or ask a question
Author

Rob Knight

Bio: Rob Knight is an academic researcher from University of California, San Diego. The author has contributed to research in topics: Microbiome & Gut flora. The author has an hindex of 201, co-authored 1061 publications receiving 253207 citations. Previous affiliations of Rob Knight include Anschutz Medical Campus & University of Sydney.
Topics: Microbiome, Gut flora, Medicine, Metagenomics, Biology


Papers
More filters
Journal ArticleDOI
TL;DR: DeepFRI as mentioned in this paper is a graph convolutional network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures, which scales to the size of current sequence repositories.
Abstract: The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures. It outperforms current leading methods and sequence-based Convolutional Neural Networks and scales to the size of current sequence repositories. Augmenting the training set of experimental structures with homology models allows us to significantly expand the number of predictable functions. DeepFRI has significant de-noising capability, with only a minor drop in performance when experimental structures are replaced by protein models. Class activation mapping allows function predictions at an unprecedented resolution, allowing site-specific annotations at the residue-level in an automated manner. We show the utility and high performance of our method by annotating structures from the PDB and SWISS-MODEL, making several new confident function predictions. DeepFRI is available as a webserver at https://beta.deepfri.flatironinstitute.org/ .

158 citations

Journal ArticleDOI
TL;DR: In this paper, the conditional probability that each molecule is present given the presence of a specific microorganism was estimated by using neural networks to infer the interactions between microbially produced metabolites and inflammatory bowel disease.
Abstract: Integrating multiomics datasets is critical for microbiome research; however, inferring interactions across omics datasets has multiple statistical challenges. We solve this problem by using neural networks (https://github.com/biocore/mmvec) to estimate the conditional probability that each molecule is present given the presence of a specific microorganism. We show with known environmental (desert soil biocrust wetting) and clinical (cystic fibrosis lung) examples, our ability to recover microbe-metabolite relationships, and demonstrate how the method can discover relationships between microbially produced metabolites and inflammatory bowel disease.

157 citations

Journal ArticleDOI
27 May 2015-PLOS ONE
TL;DR: The results indicate that the impact of exercise on gut microbiota composition as well as body composition may depend on the developmental stage during which exercise is initiated.
Abstract: The mammalian intestine harbors a complex microbial ecosystem that influences many aspects of host physiology. Exposure to specific microbes early in development affects host metabolism, immune function, and behavior across the lifespan. Just as the physiology of the developing organism undergoes a period of plasticity, the developing microbial ecosystem is characterized by instability and may also be more sensitive to change. Early life thus presents a window of opportunity for manipulations that produce adaptive changes in microbial composition. Recent insights have revealed that increasing physical activity can increase the abundance of beneficial microbial species. We therefore investigated whether six weeks of wheel running initiated in the juvenile period (postnatal day 24) would produce more robust and stable changes in microbial communities versus exercise initiated in adulthood (postnatal day 70) in male F344 rats. 16S rRNA gene sequencing was used to characterize the microbial composition of juvenile versus adult runners and their sedentary counterparts across multiple time points during exercise and following exercise cessation. Alpha diversity measures revealed that the microbial communities of young runners were less even and diverse, a community structure that reflects volatility and malleability. Juvenile onset exercise altered several phyla and, notably, increased Bacteroidetes and decreased Firmicutes, a configuration associated with leanness. At the genus level of taxonomy, exercise altered more genera in juveniles than in the adults and produced patterns associated with adaptive metabolic consequences. Given the potential of these changes to contribute to a lean phenotype, we examined body composition in juvenile versus adult runners. Interestingly, exercise produced persistent increases in lean body mass in juvenile but not adult runners. Taken together, these results indicate that the impact of exercise on gut microbiota composition as well as body composition may depend on the developmental stage during which exercise is initiated.

156 citations

Journal ArticleDOI
TL;DR: A model is developed to diagnose ECC from healthy samples with 70% accuracy and predict, with 81% accuracy, future ECC onsets for samples clinically perceived as healthy, so that caries onset in apparently healthy teeth can be predicted using microbiota, when appropriately de-trended for age.

156 citations

Journal ArticleDOI
TL;DR: It is shown that MHC IIb polymorphism is associated with among‐individual variation in gut microbiota within a single wild vertebrate population of a small fish, the threespine stickleback, and speculated that macroparasite‐driven selection on MHC has the potential to indirectly alter the host gut microbiota, and vice versa.
Abstract: Animals harbour diverse communities of symbiotic bacteria, which differ dramatically among host individuals. This heterogeneity poses an immunological challenge: distinguishing between mutualistic and pathogenic members of diverse and host-specific microbial communities. We propose that Major Histocompatibility class II (MHC) genotypes contribute to recognition and regulation of gut microbes, and thus, MHC polymorphism contributes to microbial variation among hosts. Here, we show that MHC IIb polymorphism is associated with among-individual variation in gut microbiota within a single wild vertebrate population of a small fish, the threespine stickleback. We sampled stickleback from Cedar Lake, on Vancouver Island, and used next-generation sequencing to genotype the sticklebacks' gut microbiota (16S sequencing) and their MHC class IIb exon 2 sequences. The presence of certain MHC motifs was associated with altered relative abundance (increase or decrease) of some microbial Families. The effect sizes are modest and entail a minority of microbial taxa, but these results represent the first indication that MHC genotype may affect gut microbiota composition in natural populations (MHC-microbe associations have also been found in a few studies of lab mice). Surprisingly, these MHC effects were frequently sex-dependent. Finally, hosts with more diverse MHC motifs had less diverse gut microbiota. One implication is that MHC might influence the efficacy of therapeutic strategies to treat dysbiosis-associated disease, including the outcome of microbial transplants between healthy and diseased patients. We also speculate that macroparasite-driven selection on MHC has the potential to indirectly alter the host gut microbiota, and vice versa.

155 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.
Abstract: Supplementary Figure 1 Overview of the analysis pipeline. Supplementary Table 1 Details of conventionally raised and conventionalized mouse samples. Supplementary Discussion Expanded discussion of QIIME analyses presented in the main text; Sequencing of 16S rRNA gene amplicons; QIIME analysis notes; Expanded Figure 1 legend; Links to raw data and processed output from the runs with and without denoising.

28,911 citations

28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
TL;DR: The extensively curated SILVA taxonomy and the new non-redundant SILVA datasets provide an ideal reference for high-throughput classification of data from next-generation sequencing approaches.
Abstract: SILVA (from Latin silva, forest, http://www.arb-silva.de) is a comprehensive web resource for up to date, quality-controlled databases of aligned ribosomal RNA (rRNA) gene sequences from the Bacteria, Archaea and Eukaryota domains and supplementary online services. The referred database release 111 (July 2012) contains 3 194 778 small subunit and 288 717 large subunit rRNA gene sequences. Since the initial description of the project, substantial new features have been introduced, including advanced quality control procedures, an improved rRNA gene aligner, online tools for probe and primer evaluation and optimized browsing, searching and downloading on the website. Furthermore, the extensively curated SILVA taxonomy and the new non-redundant SILVA datasets provide an ideal reference for high-throughput classification of data from next-generation sequencing approaches.

18,256 citations

Journal ArticleDOI
TL;DR: M mothur is used as a case study to trim, screen, and align sequences; calculate distances; assign sequences to operational taxonomic units; and describe the α and β diversity of eight marine samples previously characterized by pyrosequencing of 16S rRNA gene fragments.
Abstract: mothur aims to be a comprehensive software package that allows users to use a single piece of software to analyze community sequence data. It builds upon previous tools to provide a flexible and powerful software package for analyzing sequencing data. As a case study, we used mothur to trim, screen, and align sequences; calculate distances; assign sequences to operational taxonomic units; and describe the alpha and beta diversity of eight marine samples previously characterized by pyrosequencing of 16S rRNA gene fragments. This analysis of more than 222,000 sequences was completed in less than 2 h with a laptop computer.

17,350 citations

Journal ArticleDOI
TL;DR: UCLUST is a new clustering method that exploits USEARCH to assign sequences to clusters and offers several advantages over the widely used program CD-HIT, including higher speed, lower memory use, improved sensitivity, clustering at lower identities and classification of much larger datasets.
Abstract: Motivation: Biological sequence data is accumulating rapidly, motivating the development of improved high-throughput methods for sequence classification. Results: UBLAST and USEARCH are new algorithms enabling sensitive local and global search of large sequence databases at exceptionally high speeds. They are often orders of magnitude faster than BLAST in practical applications, though sensitivity to distant protein relationships is lower. UCLUST is a new clustering method that exploits USEARCH to assign sequences to clusters. UCLUST offers several advantages over the widely used program CD-HIT, including higher speed, lower memory use, improved sensitivity, clustering at lower identities and classification of much larger datasets. Availability: Binaries are available at no charge for non-commercial use at http://www.drive5.com/usearch Contact: [email protected] Supplementary information:Supplementary data are available at Bioinformatics online.

17,301 citations