scispace - formally typeset
Search or ask a question
Institution

J. Craig Venter Institute

NonprofitLa Jolla, California, United States
About: J. Craig Venter Institute is a nonprofit organization based out in La Jolla, California, United States. It is known for research contribution in the topics: Genome & Gene. The organization has 1268 authors who have published 2300 publications receiving 304083 citations. The organization is also known as: JCVI & The Institute for Genomic Research.
Topics: Genome, Gene, Genomics, Population, Microbiome


Papers
More filters
Journal ArticleDOI
TL;DR: Electrochemical, metabolic, and transcriptional responses of EET-active microbial communities established on poised electrodes to changes in the surface redox potentials and surrounding substrates are shown, suggesting that a wide variety of Eet-active microbes utilizing diverse EET mechanisms may work together to provide such EET -active communities with an impressive ability to handle major changes in surface potential and carbon source availability.
Abstract: Some microbes can capture energy through redox reactions with electron flow to solid-phase electron acceptors, such as metal-oxides or poised electrodes, via extracellular electron transfer (EET). While diverse oxide minerals, exhibiting different surface redox potentials, are widely distributed on Earth, little is known about how microbes sense and use the minerals. Here we show electrochemical, metabolic, and transcriptional responses of EET-active microbial communities established on poised electrodes to changes in the surface redox potentials (as electron acceptors) and surrounding substrates (as electron donors). Combination of genome-centric stimulus-induced metatranscriptomics and metabolic pathway investigation revealed that nine Geobacter/Pelobacter microbes performed EET activity differently according to their preferable surface potentials and substrates. While the Geobacter/Pelobacter microbes coded numerous numbers of multi-heme c-type cytochromes and conductive e-pili, wide variations in gene expression were seen in response to altering surrounding substrates and surface potentials, accelerating EET via poised electrode or limiting EET via an open circuit system. These flexible responses suggest that a wide variety of EET-active microbes utilizing diverse EET mechanisms may work together to provide such EET-active communities with an impressive ability to handle major changes in surface potential and carbon source availability.

50 citations

Journal ArticleDOI
TL;DR: The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset.
Abstract: The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net). The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.

50 citations

Journal ArticleDOI
TL;DR: Collection and stabilization of stool microbiome samples with the DNA Genotek collection device, combined with the extraction and WGS, provides a robust, reproducible workflow that enables standardized global collection, storage, and analysis of stool for microbiome studies.
Abstract: As reports on possible associations between microbes and the host increase in number, more meaningful interpretations of this information require an ability to compare data sets across studies. This is dependent upon standardization of workflows to ensure comparability both within and between studies. Here we propose the standard use of an alternate collection and stabilization method that would facilitate such comparisons. The DNA Genotek OMNIgene∙Gut Stool Microbiome Kit was compared to the currently accepted community standard of freezing to store human stool samples prior to whole genome sequencing (WGS) for microbiome studies. This stabilization and collection device allows for ambient temperature storage, automation, and ease of shipping/transfer of samples. The device permitted the same data reproducibility as with frozen samples, and yielded higher recovery of nucleic acids. Collection and stabilization of stool microbiome samples with the DNA Genotek collection device, combined with our extraction and WGS, provides a robust, reproducible workflow that enables standardized global collection, storage, and analysis of stool for microbiome studies.

50 citations

Journal ArticleDOI
TL;DR: A large set of bacterial, archaeal, and viral genomes recovered from the environment are leveraged to expand understanding of existing RuBisCO diversity and the evolutionary processes responsible for its distribution, and to suggest that lateral gene transfer of RuBISCO may have facilitated major shifts in carbon metabolism in several important bacterial and Archaeal lineages.
Abstract: Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) is considered to be the most abundant enzyme on Earth. Despite this, its full diversity and distribution across the domains of life remain to be determined. Here, we leverage a large set of bacterial, archaeal, and viral genomes recovered from the environment to expand our understanding of existing RuBisCO diversity and the evolutionary processes responsible for its distribution. Specifically, we report a new type of RuBisCO present in Candidate Phyla Radiation (CPR) bacteria that is related to the archaeal Form III enzyme and contains the amino acid residues necessary for carboxylase activity. Genome-level metabolic analyses supported the inference that these RuBisCO function in a CO2-incorporating pathway that consumes nucleotides. Importantly, some Gottesmanbacteria (CPR) also encode a phosphoribulokinase that may augment carbon metabolism through a partial Calvin-Benson-Bassham cycle. Based on the scattered distribution of RuBisCO and its discordant evolutionary history, we conclude that this enzyme has been extensively laterally transferred across the CPR bacteria and DPANN archaea. We also report RuBisCO-like proteins in phage genomes from diverse environments. These sequences cluster with proteins in the Beckwithbacteria (CPR), implicating phage as a possible mechanism of RuBisCO transfer. Finally, we synthesize our metabolic and evolutionary analyses to suggest that lateral gene transfer of RuBisCO may have facilitated major shifts in carbon metabolism in several important bacterial and archaeal lineages.

50 citations

Journal ArticleDOI
TL;DR: The sum of gene deletion and gene duplication rates was correlated with the rate of structural chromosome rearrangements and was highest in the Ae.
Abstract: Nine different regions totaling 9.7 Mb of the 4.02 Gb Aegilops tauschii genome were sequenced using the Sanger sequencing technology and compared with orthologous Brachypodium distachyon, Oryza sativa (rice), and Sorghum bicolor (sorghum) genomic sequences. The ancestral gene content in these regions was inferred and used to estimate gene deletion and gene duplication rates along each branch of the phylogenetic tree relating the four species. The total gene number in the extant Ae. tauschii genome was estimated to be 36,371. The gene deletion and gene duplication rates and total gene numbers in the four genomes were used to estimate the total gene number in each node of the phylogenetic tree. The common ancestor of the Brachypodieae and Triticeae lineages was estimated to have had 28,558 genes, and the common ancestor of the Panicoideae, Ehrhartoideae, and Pooideae subfamilies was estimated to have had 27,152 or 28,350 genes, depending on the ancestral gene scenario. Relative to the Brachypodieae and Triticeae common ancestor, the gene number was reduced in B. distachyon by 3,026 genes and increased in Ae. tauschii by 7,813 genes. The sum of gene deletion and gene duplication rates, which reflects the rate of gene synteny loss, was correlated with the rate of structural chromosome rearrangements and was highest in the Ae. tauschii lineage and lowest in the rice lineage. The high rate of gene space evolution in the Ae. tauschii lineage accounts for the fact that, contrary to the expectations, the level of synteny between the phylogenetically more related Ae. tauschii and B. distachyon genomes is similar to the level of synteny between the Ae. tauschii genome and the genomes of the less related rice and sorghum. The ratio of gene duplication to gene deletion rates in these four grass species closely parallels both the total number of genes in a species and the overall genome size. Because the overall genome size is to a large extent a function of the repeated sequence content in a genome, we suggest that the amount and activity of repeated sequences are important factors determining the number of genes in a genome.

50 citations


Authors

Showing all 1274 results

NameH-indexPapersCitations
John R. Yates1771036129029
Anders M. Dale156823133891
Ronald W. Davis155644151276
Steven L. Salzberg147407231756
Mark Raymond Adams1471187135038
Nicholas J. Schork12558762131
William R. Jacobs11849048638
Ian T. Paulsen11235469460
Michael B. Brenner11139344771
Kenneth H. Nealson10848351100
Claire M. Fraser10835276292
Stephen L. Hoffman10445838597
Michael J. Brownstein10227447929
Amalio Telenti10242140509
John Quackenbush9942767029
Network Information
Related Institutions (5)
Wellcome Trust Sanger Institute
9.6K papers, 1.2M citations

94% related

Broad Institute
11.6K papers, 1.5M citations

92% related

Cold Spring Harbor Laboratory
6.6K papers, 1M citations

92% related

Pasteur Institute
50.3K papers, 2.5M citations

92% related

Howard Hughes Medical Institute
34.6K papers, 5.2M citations

92% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20233
202211
2021116
2020141
2019154
2018157