Institution

J. Craig Venter Institute

Nonprofit•La Jolla, California, United States•

About: J. Craig Venter Institute is a nonprofit organization based out in La Jolla, California, United States. It is known for research contribution in the topics: Genome & Gene. The organization has 1268 authors who have published 2300 publications receiving 304083 citations. The organization is also known as: JCVI & The Institute for Genomic Research.

...read moreread less

Topics: Genome, Gene, Genomics, Population, Microbiome ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

High contiguity Arabidopsis thaliana genome assembly with a single nanopore flow cell.

[...]

Todd P. Michael¹, Florian Jupe², Florian Jupe³, Felix Bemm⁴, Stanley Motley¹, Justin P. Sandoval², Christa Lanz⁴, Olivier Loudet⁵, Detlef Weigel⁴, Joseph R. Ecker² - Show less +6 more•Institutions (5)

J. Craig Venter Institute¹, Salk Institute for Biological Studies², Monsanto³, Max Planck Society⁴, Université Paris-Saclay⁵

07 Feb 2018-Nature Communications

TL;DR: It is demonstrated that even when the purpose is to understand complex structural variation at a single region of the genome, complete genome assembly is becoming the simplest way to achieve this goal.

...read moreread less

Abstract: The handheld Oxford Nanopore MinION sequencer generates ultra-long reads with minimal cost and time requirements, which makes sequencing genomes at the bench feasible. Here, we sequence the gold standard Arabidopsis thaliana genome (KBS-Mac-74 accession) on the bench with the MinION sequencer, and assemble the genome using typical consumer computing hardware (4 Cores, 16 Gb RAM) into chromosome arms (62 contigs with an N50 length of 12.3 Mb). We validate the contiguity and quality of the assembly with two independent single-molecule technologies, Bionano optical genome maps and Pacific Biosciences Sequel sequencing. The new A. thaliana KBS-Mac-74 genome enables resolution of a quantitative trait locus that had previously been recalcitrant to a Sanger-based BAC sequencing approach. In summary, we demonstrate that even when the purpose is to understand complex structural variation at a single region of the genome, complete genome assembly is becoming the simplest way to achieve this goal.

...read moreread less

250 citations

Journal Article•DOI•

Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing.

[...]

Talima Pearson¹, Joseph D. Busch¹, Jacques Ravel², Timothy D. Read², Shane D. Rhoton¹, Jana M. U'Ren¹, Tatum S. Simonson¹, Sergey Kachur¹, Rebecca R. Leadem¹, Michelle L. Cardon¹, Matthew N. Van Ert¹, Lynn Y. Huynh¹, Claire M. Fraser², Paul Keim³, Paul Keim¹ - Show less +11 more•Institutions (3)

Northern Arizona University¹, J. Craig Venter Institute², Translational Genomics Research Institute³

14 Sep 2004-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: Using whole-genome comparisons of five diverse strains of Bacillus anthracis to facilitate SNP discovery shows that only polymorphisms lying along the evolutionary pathway between reference strains will be observed, and shows how divergent branches in topologies collapse to single points but provide accurate information on internodal distances and points of origin for ancestral clades.

...read moreread less

Abstract: Phylogenetic reconstruction using molecular data is often subject to homoplasy, leading to inaccurate conclusions about phylogenetic relationships among operational taxonomic units. Compared with other molecular markers, single-nucleotide polymorphisms (SNPs) exhibit extremely low mutation rates, making them rare in recently emerged pathogens, but they are less prone to homoplasy and thus extremely valuable for phylogenetic analyses. Despite their phylogenetic potential, ascertainment bias occurs when SNP characters are discovered through biased taxonomic sampling; by using whole-genome comparisons of five diverse strains of Bacillus anthracis to facilitate SNP discovery, we show that only polymorphisms lying along the evolutionary pathway between reference strains will be observed. We illustrate this in theoretical and simulated data sets in which complex phylogenetic topologies are reduced to linear evolutionary models. Using a set of 990 SNP markers, we also show how divergent branches in our topologies collapse to single points but provide accurate information on internodal distances and points of origin for ancestral clades. These data allowed us to determine the ancestral root of B. anthracis, showing that it lies closer to a newly described "C" branch than to either of two previously described "A" or "B" branches. In addition, subclade rooting of the C branch revealed unequal evolutionary rates that seem to be correlated with ecological parameters and strain attributes. Our use of nonhomoplastic whole-genome SNP characters allows branch points and clade membership to be estimated with great precision, providing greater insight into epidemiological, ecological, and forensic questions.

...read moreread less

249 citations

Journal Article•DOI•

Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology

[...]

Foo Cheung, Brian J. Haas, Susanne M. D. Goldberg¹, Gregory D. May², Yongli Xiao, Christopher D. Town - Show less +2 more•Institutions (2)

J. Craig Venter Institute¹, National Center for Genome Resources²

24 Oct 2006-BMC Genomics

TL;DR: Due to the large number of reads afforded by the 454 DNA sequencing technology, it is effective in revealing the expression of transcripts from a broad range of GO categories and contains many rare transcripts in normalized cDNA libraries, although only a limited portion of their sequence is uncovered.

...read moreread less

Abstract: In this study, we addressed whether a single 454 Life Science GS20 sequencing run provides new gene discovery from a normalized cDNA library, and whether the short reads produced via this technology are of value in gene structure annotation. A single 454 GS20 sequencing run on adapter-ligated cDNA, from a normalized cDNA library, generated 292,465 reads that were reduced to 252,384 reads with an average read length of 92 nucleotides after cleaning. After clustering and assembly, a total of 184,599 unique sequences were generated containing over 400 SSRs. The 454 sequences generated hits to more genes than a comparable amount of sequence from MtGI. Although short, the 454 reads are of sufficient length to map to a unique genome location as effectively as longer ESTs produced by conventional sequencing. Functional interpretation of the sequences was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of GO categories. 53,796 assemblies and singletons (29%) had no match in the existing MtGI. Within the previously unobserved Medicago transcripts, thousands had matches in a comprehensive protein database and one or more of the TIGR Plant Gene Indices. Approximately 20% of these novel sequences could be found in the Medicago genome sequence. A total of 70,026 reads generated by the 454 technology were mapped to 785 Medicago finished BACs using PASA and over 1,000 gene models required modification. In parallel to 454 sequencing, 4,445 5'-prime reads were generated by conventional sequencing using the same library and from the assembled sequences it was shown to contain about 52% full length cDNAs encoding proteins from 50 to over 500 amino acids in length. Due to the large number of reads afforded by the 454 DNA sequencing technology, it is effective in revealing the expression of transcripts from a broad range of GO categories and contains many rare transcripts in normalized cDNA libraries, although only a limited portion of their sequence is uncovered. As with longer ESTs, 454 reads can be mapped uniquely onto genomic sequence to provide support for, and modifications of, gene predictions.

...read moreread less

248 citations

Journal Article•DOI•

ePlant: Visualizing and Exploring Multiple Levels of Data for Hypothesis Generation in Plant Biology

[...]

Jamie Waese¹, Jim Fan², Asher Pasha¹, Hans Yu¹, Geoffrey Fucile³, Ruian Shi¹, Matthew Cumming¹, Lawrence A. Kelley⁴, Michael J.E. Sternberg⁴, Vivek Krishnakumar⁵, Erik S. Ferlanti⁵, Jason R. Miller⁵, Christopher D. Town⁵, Wolfgang Stuerzlinger⁶, Nicholas J. Provart¹ - Show less +11 more•Institutions (6)

University of Toronto¹, University of Waterloo², Swiss Institute of Bioinformatics³, Imperial College London⁴, J. Craig Venter Institute⁵, University of British Columbia⁶

01 Aug 2017-The Plant Cell

TL;DR: The development of ePlant is described and several examples illustrating its integrative features for hypothesis generation are presented, including the process of deploying ePl plant as an “app” on Araport.

...read moreread less

Abstract: A big challenge in current systems biology research arises when different types of data must be accessed from separate sources and visualized using separate tools. The high cognitive load required to navigate such a workflow is detrimental to hypothesis generation. Accordingly, there is a need for a robust research platform that incorporates all data and provides integrated search, analysis, and visualization features through a single portal. Here, we present ePlant (http://bar.utoronto.ca/eplant), a visual analytic tool for exploring multiple levels of Arabidopsis thaliana data through a zoomable user interface. ePlant connects to several publicly available web services to download genome, proteome, interactome, transcriptome, and 3D molecular structure data for one or more genes or gene products of interest. Data are displayed with a set of visualization tools that are presented using a conceptual hierarchy from big to small, and many of the tools combine information from more than one data type. We describe the development of ePlant in this article and present several examples illustrating its integrative features for hypothesis generation. We also describe the process of deploying ePlant as an “app” on Araport. Building on readily available web services, the code for ePlant is freely available for any other biological species research.

...read moreread less

247 citations

Journal Article•DOI•

Identification of protective and broadly conserved vaccine antigens from the genome of extraintestinal pathogenic Escherichia coli

[...]

Danilo Gomes Moriel¹, Isabella Bertoldi, Angela Spagnuolo, Sara Marchi, Roberto Rosini, Barbara Nesta, Ilaria Pastorello, Vanja A Mariani Corea, Giulia Torricelli, Elena Cartocci, Silvana Savino, Maria Scarselli, Ulrich Dobrindt², Joerg Hacker², Hervé Tettelin³, Hervé Tettelin⁴, Luke J. Tallon³, Luke J. Tallon⁴, Steven A. Sullivan⁴, Steven A. Sullivan⁵, Lothar H. Wieler⁶, Christa Ewers⁶, Derek Pickard⁷, Gordon Dougan⁷, Maria Rita Fontana, Rino Rappuoli, Mariagrazia Pizza, Laura Serino - Show less +24 more•Institutions (7)

Novartis¹, University of Würzburg², University of Maryland, Baltimore³, J. Craig Venter Institute⁴, New York University⁵, Free University of Berlin⁶, Wellcome Trust Sanger Institute⁷

18 May 2010-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The genome sequence of ExPEC IHE3034 (ST95) isolated from a case of neonatal meningitis is determined and the gene encoding the most protective antigen was detected in most of the E. coli isolates, highly conserved in sequence and found to be exported by a type II secretion system which seems to be nonfunctional in nonpathogenic strains.

...read moreread less

Abstract: Extraintestinal pathogenic Escherichia coli (ExPEC) are a common cause of disease in both mammals and birds. A vaccine to prevent such infections would be desirable given the increasing antibiotic resistance of these bacteria. We have determined the genome sequence of ExPEC IHE3034 (ST95) isolated from a case of neonatal meningitis and compared this to available genome sequences of other ExPEC strains and a few nonpathogenic E. coli. We found 19 genomic islands present in the genome of IHE3034, which are absent in the nonpathogenic E. coli isolates. By using subtractive reverse vaccinology we identified 230 antigens present in ExPEC but absent (or present with low similarity) in nonpathogenic strains. Nine antigens were protective in a mouse challenge model. Some of them were also present in other pathogenic non-ExPEC strains, suggesting that a broadly protective E. coli vaccine may be possible. The gene encoding the most protective antigen was detected in most of the E. coli isolates, highly conserved in sequence and found to be exported by a type II secretion system which seems to be nonfunctional in nonpathogenic strains.

...read moreread less

245 citations

Collapse

Authors

Showing all 1274 results

Name	H-index	Papers	Citations
John R. Yates	177	1036	129029
Anders M. Dale	156	823	133891
Ronald W. Davis	155	644	151276
Steven L. Salzberg	147	407	231756
Mark Raymond Adams	147	1187	135038
Nicholas J. Schork	125	587	62131
William R. Jacobs	118	490	48638
Ian T. Paulsen	112	354	69460
Michael B. Brenner	111	393	44771
Kenneth H. Nealson	108	483	51100
Claire M. Fraser	108	352	76292
Stephen L. Hoffman	104	458	38597
Michael J. Brownstein	102	274	47929
Amalio Telenti	102	421	40509
John Quackenbush	99	427	67029

Network Information

Related Institutions (5)

Wellcome Trust Sanger Institute

9.6K papers, 1.2M citations

94% related

Broad Institute

11.6K papers, 1.5M citations

92% related

Cold Spring Harbor Laboratory

6.6K papers, 1M citations

92% related

Pasteur Institute

50.3K papers, 2.5M citations

92% related

Howard Hughes Medical Institute

34.6K papers, 5.2M citations

92% related

Performance

Metrics

2,313

Papers

349,017

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	3
2022	11
2021	116
2020	141
2019	154
2018	157