Institution

J. Craig Venter Institute

Nonprofit•La Jolla, California, United States•

About: J. Craig Venter Institute is a nonprofit organization based out in La Jolla, California, United States. It is known for research contribution in the topics: Genome & Gene. The organization has 1268 authors who have published 2300 publications receiving 304083 citations. The organization is also known as: JCVI & The Institute for Genomic Research.

...read moreread less

Topics: Genome, Gene, Genomics, Population, Microbiome ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

An improved genome release (version Mt4.0) for the model legume Medicago truncatula

[...]

Haibao Tang¹, Vivek Krishnakumar¹, Shelby L. Bidwell¹, Benjamin D. Rosen¹, Agnes P. Chan¹, Shiguo Zhou², Laurent Gentzbittel³, Kevin L. Childs⁴, Mark Yandell⁵, Heidrun Gundlach, Klaus F. X. Mayer, David C. Schwartz², Christopher D. Town¹ - Show less +9 more•Institutions (5)

J. Craig Venter Institute¹, University of Wisconsin-Madison², University of Toulouse³, Michigan State University⁴, University of Utah⁵

27 Apr 2014-BMC Genomics

TL;DR: This work describes a further improved and refined version of the M. truncatula genome (Mt4.0) based on de novo whole genome shotgun assembly of a majority of Illumina and 454 reads using ALLPATHS-LG, and re-annotates the genome through the gene prediction pipeline, which integrates EST, RNA-seq, protein and gene prediction evidences.

...read moreread less

Abstract: Medicago truncatula, a close relative of alfalfa, is a preeminent model for studying nitrogen fixation, symbiosis, and legume genomics. The Medicago sequencing project began in 2003 with the goal to decipher sequences originated from the euchromatic portion of the genome. The initial sequencing approach was based on a BAC tiling path, culminating in a BAC-based assembly (Mt3.5) as well as an in-depth analysis of the genome published in 2011. Here we describe a further improved and refined version of the M. truncatula genome (Mt4.0) based on de novo whole genome shotgun assembly of a majority of Illumina and 454 reads using ALLPATHS-LG. The ALLPATHS-LG scaffolds were anchored onto the pseudomolecules on the basis of alignments to both the optical map and the genotyping-by-sequencing (GBS) map. The Mt4.0 pseudomolecules encompass ~360 Mb of actual sequences spanning 390 Mb of which ~330 Mb align perfectly with the optical map, presenting a drastic improvement over the BAC-based Mt3.5 which only contained 70% sequences (~250 Mb) of the current version. Most of the sequences and genes that previously resided on the unanchored portion of Mt3.5 have now been incorporated into the Mt4.0 pseudomolecules, with the exception of ~28 Mb of unplaced sequences. With regard to gene annotation, the genome has been re-annotated through our gene prediction pipeline, which integrates EST, RNA-seq, protein and gene prediction evidences. A total of 50,894 genes (31,661 high confidence and 19,233 low confidence) are included in Mt4.0 which overlapped with ~82% of the gene loci annotated in Mt3.5. Of the remaining genes, 14% of the Mt3.5 genes have been deprecated to an “unsupported” status and 4% are absent from the Mt4.0 predictions. Mt4.0 and its associated resources, such as genome browsers, BLAST-able datasets and gene information pages, can be found on the JCVI Medicago web site ( http://www.jcvi.org/medicago ). The assembly and annotation has been deposited in GenBank (BioProject: PRJNA10791). The heavily curated chromosomal sequences and associated gene models of Medicago will serve as a better reference for legume biology and comparative genomics.

...read moreread less

373 citations

Journal Article•DOI•

Genome Project Standards in a New Era of Sequencing

[...]

Patrick S. G. Chain¹, Darren Grafham, Robert S. Fulton², Michael Fitzgerald³, Jessica B. Hostetler⁴, Donna M. Muzny⁵, Johar Ali⁶, Bruce W. Birren³, D. C. Bruce⁷, D. C. Bruce¹, Christian J. Buhay⁵, James R. Cole⁸, Yan Ding⁵, Shannon Dugan⁵, Dawn Field, George M. Garrity⁸, Richard A. Gibbs⁵, Tina GravesT. Graves², Cliff S. Han⁷, Cliff S. Han¹, Scott H. Harrison⁸, Sarah K. Highlander⁵, Philip Hugenholtz¹, H. M. Khouri⁹, Chinnappa D. Kodira³, Eugene Kolker¹⁰, Nikos C. Kyrpides¹, D. Lang⁹, Alla Lapidus¹, S. A. Malfatti⁹, Victor Markowitz¹¹, T. Metha³, Karen E. Nelson⁴, Julian Parkhill, Samuel Pitluck¹, Xiang Qin⁵, Timothy D. Read¹², Jeremy Schmutz, Shanmuga Sozhamannan¹³, Peter Sterk, Robert L. Strausberg⁴, Granger G. Sutton⁴, Nicholas R. Thomson, James M. Tiedje⁸, George M. Weinstock², Aye Wollam², John C. Detter⁷ - Show less +43 more•Institutions (13)

United States Department of Energy¹, Washington University in St. Louis², Broad Institute³, J. Craig Venter Institute⁴, Baylor College of Medicine⁵, Ontario Institute for Cancer Research⁶, Los Alamos National Laboratory⁷, Michigan State University⁸, National Institutes of Health⁹, University of Washington¹⁰, Lawrence Berkeley National Laboratory¹¹, Georgia Research Alliance¹², Naval Medical Research Center¹³

09 Oct 2009-Science

TL;DR: In this article, the authors propose a method to distinguish good from poor data sets by navigating through the databases to find the number and type of reads deposited in sequence trace repositories (and not all genomes have this available), or to identify the number of contigs or genome fragments deposited to the database.

...read moreread less

Abstract: For over a decade, genome sequences have adhered to only two standards that are relied on for purposes of sequence analysis by interested third parties (1, 2). However, ongoing developments in revolutionary sequencing technologies have resulted in a redefinition of traditional whole-genome sequencing that requires reevaluation of such standards. With commercially available 454 pyrosequencing (followed by Illumina, SOLiD, and now Helicos), there has been an explosion of genomes sequenced under the moniker “draft”; however, these can be very poor quality genomes (due to inherent errors in the sequencing technologies, and the inability of assembly programs to fully address these errors). Further, one can only infer that such draft genomes may be of poor quality by navigating through the databases to find the number and type of reads deposited in sequence trace repositories (and not all genomes have this available), or to identify the number of contigs or genome fragments deposited to the database. The difficulty in assessing the quality of such deposited genomes has created some havoc for genome analysis pipelines and has contributed to many wasted hours. Exponential leaps in raw sequencing capability and greatly reduced prices have further skewed the time- and cost-ratios of draft data generation versus the painstaking process of improving and finishing a genome. The result is an ever-widening gap between drafted and finished genomes that only promises to continue (see the figure, page 236); hence, there is an urgent need to distinguish good from poor data sets.

...read moreread less

370 citations

Journal Article•DOI•

The Natural Product Domain Seeker NaPDoS: A Phylogeny Based Bioinformatic Tool to Classify Secondary Metabolite Gene Diversity

[...]

Nadine Ziemert¹, Sheila Podell¹, Kevin Penn¹, Jonathan H. Badger², Eric E. Allen¹, Paul R. Jensen¹ - Show less +2 more•Institutions (2)

University of California, San Diego¹, J. Craig Venter Institute²

29 Mar 2012-PLOS ONE

TL;DR: The web tool Natural Product Domain Seeker (NaPDoS), which provides an automated method to assess the secondary metabolite biosynthetic gene diversity and novelty of strains or environments, and provides a rapid method to identify genes that may be associated with uncharacterized biochemistry.

...read moreread less

Abstract: New bioinformatic tools are needed to analyze the growing volume of DNA sequence data. This is especially true in the case of secondary metabolite biosynthesis, where the highly repetitive nature of the associated genes creates major challenges for accurate sequence assembly and analysis. Here we introduce the web tool Natural Product Domain Seeker (NaPDoS), which provides an automated method to assess the secondary metabolite biosynthetic gene diversity and novelty of strains or environments. NaPDoS analyses are based on the phylogenetic relationships of sequence tags derived from polyketide synthase (PKS) and non-ribosomal peptide synthetase (NRPS) genes, respectively. The sequence tags correspond to PKS-derived ketosynthase domains and NRPS-derived condensation domains and are compared to an internal database of experimentally characterized biosynthetic genes. NaPDoS provides a rapid mechanism to extract and classify ketosynthase and condensation domains from PCR products, genomes, and metagenomic datasets. Close database matches provide a mechanism to infer the generalized structures of secondary metabolites while new phylogenetic lineages provide targets for the discovery of new enzyme architectures or mechanisms of secondary metabolite assembly. Here we outline the main features of NaPDoS and test it on four draft genome sequences and two metagenomic datasets. The results provide a rapid method to assess secondary metabolite biosynthetic gene diversity and richness in organisms or environments and a mechanism to identify genes that may be associated with uncharacterized biochemistry.

...read moreread less

369 citations

Journal Article•DOI•

Structural flexibility in the Burkholderia mallei genome

[...]

William C. Nierman, David DeShazer¹, H. Stanley Kim², Hervé Tettelin², Karen E. Nelson², Tamara Feldblyum², Ricky L. Ulrich¹, Catherine M. Ronning², Lauren M. Brinkac², Sean C. Daugherty², Tanja Davidsen², Robert T. DeBoy², George Dimitrov², Robert J. Dodson², A. Scott Durkin², Michelle L. Gwinn², Daniel H. Haft², Hoda Khouri², James F. Kolonay², Ramana Madupu², Yasmin Mohammoud², William C. Nelson², Diana Radune², Claudia M. Romero², Saul H Sarria², Jeremy D. Selengut², Christine Shamblin², Steven A. Sullivan², Owen White², Yan Yu², Nikhat Zafar², Liwei Zhou², Claire M. Fraser³, Claire M. Fraser² - Show less +30 more•Institutions (3)

United States Army Medical Research Institute of Infectious Diseases¹, J. Craig Venter Institute², George Washington University³

28 Sep 2004-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: Variation in simple sequence repeats in key genes can provide a mechanism for generating antigenic variation that may account for the mammalian host's inability to mount a durable adaptive immune response to a B. mallei infection.

...read moreread less

Abstract: The complete genome sequence of Burkholderia mallei ATCC 23344 provides insight into this highly infectious bacterium's pathogenicity and evolutionary history. B. mallei, the etiologic agent of glanders, has come under renewed scientific investigation as a result of recent concerns about its past and potential future use as a biological weapon. Genome analysis identified a number of putative virulence factors whose function was supported by comparative genome hybridization and expression profiling of the bacterium in hamster liver in vivo. The genome contains numerous insertion sequence elements that have mediated extensive deletions and rearrangements of the genome relative to Burkholderia pseudomallei. The genome also contains a vast number (>12,000) of simple sequence repeats. Variation in simple sequence repeats in key genes can provide a mechanism for generating antigenic variation that may account for the mammalian host's inability to mount a durable adaptive immune response to a B. mallei infection.

...read moreread less

369 citations

Journal Article•DOI•

Single-nucleus and single-cell transcriptomes compared in matched cortical cell types.

[...]

Trygve E. Bakken¹, Rebecca D. Hodge¹, Jeremy A. Miller¹, Zizhen Yao¹, Thuc Nghi Nguyen¹, Brian D. Aevermann², Eliza Barkan¹, Darren Bertagnolli¹, Tamara Casper¹, Nick Dee¹, Emma Garren¹, Jeff Goldy¹, Lucas T. Graybuck¹, Matthew Kroll¹, Roger S. Lasken², Kanan Lathia¹, Sheana Parry¹, Christine Rimorin¹, Richard H. Scheuermann², Nicholas J. Schork², Soraya I. Shehata¹, Michael Tieu¹, John W. Phillips¹, Amy Bernard¹, Kimberly A. Smith¹, Hongkui Zeng¹, Ed Lein¹, Bosiljka Tasic¹ - Show less +24 more•Institutions (2)

Allen Institute for Brain Science¹, J. Craig Venter Institute²

26 Dec 2018-PLOS ONE

TL;DR: It is demonstrated that closely related neuronal cell types can be similarly discriminated with both methods if intronic sequences are included in snRNA-seq analysis, and the high information content of nuclear RNA for characterization of cellular diversity in brain tissues is illustrated.

...read moreread less

Abstract: Transcriptomic profiling of complex tissues by single-nucleus RNA-sequencing (snRNA-seq) affords some advantages over single-cell RNA-sequencing (scRNA-seq). snRNA-seq provides less biased cellular coverage, does not appear to suffer cell isolation-based transcriptional artifacts, and can be applied to archived frozen specimens. We used well-matched snRNA-seq and scRNA-seq datasets from mouse visual cortex to compare cell type detection. Although more transcripts are detected in individual whole cells (~11,000 genes) than nuclei (~7,000 genes), we demonstrate that closely related neuronal cell types can be similarly discriminated with both methods if intronic sequences are included in snRNA-seq analysis. We estimate that the nuclear proportion of total cellular mRNA varies from 20% to over 50% for large and small pyramidal neurons, respectively. Together, these results illustrate the high information content of nuclear RNA for characterization of cellular diversity in brain tissues.

...read moreread less

368 citations

Collapse

Authors

Showing all 1274 results

Name	H-index	Papers	Citations
John R. Yates	177	1036	129029
Anders M. Dale	156	823	133891
Ronald W. Davis	155	644	151276
Steven L. Salzberg	147	407	231756
Mark Raymond Adams	147	1187	135038
Nicholas J. Schork	125	587	62131
William R. Jacobs	118	490	48638
Ian T. Paulsen	112	354	69460
Michael B. Brenner	111	393	44771
Kenneth H. Nealson	108	483	51100
Claire M. Fraser	108	352	76292
Stephen L. Hoffman	104	458	38597
Michael J. Brownstein	102	274	47929
Amalio Telenti	102	421	40509
John Quackenbush	99	427	67029