Institution
J. Craig Venter Institute
Nonprofit•La Jolla, California, United States•
About: J. Craig Venter Institute is a nonprofit organization based out in La Jolla, California, United States. It is known for research contribution in the topics: Genome & Gene. The organization has 1268 authors who have published 2300 publications receiving 304083 citations. The organization is also known as: JCVI & The Institute for Genomic Research.
Topics: Genome, Gene, Genomics, Population, Microbiome
Papers published on a yearly basis
Papers
More filters
••
TL;DR: New ‘massively parallel’ sequencing methods are greatly increasing sequencing capacity, but further innovations are needed to achieve the ‘thousand dollar genome’ that many feel is prerequisite to personalized genomic medicine.
Abstract: Fifteen years elapsed between the discovery of the double helix (1953) and the first DNA sequencing (1968). Modern DNA sequencing began in 1977, with development of the chemical method of Maxam and Gilbert and the dideoxy method of Sanger, Nicklen and Coulson, and with the first complete DNA sequence (phage rX174), which demonstrated that sequence could give profound insights into genetic organization. Incremental improvements allowed sequencing of molecules >200kb (human cytomegalovirus) leading to an avalanche of data that demanded computational analysis and spawned the field of bioinformatics. The US Human Genome Project spurred sequencing activity. By 1992 the first ‘sequencing factory’ was established, and others soon followed. The first complete cellular genome sequences, from bacteria, appeared in 1995 and other eubacterial, archaebacterial and eukaryotic genomes were soon sequenced. Competition between the public Human Genome Project and Celera Genomics produced working drafts of the human genome sequence, published in 2001, but refinement and analysis of the human genome sequence will continue for the foreseeable future. New ‘massively parallel’ sequencing methods are greatly increasing sequencing capacity, but further innovations are needed to achieve the ‘thousand dollar genome’ that many feel is prerequisite to personalized genomic medicine. These advances will also allow new approaches to a variety of problems in biology, evolution and the environment.
292 citations
••
TL;DR: This is the first glimpse of an individual's exome and a snapshot of the current state of personalized genomics, and presents an approach to analyze the coding variation in humans by proposing multiple bioinformatic methods to hone in on possible functional variation.
Abstract: There is much interest in characterizing the variation in a human individual, because this may elucidate what contributes significantly to a person's phenotype, thereby enabling personalized genomics. We focus here on the variants in a person's ‘exome,’ which is the set of exons in a genome, because the exome is believed to harbor much of the functional variation. We provide an analysis of the ∼12,500 variants that affect the protein coding portion of an individual's genome. We identified ∼10,400 nonsynonymous single nucleotide polymorphisms (nsSNPs) in this individual, of which ∼15–20% are rare in the human population. We predict ∼1,500 nsSNPs affect protein function and these tend be heterozygous, rare, or novel. Of the ∼700 coding indels, approximately half tend to have lengths that are a multiple of three, which causes insertions/deletions of amino acids in the corresponding protein, rather than introducing frameshifts. Coding indels also occur frequently at the termini of genes, so even if an indel causes a frameshift, an alternative start or stop site in the gene can still be used to make a functional protein. In summary, we reduced the set of ∼12,500 nonsilent coding variants by ∼8-fold to a set of variants that are most likely to have major effects on their proteins' functions. This is our first glimpse of an individual's exome and a snapshot of the current state of personalized genomics. The majority of coding variants in this individual are common and appear to be functionally neutral. Our results also indicate that some variants can be used to improve the current NCBI human reference genome. As more genomes are sequenced, many rare variants and non-SNP variants will be discovered. We present an approach to analyze the coding variation in humans by proposing multiple bioinformatic methods to hone in on possible functional variation.
291 citations
••
TL;DR: It is proposed that while members of a given marine Synechococcus lineage may have the same broad geographical distribution, local niche occupancy is facilitated by lateral gene transfers, a process in which genomic islands play a key role as a repository for transferred genes.
Abstract: Background
The picocyanobacterial genus Synechococcus occurs over wide oceanic expanses, having colonized most available niches in the photic zone. Large scale distribution patterns of the different Synechococcus clades (based on 16S rRNA gene markers) suggest the occurrence of two major lifestyles ('opportunists'/'specialists'), corresponding to two distinct broad habitats ('coastal'/'open ocean'). Yet, the genetic basis of niche partitioning is still poorly understood in this ecologically important group.
290 citations
••
TL;DR: This work reports on the sequencing of 10,545 human genomes at 30×–40× coverage with an emphasis on quality metrics and novel variant and sequence discovery and concludes that high-coverage genome sequencing provides accurate detail on human variation for discovery and clinical applications.
Abstract: We report on the sequencing of 10,545 human genomes at 30×-40× coverage with an emphasis on quality metrics and novel variant and sequence discovery. We find that 84% of an individual human genome can be sequenced confidently. This high-confidence region includes 91.5% of exon sequence and 95.2% of known pathogenic variant positions. We present the distribution of over 150 million single-nucleotide variants in the coding and noncoding genome. Each newly sequenced genome contributes an average of 8,579 novel variants. In addition, each genome carries on average 0.7 Mb of sequence that is not found in the main build of the hg38 reference genome. The density of this catalog of variation allowed us to construct high-resolution profiles that define genomic sites that are highly intolerant of genetic variation. These results indicate that the data generated by deep genome sequencing is of the quality necessary for clinical use.
288 citations
••
University of East Anglia1, Joint Genome Institute2, Norwich Research Park3, Stazione Zoologica Anton Dohrn4, J. Craig Venter Institute5, Scripps Institution of Oceanography6, Alfred Wegener Institute for Polar and Marine Research7, Université Paris-Saclay8, École Normale Supérieure9, University of Paris10, University of Cologne11, Leibniz Association12, University of Konstanz13, Medical University of South Carolina14, University of Duisburg-Essen15, Leibniz Institute for Neurobiology16, University of Washington17, University of Nevada, Las Vegas18, Monterey Bay Aquarium Research Institute19, University of British Columbia20, University of California, Berkeley21
TL;DR: In this article, the genome evolution of a cold-adapted diatom from the Southern Ocean, Fragilariopsis cylindrus, based on a comparison with temperate diatoms was studied.
Abstract: The Southern Ocean houses a diverse and productive community of organisms. Unicellular eukaryotic diatoms are the main primary producers in this environment, where photosynthesis is limited by low concentrations of dissolved iron and large seasonal fluctuations in light, temperature and the extent of sea ice. How diatoms have adapted to this extreme environment is largely unknown. Here we present insights into the genome evolution of a cold-adapted diatom from the Southern Ocean, Fragilariopsis cylindrus, based on a comparison with temperate diatoms. We find that approximately 24.7 per cent of the diploid F. cylindrus genome consists of genetic loci with alleles that are highly divergent (15.1 megabases of the total genome size of 61.1 megabases). These divergent alleles were differentially expressed across environmental conditions, including darkness, low iron, freezing, elevated temperature and increased CO2. Alleles with the largest ratio of non-synonymous to synonymous nucleotide substitutions also show the most pronounced condition-dependent expression, suggesting a correlation between diversifying selection and allelic differentiation. Divergent alleles may be involved in adaptation to environmental fluctuations in the Southern Ocean.
287 citations
Authors
Showing all 1274 results
Name | H-index | Papers | Citations |
---|---|---|---|
John R. Yates | 177 | 1036 | 129029 |
Anders M. Dale | 156 | 823 | 133891 |
Ronald W. Davis | 155 | 644 | 151276 |
Steven L. Salzberg | 147 | 407 | 231756 |
Mark Raymond Adams | 147 | 1187 | 135038 |
Nicholas J. Schork | 125 | 587 | 62131 |
William R. Jacobs | 118 | 490 | 48638 |
Ian T. Paulsen | 112 | 354 | 69460 |
Michael B. Brenner | 111 | 393 | 44771 |
Kenneth H. Nealson | 108 | 483 | 51100 |
Claire M. Fraser | 108 | 352 | 76292 |
Stephen L. Hoffman | 104 | 458 | 38597 |
Michael J. Brownstein | 102 | 274 | 47929 |
Amalio Telenti | 102 | 421 | 40509 |
John Quackenbush | 99 | 427 | 67029 |