scispace - formally typeset
Search or ask a question
Author

Andrew Zimmer

Bio: Andrew Zimmer is an academic researcher from Broad Institute. The author has contributed to research in topics: Genome & Genomics. The author has an hindex of 11, co-authored 20 publications receiving 6744 citations.

Papers
More filters
Journal ArticleDOI
08 Dec 2005-Nature
TL;DR: A high-quality draft genome sequence of the domestic dog is reported, together with a dense map of single nucleotide polymorphisms (SNPs) across breeds, to shed light on the structure and evolution of genomes and genes.
Abstract: Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

2,431 citations

Journal ArticleDOI
Andrew G. Clark1, Michael B. Eisen2, Michael B. Eisen3, Douglas Smith  +426 moreInstitutions (70)
08 Nov 2007-Nature
TL;DR: These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution.
Abstract: Comparative analysis of multiple genomes in a phylogenetic framework dramatically improves the precision and sensitivity of evolutionary inference, producing more robust results than single-genome analyses can provide. The genomes of 12 Drosophila species, ten of which are presented here for the first time (sechellia, simulans, yakuba, erecta, ananassae, persimilis, willistoni, mojavensis, virilis and grimshawi), illustrate how rates and patterns of sequence divergence across taxa can illuminate evolutionary processes on a genomic scale. These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution. Despite remarkable similarities among these Drosophila species, we identified many putatively non-neutral changes in protein-coding genes, non-coding RNA genes, and cis-regulatory regions. These may prove to underlie differences in the ecology and behaviour of these diverse species.

2,057 citations

Journal ArticleDOI
10 May 2007-Nature
TL;DR: A high-quality draft of the genome sequence of the grey, short-tailed opossum is reported, indicating a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation.
Abstract: We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian ('marsupial') species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation.

724 citations

Journal ArticleDOI
TL;DR: The complete genome sequence of an acetate-utilizing methanogen, Methanosarcina acetivorans C2A, is reported, which indicates the likelihood of undiscovered natural energy sources for methanogenesis, whereas the presence of single-subunit carbon monoxide dehydrogenases raises the possibility of nonmethanogenic growth.
Abstract: The Archaea remain the most poorly understood domain of life despite their importance to the biosphere. Methanogenesis, which plays a pivotal role in the global carbon cycle, is unique to the Archaea. Each year, an estimated 900 million metric tons of methane are biologically produced, representing the major global source for this greenhouse gas and contributing significantly to global warming (Schlesinger 1997). Methanogenesis is critical to the waste-treatment industry and biologically produced methane also represents an important alternative fuel source. At least two-thirds of the methane in nature is derived from acetate, although only two genera of methanogens are known to be capable of utilizing this substrate. We report here the first complete genome sequence of an acetate-utilizing (acetoclastic) methanogen, Methanosarcina acetivorans C2A. The Methanosarcineae are metabolically and physiologically the most versatile methanogens. Only Methanosarcina species possess all three known pathways for methanogenesis (Fig. ​(Fig.1)1) and are capable of utilizing no less than nine methanogenic substrates, including acetate. In contrast, all other orders of methanogens possess a single pathway for methanogenesis, and many utilize no more than two substrates. Among methanogens, the Methanosarcineae also display extensive environmental diversity. Individual species of Methanosarcina have been found in freshwater and marine sediments, decaying leaves and garden soils, oil wells, sewage and animal waste digesters and lagoons, thermophilic digesters, feces of herbivorous animals, and the rumens of ungulates (Zinder 1993). Figure 1 Three pathways for methanogenesis. Methanogenesis is a form of anaerobic respiration using a variety of one-carbon (C-1) compounds or acetic acid as a terminal electron acceptor. All three pathways converge on the reduction of methyl-CoM to methane (CH ... The Methanosarcineae are unique among the Archaea in forming complex multicellular structures during different phases of growth and in response to environmental change (Fig. ​(Fig.2).2). Within the Methanosarcineae, a number of distinct morphological forms have been characterized, including single cells with and without a cell envelope, as well as multicellular packets and lamina (Macario and Conway de Macario 2001). Packets and lamina display internal morphological heterogeneity, suggesting the possibility of cellular differentiation. Moreover, it has been suggested that cells within lamina may display differential production of extracellular material, a potential form of cellular specialization (Macario and Conway de Macario 2001). The formation of multicellular structures has been proposed to act as an adaptation to stress and likely plays a role in the ability of Methanosarcina species to colonize diverse environments. Figure 2 Different morphological forms of Methanosarcina acetivorans. Thin-section electron micrographs showing M. acetivorans growing as both single cells (center of micrograph) and within multicellular aggregates (top left, bottom right). Cells were harvested ... Significantly, powerful methods for genetic analysis exist for Methanosarcina species. These tools include plasmid shuttle vectors (Metcalf et al. 1997), very high efficiency transformation (Metcalf et al. 1997), random in vivo transposon mutagenesis (Zhang et al. 2000), directed mutagenesis of specific genes (Zhang et al. 2000), multiple selectable markers (Boccazzi et al. 2000), reporter gene fusions (M. Pritchett and W. Metcalf, unpubl.), integration vectors (Conway de Macario et al. 1996), and anaerobic incubators for large-scale growth of methanogens on solid media (Metcalf et al. 1998). Furthermore, and in contrast to other known methanogens, genetic analysis can be used to study the process of methanogenesis: Because Methanosarcina species are able to utilize each of the three known methanogenic pathways, mutants in a single pathway are viable (M. Pritchett and W. Metcalf, unpubl.). The availability of genetic methods allowing immediate exploitation of genomic sequence, coupled with the genetic, physiological, and environmental diversity of M. acetivorans make this species an outstanding model organism for the study of archaeal biology. For these reasons, we set out to study the genome of M. acetivorans.

626 citations

Journal ArticleDOI
TL;DR: An automated, highly scalable method for carrying out the Solution Hybrid Selection capture approach that provides a dramatic increase in scale and throughput of sequence-ready libraries produced is presented.
Abstract: Genome targeting methods enable cost-effective capture of specific subsets of the genome for sequencing. We present here an automated, highly scalable method for carrying out the Solution Hybrid Selection capture approach that provides a dramatic increase in scale and throughput of sequence-ready libraries produced. Significant process improvements and a series of in-process quality control checkpoints are also added. These process improvements can also be used in a manual version of the protocol.

618 citations


Cited by
More filters
Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal ArticleDOI
21 Apr 2006-Cell
TL;DR: It is proposed that bivalent domains silence developmental genes in ES cells while keeping them poised for activation, highlighting the importance of DNA sequence in defining the initial epigenetic landscape and suggesting a novel chromatin-based mechanism for maintaining pluripotency.

5,131 citations

Journal ArticleDOI
14 Jun 2007-Nature
TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.
Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

5,091 citations

Journal ArticleDOI
10 Oct 2013-Cell
TL;DR: Correlative analyses confirm that the survival advantage of the proneural subtype is conferred by the G-CIMP phenotype, and MGMT DNA methylation may be a predictive biomarker for treatment response only in classical subtype GBM.

3,593 citations

01 Jan 2013
TL;DR: In this article, the landscape of somatic genomic alterations based on multidimensional and comprehensive characterization of more than 500 glioblastoma tumors (GBMs) was described, including several novel mutated genes as well as complex rearrangements of signature receptors, including EGFR and PDGFRA.
Abstract: We describe the landscape of somatic genomic alterations based on multidimensional and comprehensive characterization of more than 500 glioblastoma tumors (GBMs). We identify several novel mutated genes as well as complex rearrangements of signature receptors, including EGFR and PDGFRA. TERT promoter mutations are shown to correlate with elevated mRNA expression, supporting a role in telomerase reactivation. Correlative analyses confirm that the survival advantage of the proneural subtype is conferred by the G-CIMP phenotype, and MGMT DNA methylation may be a predictive biomarker for treatment response only in classical subtype GBM. Integrative analysis of genomic and proteomic profiles challenges the notion of therapeutic inhibition of a pathway as an alternative to inhibition of the target itself. These data will facilitate the discovery of therapeutic and diagnostic target candidates, the validation of research and clinical observations and the generation of unanticipated hypotheses that can advance our molecular understanding of this lethal cancer.

2,616 citations