scispace - formally typeset
Search or ask a question
Author

Samuel Deutsch

Bio: Samuel Deutsch is an academic researcher from Joint Genome Institute. The author has contributed to research in topics: Chromosome 21 & Gene. The author has an hindex of 37, co-authored 84 publications receiving 6139 citations. Previous affiliations of Samuel Deutsch include Lawrence Berkeley National Laboratory & United States Department of Energy.


Papers
More filters
Journal ArticleDOI
04 Sep 2009-Science
TL;DR: The data suggest that the complete regulatory variant repertoire can only be uncovered in the context of cell-type specificity, and identifies multiple expressive quantitative trait loci per gene, unique or shared among cell types and positively correlated with the number of transcripts per gene.
Abstract: Studies correlating genetic variation to gene expression facilitate the interpretation of common human phenotypes and disease. As functional variants may be operating in a tissue-dependent manner, we performed gene expression profiling and association with genetic variants (single-nucleotide polymorphisms) on three cell types of 75 individuals. We detected cell type-specific genetic effects, with 69 to 80% of regulatory variants operating in a cell type-specific manner, and identified multiple expressive quantitative trait loci (eQTLs) per gene, unique or shared among cell types and positively correlated with the number of transcripts per gene. Cell type-specific eQTLs were found at larger distances from genes and at lower effect size, similar to known enhancers. These data suggest that the complete regulatory variant repertoire can only be uncovered in the context of cell-type specificity.

785 citations

Journal ArticleDOI
TL;DR: Comparative genomics is beginning to identify the functional components of the chromosome and that in turn will set the stage for the functional characterization of the sequences.
Abstract: The sequence of chromosome 21 was a turning point for the understanding of Down syndrome. Comparative genomics is beginning to identify the functional components of the chromosome and that in turn will set the stage for the functional characterization of the sequences. Animal models combined with genome-wide analytical methods have proved indispensable for unravelling the mysteries of gene dosage imbalance.

613 citations

Journal ArticleDOI
TL;DR: The results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I) HapMap has sufficient density to enable linkage disequilibrium mapping in humans.
Abstract: The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12–13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs) with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis-) to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I) HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

553 citations

Journal ArticleDOI
08 May 2009-PLOS ONE
TL;DR: This analysis revealed that the genetic structure of the European population correlates closely with geography, and allows the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS).
Abstract: Using principal component (PC) analysis, we studied the genetic constitution of 3,112 individuals from Europe as portrayed by more than 270,000 single nucleotide polymorphisms (SNPs) genotyped with the Illumina Infinium platform. In cohorts where the sample size was >100, one hundred randomly chosen samples were used for analysis to minimize the sample size effect, resulting in a total of 1,564 samples. This analysis revealed that the genetic structure of the European population correlates closely with geography. The first two PCs highlight the genetic diversity corresponding to the northwest to southeast gradient and position the populations according to their approximate geographic origin. The resulting genetic map forms a triangular structure with a) Finland, b) the Baltic region, Poland and Western Russia, and c) Italy as its vertexes, and with d) Central- and Western Europe in its centre. Inter- and intra- population genetic differences were quantified by the inflation factor lambda (lambda) (ranging from 1.00 to 4.21), fixation index (F(st)) (ranging from 0.000 to 0.023), and by the number of markers exhibiting significant allele frequency differences in pair-wise population comparisons. The estimated lambda was used to assess the real diminishing impact to association statistics when two distinct populations are merged directly in an analysis. When the PC analysis was confined to the 1,019 Estonian individuals (0.1% of the Estonian population), a fine structure emerged that correlated with the geography of individual counties. With at least two cohorts available from several countries, genetic substructures were investigated in Czech, Finnish, German, Estonian and Italian populations. Together with previously published data, our results allow the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS).

440 citations

Journal ArticleDOI
TL;DR: The 1166C allele may be functionally associated with hypertension by abrogating regulation by hsa-miR-155, thereby elevating AGTR1 levels, and mapping annotated SNPs onto a collection of experimentally supported human miRNA targets found one of these target sites containing SNP rs5186.
Abstract: Animal microRNAs (miRNAs) regulate gene expression through base pairing to their targets within the 3′ untranslated region (UTR) of protein-coding genes. Single-nucleotide polymorphisms (SNPs) located within such target sites can affect miRNA regulation. We mapped annotated SNPs onto a collection of experimentally supported human miRNA targets. Of the 143 experimentally supported human target sites, 9 contain 12 SNPs. We further experimentally investigated one of these target sites for hsa-miR-155, within the 3′ UTR of the human AGTR1 gene that contains SNP rs5186. Using reporter silencing assays, we show that hsa-miR-155 down-regulates the expression of only the 1166A, and not the 1166C, allele of rs5186. Remarkably, the 1166C allele has been associated with hypertension in many studies. Thus, the 1166C allele may be functionally associated with hypertension by abrogating regulation by hsa-miR-155, thereby elevating AGTR1 levels. Since hsa-miR-155 is on chromosome 21, we hypothesize that the observed lower blood pressure in trisomy 21 is partially caused by the overexpression of hsa-miR-155 leading to allele-specific underexpression of AGTR1. Indeed, we have shown in fibroblasts from monozygotic twins discordant for trisomy 21 that levels of AGTR1 protein are lower in trisomy 21.

355 citations


Cited by
More filters
01 Jun 2012
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

10,124 citations

Journal ArticleDOI
Robert H. Waterston1, Kerstin Lindblad-Toh2, Ewan Birney, Jane Rogers3  +219 moreInstitutions (26)
05 Dec 2002-Nature
TL;DR: The results of an international collaboration to produce a high-quality draft sequence of the mouse genome are reported and an initial comparative analysis of the Mouse and human genomes is presented, describing some of the insights that can be gleaned from the two sequences.
Abstract: The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

6,643 citations

Journal ArticleDOI
John T. Lonsdale, Jeffrey Thomas, Mike Salvatore, Rebecca Phillips, Edmund Lo, Saboor Shad, Richard Hasz, Gary Walters, Fernando U. Garcia1, Nancy Young2, Barbara A. Foster3, Mike Moser3, Ellen Karasik3, Bryan Gillard3, Kimberley Ramsey3, Susan L. Sullivan, Jason Bridge, Harold Magazine, John Syron, Johnelle Fleming, Laura A. Siminoff4, Heather M. Traino4, Maghboeba Mosavel4, Laura Barker4, Scott D. Jewell5, Daniel C. Rohrer5, Dan Maxim5, Dana Filkins5, Philip Harbach5, Eddie Cortadillo5, Bree Berghuis5, Lisa Turner5, Eric Hudson5, Kristin Feenstra5, Leslie H. Sobin6, James A. Robb6, Phillip Branton, Greg E. Korzeniewski6, Charles Shive6, David Tabor6, Liqun Qi6, Kevin Groch6, Sreenath Nampally6, Steve Buia6, Angela Zimmerman6, Anna M. Smith6, Robin Burges6, Karna Robinson6, Kim Valentino6, Deborah Bradbury6, Mark Cosentino6, Norma Diaz-Mayoral6, Mary Kennedy6, Theresa Engel6, Penelope Williams6, Kenyon Erickson, Kristin G. Ardlie7, Wendy Winckler7, Gad Getz8, Gad Getz7, David S. DeLuca7, MacArthur Daniel MacArthur7, MacArthur Daniel MacArthur8, Manolis Kellis7, Alexander Thomson7, Taylor Young7, Ellen Gelfand7, Molly Donovan7, Yan Meng7, George B. Grant7, Deborah C. Mash9, Yvonne Marcus9, Margaret J. Basile9, Jun Liu8, Jun Zhu10, Zhidong Tu10, Nancy J. Cox11, Dan L. Nicolae11, Eric R. Gamazon11, Hae Kyung Im11, Anuar Konkashbaev11, Jonathan K. Pritchard11, Jonathan K. Pritchard12, Matthew Stevens11, Timothée Flutre11, Xiaoquan Wen11, Emmanouil T. Dermitzakis13, Tuuli Lappalainen13, Roderic Guigó, Jean Monlong, Michael Sammeth, Daphne Koller14, Alexis Battle14, Sara Mostafavi14, Mark I. McCarthy15, Manual Rivas15, Julian Maller15, Ivan Rusyn16, Andrew B. Nobel16, Fred A. Wright16, Andrey A. Shabalin16, Mike Feolo17, Nataliya Sharopova17, Anne Sturcke17, Justin Paschal17, James M. Anderson17, Elizabeth L. Wilder17, Leslie Derr17, Eric D. Green17, Jeffery P. Struewing17, Gary F. Temple17, Simona Volpi17, Joy T. Boyer17, Elizabeth J. Thomson17, Mark S. Guyer17, Cathy Ng17, Assya Abdallah17, Deborah Colantuoni17, Thomas R. Insel17, Susan E. Koester17, Roger Little17, Patrick Bender17, Thomas Lehner17, Yin Yao17, Carolyn C. Compton17, Jimmie B. Vaught17, Sherilyn Sawyer17, Nicole C. Lockhart17, Joanne P. Demchok17, Helen F. Moore17 
TL;DR: The Genotype-Tissue Expression (GTEx) project is described, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.
Abstract: Genome-wide association studies have identified thousands of loci for common diseases, but, for the majority of these, the mechanisms underlying disease susceptibility remain unknown. Most associated variants are not correlated with protein-coding changes, suggesting that polymorphisms in regulatory regions probably contribute to many disease phenotypes. Here we describe the Genotype-Tissue Expression (GTEx) project, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.

6,545 citations

Journal ArticleDOI
John W. Belmont1, Andrew Boudreau, Suzanne M. Leal1, Paul Hardenbol  +229 moreInstitutions (40)
27 Oct 2005
TL;DR: A public database of common variation in the human genome: more than one million single nucleotide polymorphisms for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted.
Abstract: Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution.

5,479 citations

Journal ArticleDOI
14 Jun 2007-Nature
TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.
Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

5,091 citations