scispace - formally typeset
Search or ask a question
Author

Daniel Bates

Bio: Daniel Bates is an academic researcher from University of Washington. The author has contributed to research in topics: Human genome & Gene. The author has an hindex of 16, co-authored 24 publications receiving 11337 citations.

Papers
More filters
Journal ArticleDOI
07 Sep 2012-Science
TL;DR: P pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders are suggested.
Abstract: Genome-wide association studies have identified many noncoding variants associated with common diseases and traits. We show that these variants are concentrated in regulatory DNA marked by deoxyribonuclease I (DNase I) hypersensitive sites (DHSs). Eighty-eight percent of such DHSs are active during fetal development and are enriched in variants associated with gestational exposure–related phenotypes. We identified distant gene targets for hundreds of variant-containing DHSs that may explain phenotype associations. Disease-associated variants systematically perturb transcription factor recognition sequences, frequently alter allelic chromatin states, and form regulatory networks. We also demonstrated tissue-selective enrichment of more weakly disease-associated variants within DHSs and the de novo identification of pathogenic cell types for Crohn’s disease, multiple sclerosis, and an electrocardiogram trait, without prior knowledge of physiological mechanisms. Our results suggest pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders.

3,177 citations

01 Sep 2012
TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

2,767 citations

Journal ArticleDOI
06 Sep 2012-Nature
TL;DR: The first extensive map of human DHSs identified through genome-wide profiling in 125 diverse cell and tissue types is presented, revealing novel relationships between chromatin accessibility, transcription, DNA methylation and regulatory factor occupancy patterns.
Abstract: DNase I hypersensitive sites (DHSs) are markers of regulatory DNA and have underpinned the discovery of all classes of cis-regulatory elements including enhancers, promoters, insulators, silencers and locus control regions. Here we present the first extensive map of human DHSs identified through genome-wide profiling in 125 diverse cell and tissue types. We identify ∼2.9 million DHSs that encompass virtually all known experimentally validated cis-regulatory sequences and expose a vast trove of novel elements, most with highly cell-selective regulation. Annotating these elements using ENCODE data reveals novel relationships between chromatin accessibility, transcription, DNA methylation and regulatory factor occupancy patterns. We connect ∼580,000 distal DHSs with their target promoters, revealing systematic pairing of different classes of distal DHSs and specific promoter types. Patterning of chromatin accessibility at many regulatory regions is organized with dozens to hundreds of co-activated elements, and the transcellular DNase I sensitivity pattern at a given region can predict cell-type-specific functional behaviours. The DHS landscape shows signatures of recent functional evolutionary constraint. However, the DHS compartment in pluripotent and immortalized cells exhibits higher mutation rates than that in highly differentiated cells, exposing an unexpected link between chromatin accessibility, proliferative potential and patterns of human variation. An extensive map of human DNase I hypersensitive sites, markers of regulatory DNA, in 125 diverse cell and tissue types is described; integration of this information with other ENCODE-generated data sets identifies new relationships between chromatin accessibility, transcription, DNA methylation and regulatory factor occupancy patterns. This paper describes the first extensive map of human DNaseI hypersensitive sites — markers of regulatory DNA — in 125 diverse cell and tissue types. Integration of this information with other data sets generated by ENCODE (Encyclopedia of DNA Elements) identified new relationships between chromatin accessibility, transcription, DNA methylation and regulatory-factor occupancy patterns. Evolutionary-conservation analysis revealed signatures of recent functional constraint within DNaseI hypersensitive sites.

2,628 citations

Journal ArticleDOI
Feng Yue1, Feng Yue2, Yong Cheng3, Alessandra Breschi, Jeff Vierstra4, Weisheng Wu5, Weisheng Wu1, Tyrone Ryba6, Tyrone Ryba7, Richard Sandstrom4, Zhihai Ma3, Carrie A. Davis8, Benjamin D. Pope6, Yin Shen2, Dmitri D. Pervouchine, Sarah Djebali, Robert E. Thurman4, Rajinder Kaul4, Eric Rynes4, Anthony Kirilusha9, Georgi K. Marinov9, Brian A. Williams9, Diane Trout9, Henry Amrhein9, Katherine I. Fisher-Aylor9, Igor Antoshechkin9, Gilberto DeSalvo9, Lei Hoon See8, Meagan Fastuca8, Jorg Drenkow8, Chris Zaleski8, Alexander Dobin8, Pablo Prieto, Julien Lagarde, Giovanni Bussotti, Andrea Tanzer10, Olgert Denas11, Kanwei Li11, M. A. Bender12, M. A. Bender4, Miaohua Zhang12, Rachel Byron12, Mark Groudine4, Mark Groudine12, David McCleary2, Long Pham2, Zhen Ye2, Samantha Kuan2, Lee Edsall2, Yi-Chieh Wu13, Matthew D. Rasmussen13, Mukul S. Bansal13, Manolis Kellis13, Manolis Kellis14, Cheryl A. Keller1, Christapher S. Morrissey1, Tejaswini Mishra1, Deepti Jain1, Nergiz Dogan1, Robert S. Harris1, Philip Cayting3, Trupti Kawli3, Alan P. Boyle3, Alan P. Boyle5, Ghia Euskirchen3, Anshul Kundaje3, Shin Lin3, Yiing Lin3, Camden Jansen15, Venkat S. Malladi3, Melissa S. Cline16, Drew T. Erickson3, Vanessa M. Kirkup16, Katrina Learned16, Cricket A. Sloan3, Kate R. Rosenbloom16, Beatriz Lacerda de Sousa17, Kathryn Beal, Miguel Pignatelli, Paul Flicek, Jin Lian18, Tamer Kahveci19, Dongwon Lee20, W. James Kent16, Miguel Santos17, Javier Herrero21, Cedric Notredame, Audra K. Johnson4, Shinny Vong4, Kristen Lee4, Daniel Bates4, Fidencio Neri4, Morgan Diegel4, Theresa K. Canfield4, Peter J. Sabo4, Matthew S. Wilken4, Thomas A. Reh4, Erika Giste4, Anthony Shafer4, Tanya Kutyavin4, Eric Haugen4, Douglas Dunn4, Alex Reynolds4, Shane Neph4, Richard Humbert4, R. Scott Hansen4, Marella F. T. R. de Bruijn22, Licia Selleri23, Alexander Y. Rudensky24, Steven Z. Josefowicz24, Robert M. Samstein24, Evan E. Eichler4, Stuart H. Orkin25, Dana N. Levasseur26, Thalia Papayannopoulou4, Kai Hsin Chang4, Arthur I. Skoultchi27, Srikanta Gosh27, Christine M. Disteche4, Piper M. Treuting4, Yanli Wang1, Mitchell J. Weiss, Gerd A. Blobel28, Xiaoyi Cao2, Sheng Zhong2, Ting Wang29, Peter J. Good30, Rebecca F. Lowdon29, Rebecca F. Lowdon30, Leslie B. Adams31, Leslie B. Adams30, Xiao Qiao Zhou30, Michael J. Pazin30, Elise A. Feingold30, Barbara J. Wold9, James Taylor11, Ali Mortazavi15, Sherman M. Weissman18, John A. Stamatoyannopoulos4, Michael Snyder3, Roderic Guigó, Thomas R. Gingeras8, David M. Gilbert6, Ross C. Hardison1, Michael A. Beer20, Bing Ren2 
20 Nov 2014-Nature
TL;DR: The mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types as mentioned in this paper.
Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases

1,335 citations

Journal ArticleDOI
06 Sep 2012-Nature
TL;DR: A stereotyped 50-base-pair footprint is identified that precisely defines the site of transcript origination within thousands of human promoters, and a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function are described.
Abstract: Regulatory factor binding to genomic DNA protects the underlying sequence from cleavage by DNase I, leaving nucleotide-resolution footprints. Using genomic DNase I footprinting across 41 diverse cell and tissue types, we detected 45 million transcription factor occupancy events within regulatory regions, representing differential binding to 8.4 million distinct short sequence elements. Here we show that this small genomic sequence compartment, roughly twice the size of the exome, encodes an expansive repertoire of conserved recognition sequences for DNA-binding proteins that nearly doubles the size of the human cis-regulatory lexicon. We find that genetic variants affecting allelic chromatin states are concentrated in footprints, and that these elements are preferentially sheltered from DNA methylation. High-resolution DNase I cleavage patterns mirror nucleotide-level evolutionary conservation and track the crystallographic topography of protein-DNA interfaces, indicating that transcription factor structure has been evolutionarily imprinted on the human genome sequence. We identify a stereotyped 50-base-pair footprint that precisely defines the site of transcript origination within thousands of human promoters. Finally, we describe a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function, and exhibit cell-selective occupancy patterns that closely parallel major regulators of development, differentiation and pluripotency.

846 citations


Cited by
More filters
Journal ArticleDOI
06 Sep 2012-Nature
TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.
Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

13,548 citations

Journal ArticleDOI
23 Jan 2015-Science
TL;DR: In this paper, a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level.
Abstract: Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.

9,745 citations

Journal ArticleDOI
15 Feb 2013-Science
TL;DR: The type II bacterial CRISPR system is engineer to function with custom guide RNA (gRNA) in human cells to establish an RNA-guided editing tool for facile, robust, and multiplexable human genome engineering.
Abstract: Bacteria and archaea have evolved adaptive immune defenses, termed clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) systems, that use short RNA to direct degradation of foreign nucleic acids. Here, we engineer the type II bacterial CRISPR system to function with custom guide RNA (gRNA) in human cells. For the endogenous AAVS1 locus, we obtained targeting rates of 10 to 25% in 293T cells, 13 to 8% in K562 cells, and 2 to 4% in induced pluripotent stem cells. We show that this process relies on CRISPR components; is sequence-specific; and, upon simultaneous introduction of multiple gRNAs, can effect multiplex editing of target loci. We also compute a genome-wide resource of ~190 K unique gRNAs targeting ~40.5% of human exons. Our results establish an RNA-guided editing tool for facile, robust, and multiplexable human genome engineering.

8,197 citations

Journal Article
01 Jan 2012-Nature
TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.
Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

8,106 citations

Journal ArticleDOI
Stephan Ripke1, Stephan Ripke2, Benjamin M. Neale2, Benjamin M. Neale1  +351 moreInstitutions (102)
24 Jul 2014-Nature
TL;DR: Associations at DRD2 and several genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses.
Abstract: Schizophrenia is a highly heritable disorder. Genetic risk is conferred by a large number of alleles, including common alleles of small effect that might be detected by genome-wide association studies. Here we report a multi-stage schizophrenia genome-wide association study of up to 36,989 cases and 113,075 controls. We identify 128 independent associations spanning 108 conservatively defined loci that meet genome-wide significance, 83 of which have not been previously reported. Associations were enriched among genes expressed in brain, providing biological plausibility for the findings. Many findings have the potential to provide entirely new insights into aetiology, but associations at DRD2 and several genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses. Independent of genes expressed in brain, associations were enriched among genes expressed in tissues that have important roles in immunity, providing support for the speculated link between the immune system and schizophrenia.

6,809 citations