Showing papers by "Jorja G. Henikoff published in 2010"
••
Broad Institute1, Harvard University2, University of Chicago3, Duke University4, Lawrence Berkeley National Laboratory5, Massachusetts Institute of Technology6, Ontario Institute for Cancer Research7, University of Florida8, University of Liège9, Memorial Sloan Kettering Cancer Center10, Utrecht University11, University of Cambridge12, National Institutes of Health13, University of California, Berkeley14, Cold Spring Harbor Laboratory15, University of Connecticut16, Washington University in St. Louis17, Affymetrix18, Indiana University19, University of California, Santa Cruz20, Rutgers University21, University of California, San Diego22, Fred Hutchinson Cancer Research Center23
TL;DR: The Drosophila Encyclopedia of DNA Elements (modENCODE) project as mentioned in this paper has been used to map transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines.
Abstract: To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- and tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation.
1,102 citations
••
Yale University1, Stanford University2, University of Toronto3, Harvard University4, University of California, Santa Cruz5, University of North Carolina at Chapel Hill6, Ontario Institute for Cancer Research7, New York University8, Stony Brook University9, University of Cambridge10, Hoffmann-La Roche11, Lawrence Berkeley National Laboratory12, University of California, San Diego13, Rockefeller University14, University of Washington15, National Institutes of Health16, University of Michigan17, Fred Hutchinson Cancer Research Center18, Max Planck Society19, Memorial Sloan Kettering Cancer Center20, Weizmann Institute of Science21, Max Delbrück Center for Molecular Medicine22, Cold Spring Harbor Laboratory23, Vanderbilt University24, New York University Abu Dhabi25
TL;DR: These studies identified regions of the nematode and fly genomes that show highly occupied targets (or HOT) regions where DNA was bound by more than 15 of the transcription factors analyzed and the expression of related genes were characterized, providing insights into the organization, structure, and function of the two genomes.
Abstract: We systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor-binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor-binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
978 citations
••
TL;DR: It is found that nucleosome turnover is most rapid over active gene bodies, epigenetic regulatory elements, and replication origins in Drosophila cells, and nucleosom turnover differences underlie their opposing activities and challenging models for epigenetic inheritance.
Abstract: Nucleosome disruption and replacement are crucial activities that maintain epigenomes, but these highly dynamic processes have been difficult to study. Here, we describe a direct method for measuring nucleosome turnover dynamics genome-wide. We found that nucleosome turnover is most rapid over active gene bodies, epigenetic regulatory elements, and replication origins in Drosophila cells. Nucleosomes turn over faster at sites for trithorax-group than polycomb-group protein binding, suggesting that nucleosome turnover differences underlie their opposing activities and challenging models for epigenetic inheritance that rely on stability of histone marks. Our results establish a general strategy for studying nucleosome dynamics and uncover nucleosome turnover differences across the genome that are likely to have functional importance for epigenome maintenance, gene regulation, and control of DNA replication.
462 citations
01 Jan 2010
TL;DR: A map demarcating the boundaries of gene regulatory units and a framework for understanding insulator function during the development and evolution of Drosophila are provided.
Abstract: Insulators are DNA sequences that control the interactions among genomic regulatory elements and act as chromatin boundaries. A thorough understanding of their location and function is necessary to address the complexities of metazoan gene regulation. We studied by ChIP–chip the genome-wide binding sites of 6 insulator-associated proteins—dCTCF, CP190, BEAF-32, Su(Hw), Mod(mdg4), and GAF—to obtain the first comprehensive map of insulator elements in Drosophila embryos. We identify over 14,000 putative insulators, including all classically defined insulators. We find two major classes of insulators defined by dCTCF/CP190/BEAF-32 and Su(Hw), respectively. Distributional analyses of insulators revealed that particular sub-classes of insulator elements are excluded between cis-regulatory elements and their target promoters; divide differentially expressed, alternative, and divergent promoters; act as chromatin boundaries; are associated with chromosomal breakpoints among species; and are embedded within active chromatin domains. Together, these results provide a map demarcating the boundaries of gene regulatory units and a framework for understanding insulator function during the development and evolution of Drosophila.
316 citations
••
TL;DR: In this paper, the genome-wide binding sites of 6 insulator-associated proteins (dCTCF, CP190, BEAF-32, Su(Hw), Mod(mdg4), and GAF) were studied to obtain the first comprehensive map of insulator elements in Drosophila embryos.
Abstract: Insulators are DNA sequences that control the interactions among genomic regulatory elements and act as chromatin boundaries. A thorough understanding of their location and function is necessary to address the complexities of metazoan gene regulation. We studied by ChIP–chip the genome-wide binding sites of 6 insulator-associated proteins—dCTCF, CP190, BEAF-32, Su(Hw), Mod(mdg4), and GAF—to obtain the first comprehensive map of insulator elements in Drosophila embryos. We identify over 14,000 putative insulators, including all classically defined insulators. We find two major classes of insulators defined by dCTCF/CP190/BEAF-32 and Su(Hw), respectively. Distributional analyses of insulators revealed that particular sub-classes of insulator elements are excluded between cis-regulatory elements and their target promoters; divide differentially expressed, alternative, and divergent promoters; act as chromatin boundaries; are associated with chromosomal breakpoints among species; and are embedded within active chromatin domains. Together, these results provide a map demarcating the boundaries of gene regulatory units and a framework for understanding insulator function during the development and evolution of Drosophila.
309 citations
••
TL;DR: The results suggest that H2A.Z enrichment patterns result from intrinsic structural differences between heterotypic and homotypic H2a.Z nucleosomes that follow disruption during transcriptional elongation.
Abstract: Nucleosomes that contain the histone variant H2A.Z are enriched around transcriptional start sites, but the mechanistic basis for this enrichment is unknown. A single octameric nucleosome can contain two H2A.Z histones (homotypic) or one H2A.Z and one canonical H2A (heterotypic). To elucidate the function of H2A.Z, we generated high-resolution maps of homotypic and heterotypic Drosophila H2A.Z (H2Av) nucleosomes. Although homotypic and heterotypic H2A.Z nucleosomes mapped throughout most of the genome, homotypic nucleosomes were enriched and heterotypic nucleosomes were depleted downstream of active promoters and intron-exon junctions. The distribution of homotypic H2A.Z nucleosomes resembled that of classical active chromatin and showed evidence of disruption during transcriptional elongation. Both homotypic H2A.Z nucleosomes and classical active chromatin were depleted downstream of paused polymerases. Our results suggest that H2A.Z enrichment patterns result from intrinsic structural differences between heterotypic and homotypic H2A.Z nucleosomes that follow disruption during transcriptional elongation.
131 citations
•
TL;DR: The modENCODE project as mentioned in this paper has been used to study the functional regulatory network of Drosophila melanogaster and Caenorhabditis elegans worms, including the binding sites of transcription factors.
Abstract: From Genome to Regulatory Networks For biologists, having a genome in hand is only the beginning—much more investigation is still needed to characterize how the genome is used to help to produce a functional organism (see the Perspective by Blaxter). In this vein, Gerstein et al. (p. 1775) summarize for the Caenorhabditis elegans genome, and The modENCODE Consortium (p. 1787) summarize for the Drosophila melanogaster genome, full transcriptome analyses over developmental stages, genome-wide identification of transcription factor binding sites, and high-resolution maps of chromatin organization. Both studies identified regions of the nematode and fly genomes that show highly occupied targets (or HOT) regions where DNA was bound by more than 15 of the transcription factors analyzed and the expression of related genes were characterized. Overall, the studies provide insights into the organization, structure, and function of the two genomes and provide basic information needed to guide and correlate both focused and genome-wide studies. The Drosophila modENCODE project demonstrates the functional regulatory network of flies. To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- and tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation.
81 citations
••
TL;DR: An in vivo tagging system coupled to chromatin purification for genome-wide epigenetic profiling in Caenorhabditis elegans is described and it is found that embryonic chromatin is differentially extracted with increasing salt concentrations.
Abstract: High-resolution mapping of chromatin features has emerged as an important strategy for understanding gene regulation and epigenetic inheritance. We describe an in vivo tagging system coupled to chromatin purification for genome-wide epigenetic profiling in Caenorhabditis elegans. In this system, we coexpressed the Escherichia coli biotin ligase enzyme (BirA), together with the C. elegans H3.3 gene fused to BioTag, a 23-amino-acid peptide serving as a biotinylation substrate for BirA, in vivo in worms. We found that the fusion BioTag::H3.3 was efficiently biotinylated in vivo. We developed methods to isolate chromatin under different salt extraction conditions, followed by affinity purification of biotinylated chromatin with streptavidin and genome-wide profiling with microarrays. We found that embryonic chromatin is differentially extracted with increasing salt concentrations. Interestingly, chromatin that remains insoluble after washing in 600 mM salt is enriched at 5' and 3' ends, suggesting the presence of large protein complexes that render chromatin insoluble at transcriptional initiation and termination sites. We also found that H3.3 landscapes from these salt fractions display consistent features that correlate with gene activity: the most highly expressed genes contain the most H3.3. This versatile two-component approach has the potential of facilitating genome-wide chromatin dynamics and regulatory site identification in C. elegans.
66 citations