Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues
TL;DR: By mapping base-resolution methylomes in adult mouse tissues at shallow coverage, this work identifies 302,864 tissue-specific differentially methylated regions (tsDMRs) and estimates that >6.7% of the mouse genome is variably methylated, and suggests that epigenetic memory of embryonic development may be retained in adult tissues.
Abstract: Mammalian development requires cytosine methylation, a heritable epigenetic mark of cellular memory believed to maintain a cell's unique gene expression pattern. However, it remains unclear how dynamic DNA methylation relates to cell type-specific gene expression and animal development. Here, by mapping base-resolution methylomes in 17 adult mouse tissues at shallow coverage, we identify 302,864 tissue-specific differentially methylated regions (tsDMRs) and estimate that >6.7% of the mouse genome is variably methylated. Supporting a prominent role for DNA methylation in gene regulation, most tsDMRs occur at distal cis-regulatory elements. Unexpectedly, some tsDMRs mark enhancers that are dormant in adult tissues but active in embryonic development. These 'vestigial' enhancers are hypomethylated and lack active histone modifications in adult tissues but nevertheless exhibit activity during embryonic development. Our results provide new insights into the role of DNA methylation at tissue-specific enhancers and suggest that epigenetic memory of embryonic development may be retained in adult tissues.
Citations
More filters
••
TL;DR: These observations indicate that the underlying DNA sequence largely accounts for local patterns of methylation, which is highly informative when studying gene regulation in normal and diseased cells, and it can potentially function as a biomarker.
Abstract: Cytosine methylation is a DNA modification generally associated with transcriptional silencing. Factors that regulate methylation have been linked to human disease, yet how they contribute to malignances remains largely unknown. Genomic maps of DNA methylation have revealed unexpected dynamics at gene regulatory regions, including active demethylation by TET proteins at binding sites for transcription factors. These observations indicate that the underlying DNA sequence largely accounts for local patterns of methylation. As a result, this mark is highly informative when studying gene regulation in normal and diseased cells, and it can potentially function as a biomarker. Although these findings challenge the view that methylation is generally instructive for gene silencing, several open questions remain, including how methylation is targeted and recognized and in what context it affects genome readout.
1,564 citations
••
TL;DR: Mapping genome-wide chromatin interactions in human embryonic stem cells and four human ES-cell-derived lineages reveals extensive chromatin reorganization during lineage specification, providing a global view of chromatin dynamics and a resource for studying long-range control of gene expression in distinct human cell lineages.
Abstract: Higher-order chromatin structure is emerging as an important regulator of gene expression. Although dynamic chromatin structures have been identified in the genome, the full scope of chromatin dynamics during mammalian development and lineage specification remains to be determined. By mapping genome-wide chromatin interactions in human embryonic stem (ES) cells and four human ES-cell-derived lineages, we uncover extensive chromatin reorganization during lineage specification. We observe that although self-associating chromatin domains are stable during differentiation, chromatin interactions both within and between domains change in a striking manner, altering 36% of active and inactive chromosomal compartments throughout the genome. By integrating chromatin interaction maps with haplotype-resolved epigenome and transcriptome data sets, we find widespread allelic bias in gene expression correlated with allele-biased chromatin states of linked promoters and distal enhancers. Our results therefore provide a global view of chromatin dynamics and a resource for studying long-range control of gene expression in distinct human cell lineages.
1,393 citations
••
TL;DR: The mechanisms and functions of DNA methylation and demethylation in both mice and humans at CpG-rich promoters, gene bodies and transposable elements are discussed and the dynamic erasure and re-establishment in embryonic, germline and somatic cell development is highlighted.
Abstract: DNA methylation is of paramount importance for mammalian embryonic development. DNA methylation has numerous functions: it is implicated in the repression of transposons and genes, but is also associated with actively transcribed gene bodies and, in some cases, with gene activation per se. In recent years, sensitive technologies have been developed that allow the interrogation of DNA methylation patterns from a small number of cells. The use of these technologies has greatly improved our knowledge of DNA methylation dynamics and heterogeneity in embryos and in specific tissues. Combined with genetic analyses, it is increasingly apparent that regulation of DNA methylation erasure and (re-)establishment varies considerably between different developmental stages. In this Review, we discuss the mechanisms and functions of DNA methylation and demethylation in both mice and humans at CpG-rich promoters, gene bodies and transposable elements. We highlight the dynamic erasure and re-establishment of DNA methylation in embryonic, germline and somatic cell development. Finally, we provide insights into DNA methylation gained from studying genetic diseases. DNA methylation is essential for mammalian embryogenesis owing to its repression of transposons and genes, but it is also associated with gene activation. The recent use of sensitive technologies has revealed that DNA methylation dynamics vary considerably between embryonic, germline and somatic cell development, with implications for genetic diseases and cancer.
1,039 citations
••
TL;DR: In this article, a single-cell bisulfite sequencing (scBS-seq) method was used to accurately measure DNA methylation at up to 48.4% of CpG sites.
Abstract: We report a single-cell bisulfite sequencing (scBS-seq) method that can be used to accurately measure DNA methylation at up to 48.4% of CpG sites. Embryonic stem cells grown in serum or in 2i medium displayed epigenetic heterogeneity, with '2i-like' cells present in serum culture. Integration of 12 individual mouse oocyte datasets largely recapitulated the whole DNA methylome, which makes scBS-seq a versatile tool to explore DNA methylation in rare cells and heterogeneous populations.
899 citations
••
TL;DR: This work systematically analyzed binding specificities of full-length transcription factors and extended DNA binding domains to unmethylated and CpG-methylated DNA by using methylation-sensitive SELEX (systematic evolution of ligands by exponential enrichment).
Abstract: INTRODUCTION Nearly all cells in the human body share the same primary genome sequence consisting of four nucleotide bases. One of the bases, cytosine, is commonly modified by methylation of its 5 position in CpG dinucleotides (mCpG). Most CpG dinucleotides in the human genome are methylated, but the level of CpG methylation varies with genetic location (promoter versus gene body), whether genes are active versus silenced, and cell type. Research has shown that the maintenance of a particular cellular state after cell division is dependent on faithful transmission of methylated CpGs, as well as inheritance of the mother cells’ repertoire of transcription factors by the daughter cells. These two mechanisms of epigenetic inheritance are linked to each other; the binding of transcription factors can be affected by cytosine methylation, and cytosine methylation can, in turn, be added or removed by proteins that associate with transcription factors. RATIONALE The genetic and epigenetic language, which imparts when and where genes are expressed, is understood at a conceptual level. However, a more detailed understanding is needed of the genomic regulatory mechanism by which methylated cytosines affect transcription factor binding. Because cytosine methylation changes DNA structure, it has the potential to affect binding of all transcription factors. However, a systematic analysis of binding of a large collection of transcription factors to all possible DNA sequences has not previously been conducted. RESULTS To globally characterize the effect of cytosine methylation on transcription factor binding, we systematically analyzed binding specificities of full-length transcription factors and extended DNA binding domains to unmethylated and CpG-methylated DNA by using methylation-sensitive SELEX (systematic evolution of ligands by exponential enrichment). We evaluated binding of 542 transcription factors and identified a large number of previously uncharacterized transcription factor recognition motifs. Binding of most major classes of transcription factors, including bHLH, bZIP, and ETS, was inhibited by mCpG. In contrast, transcription factors such as homeodomain, POU, and NFAT proteins preferred to bind methylated DNA. This class of binding was enriched in factors with central roles in embryonic and organismal development. The observed binding preferences were validated using several orthogonal methods, including bisulfite-SELEX and protein-binding microarrays. In addition, the preference of the pluripotency factor OCT4 to bind to a mCpG-containing motif was confirmed by chromatin immunoprecipitation analysis in mouse embryonic stem cells with low or high levels of CpG methylation (due to deficiency in all enzymes that methylate cytosines or contribute to their removal, respectively). Crystal structure analysis of the homeodomain proteins HOXB13, CDX1, CDX2, and LHX4 revealed three key residues that contribute to the preference of this developmentally important family of transcription factors for mCpG. The preference for binding to mCpG was due to direct hydrophobic interactions with the 5-methyl group of methylcytosine. In contrast, inhibition of binding of other transcription factors to methylated sequences was found to be caused by steric hindrance. CONCLUSION Our work constitutes a global analysis of the effect of cytosine methylation on DNA binding specificities of human transcription factors. CpG methylation can influence binding of most transcription factors to DNA—in some cases negatively and in others positively. Our finding that many developmentally important transcription factors prefer to bind to mCpG sites can inform future analyses of the role of DNA methylation on cell differentiation, chromatin reprogramming, and transcriptional regulation.
846 citations
References
More filters
••
TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.
Abstract: Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.
20,335 citations
••
TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.
Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.
13,548 citations
••
TL;DR: It is demonstrated in macrophages and B cells that collaborative interactions of the common factor PU.1 with small sets of macrophage- or B cell lineage-determining transcription factors establish cell-specific binding sites that are associated with the majority of promoter-distal H3K4me1-marked genomic regions.
9,620 citations
••
TL;DR: The heritability of methylation states and the secondary nature of the decision to invite or exclude methylation support the idea that DNA methylation is adapted for a specific cellular memory function in development.
Abstract: The character of a cell is defined by its constituent proteins, which are the result of specific patterns of gene expression. Crucial determinants of gene expression patterns are DNA-binding transcription factors that choose genes for transcriptional activation or repression by recognizing the sequence of DNA bases in their promoter regions. Interaction of these factors with their cognate sequences triggers a chain of events, often involving changes in the structure of chromatin, that leads to the assembly of an active transcription complex (e.g., Cosma et al. 1999). But the types of transcription factors present in a cell are not alone sufficient to define its spectrum of gene activity, as the transcriptional potential of a genome can become restricted in a stable manner during development. The constraints imposed by developmental history probably account for the very low efficiency of cloning animals from the nuclei of differentiated cells (Rideout et al. 2001; Wakayama and Yanagimachi 2001). A “transcription factors only” model would predict that the gene expression pattern of a differentiated nucleus would be completely reversible upon exposure to a new spectrum of factors. Although many aspects of expression can be reprogrammed in this way (Gurdon 1999), some marks of differentiation are evidently so stable that immersion in an alien cytoplasm cannot erase the memory. The genomic sequence of a differentiated cell is thought to be identical in most cases to that of the zygote from which it is descended (mammalian B and T cells being an obvious exception). This means that the marks of developmental history are unlikely to be caused by widespread somatic mutation. Processes less irrevocable than mutation fall under the umbrella term “epigenetic” mechanisms. A current definition of epigenetics is: “The study of mitotically and/or meiotically heritable changes in gene function that cannot be explained by changes in DNA sequence” (Russo et al. 1996). There are two epigenetic systems that affect animal development and fulfill the criterion of heritability: DNA methylation and the Polycomb-trithorax group (Pc-G/trx) protein complexes. (Histone modification has some attributes of an epigenetic process, but the issue of heritability has yet to be resolved.) This review concerns DNA methylation, focusing on the generation, inheritance, and biological significance of genomic methylation patterns in the development of mammals. Data will be discussed favoring the notion that DNA methylation may only affect genes that are already silenced by other mechanisms in the embryo. Embryonic transcription, on the other hand, may cause the exclusion of the DNA methylation machinery. The heritability of methylation states and the secondary nature of the decision to invite or exclude methylation support the idea that DNA methylation is adapted for a specific cellular memory function in development. Indeed, the possibility will be discussed that DNA methylation and Pc-G/trx may represent alternative systems of epigenetic memory that have been interchanged over evolutionary time. Animal DNA methylation has been the subject of several recent reviews (Bird and Wolffe 1999; Bestor 2000; Hsieh 2000; Costello and Plass 2001; Jones and Takai 2001). For recent reviews of plant and fungal DNA methylation, see Finnegan et al. (2000), Martienssen and Colot (2001), and Matzke et al. (2001).
6,691 citations
••
TL;DR: It is found that the boundaries of topological domains are enriched for the insulator binding protein CTCF, housekeeping genes, transfer RNAs and short interspersed element (SINE) retrotransposons, indicating that these factors may have a role in establishing the topological domain structure of the genome.
Abstract: The spatial organization of the genome is intimately linked to its biological function, yet our understanding of higher order genomic structure is coarse, fragmented and incomplete. In the nucleus of eukaryotic cells, interphase chromosomes occupy distinct chromosome territories, and numerous models have been proposed for how chromosomes fold within chromosome territories. These models, however, provide only few mechanistic details about the relationship between higher order chromatin structure and genome function. Recent advances in genomic technologies have led to rapid advances in the study of three-dimensional genome organization. In particular, Hi-C has been introduced as a method for identifying higher order chromatin interactions genome wide. Here we investigate the three-dimensional organization of the human and mouse genomes in embryonic stem cells and terminally differentiated cell types at unprecedented resolution. We identify large, megabase-sized local chromatin interaction domains, which we term 'topological domains', as a pervasive structural feature of the genome organization. These domains correlate with regions of the genome that constrain the spread of heterochromatin. The domains are stable across different cell types and highly conserved across species, indicating that topological domains are an inherent property of mammalian genomes. Finally, we find that the boundaries of topological domains are enriched for the insulator binding protein CTCF, housekeeping genes, transfer RNAs and short interspersed element (SINE) retrotransposons, indicating that these factors may have a role in establishing the topological domain structure of the genome.
5,774 citations