scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

TL;DR: Hi-C is described, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing and demonstrates the power of Hi-C to map the dynamic conformations of entire genomes.
Abstract: We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1 megabase. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free, polymer conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.

Content maybe subject to copyright    Report

Citations
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
18 Dec 2014-Cell
TL;DR: In situ Hi-C is used to probe the 3D architecture of genomes, constructing haploid and diploid maps of nine cell types, identifying ∼10,000 loops that frequently link promoters and enhancers, correlate with gene activation, and show conservation across cell types and species.

5,945 citations


Cites background or methods or result from "Comprehensive mapping of long-range..."

  • ...In our earlier experiments at 1 Mb map resolution (Lieberman-Aiden et al., 2009), we saw large squares of enhanced contact frequency tiling the diagonal of the contact matrices....

    [...]

  • ...C showed strong consistency between our loop calls and all six previously published Hi-C experiments in lymphoblastoid cell lines (Lieberman-Aiden et al., 2009; Kalhor et al., 2012) (Figure 3D; Data S2, I.E; Table S6)....

    [...]

  • ...Dilution Hi-C was performed as in Lieberman-Aiden et al. (2009)....

    [...]

  • ...(C) We compare our map of chromosome 7 in GM12878 (last column) to earlier Hi-Cmaps: Lieberman-Aiden et al. (2009), Kalhor et al. (2012), and Jin et al. (2013)....

    [...]

  • ...To interrogate all loci at once, we developed Hi-C, which combines DNA proximity ligation with high-throughput sequencing in a genome-wide fashion (Lieberman-Aiden et al., 2009)....

    [...]

Journal ArticleDOI
17 May 2012-Nature
TL;DR: It is found that the boundaries of topological domains are enriched for the insulator binding protein CTCF, housekeeping genes, transfer RNAs and short interspersed element (SINE) retrotransposons, indicating that these factors may have a role in establishing the topological domain structure of the genome.
Abstract: The spatial organization of the genome is intimately linked to its biological function, yet our understanding of higher order genomic structure is coarse, fragmented and incomplete. In the nucleus of eukaryotic cells, interphase chromosomes occupy distinct chromosome territories, and numerous models have been proposed for how chromosomes fold within chromosome territories. These models, however, provide only few mechanistic details about the relationship between higher order chromatin structure and genome function. Recent advances in genomic technologies have led to rapid advances in the study of three-dimensional genome organization. In particular, Hi-C has been introduced as a method for identifying higher order chromatin interactions genome wide. Here we investigate the three-dimensional organization of the human and mouse genomes in embryonic stem cells and terminally differentiated cell types at unprecedented resolution. We identify large, megabase-sized local chromatin interaction domains, which we term 'topological domains', as a pervasive structural feature of the genome organization. These domains correlate with regions of the genome that constrain the spread of heterochromatin. The domains are stable across different cell types and highly conserved across species, indicating that topological domains are an inherent property of mammalian genomes. Finally, we find that the boundaries of topological domains are enriched for the insulator binding protein CTCF, housekeeping genes, transfer RNAs and short interspersed element (SINE) retrotransposons, indicating that these factors may have a role in establishing the topological domain structure of the genome.

5,774 citations

Journal ArticleDOI
Anshul Kundaje1, Wouter Meuleman2, Wouter Meuleman1, Jason Ernst3, Misha Bilenky4, Angela Yen1, Angela Yen2, Alireza Heravi-Moussavi4, Pouya Kheradpour1, Pouya Kheradpour2, Zhizhuo Zhang1, Zhizhuo Zhang2, Jianrong Wang1, Jianrong Wang2, Michael J. Ziller2, Viren Amin5, John W. Whitaker, Matthew D. Schultz6, Lucas D. Ward1, Lucas D. Ward2, Abhishek Sarkar1, Abhishek Sarkar2, Gerald Quon2, Gerald Quon1, Richard Sandstrom7, Matthew L. Eaton1, Matthew L. Eaton2, Yi-Chieh Wu1, Yi-Chieh Wu2, Andreas R. Pfenning1, Andreas R. Pfenning2, Xinchen Wang1, Xinchen Wang2, Melina Claussnitzer1, Melina Claussnitzer2, Yaping Liu1, Yaping Liu2, Cristian Coarfa5, R. Alan Harris5, Noam Shoresh2, Charles B. Epstein2, Elizabeta Gjoneska2, Elizabeta Gjoneska1, Danny Leung8, Wei Xie8, R. David Hawkins8, Ryan Lister6, Chibo Hong9, Philippe Gascard9, Andrew J. Mungall4, Richard A. Moore4, Eric Chuah4, Angela Tam4, Theresa K. Canfield7, R. Scott Hansen7, Rajinder Kaul7, Peter J. Sabo7, Mukul S. Bansal2, Mukul S. Bansal1, Mukul S. Bansal10, Annaick Carles4, Jesse R. Dixon8, Kai How Farh2, Soheil Feizi2, Soheil Feizi1, Rosa Karlic11, Ah Ram Kim2, Ah Ram Kim1, Ashwinikumar Kulkarni12, Daofeng Li13, Rebecca F. Lowdon13, Ginell Elliott13, Tim R. Mercer14, Shane Neph7, Vitor Onuchic5, Paz Polak15, Paz Polak2, Nisha Rajagopal8, Pradipta R. Ray12, Richard C Sallari1, Richard C Sallari2, Kyle Siebenthall7, Nicholas A Sinnott-Armstrong1, Nicholas A Sinnott-Armstrong2, Michael Stevens13, Robert E. Thurman7, Jie Wu16, Bo Zhang13, Xin Zhou13, Arthur E. Beaudet5, Laurie A. Boyer1, Philip L. De Jager15, Philip L. De Jager2, Peggy J. Farnham17, Susan J. Fisher9, David Haussler18, Steven J.M. Jones19, Steven J.M. Jones4, Wei Li5, Marco A. Marra4, Michael T. McManus9, Shamil R. Sunyaev15, Shamil R. Sunyaev2, James A. Thomson20, Thea D. Tlsty9, Li-Huei Tsai2, Li-Huei Tsai1, Wei Wang, Robert A. Waterland5, Michael Q. Zhang21, Lisa Helbling Chadwick22, Bradley E. Bernstein6, Bradley E. Bernstein2, Bradley E. Bernstein15, Joseph F. Costello9, Joseph R. Ecker11, Martin Hirst4, Alexander Meissner2, Aleksandar Milosavljevic5, Bing Ren8, John A. Stamatoyannopoulos7, Ting Wang13, Manolis Kellis1, Manolis Kellis2 
19 Feb 2015-Nature
TL;DR: It is shown that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease.
Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

5,037 citations

01 Feb 2015
TL;DR: In this article, the authors describe the integrative analysis of 111 reference human epigenomes generated as part of the NIH Roadmap Epigenomics Consortium, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression.
Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

4,409 citations

References
More filters
Journal ArticleDOI
TL;DR: This work described a high-resolution, genome-scale approach for quantifying chromatin accessibility by measuring DNase I sensitivity as a continuous function of genome position using tiling DNA microarrays (DNase-array), and developed a computational approach for visualizing higher-order features of chromatin structure.
Abstract: Localized accessibility of critical DNA sequences to the regulatory machinery is a key requirement for regulation of human genes. Here we describe a high-resolution, genome-scale approach for quantifying chromatin accessibility by measuring DNase I sensitivity as a continuous function of genome position using tiling DNA microarrays (DNase-array). We demonstrate this approach across 1% ( approximately 30 Mb) of the human genome, wherein we localized 2,690 classical DNase I hypersensitive sites with high sensitivity and specificity, and also mapped larger-scale patterns of chromatin architecture. DNase I hypersensitive sites exhibit marked aggregation around transcriptional start sites (TSSs), though the majority mark nonpromoter functional elements. We also developed a computational approach for visualizing higher-order features of chromatin structure. This revealed that human chromatin organization is dominated by large (100-500 kb) 'superclusters' of DNase I hypersensitive sites, which encompass both gene-rich and gene-poor regions. DNase-array is a powerful and straightforward approach for systematic exposition of the cis-regulatory architecture of complex genomes.

349 citations

Journal ArticleDOI
TL;DR: It is suggested that determining the relationship between nuclear organization and the linear arrangement of genes will lead to a greater understanding of how transcriptomes, dedicated to a particular cellular function or fate, are coordinately regulated.
Abstract: The extent to which the nucleus is functionally organized has broad biological implications. Evidence supports the idea that basic nuclear functions, such as transcription, are structurally integrated within the nucleus. Moreover, recent studies indicate that the linear arrangement of genes within eukaryotic genomes is nonrandom. We suggest that determining the relationship between nuclear organization and the linear arrangement of genes will lead to a greater understanding of how transcriptomes, dedicated to a particular cellular function or fate, are coordinately regulated. Current network theories may provide a useful framework for modeling the inherent complexity the functional organization of the nucleus.

308 citations

Journal ArticleDOI
TL;DR: The folding of chromosomes in interphase nuclei is determined by measuring the distance between points on the same chromosome by the simplest model that fully describes the observed large-scale spatial relationships.
Abstract: We determined the folding of chromosomes in interphase nuclei by measuring the distance between points on the same chromosome. Over 25,000 measurements were made in G0/G1 nuclei between DNA sequences separated by 0.15-190 megabase pairs (Mbp) on three human chromosomes. The DNA sequences were specifically labeled by fluorescence in situ hybridization. The relationship between mean-square interphase distance and genomic separation has two linear phases, with a transition at approximately 2 Mbp. This biphasic relationship indicates the existence of two organizational levels at scales > 100 kbp. On one level, chromatin appears to be arranged in large loops several Mbp in size. Within each loop, chromatin is randomly folded. On the second level, specific loop-attachment sites are arranged to form a supple, backbonelike structure, which also shows characteristic random walk behavior. This random walk/giant loop model is the simplest model that fully describes the observed large-scale spatial relationships. Additional evidence for large loops comes from measurements among probes in Xq28, where interphase distance increases and then locally decreases with increasing genomic separation.

308 citations

Journal ArticleDOI
10 Aug 1993-EPL
TL;DR: It is argued that in order to maintain the biological function of DNA confined inside the cell nucleus, its spatial structure has to be unknotted, of the so-called crumpled globule type.
Abstract: We argue that in order to maintain the biological function of DNA confined inside the cell nucleus, its spatial structure has to be unknotted, of the so-called crumpled globule type. The fixation of a particular realization of this non-equilibrium structure by attractive interactions between specific units imposes a connection between the spatial structure of DNA and the statistical distribution of these units along the chain contour. This suggests that both primary sequence and spatial structure of native DNA were formed simultaneously by a self-similar evolution process. The predictions of our model are compared with recent observations of long-range correlations in intron-containing genes and non-transcribed regulatory elements and further experimental tests are proposed.

294 citations

Journal ArticleDOI
TL;DR: A polymer model able to describe key properties of chromatin over length scales ranging from 0.5 to 75 Mb is presented and can explain the observed data and suggests that on the tens-of-megabases length scale P is small, i.e., 10–30 loops per 100 Mb, sufficient to enforce folding inside the confined space of a chromosome territory.
Abstract: Genome function in higher eukaryotes involves major changes in the spatial organization of the chromatin fiber. Nevertheless, our understanding of chromatin folding is remarkably limited. Polymer models have been used to describe chromatin folding. However, none of the proposed models gives a satisfactory explanation of experimental data. In particularly, they ignore that each chromosome occupies a confined space, i.e., the chromosome territory. Here, we present a polymer model that is able to describe key properties of chromatin over length scales ranging from 0.5 to 75 Mb. This random loop (RL) model assumes a self-avoiding random walk folding of the polymer backbone and defines a probability P for 2 monomers to interact, creating loops of a broad size range. Model predictions are compared with systematic measurements of chromatin folding of the q-arms of chromosomes 1 and 11. The RL model can explain our observed data and suggests that on the tens-of-megabases length scale P is small, i.e., 10-30 loops per 100 Mb. This is sufficient to enforce folding inside the confined space of a chromosome territory. On the 0.5- to 3-Mb length scale chromatin compaction differs in different subchromosomal domains. This aspect of chromatin structure is incorporated in the RL model by introducing heterogeneity along the fiber contour length due to different local looping probabilities. The RL model creates a quantitative and predictive framework for the identification of nuclear components that are responsible for chromatin-chromatin interactions and determine the 3-dimensional organization of the chromatin fiber.

264 citations

Related Papers (5)