scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Genomics of long-range regulatory elements.

07 Sep 2010-Annual Review of Genomics and Human Genetics (Annu Rev Genomics Hum Genet)-Vol. 11, Iss: 1, pp 1-23
TL;DR: The advantages and limitations of sequence conservation as a predictor of regulatory function are discussed and complementary emerging technologies now being applied to annotate regulatory elements in vertebrate genomes are presented.
Abstract: Transcriptional regulation of gene expression plays a significant role in establishing the diversity of human cell types and biological functions from a common set of genes. The components of regulatory control in the human genome include cis-acting elements that act across immense genomic distances to influence the spatial and temporal distribution of gene expression. Here we review the established categories of distant-acting regulatory elements, discussing the classical and contemporary evidence of their regulatory potential and clinical importance. Current efforts to identify regulatory sequences throughout the genome and elucidate their biological significance depend heavily on advances in sequence conservation-based analyses and on increasingly large-scale efforts applying transgenic technologies in model organisms. We discuss the advantages and limitations of sequence conservation as a predictor of regulatory function and present complementary emerging technologies now being applied to annotate regulatory elements in vertebrate genomes.
Citations
More filters
Journal ArticleDOI
04 Feb 2011-Cell
TL;DR: In this article, the authors show that gene-distal enhancers are key contributors to gene expression patterns, exhibiting both sequence diversity and cell type specificity, contributing to a general model for enhancer function that involves direct enhancer-promoter contact.

756 citations

Journal ArticleDOI
TL;DR: The strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies are reviewed.
Abstract: With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.

691 citations

Journal ArticleDOI
TL;DR: Several, but not all, positions with higher effects showed evidence for purifying selection, or co-localized with known liver-associated transcription factor binding sites, demonstrating the value of empirical high-resolution functional analysis.
Abstract: The functional consequences of genetic variation in mammalian regulatory elements are poorly understood. We report the in vivo dissection of three mammalian enhancers at single-nucleotide resolution through a massively parallel reporter assay. For each enhancer, we synthesized a library of >100,000 mutant haplotypes with 2-3% divergence from the wild-type sequence. Each haplotype was linked to a unique sequence tag embedded within a transcriptional cassette. We introduced each enhancer library into mouse liver and measured the relative activities of individual haplotypes en masse by sequencing the transcribed tags. Linear regression analysis yielded highly reproducible estimates of the effect of every possible single-nucleotide change on enhancer activity. The functional consequence of most mutations was modest, with ∼22% affecting activity by >1.2-fold and ∼3% by >2-fold. Several, but not all, positions with higher effects showed evidence for purifying selection, or co-localized with known liver-associated transcription factor binding sites, demonstrating the value of empirical high-resolution functional analysis.

510 citations


Additional excerpts

  • ...6μM of relevant reverse primer (BARCODE_PE_R_ILMN_INDEX[1-8]), 0....

    [...]

Journal ArticleDOI
TL;DR: Advances in predictive modeling can enable data-set integration to reveal pathways shared across loci and alleles, and richer regulatory models can guide the search for epistatic interactions, and new massively parallel reporter experiments can systematically validate regulatory predictions.
Abstract: Association studies provide genome-wide information about the genetic basis of complex disease, but medical research has focused primarily on protein-coding variants, owing to the difficulty of interpreting noncoding mutations. This picture has changed with advances in the systematic annotation of functional noncoding elements. Evolutionary conservation, functional genomics, chromatin state, sequence motifs and molecular quantitative trait loci all provide complementary information about the function of noncoding sequences. These functional maps can help with prioritizing variants on risk haplotypes, filtering mutations encountered in the clinic and performing systems-level analyses to reveal processes underlying disease associations. Advances in predictive modeling can enable data-set integration to reveal pathways shared across loci and alleles, and richer regulatory models can guide the search for epistatic interactions. Lastly, new massively parallel reporter experiments can systematically validate regulatory predictions. Ultimately, advances in regulatory and systems genomics can help unleash the value of whole-genome sequencing for personalized genomic risk assessment, diagnosis and treatment.

470 citations

Journal ArticleDOI
TL;DR: It is found that inactivation of enhancer chromatin by TAL effector repeat domains fused to the LSD1 histone demethylase frequently causes downregulation of proximal genes, revealing enhancer target genes.
Abstract: The function of specific enhancers is studied using TAL effectors fused to a histone demethylase

379 citations

References
More filters
Journal ArticleDOI
Eric S. Lander1, Lauren Linton1, Bruce W. Birren1, Chad Nusbaum1  +245 moreInstitutions (29)
15 Feb 2001-Nature
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

22,269 citations


"Genomics of long-range regulatory e..." refers background in this paper

  • ...In fact, sequence characteristics of composition and structure have already demonstrated some success in discriminating functional from nonfunctional sequences in vertebrate systems (63, 80, 90)....

    [...]

Journal ArticleDOI
08 Oct 2009-Nature
TL;DR: This paper examined potential sources of missing heritability and proposed research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.
Abstract: Genome-wide association studies have identified hundreds of genetic variants associated with complex human diseases and traits, and have provided valuable insights into their genetic architecture. Most variants identified so far confer relatively small increments in risk, and explain only a small proportion of familial clustering, leading many to question how the remaining, 'missing' heritability can be explained. Here we examine potential sources of missing heritability and propose research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.

7,797 citations


"Genomics of long-range regulatory e..." refers background in this paper

  • ...The rapid evolution of genotyping technologies has resulted in a glut of GWAS (genome-wide association study) data that collectively imply a significant role for regulatory variation in common genetic disorders (75)....

    [...]

  • ...Consequently, an increasingly prominent role has been suggested for regulatory mutations in disease risk (26, 33, 75, 91)....

    [...]

Journal ArticleDOI
09 Oct 2009-Science
TL;DR: Hi-C is described, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing and demonstrates the power of Hi-C to map the dynamic conformations of entire genomes.
Abstract: We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1 megabase. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free, polymer conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.

7,180 citations


Additional excerpts

  • ...enrichment and identification by massively parallel sequencing (68)....

    [...]

Journal ArticleDOI
Robert H. Waterston1, Kerstin Lindblad-Toh2, Ewan Birney, Jane Rogers3  +219 moreInstitutions (26)
05 Dec 2002-Nature
TL;DR: The results of an international collaboration to produce a high-quality draft sequence of the mouse genome are reported and an initial comparative analysis of the Mouse and human genomes is presented, describing some of the insights that can be gleaned from the two sequences.
Abstract: The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

6,643 citations


"Genomics of long-range regulatory e..." refers background in this paper

  • ...Initial comparisons of the human genome with genomes of other vertebrate species, such as mouse, indicated that CNSs were abundant in the genome (12, 20, 27, 84, 120)....

    [...]

Journal ArticleDOI
21 Jun 2002-Science
TL;DR: It is shown that the human genome can be parsed objectively into haplotype blocks: sizable regions over which there is little evidence for historical recombination and within which only a few common haplotypes are observed.
Abstract: Haplotype-based methods offer a powerful approach to disease gene mapping, based on the association between causal mutations and the ancestral haplotypes on which they arose. As part of The SNP Consortium Allele Frequency Projects, we characterized haplotype patterns across 51 autosomal regions (spanning 13 megabases of the human genome) in samples from Africa, Europe, and Asia. We show that the human genome can be parsed objectively into haplotype blocks: sizable regions over which there is little evidence for historical recombination and within which only a few common haplotypes are observed. The boundaries of blocks and specific haplotypes they contain are highly correlated across populations. We demonstrate that such haplotype frameworks provide substantial statistical power in association studies of common genetic variation across each region. Our results provide a foundation for the construction of a haplotype map of the human genome, facilitating comprehensive genetic association studies of human disease.

5,634 citations


"Genomics of long-range regulatory e..." refers background in this paper

  • ...Although the average extent of LD in the human genome is approximately 15 kb (39), many LD blocks extend over much longer distances and may thus uncover many variants with equivalent evidence of association with disease....

    [...]

Related Papers (5)