scispace - formally typeset
Search or ask a question

Showing papers in "Nature Reviews Genetics in 2009"


Journal ArticleDOI
TL;DR: The RNA-Seq approach to transcriptome profiling that uses deep-sequencing technologies provides a far more precise measurement of levels of transcripts and their isoforms than other methods.
Abstract: RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. Studies using this method have already altered our view of the extent and complexity of eukaryotic transcriptomes. RNA-Seq also provides a far more precise measurement of levels of transcripts and their isoforms than other methods. This article describes the RNA-Seq approach, the challenges associated with its application, and the advances made so far in characterizing several eukaryote transcriptomes.

11,528 citations


Journal ArticleDOI
TL;DR: The rapidly advancing field of long ncRNAs is reviewed, describing their conservation, their organization in the genome and their roles in gene regulation, and the medical implications.
Abstract: In mammals and other eukaryotes most of the genome is transcribed in a developmentally regulated manner to produce large numbers of long non-coding RNAs (ncRNAs). Here we review the rapidly advancing field of long ncRNAs, describing their conservation, their organization in the genome and their roles in gene regulation. We also consider the medical implications, and the emerging recognition that any transcript, regardless of coding potential, can have an intrinsic function as an RNA.

4,911 citations


Journal ArticleDOI
TL;DR: Because malignant cells show dependence on the dysregulated expression of miRNA genes, which in turn control or are controlled by the dysregulation of multiple protein-coding oncogenes or tumour suppressor genes, these small RNAs provide important opportunities for the development of future miRNA-based therapies.
Abstract: Over the past several years it has become clear that alterations in the expression of microRNA (miRNA) genes contribute to the pathogenesis of most — if not all — human malignancies. These alterations can be caused by various mechanisms, including deletions, amplifications or mutations involving miRNA loci, epigenetic silencing or the dysregulation of transcription factors that target specific miRNAs. Because malignant cells show dependence on the dysregulated expression of miRNA genes, which in turn control or are controlled by the dysregulation of multiple protein-coding oncogenes or tumour suppressor genes, these small RNAs provide important opportunities for the development of future miRNA-based therapies.

2,873 citations


Journal ArticleDOI
TL;DR: Since the discovery in 1993 of the first small silencing RNA, a dizzying number of small RNA classes have been identified, including microRNAs (miRNAs), small interfering RNAs (siRNAs) and Piwi-interacting RNAs.
Abstract: Since the discovery in 1993 of the first small silencing RNA, a dizzying number of small RNA classes have been identified, including microRNAs (miRNAs), small interfering RNAs (siRNAs) and Piwi-interacting RNAs (piRNAs). These classes differ in their biogenesis, their modes of target regulation and in the biological pathways they regulate. There is a growing realization that, despite their differences, these distinct small RNA pathways are interconnected, and that small RNA pathways compete and collaborate as they regulate genes and protect the genome from external and internal threats.

2,266 citations


Journal ArticleDOI
TL;DR: In this article, the expression of many HDAC isoforms in eukaryotic cells raises questions about their possible specificity or redundancy, and whether they control global or specific programs of gene expression.
Abstract: Histone deacetylases (HDACs) are part of a vast family of enzymes that have crucial roles in numerous biological processes, largely through their repressive influence on transcription. The expression of many HDAC isoforms in eukaryotic cells raises questions about their possible specificity or redundancy, and whether they control global or specific programmes of gene expression. Recent analyses of HDAC knockout mice have revealed highly specific functions of individual HDACs in development and disease. Mutant mice lacking individual HDACs are a powerful tool for defining the functions of HDACs in vivo and the molecular targets of HDAC inhibitors in disease.

2,265 citations


Journal ArticleDOI
TL;DR: Relationships between DNA methylation and histone modification have implications for understanding normal development as well as somatic cell reprogramming and tumorigenesis.
Abstract: Both DNA methylation and histone modification are involved in establishing patterns of gene repression during development. Certain forms of histone methylation cause local formation of heterochromatin, which is readily reversible, whereas DNA methylation leads to stable long-term repression. It has recently become apparent that DNA methylation and histone modification pathways can be dependent on one another, and that this crosstalk can be mediated by biochemical interactions between SET domain histone methyltransferases and DNA methyltransferases. Relationships between DNA methylation and histone modification have implications for understanding normal development as well as somatic cell reprogramming and tumorigenesis.

2,108 citations


Journal ArticleDOI
Peter J. Park1
TL;DR: The benefits and challenges in harnessing ChIP–seq are described with an emphasis on issues related to experimental design and data analysis, and effective computational analysis will be crucial for uncovering biological mechanisms.
Abstract: Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a technique for genome-wide profiling of DNA-binding proteins, histone modifications or nucleosomes. Owing to the tremendous progress in next-generation sequencing technology, ChIP-seq offers higher resolution, less noise and greater coverage than its array-based predecessor ChIP-chip. With the decreasing cost of sequencing, ChIP-seq has become an indispensable tool for studying gene regulation and epigenetic mechanisms. In this Review, I describe the benefits and challenges in harnessing this technique with an emphasis on issues related to experimental design and data analysis. ChIP-seq experiments generate large quantities of data, and effective computational analysis will be crucial for uncovering biological mechanisms.

1,934 citations


Journal ArticleDOI
TL;DR: The genetic basis of inbreeding depression and of the related phenomenon, heterosis, has been a puzzle for many decades as mentioned in this paper, based on recent studies in many species, the authors argue that both phenomena are predominantly caused by the presence of recessive deleterious mutations in populations.
Abstract: The genetic basis of inbreeding depression and of the related phenomenon, heterosis, has been a puzzle for many decades. Based on recent studies in many species, the authors argue that both phenomena are predominantly caused by the presence of recessive deleterious mutations in populations. Inbreeding depression — the reduced survival and fertility of offspring of related individuals — occurs in wild animal and plant populations as well as in humans, indicating that genetic variation in fitness traits exists in natural populations. Inbreeding depression is important in the evolution of outcrossing mating systems and, because intercrossing inbred strains improves yield (heterosis), which is important in crop breeding, the genetic basis of these effects has been debated since the early twentieth century. Classical genetic studies and modern molecular evolutionary approaches now suggest that inbreeding depression and heterosis are predominantly caused by the presence of recessive deleterious mutations in populations.

1,604 citations


Journal ArticleDOI
TL;DR: An analysis of 1,391 manually curated sequence-specific DNA-binding transcription factors, their functions, genomic organization and evolutionary conservation provides a solid foundation for future investigations to elucidate regulatory mechanisms underlying diverse mammalian biological processes.
Abstract: Transcription factors are key cellular components that control gene expression: their activities determine how cells function and respond to the environment. Currently, there is great interest in research into human transcriptional regulation. However, surprisingly little is known about these regulators themselves. For example, how many transcription factors does the human genome contain? How are they expressed in different tissues? Are they evolutionarily conserved? Here, we present an analysis of 1,391 manually curated sequence-specific DNA-binding transcription factors, their functions, genomic organization and evolutionary conservation. Much remains to be explored, but this study provides a solid foundation for future investigations to elucidate regulatory mechanisms underlying diverse mammalian biological processes.

1,489 citations


Journal ArticleDOI
TL;DR: The properties of Ne are reviewed in a variety of different situations of biological interest, and advances in genomic techniques are giving new insights into how selection shapes Ne.
Abstract: The effective size of a population, N e , determines the rate of change in the composition of a population caused by genetic drift, which is the random sampling of genetic variants in a finite population. N e is crucial in determining the level of variability in a population, and the effectiveness of selection relative to drift. This article reviews the properties of N e in a variety of different situations of biological interest, and the factors that influence it. In particular, the action of selection means that N e varies across the genome, and advances in genomic techniques are giving new insights into how selection shapes N e .

1,452 citations


Journal ArticleDOI
TL;DR: This Review focuses on non-long terminal repeat (LTR) retrotransposons, and discusses the many ways that they affect the human genome: from generating insertion mutations and genomic instability to altering gene expression and contributing to genetic innovation.
Abstract: Their ability to move within genomes gives transposable elements an intrinsic propensity to affect genome evolution. Non-long terminal repeat (LTR) retrotransposons — including LINE-1, Alu and SVA elements — have proliferated over the past 80 million years of primate evolution and now account for approximately one-third of the human genome. In this Review, we focus on this major class of elements and discuss the many ways that they affect the human genome: from generating insertion mutations and genomic instability to altering gene expression and contributing to genetic innovation. Increasingly detailed analyses of human and other primate genomes are revealing the scale and complexity of the past and current contributions of non-LTR retrotransposons to genomic change in the human lineage.

Journal ArticleDOI
TL;DR: A critical survey of the methods and related software packages currently used to detect the interactions between genetic loci that contribute to human genetic disease is provided.
Abstract: Following the identification of several disease-associated polymorphisms by genome-wide association (GWA) analysis, interest is now focusing on the detection of effects that, owing to their interaction with other genetic or environmental factors, might not be identified by using standard single-locus tests. In addition to increasing the power to detect associations, it is hoped that detecting interactions between loci will allow us to elucidate the biological and biochemical pathways that underpin disease. Here I provide a critical survey of the methods and related software packages currently used to detect the interactions between genetic loci that contribute to human genetic disease. I also discuss the difficulties in determining the biological relevance of statistical interactions.

Journal ArticleDOI
TL;DR: This Review clarifies how FST is defined, how it should be estimated,How it is related to similar statistics and how estimates of FST should be interpreted.
Abstract: FST describes the processes that lead to genetic differentiation among and within populations and is widely used in population and evolutionary genetics. This article describes the meaning of FST and how it should be estimated and interpreted.

Journal ArticleDOI
TL;DR: Current models of the mechanisms that cause copy number variation focus on perturbation of DNA replication and replication of non-contiguous DNA segments and cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Abstract: Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans Here we review current models of the mechanisms that cause copy number variation Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change

Journal ArticleDOI
TL;DR: The principles of quantitative trait locus mapping are reviewed and insights about the genetic architecture of quantitative traits that have been obtained over the past decades are summarized.
Abstract: A major challenge in current biology is to understand the genetic basis of variation for quantitative traits. We review the principles of quantitative trait locus mapping and summarize insights about the genetic architecture of quantitative traits that have been obtained over the past decades. We are currently in the midst of a genomic revolution, which enables us to incorporate genetic variation in transcript abundance and other intermediate molecular phenotypes into a quantitative trait locus mapping framework. This systems genetics approach enables us to understand the biology inside the 'black box' that lies between genotype and phenotype in terms of causal networks of interacting genes.

Journal ArticleDOI
TL;DR: Most common SNPs have now been assessed in genome-wide studies for statistical associations with many complex traits, including many important common diseases, and only a limited amount of the heritable component of any complex trait has been identified.
Abstract: The last few years have seen extensive efforts to catalogue human genetic variation and correlate it with phenotypic differences. Most common SNPs have now been assessed in genome-wide studies for statistical associations with many complex traits, including many important common diseases. Although these studies have provided new biological insights, only a limited amount of the heritable component of any complex trait has been identified and it remains a challenge to elucidate the functional link between associated variants and phenotypic traits. Technological advances, such as the ability to detect rare and structural variants, and a clear understanding of the challenges in linking different types of variation with phenotype, will be essential for future progress.

Journal ArticleDOI
TL;DR: What high-resolution genome-wide maps of nucleosomes positions have taught us about how nucleosome positioning demarcates promoter regions and transcriptional start sites and how the composition and structure of promoter nucleosites facilitate or inhibit transcription is discussed.
Abstract: Knowing the precise locations of nucleosomes in a genome is key to understanding how genes are regulated. Recent 'next generation' ChIP-chip and ChIP-Seq technologies have accelerated our understanding of the basic principles of chromatin organization. Here we discuss what high-resolution genome-wide maps of nucleosome positions have taught us about how nucleosome positioning demarcates promoter regions and transcriptional start sites, and how the composition and structure of promoter nucleosomes facilitate or inhibit transcription. A detailed picture is starting to emerge of how diverse factors, including underlying DNA sequences and chromatin remodelling complexes, influence nucleosome positioning.

Journal ArticleDOI
TL;DR: This work uses genome-wide SNP panels as markers and statistical methods that capture the effects of large numbers of SNPs simultaneously to double the rate of genetic improvement per year in many livestock systems.
Abstract: Genome-wide panels of SNPs have recently been used in domestic animal species to map and identify genes for many traits and to select genetically desirable livestock. This has led to the discovery of the causal genes and mutations for several single-gene traits but not for complex traits. However, the genetic merit of animals can still be estimated by genomic selection, which uses genome-wide SNP panels as markers and statistical methods that capture the effects of large numbers of SNPs simultaneously. This approach is expected to double the rate of genetic improvement per year in many livestock systems.

Journal ArticleDOI
TL;DR: It is argued that ancient genome doublings could probably have survived only under very specific conditions, but that, whenever established, they might have had a pronounced impact on species diversification, and led to an increase in biological complexity and the origin of evolutionary novelties.
Abstract: Many organisms are currently polyploid, or have a polyploid ancestry and now have secondarily 'diploidized' genomes. This finding is surprising because retained whole-genome duplications (WGDs) are exceedingly rare, suggesting that polyploidy is usually an evolutionary dead end. We argue that ancient genome doublings could probably have survived only under very specific conditions, but that, whenever established, they might have had a pronounced impact on species diversification, and led to an increase in biological complexity and the origin of evolutionary novelties.

Journal ArticleDOI
TL;DR: The availability of systematically generated eQTL information could provide immediate insight into a biological basis for disease associations identified through genome-wide association studies, and can help to identify networks of genes involved in disease pathogenesis.
Abstract: Variation in gene expression is an important mechanism underlying susceptibility to complex disease. The simultaneous genome-wide assay of gene expression and genetic variation allows the mapping of the genetic factors that underpin individual differences in quantitative levels of expression (expression QTLs; eQTLs). The availability of systematically generated eQTL information could provide immediate insight into a biological basis for disease associations identified through genome-wide association (GWA) studies, and can help to identify networks of genes involved in disease pathogenesis. Although there are limitations to current eQTL maps, understanding of disease will be enhanced with novel technologies and international efforts that extend to a wide range of new samples and tissues.

Journal ArticleDOI
TL;DR: A new model is proposed that suggests that the maintenance of DNA methylation relies not only on the recognition of hemimethylated DNA by DNA methyltransferase 1 (DNMT1) but also on the localization of the DNMT3A andDNMT3B enzymes to specific chromatin regions that contain methylated DNA.
Abstract: DNA methylation patterns are set up early in mammalian development and are then copied during the division of somatic cells. A long-established model for the maintenance of these patterns explains some, but not all, of the data that are now available. We propose a new model that suggests that the maintenance of DNA methylation relies not only on the recognition of hemimethylated DNA by DNA methyltransferase 1 (DNMT1) but also on the localization of the DNMT3A and DNMT3B enzymes to specific chromatin regions that contain methylated DNA.

Journal ArticleDOI
TL;DR: A polygenic framework is highlighted, supported by recent GWA research, in which qualitative disorders can be interpreted simply as being the extremes of quantitative dimensions, which has far-reaching implications for the diagnosis, treatment and prevention of the problematic extremes of these traits.
Abstract: After drifting apart for 100 years, the two worlds of genetics - quantitative genetics and molecular genetics - are finally coming together in genome-wide association (GWA) research, which shows that the heritability of complex traits and common disorders is due to multiple genes of small effect size. We highlight a polygenic framework, supported by recent GWA research, in which qualitative disorders can be interpreted simply as being the extremes of quantitative dimensions. Research that focuses on quantitative traits - including the low and high ends of normal distributions - could have far-reaching implications for the diagnosis, treatment and prevention of the problematic extremes of these traits.

Journal ArticleDOI
TL;DR: This Review explains how evolutionary geneticists use fitness to predict changes in the genetic composition of populations through time and distinguishes among these meanings (for example, individual fitness, absolute fitness and relative fitness).
Abstract: Although the operation of natural selection requires that genotypes differ in fitness, some geneticists may find it easier to understand natural selection than fitness. Partly this reflects the fact that the word 'fitness' has been used to mean subtly different things. In this Review I distinguish among these meanings (for example, individual fitness, absolute fitness and relative fitness) and explain how evolutionary geneticists use fitness to predict changes in the genetic composition of populations through time. I also review the empirical study of fitness, emphasizing approaches that take advantage of recent genetic and genomic data, and I highlight important unresolved problems in understanding fitness.

Journal ArticleDOI
TL;DR: Stochastic models are being used increasingly in preference to deterministic models to describe biochemical network dynamics at the single-cell level to adequately describe observed noise, variability and heterogeneity of biological systems over a range of scales of biological organization.
Abstract: Two related developments are currently changing traditional approaches to computational systems biology modelling. First, stochastic models are being used increasingly in preference to deterministic models to describe biochemical network dynamics at the single-cell level. Second, sophisticated statistical methods and algorithms are being used to fit both deterministic and stochastic models to time course and other experimental data. Both frameworks are needed to adequately describe observed noise, variability and heterogeneity of biological systems over a range of scales of biological organization.

Journal ArticleDOI
TL;DR: The salient features of the evolution of the Fox gene family are summarized and the diverse contribution of various Fox subfamilies to developmental processes, from organogenesis to speech acquisition are highlighted.
Abstract: The forkhead box (Fox) family of transcription factors, which originated in unicellular eukaryotes, has expanded over time through multiple duplication events, and sometimes through gene loss, to over 40 members in mammals. Fox genes have evolved to acquire a specialized function in many key biological processes. Mutations in Fox genes have a profound effect on human disease, causing phenotypes as varied as cancer, glaucoma and language disorders. We summarize the salient features of the evolution of the Fox gene family and highlight the diverse contribution of various Fox subfamilies to developmental processes, from organogenesis to speech acquisition.

Journal ArticleDOI
TL;DR: This Review outlines the questions that can be answered using viral evolutionary analysis across a wide range of biological scales and describes the current state of the art in such analysis.
Abstract: Many organisms that cause infectious diseases, particularly RNA viruses, mutate so rapidly that their evolutionary and ecological behaviours are inextricably linked. Consequently, aspects of the transmission and epidemiology of these pathogens are imprinted on the genetic diversity of their genomes. Large-scale empirical analyses of the evolutionary dynamics of important pathogens are now feasible owing to the increasing availability of pathogen sequence data and the development of new computational and statistical methods of analysis. In this Review, we outline the questions that can be answered using viral evolutionary analysis across a wide range of biological scales.

Journal ArticleDOI
TL;DR: How recent genomic profiling of transcription factors gives insight into how binding specificity is achieved and what features of chromatin influence the ability of transcription factor to interact with the genome is discussed.
Abstract: A crucial question in the field of gene regulation is whether the location at which a transcription factor binds influences its effectiveness or the mechanism by which it regulates transcription. Comprehensive transcription factor binding maps are needed to address these issues, and genome-wide mapping is now possible thanks to the technological advances of ChIP-chip and ChIP-seq. This Review discusses how recent genomic profiling of transcription factors gives insight into how binding specificity is achieved and what features of chromatin influence the ability of transcription factors to interact with the genome. It also suggests future experiments that may further our understanding of the causes and consequences of transcription factor-genome interactions.

Journal ArticleDOI
TL;DR: It is discussed how identifying new genes that are associated with more than one autoimmune or chronic inflammatory disorder could explain the genetic basis of the shared pathogenesis of immune-related diseases.
Abstract: Recent genetic studies have revealed shared immunological mechanisms in several immune-related disorders that further our understanding of the development and concomitance of these diseases. Our Review focuses on these shared aspects, using the novel findings of recently performed genome-wide association studies and non-synonymous SNP scans as a starting point. We discuss how identifying new genes that are associated with more than one autoimmune or chronic inflammatory disorder could explain the genetic basis of the shared pathogenesis of immune-related diseases. This analysis helps to highlight the key molecular pathways that are involved in these disorders and the potential roles of novel genes in immune-related diseases.

Journal ArticleDOI
TL;DR: These methods are reviewed, focusing on single-SNP tests in genome-wide association studies, and the use of Bayesian methods for fine mapping in candidate regions is demonstrated, and guidance for refereeing manuscripts that contain Bayesian analyses is provided.
Abstract: Bayesian statistical methods have recently made great inroads into many areas of science, and this advance is now extending to the assessment of association between genetic variants and disease or other phenotypes. We review these methods, focusing on single-SNP tests in genome-wide association studies. We discuss the advantages of the Bayesian approach over classical (frequentist) approaches in this setting and provide a tutorial on basic analysis steps, including practical guidelines for appropriate prior specification. We demonstrate the use of Bayesian methods for fine mapping in candidate regions, discuss meta-analyses and provide guidance for refereeing manuscripts that contain Bayesian analyses.

Journal ArticleDOI
TL;DR: This work reviews studies of error frequencies, the cellular and organismal consequences of errors and the attendant long-range evolutionary responses to errors, and emphasizes major areas in which little is known, such as the failure rates of protein folding.
Abstract: Errors in protein synthesis disrupt cellular fitness, cause disease phenotypes and shape gene and genome evolution. Experimental and theoretical results on this topic have accumulated rapidly in disparate fields, such as neurobiology, protein biosynthesis and degradation and molecular evolution, but with limited communication among disciplines. Here, we review studies of error frequencies, the cellular and organismal consequences of errors and the attendant long-range evolutionary responses to errors. We emphasize major areas in which little is known, such as the failure rates of protein folding, in addition to areas in which technological innovations may enable imminent gains, such as the elucidation of translational missense error frequencies. Evolutionary responses to errors fall into two broad categories: adaptations that minimize errors and their attendant costs and adaptations that exploit errors for the organism's benefit.