Showing papers in "Nature Genetics in 2013"
••
Drexel University1, Yeshiva University2, Roswell Park Cancer Institute3, Virginia Commonwealth University4, Van Andel Institute5, Science Applications International Corporation6, Massachusetts Institute of Technology7, Harvard University8, University of Miami9, Icahn School of Medicine at Mount Sinai10, University of Chicago11, Howard Hughes Medical Institute12, University of Geneva13, Stanford University14, University of Oxford15, University of North Carolina at Chapel Hill16, National Institutes of Health17
TL;DR: The Genotype-Tissue Expression (GTEx) project is described, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.
Abstract: Genome-wide association studies have identified thousands of loci for common diseases, but, for the majority of these, the mechanisms underlying disease susceptibility remain unknown. Most associated variants are not correlated with protein-coding changes, suggesting that polymorphisms in regulatory regions probably contribute to many disease phenotypes. Here we describe the Genotype-Tissue Expression (GTEx) project, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.
6,545 citations
••
John N. Weinstein1, John N. Weinstein2, Eric A. Collisson3, Gordon B. Mills2 +376 more•Institutions (31)
TL;DR: The Pan-Cancer initiative compares the first 12 tumor types profiled by TCGA with a major opportunity to develop an integrated picture of commonalities, differences and emergent themes across tumor lineages.
Abstract: The Cancer Genome Atlas (TCGA) Research Network has profiled and analyzed large numbers of human tumors to discover molecular aberrations at the DNA, RNA, protein and epigenetic levels. The resulting rich data provide a major opportunity to develop an integrated picture of commonalities, differences and emergent themes across tumor lineages. The Pan-Cancer initiative compares the first 12 tumor types profiled by TCGA. Analysis of the molecular aberrations and their functional roles across tumor types will teach us how to extend therapies effective in one cancer type to others with a similar genomic profile.
5,294 citations
••
Jean-Charles Lambert1, Jean-Charles Lambert2, Jean-Charles Lambert3, Carla A. Ibrahim-Verbaas4 +212 more•Institutions (75)
TL;DR: In addition to the APOE locus (encoding apolipoprotein E), 19 loci reached genome-wide significance (P < 5 × 10−8) in the combined stage 1 and stage 2 analysis, of which 11 are newly associated with Alzheimer's disease.
Abstract: Eleven susceptibility loci for late-onset Alzheimer's disease (LOAD) were identified by previous studies; however, a large portion of the genetic risk for this disease remains unexplained. We conducted a large, two-stage meta-analysis of genome-wide association studies (GWAS) in individuals of European ancestry. In stage 1, we used genotyped and imputed data (7,055,881 SNPs) to perform meta-analysis on 4 previously published GWAS data sets consisting of 17,008 Alzheimer's disease cases and 37,154 controls. In stage 2, 11,632 SNPs were genotyped and tested for association in an independent set of 8,572 Alzheimer's disease cases and 11,312 controls. In addition to the APOE locus (encoding apolipoprotein E), 19 loci reached genome-wide significance (P < 5 × 10−8) in the combined stage 1 and stage 2 analysis, of which 11 are newly associated with Alzheimer's disease.
3,726 citations
••
Cristen J. Willer1, Ellen M. Schmidt1, Sebanti Sengupta1, Gina M. Peloso2 +316 more•Institutions (87)
TL;DR: It is found that loci associated with blood lipid levels are often associated with cardiovascular and metabolic traits, including coronary artery disease, type 2 diabetes, blood pressure, waist-hip ratio and body mass index.
Abstract: Levels of low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, triglycerides and total cholesterol are heritable, modifiable risk factors for coronary artery disease. To identify new loci and refine known loci influencing these lipids, we examined 188,577 individuals using genome-wide and custom genotyping arrays. We identify and annotate 157 loci associated with lipid levels at P < 5 × 10(-8), including 62 loci not previously associated with lipid levels in humans. Using dense genotyping in individuals of European, East Asian, South Asian and African ancestry, we narrow association signals in 12 loci. We find that loci associated with blood lipid levels are often associated with cardiovascular and metabolic traits, including coronary artery disease, type 2 diabetes, blood pressure, waist-hip ratio and body mass index. Our results demonstrate the value of using genetic data from individuals of diverse ancestry and provide insights into the biological mechanisms regulating blood lipids to guide future genetic, biological and therapeutic research.
2,585 citations
••
TL;DR: Empirical evidence of shared genetic etiology for psychiatric disorders can inform nosology and encourages the investigation of common pathophysiologies for related disorders.
Abstract: Most psychiatric disorders are moderately to highly heritable. The degree to which genetic variation is unique to individual disorders or shared across disorders is unclear. To examine shared genetic etiology, we use genome-wide genotype data from the Psychiatric Genomics Consortium (PGC) for cases and controls in schizophrenia, bipolar disorder, major depressive disorder, autism spectrum disorders (ASD) and attention-deficit/hyperactivity disorder (ADHD). We apply univariate and bivariate methods for the estimation of genetic variation within and covariation between disorders. SNPs explained 17-29% of the variance in liability. The genetic correlation calculated using common SNPs was high between schizophrenia and bipolar disorder (0.68 ± 0.04 s.e.), moderate between schizophrenia and major depressive disorder (0.43 ± 0.06 s.e.), bipolar disorder and major depressive disorder (0.47 ± 0.06 s.e.), and ADHD and major depressive disorder (0.32 ± 0.07 s.e.), low between schizophrenia and ASD (0.16 ± 0.06 s.e.) and non-significant for other pairs of disorders as well as between psychiatric disorders and the negative control of Crohn's disease. This empirical evidence of shared genetic etiology for psychiatric disorders can inform nosology and encourages the investigation of common pathophysiologies for related disorders.
2,058 citations
••
University Medical Center Groningen1, University of Tartu2, University of Exeter3, Greifswald University Hospital4, National Institute for Health and Welfare5, University of Washington6, Churchill Hospital7, Technische Universität München8, University of Queensland9, University Medical Center Utrecht10, Erasmus University Rotterdam11, Leiden University Medical Center12, National Institutes of Health13, University College London14, University of Auckland15, QIMR Berghofer Medical Research Institute16, Wellcome Trust Centre for Human Genetics17, University of Düsseldorf18, Ludwig Maximilian University of Munich19, University of Groningen20, Group Health Cooperative21, University of Helsinki22, Wellcome Trust Sanger Institute23
TL;DR: Variants associated with cholesterol metabolism and type 1 diabetes showed similar phenomena, indicating that large-scale eQTL mapping provides insight into the downstream effects of many trait-associated variants.
Abstract: Identifying the downstream effects of disease-associated SNPs is challenging. To help overcome this problem, we performed expression quantitative trait locus (eQTL) meta-analysis in non-transformed peripheral blood samples from 5,311 individuals with replication in 2,775 individuals. We identified and replicated trans eQTLs for 233 SNPs (reflecting 103 independent loci) that were previously associated with complex traits at genome-wide significance. Some of these SNPs affect multiple genes in trans that are known to be altered in individuals with disease: rs4917014, previously associated with systemic lupus erythematosus (SLE), altered gene expression of C1QB and five type I interferon response genes, both hallmarks of SLE. DeepSAGE RNA sequencing showed that rs4917014 strongly alters the 3' UTR levels of IKZF1 in cis, and chromatin immunoprecipitation and sequencing analysis of the trans-regulated genes implicated IKZF1 as the causal gene. Variants associated with cholesterol metabolism and type 1 diabetes showed similar phenomena, indicating that large-scale eQTL mapping provides insight into the downstream effects of many trait-associated variants.
1,627 citations
••
TL;DR: The Cancer Genome Atlas Pan-Cancer data set was used in this article to investigate the role of SCNAs in cancer-related SCNA patterns, including whole-genome doubling, TP53 mutations, CCNE1 amplifications and alterations of PPP2R complex.
Abstract: Determining how somatic copy number alterations (SCNAs) promote cancer is an important goal. We characterized SCNA patterns in 4,934 cancers from The Cancer Genome Atlas Pan-Cancer data set. Whole-genome doubling, observed in 37% of cancers, was associated with higher rates of every other type of SCNA, TP53 mutations, CCNE1 amplifications and alterations of the PPP2R complex. SCNAs that were internal to chromosomes tended to be shorter than telomere-bounded SCNAs, suggesting different mechanisms underlying their generation. Significantly recurrent focal SCNAs were observed in 140 regions, including 102 without known oncogene or tumor suppressor gene targets and 50 with significantly mutated genes. Amplified regions without known oncogenes were enriched for genes involved in epigenetic regulation. When levels of genomic disruption were accounted for, 7% of region pairs were anticorrelated, and these regions tended to encompass genes whose proteins physically interact, suggesting related functions. These results provide insights into mechanisms of generation and functional consequences of cancer-related SCNAs.
1,597 citations
••
Panos Deloukas1, Stavroula Kanoni1, Christina Willenborg2, Martin Farrall3 +201 more•Institutions (64)
TL;DR: An association analysis in CAD cases and controls identifies 15 loci reaching genome-wide significance, taking the number of susceptibility loci for CAD to 46, and a further 104 independent variants strongly associated with CAD at a 5% false discovery rate (FDR).
Abstract: Coronary artery disease (CAD) is the commonest cause of death. Here, we report an association analysis in 63,746 CAD cases and 130,681 controls identifying 15 loci reaching genome-wide significance, taking the number of susceptibility loci for CAD to 46, and a further 104 independent variants (r(2) < 0.2) strongly associated with CAD at a 5% false discovery rate (FDR). Together, these variants explain approximately 10.6% of CAD heritability. Of the 46 genome-wide significant lead SNPs, 12 show a significant association with a lipid trait, and 5 show a significant association with blood pressure, but none is significantly associated with diabetes. Network analysis with 233 candidate genes (loci at 10% FDR) generated 5 interaction networks comprising 85% of these putative genes involved in CAD. The four most significant pathways mapping to these networks are linked to lipid metabolism and inflammation, underscoring the causal role of these activities in the genetic etiology of CAD. Our study provides insights into the genetic basis of CAD and identifies key biological pathways.
1,518 citations
••
Broad Institute1, Harvard University2, Karolinska Institutet3, University of North Carolina at Chapel Hill4, Oslo University Hospital5, Icahn School of Medicine at Mount Sinai6, University of Queensland7, deCODE genetics8, Aarhus University Hospital9, Lundbeck10, Aarhus University11, Trinity College, Dublin12, Cardiff University13, Radboud University Nijmegen14, VU University Amsterdam15, Russian Academy16, Statens Serum Institut17, Virginia Commonwealth University18, King's College London19, Queen's University Belfast20, University of Belgrade21, Erasmus University Rotterdam22, Martin Luther University of Halle-Wittenberg23, Ludwig Maximilian University of Munich24, University of Iceland25, Tbilisi State Medical University26, National Institutes of Health27, University of Verona28, University College London29
TL;DR: The authors conducted a multi-stage genome-wide association study (GWAS) for schizophrenia and found that 8,300 independent, mostly common SNPs (95% credible interval of 6,300-10,200 SNPs) contribute to risk for schizophrenia.
Abstract: Schizophrenia is an idiopathic mental disorder with a heritable component and a substantial public health impact. We conducted a multi-stage genome-wide association study (GWAS) for schizophrenia beginning with a Swedish national sample (5,001 cases and 6,243 controls) followed by meta-Analysis with previous schizophrenia GWAS (8,832 cases and 12,067 controls) and finally by replication of SNPs in 168 genomic regions in independent samples (7,413 cases, 19,762 controls and 581 parent-offspring trios). We identified 22 loci associated at genome-wide significance; 13 of these are new, and 1 was previously implicated in bipolar disorder. Examination of candidate genes at these loci suggests the involvement of neuronal calcium signaling. We estimate that 8,300 independent, mostly common SNPs (95% credible interval of 6,300-10,200 SNPs) contribute to risk for schizophrenia and that these collectively account for at least 32% of the variance in liability. Common genetic variation has an important role in the etiology of schizophrenia, and larger studies will allow more detailed understanding of this disorder.
1,343 citations
••
TL;DR: This work distilled thousands of genetic and epigenetic features altered in cancers to ∼500 selected functional events (SFEs) and derived a hierarchical classification of 3,299 TCGA tumors from 12 cancer types, indicating the presence of different oncogenic processes.
Abstract: Cancer therapy is challenged by the diversity of molecular implementations of oncogenic processes and by the resulting variation in therapeutic responses. Projects such as The Cancer Genome Atlas (TCGA) provide molecular tumor maps in unprecedented detail. The interpretation of these maps remains a major challenge. Here we distilled thousands of genetic and epigenetic features altered in cancers to ~500 selected functional events (SFEs). Using this simplified description, we derived a hierarchical classification of 3,299 TCGA tumors from 12 cancer types. The top classes are dominated by either mutations (M class) or copy number changes (C class). This distinction is clearest at the extremes of genomic instability, indicating the presence of different oncogenic processes. The full hierarchy shows functional event patterns characteristic of multiple cross-tissue groups of tumors, termed oncogenic signature classes. Targetable functional events in a tumor class are suggestive of class-specific combination therapy. These results may assist in the definition of clinical trials to match actionable oncogenic signatures with personalized therapies.
1,212 citations
••
Ashley Beecham1, Nikolaos A. Patsopoulos2, Nikolaos A. Patsopoulos3, Dionysia K. Xifara4 +203 more•Institutions (73)
TL;DR: This study enhances the catalog of multiple sclerosis risk variants and illustrates the value of fine mapping in the resolution of GWAS signals.
Abstract: Using the ImmunoChip custom genotyping array, we analyzed 14,498 subjects with multiple sclerosis and 24,091 healthy controls for 161,311 autosomal variants and identified 135 potentially associated regions (P < 10 × 10(-4)) In a replication phase, we combined these data with previous genome-wide association study (GWAS) data from an independent 14,802 subjects with multiple sclerosis and 26,703 healthy controls In these 80,094 individuals of European ancestry, we identified 48 new susceptibility variants (P < 50 × 10(-8)), 3 of which we found after conditioning on previously identified variants Thus, there are now 110 established multiple sclerosis risk variants at 103 discrete loci outside of the major histocompatibility complex With high-resolution Bayesian fine mapping, we identified five regions where one variant accounted for more than 50% of the posterior probability of association This study enhances the catalog of multiple sclerosis risk variants and illustrates the value of fine mapping in the resolution of GWAS signals
••
TL;DR: A two-stage genome-wide association study of Alzheimer's disease involving over 16,000 individuals, the most powerful AD GWAS to date, produced compelling evidence for association with Alzheimer’s disease in the combined dataset.
Abstract: Nature GeNetics ADVANCE ONLINE PUBLICATION We undertook a two-stage genome-wide association study (GWAS) of Alzheimer’s disease (AD) involving over 16,000 individuals, the most powerful AD GWAS to date. In stage 1 (3,941 cases and 7,848 controls), we replicated the established association with the apolipoprotein E (APOE) locus (most significant SNP, rs2075650, P = 1.8 × 10−157) and observed genome-wide significant association with SNPs at two loci not previously associated with the disease: at the CLU (also known as APOJ) gene (rs11136000, P = 1.4 × 10−9) and 5 to the PICALM gene (rs3851179, P = 1.9 × 10−8). These associations were replicated in stage 2 (2,023 cases and 2,340 controls), producing compelling evidence for association with Alzheimer’s disease in the combined dataset (rs11136000, P = 8.5 × 10−10, odds ratio = 0.86; rs3851179, P = 1.3 × 10−9, odds ratio = 0.86).
••
TL;DR: A meta-analysis of 9 genome-wide association studies, including 10,052 breast cancer cases and 12,575 controls of European ancestry, and identified 29,807 SNPs for further genotyping suggests that more than 1,000 additional loci are involved in breast cancer susceptibility.
Abstract: Breast cancer is the most common cancer among women Common variants at 27 loci have been identified as associated with susceptibility to breast cancer, and these account for ∼9% of the familial risk of the disease We report here a meta-analysis of 9 genome-wide association studies, including 10,052 breast cancer cases and 12,575 controls of European ancestry, from which we selected 29,807 SNPs for further genotyping These SNPs were genotyped in 45,290 cases and 41,880 controls of European ancestry from 41 studies in the Breast Cancer Association Consortium (BCAC) The SNPs were genotyped as part of a collaborative genotyping experiment involving four consortia (Collaborative Oncological Gene-environment Study, COGS) and used a custom Illumina iSelect genotyping array, iCOGS, comprising more than 200,000 SNPs We identified SNPs at 41 new breast cancer susceptibility loci at genome-wide significance (P < 5 × 10(-8)) Further analyses suggest that more than 1,000 additional loci are involved in breast cancer susceptibility
••
TL;DR: It is demonstrated that mSWI/SNF is the most frequently mutated chromatin-regulatory complex (CRC) in human cancer, exhibiting a broad mutation pattern, similar to that of TP53, and proper functioning of polymorphic BAF complexes may constitute a major mechanism of tumor suppression.
Abstract: Subunits of mammalian SWI/SNF (mSWI/SNF or BAF) complexes have recently been implicated as tumor suppressors in human malignancies. To understand the full extent of their involvement, we conducted a proteomic analysis of endogenous mSWI/SNF complexes, which identified several new dedicated, stable subunits not found in yeast SWI/SNF complexes, including BCL7A, BCL7B and BCL7C, BCL11A and BCL11B, BRD9 and SS18. Incorporating these new members, we determined mSWI/SNF subunit mutation frequency in exome and whole-genome sequencing studies of primary human tumors. Notably, mSWI/SNF subunits are mutated in 19.6% of all human tumors reported in 44 studies. Our analysis suggests that specific subunits protect against cancer in specific tissues. In addition, mutations affecting more than one subunit, defined here as compound heterozygosity, are prevalent in certain cancers. Our studies demonstrate that mSWI/SNF is the most frequently mutated chromatin-regulatory complex (CRC) in human cancer, exhibiting a broad mutation pattern, similar to that of TP53. Thus, proper functioning of polymorphic BAF complexes may constitute a major mechanism of tumor suppression.
••
TL;DR: It is demonstrated that alteration of root system architecture improves drought avoidance through the cloning and characterization of DEEPER ROOTING 1 (DRO1), a rice quantitative trait locus controlling root growth angle.
Abstract: The genetic improvement of drought resistance is essential for stable and adequate crop production in drought-prone areas. Here we demonstrate that alteration of root system architecture improves drought avoidance through the cloning and characterization of DEEPER ROOTING 1 (DRO1), a rice quantitative trait locus controlling root growth angle. DRO1 is negatively regulated by auxin and is involved in cell elongation in the root tip that causes asymmetric root growth and downward bending of the root in response to gravity. Higher expression of DRO1 increases the root growth angle, whereby roots grow in a more downward direction. Introducing DRO1 into a shallow-rooting rice cultivar by backcrossing enabled the resulting line to avoid drought by increasing deep rooting, which maintained high yield performance under drought conditions relative to the recipient cultivar. Our experiments suggest that control of root system architecture will contribute to drought avoidance in crops.
••
TL;DR: It is shown that throughout cancer genomes APOBEC-mediated mutagenesis is pervasive and correlates withAPOBEC mRNA levels, and that ubiquitous APOBec-mediated Mutagenesis are carcinogenic.
Abstract: Recent studies indicate that a subclass of APOBEC cytidine deaminases, which convert cytosine to uracil during RNA editing and retrovirus or retrotransposon restriction, may induce mutation clusters in human tumors. We show here that throughout cancer genomes APOBEC-mediated mutagenesis is pervasive and correlates with APOBEC mRNA levels. Mutation clusters in whole-genome and exome data sets conformed to the stringent criteria indicative of an APOBEC mutation pattern. Applying these criteria to 954,247 mutations in 2,680 exomes from 14 cancer types, mostly from The Cancer Genome Atlas (TCGA), showed a significant presence of the APOBEC mutation pattern in bladder, cervical, breast, head and neck, and lung cancers, reaching 68% of all mutations in some samples. Within breast cancer, the HER2-enriched subtype was clearly enriched for tumors with the APOBEC mutation pattern, suggesting that this type of mutagenesis is functionally linked with cancer development. The APOBEC mutation pattern also extended to cancer-associated genes, implying that ubiquitous APOBEC-mediated mutagenesis is carcinogenic.
••
TL;DR: The authors reported a low median exonic mutation frequency of 0.60 per Mb (0.48 nonsilent) and notably few recurrently mutated genes in high-risk neuroblastoma.
Abstract: Neuroblastoma is a malignancy of the developing sympathetic nervous system that often presents with widespread metastatic disease, resulting in survival rates of less than 50%. To determine the spectrum of somatic mutation in high-risk neuroblastoma, we studied 240 affected individuals (cases) using a combination of whole-exome, genome and transcriptome sequencing as part of the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) initiative. Here we report a low median exonic mutation frequency of 0.60 per Mb (0.48 nonsilent) and notably few recurrently mutated genes in these tumors. Genes with significant somatic mutation frequencies included ALK (9.2% of cases), PTPN11 (2.9%), ATRX (2.5%, and an additional 7.1% had focal deletions), MYCN (1.7%, causing a recurrent p.Pro44Leu alteration) and NRAS (0.83%). Rare, potentially pathogenic germline variants were significantly enriched in ALK, CHEK2, PINK1 and BARD1. The relative paucity of recurrent somatic mutations in neuroblastoma challenges current therapeutic strategies that rely on frequently altered oncogenic drivers.
••
TL;DR: A comprehensive genetic analysis of two independent cohorts of metastatic ER-positive breast tumors and identified mutations in ESR1 affecting the ligand-binding domain (LBD) in 14 of 80 cases that implicate LBD-mutant forms of ER in mediating clinical resistance to hormonal therapy and suggest that more potent ER antagonists may be of substantial therapeutic benefit.
Abstract: Sarat Chandarlapaty and colleagues report the identification of mutations in the ESR1 gene affecting the ligand-binding domain of the encoded estrogen receptor in 20% of metastatic hormone-resistant breast cancers. They determine that the mutant receptor has a hormone-independent active state that likely promotes resistance to estrogen-depriving therapies. Seventy percent of breast cancers express estrogen receptor (ER), and most of these are sensitive to ER inhibition. However, many such tumors for unknown reasons become refractory to inhibition of estrogen action in the metastatic setting. We conducted a comprehensive genetic analysis of two independent cohorts of metastatic ER-positive breast tumors and identified mutations in ESR1 affecting the ligand-binding domain (LBD) in 14 of 80 cases. These included highly recurrent mutations encoding p.Tyr537Ser, p.Tyr537Asn and p.Asp538Gly alterations. Molecular dynamics simulations suggest that the structures of the Tyr537Ser and Asp538Gly mutants involve hydrogen bonding of the mutant amino acids with Asp351, thus favoring the agonist conformation of the receptor. Consistent with this model, mutant receptors drive ER-dependent transcription and proliferation in the absence of hormone and reduce the efficacy of ER antagonists. These data implicate LBD-mutant forms of ER in mediating clinical resistance to hormonal therapy and suggest that more potent ER antagonists may be of substantial therapeutic benefit.
••
TL;DR: This integrated molecular analysis of clear-cell renal cell carcinoma unmasked new correlations between DNA methylation, gene mutation and/or gene expression and copy number profiles, enabling the stratification of clinical risks for patients with ccRCC.
Abstract: Clear-cell renal cell carcinoma (ccRCC) is the most prevalent kidney cancer and its molecular pathogenesis is incompletely understood. Here we report an integrated molecular study of ccRCC in which ≥100 ccRCC cases were fully analyzed by whole-genome and/or whole-exome and RNA sequencing as well as by array-based gene expression, copy number and/or methylation analyses. We identified a full spectrum of genetic lesions and analyzed gene expression and DNA methylation signatures and determined their impact on tumor behavior. Defective VHL-mediated proteolysis was a common feature of ccRCC, which was caused not only by VHL inactivation but also by new hotspot TCEB1 mutations, which abolished Elongin C-VHL binding, leading to HIF accumulation. Other newly identified pathways and components recurrently mutated in ccRCC included PI3K-AKT-mTOR signaling, the KEAP1-NRF2-CUL3 apparatus, DNA methylation, p53-related pathways and mRNA processing. This integrated molecular analysis unmasked new correlations between DNA methylation, gene mutation and/or gene expression and copy number profiles, enabling the stratification of clinical risks for patients with ccRCC.
••
Centra1, Clemson University2, Washington State University3, United States Department of Energy4, Parco Tecnologico Padano5, University of Chile6, North Carolina State University7, University of California, Berkeley8, University of Bologna9, Andrés Bello National University10, University of Milan11, University of Udine12, University of Barcelona13
TL;DR: Comparisons showed that peach has not undergone recent whole-genome duplication, and even though the ancestral triplicated blocks in peach are fragmentary compared to those in grape, all seven paleosets of paralogs from the putative paleoancestor are detectable.
Abstract: Rosaceae is the most important fruit-producing clade, and its key commercially relevant genera (Fragaria, Rosa, Rubus and Prunus) show broadly diverse growth habits, fruit types and compact diploid genomes. Peach, a diploid Prunus species, is one of the best genetically characterized deciduous trees. Here we describe the high-quality genome sequence of peach obtained from a completely homozygous genotype. We obtained a complete chromosome-scale assembly using Sanger whole-genome shotgun methods. We predicted 27,852 protein-coding genes, as well as noncoding RNAs. We investigated the path of peach domestication through whole-genome resequencing of 14 Prunus accessions. The analyses suggest major genetic bottlenecks that have substantially shaped peach genome diversity. Furthermore, comparative analyses showed that peach has not undergone recent whole-genome duplication, and even though the ancestral triplicated blocks in peach are fragmentary compared to those in grape, all seven paleosets of paralogs from the putative paleoancestor are detectable.
••
TL;DR: Five new LBD-localized ESR1 mutations identified here were shown to result in constitutive activity and continued responsiveness to anti-estrogen therapies in vitro, suggesting that activating mutations in E SR1 are a key mechanism in acquired endocrine resistance in breast cancer therapy.
Abstract: Arul Chinnaiyan and colleagues report the results of prospective clinical sequencing of 11 estrogen receptor–positive metastatic breast cancers. They identify ESR1 mutations affecting the ligand-binding domain in six hormone-resistant metastatic breast cancers and show that the mutant estrogen receptors are constitutively active and continue to be responsive to anti-estrogen therapies in vitro. Breast cancer is the most prevalent cancer in women, and over two-thirds of cases express estrogen receptor-α (ER-α, encoded by ESR1). Through a prospective clinical sequencing program for advanced cancers, we enrolled 11 patients with ER-positive metastatic breast cancer. Whole-exome and transcriptome analysis showed that six cases harbored mutations of ESR1 affecting its ligand-binding domain (LBD), all of whom had been treated with anti-estrogens and estrogen deprivation therapies. A survey of The Cancer Genome Atlas (TCGA) identified four endometrial cancers with similar mutations of ESR1. The five new LBD-localized ESR1 mutations identified here (encoding p.Leu536Gln, p.Tyr537Ser, p.Tyr537Cys, p.Tyr537Asn and p.Asp538Gly) were shown to result in constitutive activity and continued responsiveness to anti-estrogen therapies in vitro. Taken together, these studies suggest that activating mutations in ESR1 are a key mechanism in acquired endocrine resistance in breast cancer therapy.
••
Swiss Tropical and Public Health Institute1, University of Basel2, Fudan University3, University of Melbourne4, University of California, San Francisco5, Wellcome Trust Sanger Institute6, Veterinary Laboratories Agency7, King's College London8, University of Ghana9, Centers for Disease Control and Prevention10, Addis Ababa University11, Imperial College London12, National Institute for Medical Research13
TL;DR: Coalescent analyses indicate that MTBC emerged about 70,000 years ago, accompanied migrations of anatomically modern humans out of Africa and expanded as a consequence of increases in human population density during the Neolithic period, consistent with MTBC displaying characteristics indicative of adaptation to both low and high host densities.
Abstract: Tuberculosis caused 20% of all human deaths in the Western world between the seventeenth and nineteenth centuries and remains a cause of high mortality in developing countries. In analogy to other crowd diseases, the origin of human tuberculosis has been associated with the Neolithic Demographic Transition, but recent studies point to a much earlier origin. We analyzed the whole genomes of 259 M. tuberculosis complex (MTBC) strains and used this data set to characterize global diversity and to reconstruct the evolutionary history of this pathogen. Coalescent analyses indicate that MTBC emerged about 70,000 years ago, accompanied migrations of anatomically modern humans out of Africa and expanded as a consequence of increases in human population density during the Neolithic period. This long coevolutionary history is consistent with MTBC displaying characteristics indicative of adaptation to both low and high host densities.
••
National Institutes of Health1, Center for Drug Evaluation and Research2, Silver Spring Networks3, Johns Hopkins University4, Carolinas Medical Center5, Cornell University6, National Development and Research Institutes7, University of Maryland, Baltimore8, Veterans Health Administration9, University of California, Irvine10
TL;DR: RNA sequencing in primary human hepatocytes activated with synthetic double-stranded RNA to mimic HCV infection provides new insights into the genetic regulation of HCV clearance and its clinical management.
Abstract: Chronic infection with hepatitis C virus (HCV) is a common cause of liver cirrhosis and cancer. We performed RNA sequencing in primary human hepatocytes activated with synthetic double-stranded RNA to mimic HCV infection. Upstream of IFNL3 (IL28B) on chromosome 19q13.13, we discovered a new transiently induced region that harbors a dinucleotide variant ss469415590 (TT or ΔG), which is in high linkage disequilibrium with rs12979860, a genetic marker strongly associated with HCV clearance. ss469415590[ΔG] is a frameshift variant that creates a novel gene, designated IFNL4, encoding the interferon-λ4 protein (IFNL4), which is moderately similar to IFNL3. Compared to rs12979860, ss469415590 is more strongly associated with HCV clearance in individuals of African ancestry, although it provides comparable information in Europeans and Asians. Transient overexpression of IFNL4 in a hepatoma cell line induced STAT1 and STAT2 phosphorylation and the expression of interferon-stimulated genes. Our findings provide new insights into the genetic regulation of HCV clearance and its clinical management.
••
TL;DR: A recently described group of hypermutant, microsatellite-stable CRCs is likely to be caused by somatic POLE mutations affecting the exonuclease domain, predicted to cause a defect in the correction of mispaired bases inserted during DNA replication.
Abstract: Many individuals with multiple or large colorectal adenomas or early-onset colorectal cancer (CRC) have no detectable germline mutations in the known cancer predisposition genes. Using whole-genome sequencing, supplemented by linkage and association analysis, we identified specific heterozygous POLE or POLD1 germline variants in several multiple-adenoma and/or CRC cases but in no controls. The variants associated with susceptibility, POLE p.Leu424Val and POLD1 p.Ser478Asn, have high penetrance, and POLD1 mutation was also associated with endometrial cancer predisposition. The mutations map to equivalent sites in the proofreading (exonuclease) domain of DNA polymerases ɛ and δ and are predicted to cause a defect in the correction of mispaired bases inserted during DNA replication. In agreement with this prediction, the tumors from mutation carriers were microsatellite stable but tended to acquire base substitution mutations, as confirmed by yeast functional assays. Further analysis of published data showed that the recently described group of hypermutant, microsatellite-stable CRCs is likely to be caused by somatic POLE mutations affecting the exonuclease domain.
••
TL;DR: It is suggested that triglyceride-rich lipoproteins causally influence risk for CAD, and the strength of a polymorphism's effect on triglyceride levels is correlated with the magnitude of its effect on CAD risk.
Abstract: Triglycerides are transported in plasma by specific triglyceride-rich lipoproteins; in epidemiological studies, increased triglyceride levels correlate with higher risk for coronary artery disease (CAD). However, it is unclear whether this association reflects causal processes. We used 185 common variants recently mapped for plasma lipids (P < 5 × 10(-8) for each) to examine the role of triglycerides in risk for CAD. First, we highlight loci associated with both low-density lipoprotein cholesterol (LDL-C) and triglyceride levels, and we show that the direction and magnitude of the associations with both traits are factors in determining CAD risk. Second, we consider loci with only a strong association with triglycerides and show that these loci are also associated with CAD. Finally, in a model accounting for effects on LDL-C and/or high-density lipoprotein cholesterol (HDL-C) levels, the strength of a polymorphism's effect on triglyceride levels is correlated with the magnitude of its effect on CAD risk. These results suggest that triglyceride-rich lipoproteins causally influence risk for CAD.
••
TL;DR: Focused analysis on genes involved in vitamin C metabolism showed that GalUR, encoding the rate-limiting enzyme of the galacturonate pathway, is significantly upregulated in orange fruit, and the recent expansion of this gene family may provide a genomic basis.
Abstract: Oranges are an important nutritional source for human health and have immense economic value Here we present a comprehensive analysis of the draft genome of sweet orange (Citrus sinensis) The assembled sequence covers 873% of the estimated orange genome, which is relatively compact, as 20% is composed of repetitive elements We predicted 29,445 protein-coding genes, half of which are in the heterozygous state With additional sequencing of two more citrus species and comparative analyses of seven citrus genomes, we present evidence to suggest that sweet orange originated from a backcross hybrid between pummelo and mandarin Focused analysis on genes involved in vitamin C metabolism showed that GalUR, encoding the rate-limiting enzyme of the galacturonate pathway, is significantly upregulated in orange fruit, and the recent expansion of this gene family may provide a genomic basis This draft genome represents a valuable resource for understanding and improving many important citrus traits in the future
••
TL;DR: A collaborative genome-wide association study, including >17,100 advanced AMD cases and >60,000 controls of European and Asian ancestry, identifies 19 loci associated at P < 5 × 10−8, which show enrichment for genes involved in the regulation of complement activity, lipid metabolism, extracellular matrix remodeling and angiogenesis.
Abstract: Age-related macular degeneration (AMD) is a common cause of blindness in older individuals To accelerate the understanding of AMD biology and help design new therapies, we executed a collaborative genome-wide association study, including >17,100 advanced AMD cases and >60,000 controls of European and Asian ancestry We identified 19 loci associated at P < 5 × 10(-8) These loci show enrichment for genes involved in the regulation of complement activity, lipid metabolism, extracellular matrix remodeling and angiogenesis Our results include seven loci with associations reaching P < 5 × 10(-8) for the first time, near the genes COL8A1-FILIP1L, IER3-DDR1, SLC16A8, TGFBR1, RAD51B, ADAMTS9 and B3GALTL A genetic risk score combining SNP genotypes from all loci showed similar ability to distinguish cases and controls in all samples examined Our findings provide new directions for biological, genetic and therapeutic studies of AMD
••
University of Leicester1, King's College London2, Leiden University3, Imperial College London4, VU University Amsterdam5, University of Tartu6, University of Helsinki7, QIMR Berghofer Medical Research Institute8, University of Groningen9, Karolinska Institutet10, University College London11, University of Cambridge12, Erasmus University Rotterdam13, University of Leeds14, University of Georgia15, University of Oulu16, University of Oxford17, University of Washington18, National Institute for Health and Welfare19, National Institutes of Health20, Wellcome Trust Centre for Human Genetics21, National Institute for Health Research22, Prevention Institute23, Georgia Regents University24, Ludwig Maximilian University of Munich25, University of North Carolina at Chapel Hill26, University of Texas MD Anderson Cancer Center27, University of Milan28, University of Pennsylvania29, Harvard University30, Broad Institute31, Glenfield Hospital32
TL;DR: In this paper, a genome-wide meta-analysis of 37,684 individuals with replication of selected variants in an additional 10,739 individuals was carried out to identify seven loci, including five new loci associated with mean leukocyte telomere length (LTL) (P < 5 × 10−8).
Abstract: Interindividual variation in mean leukocyte telomere length (LTL) is associated with cancer and several age-associated diseases. We report here a genome-wide meta-analysis of 37,684 individuals with replication of selected variants in an additional 10,739 individuals. We identified seven loci, including five new loci, associated with mean LTL (P < 5 × 10(-8)). Five of the loci contain candidate genes (TERC, TERT, NAF1, OBFC1 and RTEL1) that are known to be involved in telomere biology. Lead SNPs at two loci (TERC and TERT) associate with several cancers and other diseases, including idiopathic pulmonary fibrosis. Moreover, a genetic risk score analysis combining lead variants at all 7 loci in 22,233 coronary artery disease cases and 64,762 controls showed an association of the alleles associated with shorter LTL with increased risk of coronary artery disease (21% (95% confidence interval, 5-35%) per standard deviation in LTL, P = 0.014). Our findings support a causal role of telomere-length variation in some age-related diseases.
••
TL;DR: Focusing on the therapeutically challenging diffuse LGGs, this study of 151 tumors has discovered genetic alterations and potential therapeutic targets across the entire range of pediatric L GGs and LGGNTs.
Abstract: The most common pediatric brain tumors are low-grade gliomas (LGGs) We used whole-genome sequencing to identify multiple new genetic alterations involving BRAF, RAF1, FGFR1, MYB, MYBL1 and genes with histone-related functions, including H3F3A and ATRX, in 39 LGGs and low-grade glioneuronal tumors (LGGNTs) Only a single non-silent somatic alteration was detected in 24 of 39 (62%) tumors Intragenic duplications of the portion of FGFR1 encoding the tyrosine kinase domain (TKD) and rearrangements of MYB were recurrent and mutually exclusive in 53% of grade II diffuse LGGs Transplantation of Trp53-null neonatal astrocytes expressing FGFR1 with the duplication involving the TKD into the brains of nude mice generated high-grade astrocytomas with short latency and 100% penetrance FGFR1 with the duplication induced FGFR1 autophosphorylation and upregulation of the MAPK/ERK and PI3K pathways, which could be blocked by specific inhibitors Focusing on the therapeutically challenging diffuse LGGs, our study of 151 tumors has discovered genetic alterations and potential therapeutic targets across the entire range of pediatric LGGs and LGGNTs
••
Abstract: The incidence of esophageal adenocarcinoma (EAC) has risen 600% over the last 30 years. With a 5-year survival rate of ~15%, the identification of new therapeutic targets for EAC is greatly important. We analyze the mutation spectra from whole-exome sequencing of 149 EAC tumor-normal pairs, 15 of which have also been subjected to whole-genome sequencing. We identify a mutational signature defined by a high prevalence of A>C transversions at AA dinucleotides. Statistical analysis of exome data identified 26 significantly mutated genes. Of these genes, five (TP53, CDKN2A, SMAD4, ARID1A and PIK3CA) have previously been implicated in EAC. The new significantly mutated genes include chromatin-modifying factors and candidate contributors SPG20, TLR4, ELMO1 and DOCK2. Functional analyses of EAC-derived mutations in ELMO1 identifies increased cellular invasion. Therefore, we suggest the potential activation of the RAC1 pathway as a contributor to EAC tumorigenesis.