scispace - formally typeset
Search or ask a question

Showing papers by "Adam E. Locke published in 2017"


01 Jan 2017
TL;DR: The results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate-to-large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways.
Abstract: Height is a highly heritable, classic polygenic trait with approximately 700 common associated variants identified through genome-wide association studies so far. Here, we report 83 height-associated coding variants with lower minor-allele frequencies (in the range of 0.1–4.8%) and effects of up to 2 centimetres per allele (such as those in IHH, STC2, AR and CRISPLD2), greater than ten times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (giving an increase of 1–2 centimetres per allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes that are mutated in monogenic growth disorders and highlight new biological candidates (such as ADAMTS3, IL11RA and NOX4) and pathways (such as proteoglycan and glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate-to-large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways.

407 citations


Journal ArticleDOI
Ioanna Tachmazidou1, Daniel Suveges1, Josine L. Min2, Graham R. S. Ritchie1, Graham R. S. Ritchie3, Julia Steinberg1, Klaudia Walter1, Valentina Iotchkova1, Valentina Iotchkova4, Jeremy Schwartzentruber1, Jie Huang, Yasin Memari1, Shane A. McCarthy1, Andrew A Crawford, Cristina Bombieri5, Massimiliano Cocca6, Aliki-Eleni Farmaki7, Tom R. Gaunt2, Pekka Jousilahti8, Marjolein N. Kooijman9, Benjamin Lehne10, Giovanni Malerba5, Satu Männistö8, Angela Matchan1, Carolina Medina-Gomez9, Sarah Metrustry11, Abhishek Nag11, Ioanna Ntalla12, Lavinia Paternoster2, Nigel W. Rayner13, Nigel W. Rayner1, Nigel W. Rayner14, Cinzia Sala15, William R. Scott10, William R. Scott16, Hashem A. Shihab2, Lorraine Southam1, Lorraine Southam13, Beate St Pourcain2, Michela Traglia15, Katerina Trajanoska9, Gialuigi Zaza, Weihua Zhang10, Weihua Zhang16, María Soler Artigas17, Narinder Bansal18, Marianne Benn19, Marianne Benn20, Zhongsheng Chen21, Petr Danecek20, Petr Danecek19, Wei-Yu Lin18, Adam E. Locke21, Adam E. Locke22, Jian'an Luan18, Alisa K. Manning23, Alisa K. Manning24, Antonella Mulas25, Carlo Sidore, Anne Tybjærg-Hansen19, Anne Tybjærg-Hansen20, Anette Varbo19, Anette Varbo20, Magdalena Zoledziewska, Chris Finan26, Konstantinos Hatzikotoulas1, Audrey E. Hendricks27, Audrey E. Hendricks1, John P. Kemp2, Alireza Moayyeri26, Alireza Moayyeri11, Kalliope Panoutsopoulou1, Michal Szpak1, Scott Wilson11, Scott Wilson28, Scott Wilson29, Michael Boehnke21, Francesco Cucca25, Emanuele Di Angelantonio30, Emanuele Di Angelantonio18, Claudia Langenberg18, Cecilia M. Lindgren14, Cecilia M. Lindgren13, Mark I. McCarthy14, Mark I. McCarthy13, Mark I. McCarthy31, Andrew P. Morris32, Andrew P. Morris33, Andrew P. Morris13, Børge G. Nordestgaard20, Børge G. Nordestgaard19, Robert A. Scott18, Martin D. Tobin30, Martin D. Tobin17, Nicholas J. Wareham18, Paul Burton2, John C. Chambers34, John C. Chambers16, John C. Chambers10, George Davey Smith2, George Dedoussis7, Janine F. Felix9, Oscar H. Franco9, Giovanni Gambaro35, Paolo Gasparini6, Christopher J Hammond11, Albert Hofman9, Vincent W. V. Jaddoe9, Marcus E. Kleber36, Jaspal S. Kooner8, Jaspal S. Kooner16, Jaspal S. Kooner34, Markus Perola32, Markus Perola8, Markus Perola37, Caroline L Relton2, Susan M. Ring2, Fernando Rivadeneira9, Veikko Salomaa8, Tim D. Spector11, Oliver Stegle4, Daniela Toniolo15, André G. Uitterlinden9, Inês Barroso1, Inês Barroso18, Celia M. T. Greenwood38, Celia M. T. Greenwood39, John R. B. Perry18, John R. B. Perry11, Brian R. Walker3, Adam S. Butterworth18, Adam S. Butterworth30, Yali Xue1, Richard Durbin1, Kerrin S. Small11, Nicole Soranzo2, Nicholas J. Timpson2, Eleftheria Zeggini1 
TL;DR: This work applied a hybrid whole-genome sequencing (WGS) and deep imputation approach to examine the broader allelic architecture of 12 anthropometric traits associated with height, body mass, and fat distribution in up to 267,616 individuals to report 106 genome-wide significant signals that have not been previously identified.
Abstract: Deep sequence-based imputation can enhance the discovery power of genome-wide association studies by assessing previously unexplored variation across the common- and low-frequency spectra We applied a hybrid whole-genome sequencing (WGS) and deep imputation approach to examine the broader allelic architecture of 12 anthropometric traits associated with height, body mass, and fat distribution in up to 267,616 individuals We report 106 genome-wide significant signals that have not been previously identified, including 9 low-frequency variants pointing to functional candidates Of the 106 signals, 6 are in genomic regions that have not been implicated with related traits before, 28 are independent signals at previously reported regions, and 72 represent previously reported signals for a different anthropometric trait 71% of signals reside within genes and fine mapping resolves 23 signals to one or two likely causal variants We confirm genetic overlap between human monogenic and polygenic anthropometric traits and find signal enrichment in cis expression QTLs in relevant tissues Our results highlight the potential of WGS strategies to enhance biologically relevant discoveries across the frequency spectrum

121 citations


Journal ArticleDOI
01 Jul 2017-Diabetes
TL;DR: The allelic spectrum for coding variants in AKT2 associated with disorders of glucose homeostasis is extended and bidirectional effects of variants within the pleckstrin homology domain ofAKT2 are demonstrated.
Abstract: To identify novel coding association signals and facilitate characterization of mechanisms influencing glycemic traits and type 2 diabetes risk, we analyzed 109,215 variants derived from exome array genotyping together with an additional 390,225 variants from exome sequence in up to 39,339 normoglycemic individuals from five ancestry groups. We identified a novel association between the coding variant (p.Pro50Thr) in AKT2 and fasting plasma insulin (FI), a gene in which rare fully penetrant mutations are causal for monogenic glycemic disorders. The low-frequency allele is associated with a 12% increase in FI levels. This variant is present at 1.1% frequency in Finns but virtually absent in individuals from other ancestries. Carriers of the FI-increasing allele had increased 2-h insulin values, decreased insulin sensitivity, and increased risk of type 2 diabetes (odds ratio 1.05). In cellular studies, the AKT2-Thr50 protein exhibited a partial loss of function. We extend the allelic spectrum for coding variants in AKT2 associated with disorders of glucose homeostasis and demonstrate bidirectional effects of variants within the pleckstrin homology domain of AKT2.

53 citations


Journal ArticleDOI
TL;DR: Gene-based associations (P<10−10) support a role for coding variants in LIPC and LIPG with lipoprotein subclass traits and novel association signals provide further insight into the molecular basis of dyslipidemia and the etiology of metabolic disorders.
Abstract: Lipid and lipoprotein subclasses are associated with metabolic and cardiovascular diseases, yet the genetic contributions to variability in subclass traits are not fully understood. We conducted single-variant and gene-based association tests between 15.1M variants from genome-wide and exome array and imputed genotypes and 72 lipid and lipoprotein traits in 8,372 Finns. After accounting for 885 variants at 157 previously identified lipid loci, we identified five novel signals near established loci at HIF3A, ADAMTS3, PLTP, LCAT, and LIPG. Four of the signals were identified with a low-frequency (0.005

53 citations


Journal ArticleDOI
TL;DR: This meta-analysis provides additional support for a significant interaction between FTO, depression and BMI, indicating that depression increases the effect of FTO on BMI.
Abstract: Background Depression and obesity are highly prevalent, and major impacts on public health frequently co-occur. Recently, we reported that having depression moderates the effect of the FTO gene, suggesting its implication in the association between depression and obesity. Aims To confirm these findings by investigating the FTO polymorphism rs9939609 in new cohorts, and subsequently in a meta-analysis. Method The sample consists of 6902 individuals with depression and 6799 controls from three replication cohorts and two original discovery cohorts. Linear regression models were performed to test for association between rs9939609 and body mass index (BMI), and for the interaction between rs9939609 and depression status for an effect on BMI. Fixed and random effects meta-analyses were performed using METASOFT. Results In the replication cohorts, we observed a significant interaction between FTO, BMI and depression with fixed effects meta-analysis (β=0.12, P = 2.7 × 10−4) and with the Han/Eskin random effects method (P = 1.4 × 10−7) but not with traditional random effects (β = 0.1, P = 0.35). When combined with the discovery cohorts, random effects meta-analysis also supports the interaction (β = 0.12, P = 0.027) being highly significant based on the Han/Eskin model (P = 6.9 × 10−8). On average, carriers of the risk allele who have depression have a 2.2% higher BMI for each risk allele, over and above the main effect of FTO. Conclusions This meta-analysis provides additional support for a significant interaction between FTO, depression and BMI, indicating that depression increases the effect of FTO on BMI. The findings provide a useful starting point in understanding the biological mechanism involved in the association between obesity and depression.

50 citations


Posted ContentDOI
14 Feb 2017-bioRxiv
TL;DR: This work uses ∼36 million singleton variants from 3,560 whole-genome sequences to infer fine-scale patterns of mutation rate heterogeneity and provides the most refined portrait to date of the factors contributing to genome-wide variability of the human germline mutation rate.
Abstract: Precise estimates of the single-nucleotide mutation rate and its variability are essential to the study of human genome evolution and genetic diseases. Here we use ~36 million singleton variants observed in 3,716 whole-genome sequences to characterize the heterogeneity of germline mutation rates across the genome. Adjacent-nucleotide context is the strongest predictor of mutability, with mutation rates varying by >650-fold depending on the identity of three bases upstream or downstream of the mutated site. Histone modifications, replication timing, recombination rate, and other local genomic features further modify mutability; magnitude and direction of this modification varies with the sequence context. Compared to estimates based on common variants used in previous approaches, singleton-based estimates provide a more accurate prediction of the mutation patterns seen in an independent dataset of ~46,000 de novo mutations; and incorporating the effects of genomic features further improves the prediction. The effects of sequence contexts, genomic features, and their interactions reported here capture the most refined portrait to date of the germline mutation patterns in humans.

47 citations


Journal ArticleDOI
Jason Flannick1, Jason Flannick2, Christian Fuchsberger3, Anubha Mahajan4  +349 moreInstitutions (77)
TL;DR: The GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing and exome sequencing of 12,940 individuals of multiple ancestries as mentioned in this paper.
Abstract: To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.

32 citations


Journal ArticleDOI
TL;DR: This study proposes four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs, and proposes methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap.
Abstract: Genome-wide association studies (GWASs) have been successful in discovering SNP trait associations for many quantitative traits and common diseases. Typically, the effect sizes of SNP alleles are very small and this requires large genome-wide association meta-analyses (GWAMAs) to maximize statistical power. A trend towards ever-larger GWAMA is likely to continue, yet dealing with summary statistics from hundreds of cohorts increases logistical and quality control problems, including unknown sample overlap, and these can lead to both false positive and false negative findings. In this study, we propose four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs. We propose methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap. (I) We use the population genetics Fst statistic to verify the genetic origin of each cohort and their geographic location, and demonstrate using GWAMA data from the GIANT Consortium that geographic locations of cohorts can be recovered and outlier cohorts can be detected. (II) We conduct principal component analysis based on reported allele frequencies, and are able to recover the ancestral information for each cohort. (III) We propose a new statistic that uses the reported allelic effect sizes and their standard errors to identify significant sample overlap or heterogeneity between pairs of cohorts. (IV) To quantify unknown sample overlap across all pairs of cohorts, we propose a method that uses randomly generated genetic predictors that does not require the sharing of individual-level genotype data and does not breach individual privacy.

19 citations


Posted ContentDOI
09 Jun 2017-bioRxiv
TL;DR: This work provides the most thorough investigation to date of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.
Abstract: Protein truncating variants (PTVs) are likely to modify gene function and have been linked to hundreds of Mendelian disorders. However, the impact of PTVs on complex traits has been limited by the available sample size of whole-exome sequencing studies (WES). Here we assemble WES data from 100,304 individuals to quantify the impact of rare PTVs on 13 quantitative traits and 10 diseases. We focus on those PTVs that occur in PTV-intolerant (PI) genes, as these are more likely to be pathogenic. Carriers of at least one PI-PTV were found to have an increased risk of autism, schizophrenia, bipolar disorder, intellectual disability and ADHD (P-value (p) range: 5x10-3-9x10-12). In controls, without these disorders, we found that this burden associated with increased risk of mental, behavioral and neurodevelopmental disorders as captured by electronic health record information. Furthermore, carriers of PI-PTVs tended to be shorter (p=2x10-5), have fewer years of education (p=2x10-4) and be younger (p=2x10-7); the latter observation possibly reflecting reduced survival or study participation. While other gene-sets derived from in vivo experiments did not show any associations with PTV-burden, gene sets implicated in GWAS of cardiovascular-related traits and inflammatory bowel disease showed a significant PTV-burden with corresponding traits, mainly driven by established genes involved in familial forms of these disorders. We leveraged population health registries from 14,117 individuals to study the phenome-wide impact of PI-PTVs and identified an increase in the number of hospital visits among PI-PTV carriers. In conclusion, we provide the most thorough investigation to date of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.

4 citations