Showing papers by "Adam Auton published in 2018"

PDF

Open Access

Posted Content•DOI•

Modeling functional enrichment improves polygenic prediction accuracy in UK Biobank and 23andMe data sets

[...]

Carla Marquez-Luna¹, Steven Gazal¹, Po-Ru Loh², Nicholas A. Furlotte, Adam Auton, Alkes L. Price¹ - Show less +2 more•Institutions (2)

Harvard University¹, Brigham and Women's Hospital²

24 Jul 2018-bioRxiv

TL;DR: This work introduces a new method for polygenic prediction, LDpred-funct, that leverages trait-specific functional enrichments to increase prediction accuracy andfits priors using the recently developed baseline-LD model.

...read moreread less

Abstract: Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a new method for polygenic prediction, LDpred-funct, that leverages trait-specific functional enrichments to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, which includes coding, conserved, regulatory and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. LDpred-funct attained higher prediction accuracy than other polygenic prediction methods in simulations using real genotypes. We applied LDpred-funct to predict 16 highly heritable traits in the UK Biobank. We used association statistics from British-ancestry samples as training data (avg N=365K) and samples of other European ancestries as validation data (avg N=22K), to minimize confounding. LDpred-funct attained a +27% relative improvement in prediction accuracy (avg prediction R 2 =0.173; highest R 2 =0.417 for height) compared to existing methods that do not incorporate functional information, consistent with simulations. For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (total N=1107K; higher heritability in UK Biobank cohort) increased prediction R 2 to 0.429. Our results show that modeling functional enrichment substantially improves polygenic prediction accuracy, bringing polygenic prediction of complex traits closer to clinical utility.

...read moreread less

57 citations

Posted Content•DOI•

LDpred-funct: incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets

[...]

Carla Marquez-Luna¹, Carla Marquez-Luna², Carla Marquez-Luna³, Steven Gazal¹, Steven Gazal², Po-Ru Loh⁴, Po-Ru Loh¹, Po-Ru Loh², Samuel S. Kim⁵, Samuel S. Kim¹, Furlotte N, Adam Auton, Alkes L. Price², Alkes L. Price¹ - Show less +10 more•Institutions (5)

Harvard University¹, Broad Institute², Icahn School of Medicine at Mount Sinai³, Brigham and Women's Hospital⁴, Massachusetts Institute of Technology⁵

24 Jul 2018-bioRxiv

TL;DR: In this article, a new method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy was introduced. But the method was not applied to predict 21 highly heritable traits in the UK Biobank.

...read moreread less

Abstract: Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a new method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, which includes coding, conserved, regulatory and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. LDpred-funct attained higher prediction accuracy than other polygenic prediction methods in simulations using real genotypes. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank. We used association statistics from British-ancestry samples as training data (avg N=373K) and samples of other European ancestries as validation data (avg N=22K), to minimize confounding. LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R2=0.144; highest R2=0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (total N=1107K; higher heritability in UK Biobank cohort) increased prediction R2 to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.

...read moreread less

49 citations

Journal Article•DOI•

Genome-wide association study identifies nine novel loci for 2D:4D finger ratio: a putative retrospective biomarker of testosterone exposure in utero

[...]

Nicole M. Warrington, Enisa Shevroja¹, Gibran Hemani², Pirro G. Hysi³, Yunxuan Jiang, Adam Auton⁴, Adam Auton⁵, Cindy G. Boer¹, Massimo Mangino, Carol A. Wang⁶, Carol A. Wang⁷, John P. Kemp², George McMahon², Carolina Medina-Gomez¹, Martha Hickey⁸, Katerina Trajanoska¹, Dieter Wolke⁹, M. Arfan Ikram¹, Grant W. Montgomery¹⁰, Janine F. Felix¹, Margaret J. Wright, David A. Mackey, Vincent W. V. Jaddoe, Nicholas G. Martin¹⁰, Nicholas G. Martin¹¹, Joyce Y. Tung, George Davey Smith, Craig E. Pennell, Tim D. Spector, Joyce B. J. van Meurs¹, Fernando Rivadeneira, Sarah E. Medland¹⁰, David M. Evans² - Show less +29 more•Institutions (11)

Erasmus University Rotterdam¹, University of Bristol², King's College London³, Wellcome Trust Centre for Human Genetics⁴, University of Oxford⁵, University of Western Australia⁶, University of Newcastle⁷, University of Melbourne⁸, University of Warwick⁹, QIMR Berghofer Medical Research Institute¹⁰, University College London¹¹

01 Jun 2018-Human Molecular Genetics

TL;DR: Although the hypothesis that 2D:4D ratio is a direct biomarker of prenatal exposure to androgens in healthy individuals, the findings do not explicitly exclude this possibility, and pathways involving testosterone may become apparent as the size of the discovery sample increases further.

...read moreread less

Abstract: The ratio of the length of the index finger to that of the ring finger (2D:4D) is sexually dimorphic and is commonly used as a non-invasive biomarker of prenatal androgen exposure. Most association studies of 2D:4D ratio with a diverse range of sex-specific traits have typically involved small sample sizes and have been difficult to replicate, raising questions around the utility and precise meaning of the measure. In the largest genome-wide association meta-analysis of 2D:4D ratio to date (N = 15 661, with replication N = 75 821), we identified 11 loci (9 novel) explaining 3.8% of the variance in mean 2D:4D ratio. We also found weak evidence for association (β = 0.06; P = 0.02) between 2D:4D ratio and sensitivity to testosterone [length of the CAG microsatellite repeat in the androgen receptor (AR) gene] in females only. Furthermore, genetic variants associated with (adult) testosterone levels and/or sex hormone-binding globulin were not associated with 2D:4D ratio in our sample. Although we were unable to find strong evidence from our genetic study to support the hypothesis that 2D:4D ratio is a direct biomarker of prenatal exposure to androgens in healthy individuals, our findings do not explicitly exclude this possibility, and pathways involving testosterone may become apparent as the size of the discovery sample increases further. Our findings provide new insight into the underlying biology shaping 2D:4D variation in the general population.

...read moreread less

41 citations

Posted Content•DOI•

Genome-wide study identifies 611 loci associated with risk tolerance and risky behaviors

[...]

Richard Karlsson Linnér¹, Richard Karlsson Linnér², Pietro Biroli³, Edward Kong⁴, S. Fleur W. Meddens¹, S. Fleur W. Meddens², Robbee Wedow⁵, Mark Alan Fontana⁶, Mark Alan Fontana⁷, Maël Lebreton⁸, Abdel Abdellaoui¹, Anke R. Hammerschlag¹, Michel G. Nivard¹, Aysu Okbay¹, Cornelius A. Rietveld², Pascal Timshel⁹, Pascal Timshel¹⁰, Stephen P. Tino¹¹, Maciej Trzaskowski¹², Ronald de Vlaming², Ronald de Vlaming¹, Christian L. Zund³, Yanchun Bao¹³, Laura Buzdugan³, Ann H. Caplin, Chia-Yen Chen¹⁴, Chia-Yen Chen⁴, Peter Eibich¹⁵, Peter Eibich¹⁶, Peter Eibich¹⁷, Pierre Fontanillas, Juan R. González¹⁸, Peter K. Joshi¹⁹, Ville Karhunen²⁰, Aaron Kleinman, Remy Z. Levin²¹, Christina M. Lill²², Gerardus A. Meddens, Gerard Muntané¹⁸, Sandra Sanchez-Roige²¹, Frank J. A. van Rooij², Erdogan Taskesen¹, Yang Wu¹², Futao Zhang¹², Adam Auton, Jason D. Boardman⁵, David W. Clark¹⁹, Andrew Conlin²⁰, Conor C. Dolan¹, Urs Fischbacher²³, Patrick J. F. Groenen², Kathleen Mullan Harris²⁴, Gregor Hasler²⁵, Albert Hofman², Albert Hofman⁴, Mohammad Arfan Ikram², Sonia Jain²¹, Robert Karlsson²⁶, Ronald C. Kessler⁴, Maarten Kooyman, James MacKillop²⁷, Minna Männikkö²⁰, Carlos Morcillo-Suarez¹⁸, Matthew B. McQueen⁵, Klaus M. Schmidt²⁸, Melissa C. Smart¹³, Matthias Sutter¹⁷, Matthias Sutter²⁹, Roy Thurik², André G. Uitterlinden², Jon White³⁰, Harriet de Wit³¹, Jian Yang¹², Lars Bertram²², Lars Bertram³², Dorret I. Boomsma¹, Tõnu Esko³³, Ernst Fehr³, David A. Hinds, Magnus Johannesson³⁴, Meena Kumari¹³, David Laibson⁴, Patrik K. E. Magnusson²⁶, Michelle N. Meyer, Arcadi Navarro¹⁸, Arcadi Navarro³⁵, Abraham A. Palmer²¹, Tune H. Pers⁹, Tune H. Pers¹⁰, Danielle Posthuma¹, Daniel Schunk³⁶, Murray B. Stein²¹, Rauli Svento²⁰, Henning Tiemeier², Paul R. H. J. Timmers¹⁹, Patrick Turley¹⁴, Patrick Turley⁷, Patrick Turley⁴, Robert J. Ursano³⁷, Gert G. Wagner¹⁶, Gert G. Wagner¹⁷, James F. Wilson¹⁹, James F. Wilson³⁸, Jacob Gratten¹², James J. Lee³⁹, David Cesarini⁴⁰, Daniel J. Benjamin⁷, Daniel J. Benjamin⁶, Daniel J. Benjamin⁴¹, Philipp Koellinger¹⁶, Philipp Koellinger¹, Jonathan P. Beauchamp¹¹ - Show less +108 more•Institutions (41)

VU University Amsterdam¹, Erasmus University Rotterdam², University of Zurich³, Harvard University⁴, University of Colorado Boulder⁵, Hospital for Special Surgery⁶, University of Southern California⁷, University of Amsterdam⁸, University of Copenhagen⁹, Statens Serum Institut¹⁰, University of Toronto¹¹, University of Queensland¹², University of Essex¹³, Broad Institute¹⁴, University of Oxford¹⁵, German Institute for Economic Research¹⁶, Max Planck Society¹⁷, Pompeu Fabra University¹⁸, University of Edinburgh¹⁹, University of Oulu²⁰, University of California, San Diego²¹, University of Lübeck²², University of Konstanz²³, University of North Carolina at Chapel Hill²⁴, University of Bern²⁵, Karolinska Institutet²⁶, St. Joseph's Healthcare Hamilton²⁷, Ludwig Maximilian University of Munich²⁸, University of Cologne²⁹, University College London³⁰, University of Chicago³¹, Imperial College London³², University of Tartu³³, Stockholm School of Economics³⁴, Catalan Institution for Research and Advanced Studies³⁵, University of Mainz³⁶, Uniformed Services University of the Health Sciences³⁷, Western General Hospital³⁸, University of Minnesota³⁹, New York University⁴⁰, National Bureau of Economic Research⁴¹

08 Feb 2018-bioRxiv

TL;DR: Bioinformatics analyses imply that genes near general-risk-tolerance-associated SNPs are highly expressed in brain tissues and point to a role for glutamatergic and GABAergic neurotransmission.

...read moreread less

Abstract: Humans vary substantially in their willingness to take risks. In a combined sample of over one million individuals, we conducted genome-wide association studies (GWAS) of general risk tolerance, adventurousness, and risky behaviors in the driving, drinking, smoking, and sexual domains. We identified 611 approximately independent genetic loci associated with at least one of our phenotypes, including 124 with general risk tolerance. We report evidence of substantial shared genetic influences across general risk tolerance and risky behaviors: 72 of the 124 general risk tolerance loci contain a lead SNP for at least one of our other GWAS, and general risk tolerance is moderately to strongly genetically correlated (|rˆ g | ~ 0.25 to 0.50) with a range of risky behaviors. Bioinformatics analyses imply that genes near general-risk-tolerance-associated SNPs are highly expressed in brain tissues and point to a role for glutamatergic and GABAergic neurotransmission. We find no evidence of enrichment for genes previously hypothesized to relate to risk tolerance.

...read moreread less

19 citations

Posted Content•DOI•

Estimating heritability of complex traits in admixed populations with summary statistics

[...]

Yang Luo¹, Xinyi Li¹, Xin Wang, Steven Gazal², Josep M. Mercader³, Benjamin M. Neale², Jose C Florenz², Adam Auton, Alkes L. Price², Hilary K. Finucane³, Soumya Raychaudhuri³ - Show less +7 more•Institutions (3)

Brigham and Women's Hospital¹, Harvard University², Broad Institute³

20 Dec 2018-bioRxiv

TL;DR: Cov-LDSC is introduced, a method to provide robust h g 2 estimates from GWAS summary statistics and in-sample LD estimates in admixed populations and is robust to all simulation parameters.

...read moreread less

Abstract: All summary statistics-based methods to estimate the heritability of SNPs (h g 2 ) rely on accurate linkage disequilibrium (LD) calculations. In admixed populations, such as African Americans and Latinos, LD estimates are influenced by admixture and can result in biased h g 2 estimates. Here, we introduce covariate-adjusted LD score regression (cov-LDSC), a method to provide robust h g 2 estimates from GWAS summary statistics and in-sample LD estimates in admixed populations. In simulations, we observed that unadjusted LDSC underestimates h g 2 by 10%- 60%; in contrast, cov-LDSC is robust to all simulation parameters. We applied cov-LDSC to approximately 170,000 Latino, 47,000 African American 135,000 European individuals in three quantitative and five dichotomous phenotypes. Our results show that most traits have high concordance of h g 2 between ethnic groups; for example in the 23andMe cohort, estimates of h g 2 for BMI are 0.22 ± 0.01, 0.23 ± 0.03 and 0.22 ± 0.01 in Latino, African American and European populations respectively. However, for age at menarche, we observe population specific heritability differences with estimates of h g 2 of 0.10 ± 0.03, 0.33 ± 0.13 and 0.19 ± 0.01 in Latino, African American and European populations respectively.

...read moreread less

7 citations