scispace - formally typeset
Search or ask a question

Showing papers by "David Altshuler published in 2012"


Journal ArticleDOI
Nichole D. Palmer1, Caitrin W. McDonough1, Pamela J. Hicks1, B H Roh1  +381 moreInstitutions (6)
04 Jan 2012-PLOS ONE
TL;DR: It is suggested that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations.
Abstract: African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10(-8)). SNP rs7560163 (P = 7.0×10(-9), OR (95% CI) = 0.75 (0.67-0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10(-5)) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations.

1,957 citations


Journal ArticleDOI
TL;DR: This article conducted a meta-analysis of genetic variants on the Metabochip, including 34,840 cases and 114,981 controls, overwhelmingly of European descent, and identified ten previously unreported T2D susceptibility loci, including two showing sex-differentiated association.
Abstract: To extend understanding of the genetic architecture and molecular basis of type 2 diabetes (T2D), we conducted a meta-analysis of genetic variants on the Metabochip, including 34,840 cases and 114,981 controls, overwhelmingly of European descent. We identified ten previously unreported T2D susceptibility loci, including two showing sex-differentiated association. Genome-wide analyses of these data are consistent with a long tail of additional common variant loci explaining much of the variation in susceptibility to T2D. Exploration of the enlarged set of susceptibility loci implicates several processes, including CREBBP-related transcription, adipocytokine signaling and cell cycle regulation, in diabetes pathogenesis.

1,899 citations


Journal ArticleDOI
Benjamin F. Voight1, Benjamin F. Voight2, Benjamin F. Voight3, Gina M. Peloso4, Gina M. Peloso5, Marju Orho-Melander6, Ruth Frikke-Schmidt7, Maja Barbalić8, Majken K. Jensen3, George Hindy6, Hilma Holm9, Eric L. Ding3, Toby Johnson10, Heribert Schunkert11, Nilesh J. Samani12, Nilesh J. Samani13, Robert Clarke14, Jemma C. Hopewell14, John F. Thompson12, Mingyao Li1, Gudmar Thorleifsson9, Christopher Newton-Cheh, Kiran Musunuru3, Kiran Musunuru2, James P. Pirruccello3, James P. Pirruccello2, Danish Saleheen15, Li Chen16, Alexandre F.R. Stewart16, Arne Schillert11, Unnur Thorsteinsdottir17, Unnur Thorsteinsdottir9, Gudmundur Thorgeirsson17, Sonia S. Anand18, James C. Engert19, Thomas M. Morgan20, John A. Spertus21, Monika Stoll22, Klaus Berger22, Nicola Martinelli23, Domenico Girelli23, Pascal P. McKeown24, Christopher Patterson24, Stephen E. Epstein25, Joseph M. Devaney25, Mary Susan Burnett25, Vincent Mooser26, Samuli Ripatti27, Ida Surakka27, Markku S. Nieminen27, Juha Sinisalo27, Marja-Liisa Lokki27, Markus Perola4, Aki S. Havulinna4, Ulf de Faire28, Bruna Gigante28, Erik Ingelsson28, Tanja Zeller29, Philipp S. Wild29, Paul I.W. de Bakker, Olaf H. Klungel30, Anke-Hilse Maitland-van der Zee30, Bas J M Peters30, Anthonius de Boer30, Diederick E. Grobbee30, Pieter Willem Kamphuisen31, Vera H.M. Deneer, Clara C. Elbers30, N. Charlotte Onland-Moret30, Marten H. Hofker31, Cisca Wijmenga31, W. M. Monique Verschuren, Jolanda M. A. Boer, Yvonne T. van der Schouw30, Asif Rasheed, Philippe M. Frossard, Serkalem Demissie4, Serkalem Demissie5, Cristen J. Willer32, Ron Do3, Jose M. Ordovas33, Jose M. Ordovas34, Gonçalo R. Abecasis32, Michael Boehnke32, Karen L. Mohlke35, Mark J. Daly2, Mark J. Daly3, Candace Guiducci2, Noël P. Burtt2, Aarti Surti2, Elena Gonzalez2, Shaun Purcell2, Shaun Purcell3, Stacey Gabriel2, Jaume Marrugat, John F. Peden14, Jeanette Erdmann11, Patrick Diemert11, Christina Willenborg11, Inke R. König11, Marcus Fischer36, Christian Hengstenberg36, Andreas Ziegler11, Ian Buysschaert37, Diether Lambrechts37, Frans Van de Werf37, Keith A.A. Fox38, Nour Eddine El Mokhtari39, Diana Rubin, Jürgen Schrezenmeir, Stefan Schreiber39, Arne Schäfer39, John Danesh15, Stefan Blankenberg29, Robert Roberts16, Ruth McPherson16, Hugh Watkins14, Alistair S. Hall40, Kim Overvad41, Eric B. Rimm3, Eric Boerwinkle8, Anne Tybjærg-Hansen7, L. Adrienne Cupples5, L. Adrienne Cupples4, Muredach P. Reilly1, Olle Melander6, Pier Mannuccio Mannucci42, Diego Ardissino, David S. Siscovick43, Roberto Elosua, Kari Stefansson17, Kari Stefansson9, Christopher J. O'Donnell4, Christopher J. O'Donnell3, Veikko Salomaa4, Daniel J. Rader1, Leena Peltonen44, Leena Peltonen27, Stephen M. Schwartz43, David Altshuler, Sekar Kathiresan 
11 Aug 2012
TL;DR: In this paper, a Mendelian randomisation analysis was performed to compare the effect of HDL cholesterol, LDL cholesterol, and genetic score on risk of myocardial infarction.
Abstract: Methods We performed two mendelian randomisation analyses. First, we used as an instrument a single nucleotide polymorphism (SNP) in the endothelial lipase gene (LIPG Asn396Ser) and tested this SNP in 20 studies (20 913 myocardial infarction cases, 95 407 controls). Second, we used as an instrument a genetic score consisting of 14 common SNPs that exclusively associate with HDL cholesterol and tested this score in up to 12 482 cases of myocardial infarction and 41 331 controls. As a positive control, we also tested a genetic score of 13 common SNPs exclusively associated with LDL cholesterol. – ¹³) but similar levels of other lipid and non-lipid risk factors for myocardial infarction compared with noncarriers. This diff erence in HDL cholesterol is expected to decrease risk of myocardial infarction by 13% (odds ratio [OR] 0·87, 95% CI 0·84–0·91). However, we noted that the 396Ser allele was not associated with risk of myocardial infarction (OR 0·99, 95% CI 0·88–1·11, p=0·85). From observational epidemiology, an increase of 1 SD in HDL cholesterol was associated with reduced risk of myocardial infarction (OR 0·62, 95% CI 0·58–0·66). However, a 1 SD increase in HDL cholesterol due to genetic score was not associated with risk of myocardial infarction (OR 0·93, 95% CI 0·68–1·26, p=0·63). For LDL cholesterol, the estimate from observational epidemiology (a 1 SD increase in LDL cholesterol associated with OR 1·54, 95% CI 1·45–1·63) was concordant with that from genetic score (OR 2·13, 95% CI 1·69–2·69, p=2×10

1,878 citations


Journal ArticleDOI
06 Jul 2012-Science
TL;DR: The findings suggest that most human variation is rare, not shared between populations, and that rare variants are likely to play a role in human health, and show that large sample sizes will be required to associate rare variants with complex traits.
Abstract: As a first step toward understanding how rare variants contribute to risk for complex diseases, we sequenced 15,585 human protein-coding genes to an average median depth of 111× in 2440 individuals of European (n = 1351) and African (n = 1088) ancestry. We identified over 500,000 single-nucleotide variants (SNVs), the majority of which were rare (86% with a minor allele frequency less than 0.5%), previously unknown (82%), and population-specific (82%). On average, 2.3% of the 13,595 SNVs each person carried were predicted to affect protein function of ~313 genes per genome, and ~95.7% of SNVs predicted to be functionally important were rare. This excess of rare functional variants is due to the combined effects of explosive, recent accelerated population growth and weak purifying selection. Furthermore, we show that large sample sizes will be required to associate rare variants with complex traits.

1,680 citations


Benjamin F. Voight, Gina M. Peloso, Marju Orho-Melander, Ruth Frikke-Schmidt, Maja Barbalić, Majken K. Jensen, George Hindy, Hilma Holm, Eric L. Ding, Toby Johnson, Heribert Schunkert, Nilesh J. Samani, Robert Clarke, Jemma C. Hopewell, John F. Thompson, Mingyao Li, Gudmar Thorleifsson, Christopher Newton-Cheh, Kiran Musunuru, James P. Pirruccello, Danish Saleheen, Li Chen, Alexandre F.R. Stewart, Arne Schillert, Unnur Thorsteinsdottir, Gudmundur Thorgeirsson, Sonia S. Anand, James C. Engert, Thomas M. Morgan, John A. Spertus, Monika Stoll, Klaus Berger, Nicola Martinelli, Domenico Girelli, Pascal P. McKeown, Christopher Patterson, Stephen E. Epstein, Joseph M. Devaney, Mary-Susan Burnett, Vincent Mooser, Samuli Ripatti, Ida Surakka, Markku S. Nieminen, Juha Sinisalo, Marja-Liisa Lokki, Markus Perola, Aki S. Havulinna, Ulf de Faire, Bruna Gigante, Erik Ingelsson, Tanja Zeller, Philipp S. Wild, Paul I.W. de Bakker, Olaf H. Klungel, Anke-Hilse Maitland-van der Zee, Bas J M Peters, Anthonius de Boer, Diederick E. Grobbee, Pieter Willem Kamphuisen, Vera H.M. Deneer, Clara C. Elbers, N. Charlotte Onland-Moret, Marten H. Hofker, Cisca Wijmenga, W. M. Monique Verschuren, Jolanda M. A. Boer, Yvonne T. van der Schouw, Asif Rasheed, Philippe M. Frossard, Serkalem Demissie, Cristen J. Willer, Ron Do, Jose M. Ordovas, Gonçalo R. Abecasis, Michael Boehnke, Karen L. Mohlke, Mark J. Daly, Candace Guiducci, Noël P. Burtt, Aarti Surti, Elena Gonzalez, Shaun Purcell, Stacey Gabriel, Jaume Marrugat, John F. Peden, Jeanette Erdmann, Patrick Diemert, Christina Willenborg, Inke R. Koenig, Marcus Fischer, Christian Hengstenberg, Andreas Ziegler, Ian Buysschaert, Diether Lambrechts, Frans Van de Werf, Keith A.A. Fox, Nour Eddine El Mokhtari, Diana Rubin, Juergen Schrezenmeir, Stefan Schreiber, Arne S. Schaefer, John Danesh, Stefan Blankenberg, Robert Roberts, Ruth McPherson, Hugh Watkins, Alistair S. Hall, Kim Overvad, Eric B. Rimm, Eric Boerwinkle, Anne Tybjærg-Hansen, L. Adrienne Cupples, Muredach P. Reilly, Olle Melander, Pier Mannuccio Mannucci, Diego Ardissino, David S. Siscovick, Roberto Elosua, Kari Stefansson, Christopher J. O'Donnell, Veikko Salomaa, Daniel J. Rader, Leena Peltonen, Stephen M. Schwartz, David Altshuler, Sekar Kathiresan 
01 Jan 2012
TL;DR: Mendelian randomisation analyses challenge the concept that raising of plasma HDL cholesterol will uniformly translate into reductions in risk of myocardial infarction.
Abstract: Summary Background High plasma HDL cholesterol is associated with reduced risk of myocardial infarction, but whether this association is causal is unclear. Exploiting the fact that genotypes are randomly assigned at meiosis, are independent of non-genetic confounding, and are unmodified by disease processes, mendelian randomisation can be used to test the hypothesis that the association of a plasma biomarker with disease is causal. Methods We performed two mendelian randomisation analyses. First, we used as an instrument a single nucleotide polymorphism (SNP) in the endothelial lipase gene (LIPG Asn396Ser) and tested this SNP in 20 studies (20 913 myocardial infarction cases, 95 407 controls). Second, we used as an instrument a genetic score consisting of 14 common SNPs that exclusively associate with HDL cholesterol and tested this score in up to 12 482 cases of myocardial infarction and 41 331 controls. As a positive control, we also tested a genetic score of 13 common SNPs exclusively associated with LDL cholesterol. Findings Carriers of the LIPG 396Ser allele (2·6% frequency) had higher HDL cholesterol (0·14 mmol/L higher, p=8×10−13) but similar levels of other lipid and non-lipid risk factors for myocardial infarction compared with non-carriers. This difference in HDL cholesterol is expected to decrease risk of myocardial infarction by 13% (odds ratio [OR] 0·87, 95% CI 0·84–0·91). However, we noted that the 396Ser allele was not associated with risk of myocardial infarction (OR 0·99, 95% CI 0·88–1·11, p=0·85). From observational epidemiology, an increase of 1 SD in HDL cholesterol was associated with reduced risk of myocardial infarction (OR 0·62, 95% CI 0·58–0·66). However, a 1 SD increase in HDL cholesterol due to genetic score was not associated with risk of myocardial infarction (OR 0·93, 95% CI 0·68–1·26, p=0·63). For LDL cholesterol, the estimate from observational epidemiology (a 1 SD increase in LDL cholesterol associated with OR 1·54, 95% CI 1·45–1·63) was concordant with that from genetic score (OR 2·13, 95% CI 1·69–2·69, p=2×10−10). Interpretation Some genetic mechanisms that raise plasma HDL cholesterol do not seem to lower risk of myocardial infarction. These data challenge the concept that raising of plasma HDL cholesterol will uniformly translate into reductions in risk of myocardial infarction. Funding US National Institutes of Health, The Wellcome Trust, European Union, British Heart Foundation, and the German Federal Ministry of Education and Research.

1,550 citations


Journal ArticleDOI
TL;DR: The Metabochip and its component SNP sets are described and evaluated, its performance in capturing variation across the allele-frequency spectrum is evaluated, solutions to methodological challenges commonly encountered in its analysis are described, and its performance as a platform for genotype imputation is evaluated.
Abstract: Genome-wide association studies have identified hundreds of loci for type 2 diabetes, coronary artery disease and myocardial infarction, as well as for related traits such as body mass index, glucose and insulin levels, lipid levels, and blood pressure. These studies also have pointed to thousands of loci with promising but not yet compelling association evidence. To establish association at additional loci and to characterize the genome-wide significant loci by fine-mapping, we designed the ‘‘Metabochip,’’ a custom genotyping array that assays nearly 200,000 SNP markers. Here, we describe the Metabochip and its component SNP sets, evaluate its performance in capturing variation across the allele-frequency spectrum, describe solutions to methodological challenges commonly encountered in its analysis, and evaluate its performance as a platform for genotype imputation. The metabochip achieves dramatic cost efficiencies compared to designing single-trait follow-up reagents, and provides the opportunity to compare results across a range of related traits. The metabochip and similar custom genotyping arrays offer a powerful and cost-effective approach to follow-up large-scale genotyping and sequencing studies and advance our understanding of the genetic basis of complex human diseases and traits.

516 citations


Journal ArticleDOI
Zari Dastani1, Hivert M-F.2, Hivert M-F.3, N J Timpson4  +615 moreInstitutions (128)
TL;DR: A meta-analysis of genome-wide association studies in 39,883 individuals of European ancestry to identify genes associated with metabolic disease identifies novel genetic determinants of adiponectin levels, which, taken together, influence risk of T2D and markers of insulin resistance.
Abstract: Circulating levels of adiponectin, a hormone produced predominantly by adipocytes, are highly heritable and are inversely associated with type 2 diabetes mellitus (T2D) and other metabolic traits. We conducted a meta-analysis of genome-wide association studies in 39,883 individuals of European ancestry to identify genes associated with metabolic disease. We identified 8 novel loci associated with adiponectin levels and confirmed 2 previously reported loci (P = 4.5×10(-8)-1.2×10(-43)). Using a novel method to combine data across ethnicities (N = 4,232 African Americans, N = 1,776 Asians, and N = 29,347 Europeans), we identified two additional novel loci. Expression analyses of 436 human adipocyte samples revealed that mRNA levels of 18 genes at candidate regions were associated with adiponectin concentrations after accounting for multiple testing (p<3×10(-4)). We next developed a multi-SNP genotypic risk score to test the association of adiponectin decreasing risk alleles on metabolic traits and diseases using consortia-level meta-analytic data. This risk score was associated with increased risk of T2D (p = 4.3×10(-3), n = 22,044), increased triglycerides (p = 2.6×10(-14), n = 93,440), increased waist-to-hip ratio (p = 1.8×10(-5), n = 77,167), increased glucose two hours post oral glucose tolerance testing (p = 4.4×10(-3), n = 15,234), increased fasting insulin (p = 0.015, n = 48,238), but with lower in HDL-cholesterol concentrations (p = 4.5×10(-13), n = 96,748) and decreased BMI (p = 1.4×10(-4), n = 121,335). These findings identify novel genetic determinants of adiponectin levels, which, taken together, influence risk of T2D and markers of insulin resistance.

456 citations


Journal ArticleDOI
Richa Saxena1, Richa Saxena2, Clara C. Elbers3, Clara C. Elbers4  +160 moreInstitutions (54)
09 Mar 2012
TL;DR: Large-scale meta-analysis involving a dense gene-centric approach has uncovered additional loci and variants that contribute to type 2 diabetes risk and suggests substantial overlap of T2D association signals across multiple ethnic groups.
Abstract: To identify genetic factors contributing to type 2 diabetes (T2D), we performed large-scale meta-analyses by using a custom similar to 50,000 SNP genotyping array (the ITMAT-Broad-CARe array) with similar to 2000 candidate genes in 39 multiethnic population-based studies, case-control studies, and clinical trials totaling 17,418 cases and 70,298 controls. First, meta-analysis of 25 studies comprising 14,073 cases and 57,489 controls of European descent confirmed eight established T2D loci at genome-wide significance. In silico follow-up analysis of putative association signals found in independent genome-wide association studies (including 8,130 cases and 38,987 controls) performed by the DIAGRAM consortium identified a T2D locus at genome-wide significance (GATAD2A/CILP2/PBX4; p = 5.7 x 10(-9)) and two loci exceeding study-wide significance (SREBF1, and TH/INS; p < 2.4 x 10(-6)). Second, meta-analyses of 1,986 cases and 7,695 controls from eight African-American studies identified study-wide-significant (p = 2.4 x 10(-7)) variants in HMGA2 and replicated variants in TCF7L2 (p = 5.1 x 10(-15)). Third, conditional analysis revealed multiple known and novel independent signals within five T2D-associated genes in samples of European ancestry and within HMGA2 in African-American samples. Fourth, a multiethnic meta-analysis of all 39 studies identified T2D-associated variants in BCL2 (p = 2.1 x 10(-8)). Finally, a composite genetic score of SNPs from new and established T2D signals was significantly associated with increased risk of diabetes in African-American, Hispanic, and Asian populations. In summary, large-scale meta-analysis involving a dense gene-centric approach has uncovered additional loci and variants that contribute to T2D risk and suggests substantial overlap of T2D association signals across multiple ethnic groups.

275 citations


Journal ArticleDOI
TL;DR: Rare sarcomere protein variants cause dominant hypertrophic and dilated cardiomyopathies and were associated with an increased risk for adverse cardiovascular events in the FHS cohort, suggesting that cardiovascular risk assessment in the general population can benefit from rare variant analysis.
Abstract: Rare sarcomere protein variants cause dominant hypertrophic and dilated cardiomyopathies. To evaluate whether allelic variants in eight sarcomere genes are associated with cardiac morphology and function in the community, we sequenced 3,600 individuals from the Framingham Heart Study (FHS) and Jackson Heart Study (JHS) cohorts. Out of the total, 11.2% of individuals had one or more rare nonsynonymous sarcomere variants. The prevalence of likely pathogenic sarcomere variants was 0.6%, twice the previous estimates; however, only four of the 22 individuals had clinical manifestations of hypertrophic cardiomyopathy. Rare sarcomere variants were associated with an increased risk for adverse cardiovascular events (hazard ratio: 2.3) in the FHS cohort, suggesting that cardiovascular risk assessment in the general population can benefit from rare variant analysis.

127 citations


Journal ArticleDOI
TL;DR: It is shown that an informed conditioning approach, based on the liability threshold model with parameters informed by external epidemiological information, fully accounts for disease prevalence and non-random ascertainment of phenotype as well as covariates and provides a substantial increase in power while maintaining a properly controlled false-positive rate.
Abstract: Genetic case-control association studies often include data on clinical covariates, such as body mass index (BMI), smoking status, or age, that may modify the underlying genetic risk of case or control samples. For example, in type 2 diabetes, odds ratios for established variants estimated from low–BMI cases are larger than those estimated from high–BMI cases. An unanswered question is how to use this information to maximize statistical power in case-control studies that ascertain individuals on the basis of phenotype (case-control ascertainment) or phenotype and clinical covariates (case-controlcovariate ascertainment). While current approaches improve power in studies with random ascertainment, they often lose power under case-control ascertainment and fail to capture available power increases under case-control-covariate ascertainment. We show that an informed conditioning approach, based on the liability threshold model with parameters informed by external epidemiological information, fully accounts for disease prevalence and non-random ascertainment of phenotype as well as covariates and provides a substantial increase in power while maintaining a properly controlled falsepositive rate. Our method outperforms standard case-control association tests with or without covariates, tests of gene x covariate interaction, and previously proposed tests for dealing with covariates in ascertained data, with especially large improvements in the case of case-control-covariate ascertainment. We investigate empirical case-control studies of type 2 diabetes, prostate cancer, lung cancer, breast cancer, rheumatoid arthritis, age-related macular degeneration, and end-stage kidney disease over a total of 89,726 samples. In these datasets, informed conditioning outperforms logistic regression for 115 of the 157 known associated variants investigated (P-value=1610 29 ). The improvement varied across diseases with a 16% median increase in x 2 test statistics and a commensurate increase in power. This suggests that applying our method to

87 citations


Journal ArticleDOI
16 Mar 2012-PLOS ONE
TL;DR: Genetic ancestry has a significant association with type 2 diabetes above and beyond its association with non-genetic risk factors for type 1 diabetes in African Americans, but no single gene with a major effect is sufficient to explain a large portion of the observed population difference in risk of diabetes.
Abstract: The risk of type 2 diabetes is approximately 2-fold higher in African Americans than in European Americans even after adjusting for known environmental risk factors, including socioeconomic status (SES), suggesting that genetic factors may explain some of this population difference in disease risk. However, relatively few genetic studies have examined this hypothesis in a large sample of African Americans with and without diabetes. Therefore, we performed an admixture analysis using 2,189 ancestry-informative markers in 7,021 African Americans (2,373 with type 2 diabetes and 4,648 without) from the Atherosclerosis Risk in Communities Study, the Jackson Heart Study, and the Multiethnic Cohort to 1) determine the association of type 2 diabetes and its related quantitative traits with African ancestry controlling for measures of SES and 2) identify genetic loci for type 2 diabetes through a genome-wide admixture mapping scan. The median percentage of African ancestry of diabetic participants was slightly greater than that of non-diabetic participants (study-adjusted difference=1.6%, P,0.001). The odds ratio for diabetes comparing participants in the highest vs. lowest tertile of African ancestry was 1.33 (95% confidence interval 1.13–1.55), after adjustment for age, sex, study, body mass index (BMI), and SES. Admixture scans identified two potential loci for diabetes at 12p13.31 (LOD=4.0) and 13q14.3 (Z score=4.5, P=6.6610 26 ). In conclusion, genetic ancestry has a significant association with type 2 diabetes above and beyond its association with nongenetic risk factors for type 2 diabetes in African Americans, but no single gene with a major effect is sufficient to explain a large portion of the observed population difference in risk of diabetes. There undoubtedly is a complex interplay among specific genetic loci and non-genetic factors, which may both be associated with overall admixture, leading to the observed ethnic differences in diabetes risk.

Journal ArticleDOI
TL;DR: This work proposes a new conditioning approach, which is based in part on the classical technique of liability threshold modeling, and shows that it outperforms both the no conditioning strategy and the standard conditioning strategy, with a properly controlled false-positive rate.
Abstract: Motivation: The question of how to best use information from known associated variants when conducting disease association studies has yet to be answered. Some studies compute a marginal P-value for each Several Nucleotide Polymorphisms independently, ignoring previously discovered variants. Other studies include known variants as covariates in logistic regression, but a weakness of this standard conditioning strategy is that it does not account for disease prevalence and non-random ascertainment, which can induce a correlation structure between candidate variants and known associated variants even if the variants lie on different chromosomes. Here, we propose a new conditioning approach, which is based in part on the classical technique of liability threshold modeling. Roughly, this method estimates model parameters for each known variant while accounting for the published disease prevalence from the epidemiological literature. Results: We show via simulation and application to empirical datasets that our approach outperforms both the no conditioning strategy and the standard conditioning strategy, with a properly controlled false-positive rate. Furthermore, in multiple data sets involving diseases of low prevalence, standard conditioning produces a severe drop in test statistics whereas our approach generally performs as well or better than no conditioning. Our approach may substantially improve disease gene discovery for diseases with many known risk variants. Availability: LTSOFT software is available online http://www.hsph.harvard.edu/faculty/alkes-price/software/ Contact:nzaitlen@hsph.harvard.edu; aprice@hsph.harvard.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Journal ArticleDOI
TL;DR: The findings suggest that a high genetic burden confers an adverse lipid profile and predicts attenuated response in LDL-C levels and small LDL particle number to dietary and physical activity interventions aimed at weight loss.
Abstract: Weight-loss interventions generally improve lipid profiles and reduce cardiovascular disease risk, but effects are variable and may depend on genetic factors. We performed a genetic association analysis of data from 2,993 participants in the Diabetes Prevention Program to test the hypotheses that a genetic risk score (GRS) based on deleterious alleles at 32 lipid-associated single-nucleotide polymorphisms modifies the effects of lifestyle and/or metformin interventions on lipid levels and nuclear magnetic resonance (NMR) lipoprotein subfraction size and number. Twenty-three loci previously associated with fasting LDL-C, HDL-C, or triglycerides replicated (P = 0.04-1 × 10(-17)). Except for total HDL particles (r = -0.03, P = 0.26), all components of the lipid profile correlated with the GRS (partial |r| = 0.07-0.17, P = 5 × 10(-5)-1 10(-19)). The GRS was associated with higher baseline-adjusted 1-year LDL cholesterol levels (β = +0.87, SEE ± 0.22 mg/dl/allele, P = 8 × 10(-5), P(interaction) = 0.02) in the lifestyle intervention group, but not in the placebo (β = +0.20, SEE ± 0.22 mg/dl/allele, P = 0.35) or metformin (β = -0.03, SEE ± 0.22 mg/dl/allele, P = 0.90; P(interaction) = 0.64) groups. Similarly, a higher GRS predicted a greater number of baseline-adjusted small LDL particles at 1 year in the lifestyle intervention arm (β = +0.30, SEE ± 0.012 ln nmol/L/allele, P = 0.01, P(interaction) = 0.01) but not in the placebo (β = -0.002, SEE ± 0.008 ln nmol/L/allele, P = 0.74) or metformin (β = +0.013, SEE ± 0.008 nmol/L/allele, P = 0.12; P(interaction) = 0.24) groups. Our findings suggest that a high genetic burden confers an adverse lipid profile and predicts attenuated response in LDL-C levels and small LDL particle number to dietary and physical activity interventions aimed at weight loss.

Journal ArticleDOI
TL;DR: In this article, a statistical framework was developed to estimate genotypes jointly from sequence reads, array intensities, and imputation, and they found similar sensitivity (89% and specificity (99.6%) from imputation with either 16 sequencing or 1 M SNP arrays.
Abstract: High coverage whole genome sequencing provides near complete information about genetic variation. However, other technologies can be more efficient in some settings by (a) reducing redundant coverage within samples and (b) exploiting patterns of genetic variation across samples. To characterize as many samples as possible, many genetic studies therefore employ lower coverage sequencing or SNP array genotyping coupled to statistical imputation. To compare these approaches individually and in conjunction, we developed a statistical framework to estimate genotypes jointly from sequence reads, array intensities, and imputation. In European samples, we find similar sensitivity (89%) and specificity (99.6%) from imputation with either 16sequencing or 1 M SNP arrays. Sensitivity is increased, particularly for low-frequency polymorphisms (MAFv5%), when low coverage sequence reads are added to dense genome-wide SNP arrays — the converse, however, is not true. At sites where sequence reads and array intensities produce different sample genotypes, joint analysis reduces genotype errors and identifies novel error modes. Our joint framework informs the use of nextgeneration sequencing in genome wide association studies and supports development of improved methods for genotype calling.

Journal ArticleDOI
02 Aug 2012-PLOS ONE
TL;DR: The results do not support the existence of strong interaction effects as a common risk factor for MI, and a modest upper limit on the magnitude that epistatic risk effects are likely to have at the population level is placed.
Abstract: The genetic loci that have been found by genome-wide association studies to modulate risk of coronary heart disease explain only a fraction of its total variance, and gene-gene interactions have been proposed as a potential source of the remaining heritability. Given the potentially large testing burden, we sought to enrich our search space with real interactions by analyzing variants that may be more likely to interact on the basis of two distinct hypotheses: a biological hypothesis, under which MI risk is modulated by interactions between variants that are known to be relevant for its risk factors; and a statistical hypothesis, under which interacting variants individually show weak marginal association with MI. In a discovery sample of 2,967 cases of early-onset myocardial infarction (MI) and 3,075 controls from the MIGen study, we performed pair-wise SNP interaction testing using a logistic regression framework. Despite having reasonable power to detect interaction effects of plausible magnitudes, we observed no statistically significant evidence of interaction under these hypotheses, and no clear consistency between the top results in our discovery sample and those in a large validation sample of 1,766 cases of coronary heart disease and 2,938 controls from the Wellcome Trust Case-Control Consortium. Our results do not support the existence of strong interaction effects as a common risk factor for MI. Within the scope of the hypotheses we have explored, this study places a modest upper limit on the magnitude that epistatic risk effects are likely to have at the population level (odds ratio for MI risk 1.3-2.0, depending on allele frequency and interaction model).

Anders Albrechtsen, Niels Grarup, Yun Li, Thomas Sparsø, G. Tian, H. Cao, T. Jiang, S. Y. Kim, Thorfinn Sand Korneliussen, Q. Li, Chao Nie, R. Wu, Line Skotte, Andrew P. Morris, Claes Ladenvall, Stéphane Cauchi, A. Stancáková, Gregers S. Andersen, Arne Astrup, Karina Banasik, Amanda J. Bennett, Lars Bolund, Guillaume Charpentier, Yi Chen, J. M. Dekker, Alex S. F. Doney, Mozhgan Dorkhan, Tom Forsén, Timothy M. Frayling, C J Groves, Y. Gui, Göran Hallmans, Andrew T. Hattersley, Kunlun He, Graham A. Hitman, Johan Holmkvist, S. Huang, H. Jiang, Xin Jin, Johanne Marie Justesen, Karsten Kristiansen, Johanna Kuusisto, Maria Lajer, Olivier Lantieri, Weijing Li, H. Liang, Q. Liao, X. Liu, T. Ma, X. Ma, M. P. Manijak, Michel Marre, Jacek Mokrosinski, Andrew D. Morris, B. Mu, Aneta Aleksandra Nielsen, Giel Nijpels, Peter M. Nilsson, Colin N. A. Palmer, Nigel W. Rayner, Frida Renström, Rasmus Ribel-Madsen, Neil Robertson, Olov Rolandsson, Peter Rossing, Thue W. Schwartz, P.E. Slagboom, Maria Sterner, M. Tang, Lise Tarnow, Tiinamaija Tuomi, Esther van 't Riet, Nienke van Leeuwen, Tibor V. Varga, Marie A. Vestmar, Mark Walker, B. Wang, Y. Wang, H. Wu, F. Xi, Loic Yengo, Chang Yu, Xiaoming Zhang, J. Zhang, Q. Zhang, Weihua Zhang, H. Zheng, Y. Zhou, David Altshuler, Leen M 't Hart, Paul W. Franks, B. Balkau, Philippe Froguel, Mark I. McCarthy, Markku Laakso, Leif Groop, Cramer Christensen, Ivan Brandslund, Torsten Lauritzen, Daniel R. Witte, Allan Linneberg, Torben Jørgensen, Torben Hansen, Jun Wang, Rasmus Nielsen, Oluf Pedersen 
01 Nov 2012
TL;DR: In this paper, the authors applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) > 1% with common metabolic phenotypes, including type 2 diabetes, BMI >27.5 kg/m2 and hypertension.
Abstract: Aims/hypothesisHuman complex metabolic traits are in part regulated by genetic determinants. Here we applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) >1% with common metabolic phenotypes.MethodsThe study comprised three stages. We performed medium-depth (8×) whole exome sequencing in 1,000 cases with type 2 diabetes, BMI >27.5 kg/m2 and hypertension and in 1,000 controls (stage 1). We selected 16,192 polymorphisms nominally associated (p < 0.05) with case–control status, from four selected annotation categories or from loci reported to associate with metabolic traits. These variants were genotyped in 15,989 Danes to search for association with 12 metabolic phenotypes (stage 2). In stage 3, polymorphisms showing potential associations were genotyped in a further 63,896 Europeans.ResultsExome sequencing identified 70,182 polymorphisms with MAF >1%. In stage 2 we identified 51 potential associations with one or more of eight metabolic phenotypes covered by 45 unique polymorphisms. In meta-analyses of stage 2 and stage 3 results, we demonstrated robust associations for coding polymorphisms in CD300LG (fasting HDL-cholesterol: MAF 3.5%, p = 8.5 × 10−14), COBLL1 (type 2 diabetes: MAF 12.5%, OR 0.88, p = 1.2 × 10−11) and MACF1 (type 2 diabetes: MAF 23.4%, OR 1.10, p = 8.2 × 10−10).Conclusions/interpretationWe applied exome sequencing as a basis for finding genetic determinants of metabolic traits and show the existence of low-frequency and common coding polymorphisms with impact on common metabolic traits. Based on our study, coding polymorphisms with MAF above 1% do not seem to have particularly high effect sizes on the measured metabolic traits.


Journal ArticleDOI
TL;DR: In this case study, an early-career clinician-investigator discusses impending and future career issues in an interview with an established translational scientist, who then shares ideas about pursuing clinically informed research questions.
Abstract: In this case study, an early-career clinician-investigator discusses impending and future career issues in an interview with an established translational scientist, who then shares ideas about pursuing clinically informed research questions.

Patent
07 Nov 2012
TL;DR: In this paper, a nucleic acid constructs that encode fusion peptides comprising a bioluminescent protein and a precursor of a secreted peptide or protein expressed at the cell surface and high throughput screening assays using same.
Abstract: The present invention provides nucleic acid constructs that encode fusion peptides comprising a bioluminescent protein and a precursor of a secreted peptide or protein expressed at the cell surface and high throughput screening assays using same.

01 Jan 2012
TL;DR: A meta-analysis of genetic variants on the Metabochip, including 34,840 cases and 114,981 controls, finds a long tail of additional common variant loci explaining much of the variation in susceptibility to type 2 diabetes.

Journal ArticleDOI
TL;DR: I first want to thank the nominators and the Society for this wonderful award, and it is an honor and more than a bit humbling to be in the company of the previous winners.
Abstract: I first want to thank the nominators and the Society for this wonderful award. It is an honor and more than a bit humbling to be in the company of the previous winners.