scispace - formally typeset
Search or ask a question
Author

Nilesh J. Samani

Bio: Nilesh J. Samani is an academic researcher from University of Leicester. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 149, co-authored 779 publications receiving 113545 citations. Previous affiliations of Nilesh J. Samani include University Hospitals of Leicester NHS Trust & Glenfield Hospital.


Papers
More filters
Journal ArticleDOI
TL;DR: The value of sex-specific GWAS to unravel the sexually dimorphic genetic underpinning of complex traits is demonstrated, with no evidence for genetic effects with opposite directions in men versus women.
Abstract: Given the anthropometric differences between men and women and previous evidence of sex-difference in genetic effects, we conducted a genome-wide search for sexually dimorphic associations with height, weight, body mass index, waist circumference, hip circumference, and waist-to-hip-ratio (133,723 individuals) and took forward 348 SNPs into follow-up (additional 137,052 individuals) in a total of 94 studies. Seven loci displayed significant sex-difference (FDR<5%), including four previously established (near GRB14/COBLL1, LYPLAL1/SLC30A10, VEGFA, ADAMTS9) and three novel anthropometric trait loci (near MAP3K1, HSD17B4, PPARG), all of which were genome-wide significant in women (P<5×10(-8)), but not in men. Sex-differences were apparent only for waist phenotypes, not for height, weight, BMI, or hip circumference. Moreover, we found no evidence for genetic effects with opposite directions in men versus women. The PPARG locus is of specific interest due to its role in diabetes genetics and therapy. Our results demonstrate the value of sex-specific GWAS to unravel the sexually dimorphic genetic underpinning of complex traits.

402 citations

Journal ArticleDOI
31 Oct 2008-PLOS ONE
TL;DR: A gene-centric 50 K single nucleotide polymorphism (SNP) array to assess potentially relevant loci across a range of cardiovascular, metabolic and inflammatory syndromes and it is demonstrated that the IBC array can be used to complement GWAS, increasing coverage in high priority CVD-related lociAcross all major HapMap populations.
Abstract: A wealth of genetic associations for cardiovascular and metabolic phenotypes in humans has been accumulating over the last decade, in particular a large number of loci derived from recent genome wide association studies (GWAS) True complex disease-associated loci often exert modest effects, so their delineation currently requires integration of diverse phenotypic data from large studies to ensure robust meta-analyses We have designed a gene-centric 50 K single nucleotide polymorphism (SNP) array to assess potentially relevant loci across a range of cardiovascular, metabolic and inflammatory syndromes The array utilizes a ‘‘cosmopolitan’’ tagging approach to capture the genetic diversity across ,2,000 loci in populations represented in the HapMap and SeattleSNPs projects The array content is informed by GWAS of vascular and inflammatory disease, expression quantitative trait loci implicated in atherosclerosis, pathway based approaches and comprehensive literature searching The custom flexibility of the array platform facilitated interrogation of loci at differing stringencies, according to a gene prioritization strategy that allows saturation of high priority loci with a greater density of markers than the existing GWAS tools, particularly in African HapMap samples We also demonstrate that the IBC array can be used to complement GWAS, increasing coverage in high priority CVD-related loci across all major HapMap populations DNA from over 200,000 extensively phenotyped individuals will be genotyped with this array with a significant portion of the generated data being released into the academic domain facilitating in silico replication attempts, analyses of rare variants and cross-cohort meta-analyses in diverse populations These datasets will also facilitate more robust secondary analyses, such as explorations with alternative genetic models, epistasis and gene-environment interactions

400 citations

Journal ArticleDOI
TL;DR: A broad replication provides unprecedented evidence for association between genetic variants at chromosome 9p21.3 and risk of coronary artery disease (CAD).
Abstract: Background— Recently, genome-wide association studies identified variants on chromosome 9p21.3 as affecting the risk of coronary artery disease (CAD). We investigated the association of this locus with CAD in 7 case-control studies and undertook a meta-analysis. Methods and Results— A single-nucleotide polymorphism (SNP), rs1333049, representing the 9p21.3 locus, was genotyped in 7 case-control studies involving a total of 4645 patients with myocardial infarction or CAD and 5177 controls. The mode of inheritance was determined. In addition, in 5 of the 7 studies, we genotyped 3 additional SNPs to assess a risk-associated haplotype (ACAC). Finally, a meta-analysis of the present data and previously published samples was conducted. A limited fine mapping of the locus was performed. The risk allele (C) of the lead SNP, rs1333049, was uniformly associated with CAD in each study (P<0.05). In a pooled analysis, the odds ratio per copy of the risk allele was 1.29 (95% confidence interval, 1.22 to 1.37; P=0.0001)...

393 citations

Journal ArticleDOI
Sandosh Padmanabhan1, Olle Melander2, Toby Johnson3, Anna Maria Di Blasio, Wai K. Lee1, Davide Gentilini, Claire E. Hastie1, Cristina Menni1, Cristina Menni4, Maria Cristina Monti5, Christian Delles1, Stewart Laing1, Barbara Corso5, Gerjan Navis6, Arjan J. Kwakernaak6, Pim van der Harst6, Murielle Bochud7, Marc Maillard7, Michel Burnier7, Thomas Hedner8, Sverre E. Kjeldsen9, Björn Wahlstrand8, Marketa Sjögren2, Cristiano Fava10, Cristiano Fava2, Martina Montagnana2, Martina Montagnana10, Elisa Danese10, Elisa Danese2, Ole Torffvit, Bo Hedblad2, Harold Snieder6, John M. C. Connell11, Morris Brown12, Nilesh J. Samani13, Martin Farrall14, Giancarlo Cesana4, Giuseppe Mancia4, Stefano Signorini, Guido Grassi4, Susana Eyheramendy15, H.-Erich Wichmann16, Maris Laan17, David P. Strachan18, Peter S. Sever19, Denis C. Shields20, Alice Stanton21, Peter Vollenweider7, Alexander Teumer22, Henry Völzke22, Rainer Rettig22, Christopher Newton-Cheh23, Christopher Newton-Cheh24, Pankaj Arora23, Pankaj Arora24, Feng Zhang25, Nicole Soranzo26, Nicole Soranzo25, Tim D. Spector25, Gavin Lucas, Sekar Kathiresan24, Sekar Kathiresan23, David S. Siscovick27, Jian'an Luan, Ruth J. F. Loos, Nicholas J. Wareham, Brenda W.J.H. Penninx28, Brenda W.J.H. Penninx6, Brenda W.J.H. Penninx29, Ilja M. Nolte6, Martin W. McBride1, William H. Miller1, Stuart A. Nicklin1, Andrew H. Baker1, Delyth Graham1, Robert A. McDonald1, Jill P. Pell1, Naveed Sattar1, Paul Welsh1, Patricia B. Munroe3, Mark J. Caulfield3, Alberto Zanchetti30, Anna F. Dominiczak1 
TL;DR: The newly discovered UMOD locus for hypertension has the potential to give new insights into the role of uromodulin in BP regulation and to identify novel drugable targets for reducing cardiovascular risk.
Abstract: Hypertension is a heritable and major contributor to the global burden of disease. The sum of rare and common genetic variants robustly identified so far explain only 1%-2% of the population variation in BP and hypertension. This suggests the existence of more undiscovered common variants. We conducted a genome-wide association study in 1,621 hypertensive cases and 1,699 controls and follow-up validation analyses in 19,845 cases and 16,541 controls using an extreme case-control design. We identified a locus on chromosome 16 in the 59 region of Uromodulin (UMOD; rs13333226, combined P value of 3.6x10(-11)). The minor G allele is associated with a lower risk of hypertension (OR [95% CI]: 0.87 [0.84-0.91]), reduced urinary uromodulin excretion, better renal function; and each copy of the G allele is associated with a 7.7% reduction in risk of CVD events after adjusting for age, sex, BMI, and smoking status (H.R. = 0.923, 95% CI 0.860-0.991; p = 0.027). In a subset of 13,446 individuals with estimated glomerular filtration rate (eGFR) measurements, we show that rs13333226 is independently associated with hypertension (unadjusted for eGFR: 0.89 [0.83-0.96], p = 0.004; after eGFR adjustment: 0.89 [0.83-0.96], p = 0.003). In clinical functional studies, we also consistently show the minor G allele is associated with lower urinary uromodulin excretion. The exclusive expression of uromodulin in the thick portion of the ascending limb of Henle suggests a putative role of this variant in hypertension through an effect on sodium homeostasis. The newly discovered UMOD locus for hypertension has the potential to give new insights into the role of uromodulin in BP regulation and to identify novel drugable targets for reducing cardiovascular risk.

378 citations

Journal ArticleDOI
Philip C Haycock1, Stephen Burgess2, Aayah Nounu1, Jie Zheng1  +194 moreInstitutions (88)
TL;DR: It is likely that longer telomeres increase risk for several cancers but reduce risk for some non-neoplastic diseases, including cardiovascular diseases, as well as single nucleotide polymorphisms (SNPs) that are strongly associated with telomere length in the general population.
Abstract: IMPORTANCE: The causal direction and magnitude of the association between telomere length and incidence of cancer and non-neoplastic diseases is uncertain owing to the susceptibility of observational studies to confounding and reverse causation. OBJECTIVE: To conduct a Mendelian randomization study, using germline genetic variants as instrumental variables, to appraise the causal relevance of telomere length for risk of cancer and non-neoplastic diseases. DATA SOURCES: Genomewide association studies (GWAS) published up to January 15, 2015. STUDY SELECTION: GWAS of noncommunicable diseases that assayed germline genetic variation and did not select cohort or control participants on the basis of preexisting diseases. Of 163 GWAS of noncommunicable diseases identified, summary data from 103 were available. DATA EXTRACTION AND SYNTHESIS: Summary association statistics for single nucleotide polymorphisms (SNPs) that are strongly associated with telomere length in the general population. MAIN OUTCOMES AND MEASURES: Odds ratios (ORs) and 95% confidence intervals (CIs) for disease per standard deviation (SD) higher telomere length due to germline genetic variation. RESULTS: Summary data were available for 35 cancers and 48 non-neoplastic diseases, corresponding to 420 081 cases (median cases, 2526 per disease) and 1 093 105 controls (median, 6789 per disease). Increased telomere length due to germline genetic variation was generally associated with increased risk for site-specific cancers. The strongest associations (ORs [95% CIs] per 1-SD change in genetically increased telomere length) were observed for glioma, 5.27 (3.15-8.81); serous low-malignant-potential ovarian cancer, 4.35 (2.39-7.94); lung adenocarcinoma, 3.19 (2.40-4.22); neuroblastoma, 2.98 (1.92-4.62); bladder cancer, 2.19 (1.32-3.66); melanoma, 1.87 (1.55-2.26); testicular cancer, 1.76 (1.02-3.04); kidney cancer, 1.55 (1.08-2.23); and endometrial cancer, 1.31 (1.07-1.61). Associations were stronger for rarer cancers and at tissue sites with lower rates of stem cell division. There was generally little evidence of association between genetically increased telomere length and risk of psychiatric, autoimmune, inflammatory, diabetic, and other non-neoplastic diseases, except for coronary heart disease (OR, 0.78 [95% CI, 0.67-0.90]), abdominal aortic aneurysm (OR, 0.63 [95% CI, 0.49-0.81]), celiac disease (OR, 0.42 [95% CI, 0.28-0.61]) and interstitial lung disease (OR, 0.09 [95% CI, 0.05-0.15]). CONCLUSIONS AND RELEVANCE: It is likely that longer telomeres increase risk for several cancers but reduce risk for some non-neoplastic diseases, including cardiovascular diseases.

376 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
Giuseppe Mancia1, Robert Fagard, Krzysztof Narkiewicz, Josep Redon, Alberto Zanchetti, Michael Böhm, Thierry Christiaens, Renata Cifkova, Guy De Backer, Anna F. Dominiczak, Maurizio Galderisi, Diederick E. Grobbee, Tiny Jaarsma, Paulus Kirchhof, Sverre E. Kjeldsen, Stéphane Laurent, Athanasios J. Manolis, Peter M. Nilsson, Luis M. Ruilope, Roland E. Schmieder, Per Anton Sirnes, Peter Sleight, Margus Viigimaa, Bernard Waeber, Faiez Zannad, Michel Burnier, Ettore Ambrosioni, Mark Caufield, Antonio Coca, Michael H. Olsen, Costas Tsioufis, Philippe van de Borne, José Luis Zamorano, Stephan Achenbach, Helmut Baumgartner, Jeroen J. Bax, Héctor Bueno, Veronica Dean, Christi Deaton, Çetin Erol, Roberto Ferrari, David Hasdai, Arno W. Hoes, Juhani Knuuti, Philippe Kolh2, Patrizio Lancellotti, Aleš Linhart, Petros Nihoyannopoulos, Massimo F Piepoli, Piotr Ponikowski, Juan Tamargo, Michal Tendera, Adam Torbicki, William Wijns, Stephan Windecker, Denis Clement, Thierry C. Gillebert, Enrico Agabiti Rosei, Stefan D. Anker, Johann Bauersachs, Jana Brguljan Hitij, Mark J. Caulfield, Marc De Buyzere, Sabina De Geest, Geneviève Derumeaux, Serap Erdine, Csaba Farsang, Christian Funck-Brentano, Vjekoslav Gerc, Giuseppe Germanò, Stephan Gielen, Herman Haller, Jens Jordan, Thomas Kahan, Michel Komajda, Dragan Lovic, Heiko Mahrholdt, Jan Östergren, Gianfranco Parati, Joep Perk, Jorge Polónia, Bogdan A. Popescu, Zeljko Reiner, Lars Rydén, Yuriy Sirenko, Alice Stanton, Harry A.J. Struijker-Boudier, Charalambos Vlachopoulos, Massimo Volpe, David A. Wood 
TL;DR: In this article, a randomized controlled trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly people was presented. But the authors did not discuss the effect of the combination therapy in patients living with systolic hypertension.
Abstract: ABCD : Appropriate Blood pressure Control in Diabetes ABI : ankle–brachial index ABPM : ambulatory blood pressure monitoring ACCESS : Acute Candesartan Cilexetil Therapy in Stroke Survival ACCOMPLISH : Avoiding Cardiovascular Events in Combination Therapy in Patients Living with Systolic Hypertension ACCORD : Action to Control Cardiovascular Risk in Diabetes ACE : angiotensin-converting enzyme ACTIVE I : Atrial Fibrillation Clopidogrel Trial with Irbesartan for Prevention of Vascular Events ADVANCE : Action in Diabetes and Vascular Disease: Preterax and Diamicron-MR Controlled Evaluation AHEAD : Action for HEAlth in Diabetes ALLHAT : Antihypertensive and Lipid-Lowering Treatment to Prevent Heart ATtack ALTITUDE : ALiskiren Trial In Type 2 Diabetes Using Cardio-renal Endpoints ANTIPAF : ANgioTensin II Antagonist In Paroxysmal Atrial Fibrillation APOLLO : A Randomized Controlled Trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly People ARB : angiotensin receptor blocker ARIC : Atherosclerosis Risk In Communities ARR : aldosterone renin ratio ASCOT : Anglo-Scandinavian Cardiac Outcomes Trial ASCOT-LLA : Anglo-Scandinavian Cardiac Outcomes Trial—Lipid Lowering Arm ASTRAL : Angioplasty and STenting for Renal Artery Lesions A-V : atrioventricular BB : beta-blocker BMI : body mass index BP : blood pressure BSA : body surface area CA : calcium antagonist CABG : coronary artery bypass graft CAPPP : CAPtopril Prevention Project CAPRAF : CAndesartan in the Prevention of Relapsing Atrial Fibrillation CHD : coronary heart disease CHHIPS : Controlling Hypertension and Hypertension Immediately Post-Stroke CKD : chronic kidney disease CKD-EPI : Chronic Kidney Disease—EPIdemiology collaboration CONVINCE : Controlled ONset Verapamil INvestigation of CV Endpoints CT : computed tomography CV : cardiovascular CVD : cardiovascular disease D : diuretic DASH : Dietary Approaches to Stop Hypertension DBP : diastolic blood pressure DCCT : Diabetes Control and Complications Study DIRECT : DIabetic REtinopathy Candesartan Trials DM : diabetes mellitus DPP-4 : dipeptidyl peptidase 4 EAS : European Atherosclerosis Society EASD : European Association for the Study of Diabetes ECG : electrocardiogram EF : ejection fraction eGFR : estimated glomerular filtration rate ELSA : European Lacidipine Study on Atherosclerosis ESC : European Society of Cardiology ESH : European Society of Hypertension ESRD : end-stage renal disease EXPLOR : Amlodipine–Valsartan Combination Decreases Central Systolic Blood Pressure more Effectively than the Amlodipine–Atenolol Combination FDA : U.S. Food and Drug Administration FEVER : Felodipine EVent Reduction study GISSI-AF : Gruppo Italiano per lo Studio della Sopravvivenza nell'Infarto Miocardico-Atrial Fibrillation HbA1c : glycated haemoglobin HBPM : home blood pressure monitoring HOPE : Heart Outcomes Prevention Evaluation HOT : Hypertension Optimal Treatment HRT : hormone replacement therapy HT : hypertension HYVET : HYpertension in the Very Elderly Trial IMT : intima-media thickness I-PRESERVE : Irbesartan in Heart Failure with Preserved Systolic Function INTERHEART : Effect of Potentially Modifiable Risk Factors associated with Myocardial Infarction in 52 Countries INVEST : INternational VErapamil SR/T Trandolapril ISH : Isolated systolic hypertension JNC : Joint National Committee JUPITER : Justification for the Use of Statins in Primary Prevention: an Intervention Trial Evaluating Rosuvastatin LAVi : left atrial volume index LIFE : Losartan Intervention For Endpoint Reduction in Hypertensives LV : left ventricle/left ventricular LVH : left ventricular hypertrophy LVM : left ventricular mass MDRD : Modification of Diet in Renal Disease MRFIT : Multiple Risk Factor Intervention Trial MRI : magnetic resonance imaging NORDIL : The Nordic Diltiazem Intervention study OC : oral contraceptive OD : organ damage ONTARGET : ONgoing Telmisartan Alone and in Combination with Ramipril Global Endpoint Trial PAD : peripheral artery disease PATHS : Prevention And Treatment of Hypertension Study PCI : percutaneous coronary intervention PPAR : peroxisome proliferator-activated receptor PREVEND : Prevention of REnal and Vascular ENdstage Disease PROFESS : Prevention Regimen for Effectively Avoiding Secondary Strokes PROGRESS : Perindopril Protection Against Recurrent Stroke Study PWV : pulse wave velocity QALY : Quality adjusted life years RAA : renin-angiotensin-aldosterone RAS : renin-angiotensin system RCT : randomized controlled trials RF : risk factor ROADMAP : Randomized Olmesartan And Diabetes MicroAlbuminuria Prevention SBP : systolic blood pressure SCAST : Angiotensin-Receptor Blocker Candesartan for Treatment of Acute STroke SCOPE : Study on COgnition and Prognosis in the Elderly SCORE : Systematic COronary Risk Evaluation SHEP : Systolic Hypertension in the Elderly Program STOP : Swedish Trials in Old Patients with Hypertension STOP-2 : The second Swedish Trial in Old Patients with Hypertension SYSTCHINA : SYSTolic Hypertension in the Elderly: Chinese trial SYSTEUR : SYSTolic Hypertension in Europe TIA : transient ischaemic attack TOHP : Trials Of Hypertension Prevention TRANSCEND : Telmisartan Randomised AssessmeNt Study in ACE iNtolerant subjects with cardiovascular Disease UKPDS : United Kingdom Prospective Diabetes Study VADT : Veterans' Affairs Diabetes Trial VALUE : Valsartan Antihypertensive Long-term Use Evaluation WHO : World Health Organization ### 1.1 Principles The 2013 guidelines on hypertension of the European Society of Hypertension (ESH) and the European Society of Cardiology …

14,173 citations

Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal ArticleDOI
Paul Burton1, David Clayton2, Lon R. Cardon, Nicholas John Craddock3  +192 moreInstitutions (4)
07 Jun 2007-Nature
TL;DR: This study has demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in theBritish population is generally modest.
Abstract: There is increasing evidence that genome-wide association ( GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study ( using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined similar to 2,000 individuals for each of 7 major diseases and a shared set of similar to 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 X 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals ( including 58 loci with single-point P values between 10(-5) and 5 X 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

9,244 citations

Journal ArticleDOI
Monkol Lek, Konrad J. Karczewski1, Konrad J. Karczewski2, Eric Vallabh Minikel2, Eric Vallabh Minikel1, Kaitlin E. Samocha, Eric Banks1, Timothy Fennell1, Anne H. O’Donnell-Luria2, Anne H. O’Donnell-Luria3, Anne H. O’Donnell-Luria1, James S. Ware, Andrew J. Hill4, Andrew J. Hill2, Andrew J. Hill1, Beryl B. Cummings1, Beryl B. Cummings2, Taru Tukiainen2, Taru Tukiainen1, Daniel P. Birnbaum1, Jack A. Kosmicki, Laramie E. Duncan1, Laramie E. Duncan2, Karol Estrada2, Karol Estrada1, Fengmei Zhao2, Fengmei Zhao1, James Zou1, Emma Pierce-Hoffman2, Emma Pierce-Hoffman1, Joanne Berghout5, David Neil Cooper6, Nicole A. Deflaux7, Mark A. DePristo1, Ron Do, Jason Flannick2, Jason Flannick1, Menachem Fromer, Laura D. Gauthier1, Jackie Goldstein2, Jackie Goldstein1, Namrata Gupta1, Daniel P. Howrigan2, Daniel P. Howrigan1, Adam Kiezun1, Mitja I. Kurki1, Mitja I. Kurki2, Ami Levy Moonshine1, Pradeep Natarajan, Lorena Orozco, Gina M. Peloso1, Gina M. Peloso2, Ryan Poplin1, Manuel A. Rivas1, Valentin Ruano-Rubio1, Samuel A. Rose1, Douglas M. Ruderfer8, Khalid Shakir1, Peter D. Stenson6, Christine Stevens1, Brett Thomas1, Brett Thomas2, Grace Tiao1, María Teresa Tusié-Luna, Ben Weisburd1, Hong-Hee Won9, Dongmei Yu, David Altshuler1, David Altshuler10, Diego Ardissino, Michael Boehnke11, John Danesh12, Stacey Donnelly1, Roberto Elosua, Jose C. Florez2, Jose C. Florez1, Stacey Gabriel1, Gad Getz1, Gad Getz2, Stephen J. Glatt13, Christina M. Hultman14, Sekar Kathiresan, Markku Laakso15, Steven A. McCarroll1, Steven A. McCarroll2, Mark I. McCarthy16, Mark I. McCarthy17, Dermot P.B. McGovern18, Ruth McPherson19, Benjamin M. Neale1, Benjamin M. Neale2, Aarno Palotie, Shaun Purcell8, Danish Saleheen20, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan21, Patrick F. Sullivan14, Jaakko Tuomilehto22, Ming T. Tsuang23, Hugh Watkins16, Hugh Watkins17, James G. Wilson24, Mark J. Daly1, Mark J. Daly2, Daniel G. MacArthur2, Daniel G. MacArthur1 
18 Aug 2016-Nature
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

8,758 citations