scispace - formally typeset
Search or ask a question
Author

Nilesh J. Samani

Bio: Nilesh J. Samani is an academic researcher from University of Leicester. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 149, co-authored 779 publications receiving 113545 citations. Previous affiliations of Nilesh J. Samani include University Hospitals of Leicester NHS Trust & Glenfield Hospital.


Papers
More filters
Journal ArticleDOI
TL;DR: Investigation of the association between LTL and serum metabolite levels in 7,853 individuals found longer LTL associated with higher levels of lysoPC a C17:0 and PC ae C38:4, and with lower levels of methionine, tyrosine, PC aa C32:1, and C3-OH.
Abstract: Telomere shortening has been associated with multiple age-related diseases such as cardiovascular disease, diabetes, and dementia. However, the biological mechanisms responsible for these associations remain largely unknown. In order to gain insight into the metabolic processes driving the association of leukocyte telomere length (LTL) with age-related diseases, we investigated the association between LTL and serum metabolite levels in 7,853 individuals from seven independent cohorts. LTL was determined by quantitative polymerase chain reaction and the levels of 131 serum metabolites were measured with mass spectrometry in biological samples from the same blood draw. With partial correlation analysis, we identified six metabolites that were significantly associated with LTL after adjustment for multiple testing: lysophosphatidylcholine acyl C17:0 (lysoPC a C17:0, p-value = 7.1 × 10−6), methionine (p-value = 9.2 × 10−5), tyrosine (p-value = 2.1 × 10−4), phosphatidylcholine diacyl C32:1 (PC aa C32:1, p-value = 2.4 × 10−4), hydroxypropionylcarnitine (C3-OH, p-value = 2.6 × 10−4), and phosphatidylcholine acyl-alkyl C38:4 (PC ae C38:4, p-value = 9.0 × 10−4). Pathway analysis showed that the three phosphatidylcholines and methionine are involved in homocysteine metabolism and we found supporting evidence for an association of lipid metabolism with LTL. In conclusion, we found longer LTL associated with higher levels of lysoPC a C17:0 and PC ae C38:4, and with lower levels of methionine, tyrosine, PC aa C32:1, and C3-OH. These metabolites have been implicated in inflammation, oxidative stress, homocysteine metabolism, and in cardiovascular disease and diabetes, two major drivers of morbidity and mortality.

10 citations

Journal ArticleDOI
06 Feb 2015-PLOS ONE
TL;DR: Investigation of gene-gene interactions in common variants in candidate cardiovascular genes in CAD found Moderately large additive interactions between common SNPs in genes relevant to cardiovascular disease do not appear to play a major role in genetic predisposition to CAD.
Abstract: Objective Only a small fraction of coronary artery disease (CAD) heritability has been explained by common variants identified to date. Interactions between genes of importance to cardiovascular regulation may account for some of the missing heritability of CAD. This study aimed to investigate the role of gene-gene interactions in common variants in candidate cardiovascular genes in CAD. Approach and Results 2,101 patients with CAD from the British Heart Foundation Family Heart Study and 2,426 CAD-free controls were included in the discovery cohort. All subjects were genotyped with the Illumina HumanCVD BeadChip enriched for genes and pathways relevant to the cardiovascular system and disease. The primary analysis in the discovery cohort examined pairwise interactions among 913 common (minor allele frequency >0.1) independent single nucleotide polymorphisms (SNPs) with at least nominal association with CAD in single locus analysis. A secondary exploratory interaction analysis was performed among all 11,332 independent common SNPs surviving quality control criteria. Replication analyses were conducted in 2,967 patients and 3,075 controls from the Myocardial Infarction Genetics Consortium. None of the interactions amongst 913 SNPs analysed in the primary analysis was statistically significant after correction for multiple testing (required P 1.7 for common variants in the primary analysis. Conclusions Moderately large additive interactions between common SNPs in genes relevant to cardiovascular disease do not appear to play a major role in genetic predisposition to CAD. The role of genetic interactions amongst less common SNPs and with medium and small magnitude effects remain to be investigated.

10 citations

Journal ArticleDOI
TL;DR: It is found that RA-associated SNP as a group are not associated with coronary artery disease, and shared nongenetic factors may more plausibly explain the observed coincidence of the 2 conditions.
Abstract: Objective Inflammatory diseases, specifically rheumatoid arthritis (RA), are assumed to increase the risk of coronary artery disease (CAD). More recently, multiple single-nucleotide polymorphisms (SNP) associated with RA risk were identified. If causal mechanisms affecting risks of RA and CAD are overlapping, risk alleles for RA might also increase the risk of CAD. Methods Sixty-one SNP associating with RA in genome-wide significant analyses were tested for association with CAD in CARDIoGRAM (Coronary ARtery DIsease Genome wide Replication and Meta-analysis), a metaanalysis including genome-wide association data (22,233 CAD cases, 64,762 controls). In parallel, a set of SNP being associated with low-density lipoprotein cholesterol (LDL-C) was tested as a positive control. Results Twenty-nine RA-associated SNP displayed a directionality-consistent association with CAD (OR range 1.002–1.073), whereas 32 RA-associated SNP were not associated with CAD (OR range 0.96–0.99 per RA risk-increasing allele). The proportion (48%) of directionality-consistent associated SNP equaled the proportion expected by chance (50%, p = 0.09). Of only 5 RA-associated SNP showing p values for CAD Conclusion We found no evidence that RA-associated SNP as a group are associated with CAD. Even though we were not able to study potential effects of all genetic variants individually, shared nongenetic factors may more plausibly explain the observed coincidence of the 2 conditions.

10 citations

Journal ArticleDOI
TL;DR: In this article, the authors investigated the association of urinary creatinine with body composition changes and outcomes in patients with new-onset or worsening heart failure (WHF) and found that lower spot urinary concentrations were associated with poorer exercise capacity and quality of life.
Abstract: Background: Muscle wasting and unintentional weight loss (cachexia) have been associated with worse outcomes in heart failure (HF), but timely identification of these adverse phenomena is difficult. Spot urinary creatinine may be an easily accessible marker to assess muscle loss and cachexia. This study investigated the association of urinary creatinine with body composition changes and outcomes in patients with new‐onset or worsening HF (WHF). Methods: In BIOSTAT‐CHF, baseline spot urinary creatinine measurements were available in 2315 patients with new‐onset or WHF in an international cohort (index cohort) and a validation cohort of 1431 similar patients from Scotland. Results: Median spot urinary creatinine concentrations were 5.2 [2.7–9.6] mmol/L in the index cohort. Median age was 69 ± 12 years and 73% were men. Lower spot urinary creatinine was associated with older age, lower height and weight, worse renal function, more severe HF, and a higher risk of >5% weight loss from baseline to 9 months (odds ratio = 1.23, 95% CI = 1.09–1.39 per log decrease; P = 0.001). Spot urinary creatinine was associated with Evans criteria of cachexia (OR = 1.26 per log decrease, 95% CI = 1.04–1.49; P = 0.016) and clustered with markers of heart failure severity in hierarchical cluster analyses. Lower urinary creatinine was associated with poorer exercise capacity and quality of life (both P < 0.001) and predicted a higher rate for all‐cause mortality [hazard ratio (HR) = 1.27, 95% CI = 1.17–1.38 per log decrease; P < 0.001] and the combined endpoints HF hospitalization or all‐cause mortality (HR = 1.23, 95% CI = 1.15–1.31 per log decrease; P < 0.001). Significance was lost after addition of the BIOSTAT risk model. Analyses of the validation cohort yielded similar findings. Conclusions: Lower spot urinary creatinine is associated with smaller body dimensions, renal dysfunction, and more severe HF in patients with new‐onset/WHF. Additionally, lower spot urinary creatinine is associated with an increased risk of weight loss and a poorer exercise capacity/quality of life. Urinary creatinine could therefore be a novel, easily obtainable marker to assess (risk of) muscle wasting in HF patients.

10 citations

Journal ArticleDOI
TL;DR: Loss of function mutations in NOTCH1 (OMIM 190198) has previously been associated with congenital heart disease involving the aortic valve, left ventricle outflow tract, and mitral valve that segregates in affected pedigrees as an autosomal dominant trait with variable expressivity.
Abstract: Background Bicuspid aortic valve is the most common congenital valvular heart defect in the general population. BAV is associated with significant morbidity due to valve failure, formation of thoracic aortic aneurysm, and increased risk of infective endocarditis and aortic dissection. Loss of function mutations in NOTCH1 (OMIM 190198) has previously been associated with congenital heart disease involving the aortic valve, left ventricle outflow tract, and mitral valve that segregates in affected pedigrees as an autosomal dominant trait with variable expressivity. Methods We performed whole-exome sequencing in four members of a three-generational family (three affected and one unaffected subject) with clinical phenotypes including aortic valve stenosis, thoracic aortic aneurysm, and ventricular septal defect. Results We identified 16 potentially damaging genetic variants (one stop variant, one splice variant, and 14 missense variants) cosegregating with the phenotype. Of these variants, the nonsense mutation (p.Tyr291*) in NOTCH1 was the most deleterious variant identified and the most likely variant causing the disease. Conclusion Inactivating NOTCH1 mutations are a rare cause of familial heart disease involving predominantly left ventricular outflow tract lesions and characterized by the heterogeneity of clinical phenotype.

10 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
Giuseppe Mancia1, Robert Fagard, Krzysztof Narkiewicz, Josep Redon, Alberto Zanchetti, Michael Böhm, Thierry Christiaens, Renata Cifkova, Guy De Backer, Anna F. Dominiczak, Maurizio Galderisi, Diederick E. Grobbee, Tiny Jaarsma, Paulus Kirchhof, Sverre E. Kjeldsen, Stéphane Laurent, Athanasios J. Manolis, Peter M. Nilsson, Luis M. Ruilope, Roland E. Schmieder, Per Anton Sirnes, Peter Sleight, Margus Viigimaa, Bernard Waeber, Faiez Zannad, Michel Burnier, Ettore Ambrosioni, Mark Caufield, Antonio Coca, Michael H. Olsen, Costas Tsioufis, Philippe van de Borne, José Luis Zamorano, Stephan Achenbach, Helmut Baumgartner, Jeroen J. Bax, Héctor Bueno, Veronica Dean, Christi Deaton, Çetin Erol, Roberto Ferrari, David Hasdai, Arno W. Hoes, Juhani Knuuti, Philippe Kolh2, Patrizio Lancellotti, Aleš Linhart, Petros Nihoyannopoulos, Massimo F Piepoli, Piotr Ponikowski, Juan Tamargo, Michal Tendera, Adam Torbicki, William Wijns, Stephan Windecker, Denis Clement, Thierry C. Gillebert, Enrico Agabiti Rosei, Stefan D. Anker, Johann Bauersachs, Jana Brguljan Hitij, Mark J. Caulfield, Marc De Buyzere, Sabina De Geest, Geneviève Derumeaux, Serap Erdine, Csaba Farsang, Christian Funck-Brentano, Vjekoslav Gerc, Giuseppe Germanò, Stephan Gielen, Herman Haller, Jens Jordan, Thomas Kahan, Michel Komajda, Dragan Lovic, Heiko Mahrholdt, Jan Östergren, Gianfranco Parati, Joep Perk, Jorge Polónia, Bogdan A. Popescu, Zeljko Reiner, Lars Rydén, Yuriy Sirenko, Alice Stanton, Harry A.J. Struijker-Boudier, Charalambos Vlachopoulos, Massimo Volpe, David A. Wood 
TL;DR: In this article, a randomized controlled trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly people was presented. But the authors did not discuss the effect of the combination therapy in patients living with systolic hypertension.
Abstract: ABCD : Appropriate Blood pressure Control in Diabetes ABI : ankle–brachial index ABPM : ambulatory blood pressure monitoring ACCESS : Acute Candesartan Cilexetil Therapy in Stroke Survival ACCOMPLISH : Avoiding Cardiovascular Events in Combination Therapy in Patients Living with Systolic Hypertension ACCORD : Action to Control Cardiovascular Risk in Diabetes ACE : angiotensin-converting enzyme ACTIVE I : Atrial Fibrillation Clopidogrel Trial with Irbesartan for Prevention of Vascular Events ADVANCE : Action in Diabetes and Vascular Disease: Preterax and Diamicron-MR Controlled Evaluation AHEAD : Action for HEAlth in Diabetes ALLHAT : Antihypertensive and Lipid-Lowering Treatment to Prevent Heart ATtack ALTITUDE : ALiskiren Trial In Type 2 Diabetes Using Cardio-renal Endpoints ANTIPAF : ANgioTensin II Antagonist In Paroxysmal Atrial Fibrillation APOLLO : A Randomized Controlled Trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly People ARB : angiotensin receptor blocker ARIC : Atherosclerosis Risk In Communities ARR : aldosterone renin ratio ASCOT : Anglo-Scandinavian Cardiac Outcomes Trial ASCOT-LLA : Anglo-Scandinavian Cardiac Outcomes Trial—Lipid Lowering Arm ASTRAL : Angioplasty and STenting for Renal Artery Lesions A-V : atrioventricular BB : beta-blocker BMI : body mass index BP : blood pressure BSA : body surface area CA : calcium antagonist CABG : coronary artery bypass graft CAPPP : CAPtopril Prevention Project CAPRAF : CAndesartan in the Prevention of Relapsing Atrial Fibrillation CHD : coronary heart disease CHHIPS : Controlling Hypertension and Hypertension Immediately Post-Stroke CKD : chronic kidney disease CKD-EPI : Chronic Kidney Disease—EPIdemiology collaboration CONVINCE : Controlled ONset Verapamil INvestigation of CV Endpoints CT : computed tomography CV : cardiovascular CVD : cardiovascular disease D : diuretic DASH : Dietary Approaches to Stop Hypertension DBP : diastolic blood pressure DCCT : Diabetes Control and Complications Study DIRECT : DIabetic REtinopathy Candesartan Trials DM : diabetes mellitus DPP-4 : dipeptidyl peptidase 4 EAS : European Atherosclerosis Society EASD : European Association for the Study of Diabetes ECG : electrocardiogram EF : ejection fraction eGFR : estimated glomerular filtration rate ELSA : European Lacidipine Study on Atherosclerosis ESC : European Society of Cardiology ESH : European Society of Hypertension ESRD : end-stage renal disease EXPLOR : Amlodipine–Valsartan Combination Decreases Central Systolic Blood Pressure more Effectively than the Amlodipine–Atenolol Combination FDA : U.S. Food and Drug Administration FEVER : Felodipine EVent Reduction study GISSI-AF : Gruppo Italiano per lo Studio della Sopravvivenza nell'Infarto Miocardico-Atrial Fibrillation HbA1c : glycated haemoglobin HBPM : home blood pressure monitoring HOPE : Heart Outcomes Prevention Evaluation HOT : Hypertension Optimal Treatment HRT : hormone replacement therapy HT : hypertension HYVET : HYpertension in the Very Elderly Trial IMT : intima-media thickness I-PRESERVE : Irbesartan in Heart Failure with Preserved Systolic Function INTERHEART : Effect of Potentially Modifiable Risk Factors associated with Myocardial Infarction in 52 Countries INVEST : INternational VErapamil SR/T Trandolapril ISH : Isolated systolic hypertension JNC : Joint National Committee JUPITER : Justification for the Use of Statins in Primary Prevention: an Intervention Trial Evaluating Rosuvastatin LAVi : left atrial volume index LIFE : Losartan Intervention For Endpoint Reduction in Hypertensives LV : left ventricle/left ventricular LVH : left ventricular hypertrophy LVM : left ventricular mass MDRD : Modification of Diet in Renal Disease MRFIT : Multiple Risk Factor Intervention Trial MRI : magnetic resonance imaging NORDIL : The Nordic Diltiazem Intervention study OC : oral contraceptive OD : organ damage ONTARGET : ONgoing Telmisartan Alone and in Combination with Ramipril Global Endpoint Trial PAD : peripheral artery disease PATHS : Prevention And Treatment of Hypertension Study PCI : percutaneous coronary intervention PPAR : peroxisome proliferator-activated receptor PREVEND : Prevention of REnal and Vascular ENdstage Disease PROFESS : Prevention Regimen for Effectively Avoiding Secondary Strokes PROGRESS : Perindopril Protection Against Recurrent Stroke Study PWV : pulse wave velocity QALY : Quality adjusted life years RAA : renin-angiotensin-aldosterone RAS : renin-angiotensin system RCT : randomized controlled trials RF : risk factor ROADMAP : Randomized Olmesartan And Diabetes MicroAlbuminuria Prevention SBP : systolic blood pressure SCAST : Angiotensin-Receptor Blocker Candesartan for Treatment of Acute STroke SCOPE : Study on COgnition and Prognosis in the Elderly SCORE : Systematic COronary Risk Evaluation SHEP : Systolic Hypertension in the Elderly Program STOP : Swedish Trials in Old Patients with Hypertension STOP-2 : The second Swedish Trial in Old Patients with Hypertension SYSTCHINA : SYSTolic Hypertension in the Elderly: Chinese trial SYSTEUR : SYSTolic Hypertension in Europe TIA : transient ischaemic attack TOHP : Trials Of Hypertension Prevention TRANSCEND : Telmisartan Randomised AssessmeNt Study in ACE iNtolerant subjects with cardiovascular Disease UKPDS : United Kingdom Prospective Diabetes Study VADT : Veterans' Affairs Diabetes Trial VALUE : Valsartan Antihypertensive Long-term Use Evaluation WHO : World Health Organization ### 1.1 Principles The 2013 guidelines on hypertension of the European Society of Hypertension (ESH) and the European Society of Cardiology …

14,173 citations

Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal ArticleDOI
Paul Burton1, David Clayton2, Lon R. Cardon, Nicholas John Craddock3  +192 moreInstitutions (4)
07 Jun 2007-Nature
TL;DR: This study has demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in theBritish population is generally modest.
Abstract: There is increasing evidence that genome-wide association ( GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study ( using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined similar to 2,000 individuals for each of 7 major diseases and a shared set of similar to 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 X 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals ( including 58 loci with single-point P values between 10(-5) and 5 X 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

9,244 citations

Journal ArticleDOI
Monkol Lek, Konrad J. Karczewski1, Konrad J. Karczewski2, Eric Vallabh Minikel2, Eric Vallabh Minikel1, Kaitlin E. Samocha, Eric Banks1, Timothy Fennell1, Anne H. O’Donnell-Luria2, Anne H. O’Donnell-Luria3, Anne H. O’Donnell-Luria1, James S. Ware, Andrew J. Hill4, Andrew J. Hill2, Andrew J. Hill1, Beryl B. Cummings1, Beryl B. Cummings2, Taru Tukiainen2, Taru Tukiainen1, Daniel P. Birnbaum1, Jack A. Kosmicki, Laramie E. Duncan1, Laramie E. Duncan2, Karol Estrada2, Karol Estrada1, Fengmei Zhao2, Fengmei Zhao1, James Zou1, Emma Pierce-Hoffman2, Emma Pierce-Hoffman1, Joanne Berghout5, David Neil Cooper6, Nicole A. Deflaux7, Mark A. DePristo1, Ron Do, Jason Flannick2, Jason Flannick1, Menachem Fromer, Laura D. Gauthier1, Jackie Goldstein2, Jackie Goldstein1, Namrata Gupta1, Daniel P. Howrigan2, Daniel P. Howrigan1, Adam Kiezun1, Mitja I. Kurki1, Mitja I. Kurki2, Ami Levy Moonshine1, Pradeep Natarajan, Lorena Orozco, Gina M. Peloso1, Gina M. Peloso2, Ryan Poplin1, Manuel A. Rivas1, Valentin Ruano-Rubio1, Samuel A. Rose1, Douglas M. Ruderfer8, Khalid Shakir1, Peter D. Stenson6, Christine Stevens1, Brett Thomas1, Brett Thomas2, Grace Tiao1, María Teresa Tusié-Luna, Ben Weisburd1, Hong-Hee Won9, Dongmei Yu, David Altshuler1, David Altshuler10, Diego Ardissino, Michael Boehnke11, John Danesh12, Stacey Donnelly1, Roberto Elosua, Jose C. Florez2, Jose C. Florez1, Stacey Gabriel1, Gad Getz1, Gad Getz2, Stephen J. Glatt13, Christina M. Hultman14, Sekar Kathiresan, Markku Laakso15, Steven A. McCarroll1, Steven A. McCarroll2, Mark I. McCarthy16, Mark I. McCarthy17, Dermot P.B. McGovern18, Ruth McPherson19, Benjamin M. Neale1, Benjamin M. Neale2, Aarno Palotie, Shaun Purcell8, Danish Saleheen20, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan21, Patrick F. Sullivan14, Jaakko Tuomilehto22, Ming T. Tsuang23, Hugh Watkins16, Hugh Watkins17, James G. Wilson24, Mark J. Daly1, Mark J. Daly2, Daniel G. MacArthur2, Daniel G. MacArthur1 
18 Aug 2016-Nature
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

8,758 citations