scispace - formally typeset
Search or ask a question
Author

Nilesh J. Samani

Bio: Nilesh J. Samani is an academic researcher from University of Leicester. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 149, co-authored 779 publications receiving 113545 citations. Previous affiliations of Nilesh J. Samani include University Hospitals of Leicester NHS Trust & Glenfield Hospital.


Papers
More filters
Journal ArticleDOI
31 Dec 2013-PLOS ONE
TL;DR: The data suggest that urotensin-II system genes are unlikely to play a major role in genetic control of human blood pressure or renal function, and indicate that it may have evolved towards loss of function since the divergence of primates.
Abstract: Urotensin-II controls ion/water homeostasis in fish and vascular tone in rodents. We hypothesised that common genetic variants in urotensin-II pathway genes are associated with human blood pressure or renal function. We performed family-based analysis of association between blood pressure, glomerular filtration and genes of the urotensin-II pathway (urotensin-II, urotensin-II related peptide, urotensin-II receptor) saturated with 28 tagging single nucleotide polymorphisms in 2024 individuals from 520 families; followed by an independent replication in 420 families and 7545 unrelated subjects. The expression studies of the urotensin-II pathway were carried out in 97 human kidneys. Phylogenetic evolutionary analysis was conducted in 17 vertebrate species. One single nucleotide polymorphism (rs531485 in urotensin-II gene) was associated with adjusted estimated glomerular filtration rate in the discovery cohort (p = 0.0005). It showed no association with estimated glomerular filtration rate in the combined replication resource of 8724 subjects from 6 populations. Expression of urotensin-II and its receptor showed strong linear correlation (r = 0.86, p<0.0001). There was no difference in renal expression of urotensin-II system between hypertensive and normotensive subjects. Evolutionary analysis revealed accumulation of mutations in urotensin-II since the divergence of primates and weaker conservation of urotensin-II receptor in primates than in lower vertebrates. Our data suggest that urotensin-II system genes are unlikely to play a major role in genetic control of human blood pressure or renal function. The signatures of evolutionary forces acting on urotensin-II system indicate that it may have evolved towards loss of function since the divergence of primates.

12 citations

Journal ArticleDOI
TL;DR: To study the association between an atrial fibrillation genetic risk score with prevalent AF and all‐cause mortality in patients with heart failure, a genome-wide association study is conducted.
Abstract: Aims To study the association between an atrial fibrillation (AF) genetic risk score with prevalent AF and all-cause mortality in patients with heart failure. Methods and results An AF genetic risk score was calculated in 3759 European ancestry individuals (1783 with sinus rhythm, 1976 with AF) from the BIOlogy Study to TAilored Treatment in Chronic Heart Failure (BIOSTAT-CHF) by summing 97 single nucleotide polymorphism (SNP) alleles (ranging from 0-2) weighted by the natural logarithm of the relative SNP risk from the latest AF genome-wide association study. Further, we assessed AF risk variance explained by additive SNP variation, and performance of clinical or genetic risk factors, and the combination in classifying AF prevalence. AF was classified as AF or atrial flutter (AFL) at baseline electrocardiogram and/or a history of AF or AFL. The genetic risk score was associated with AF after multivariable adjustment. Odds ratio for AF prevalence per 1-unit increase genetic risk score was 2.12 (95% confidence interval 1.84-2.45, P = 2.15 × 10-24 ) in the total cohort, 2.08 (1.72-2.50, P = 1.30 × 10-14 ) in heart failure with reduced ejection fraction (HFrEF) and 2.02 (1.37-2.99, P = 4.37 × 10-4 ) in heart failure with preserved ejection fraction (HFpEF). AF-associated loci explained 22.9% of overall AF SNP heritability. Addition of the genetic risk score to clinical risk factors increased the C-index by 2.2% to 0.721. Conclusions The AF genetic risk score was associated with increased AF prevalence in HFrEF and HFpEF. Genetic variation accounted for 22.9% of overall AF SNP heritability. Addition of genetic risk to clinical risk improved model performance in classifying AF prevalence.

12 citations

14 Sep 2017
TL;DR: It is shown that genetic studies may be useful to explore the clinical relevance of off-target drug effects and points to further mechanisms, which are potentially responsible for coxib-associated CAD risk.
Abstract: Cyclooxygenase-2 inhibitors (coxibs) are characterized by multiple molecular off-target effects and increased coronary artery disease (CAD) risk. Here, we systematically explored common variants of genes representing molecular targets of coxibs for association with CAD. Given a broad spectrum of pleiotropic effects of coxibs, our intention was to narrow potential mechanisms affecting CAD risk as we hypothesized that the affected genes may also display genomic signals of coronary disease risk. A Drug Gene Interaction Database search identified 47 gene products to be affected by coxibs. We traced association signals in 200-kb regions surrounding these genes in 84,813 CAD cases and 202,543 controls. Based on a threshold of 1 × 10−5 (Bonferroni correction for 3131 haplotype blocks), four gene loci yielded significant associations. The lead SNPs were rs7270354 (MMP9), rs4888383 (BCAR1), rs6905288 (VEGFA1), and rs556321 (CACNA1E). By additional genotyping, rs7270354 at MMP9 and rs4888383 at BCAR1 also reached the established GWAS threshold for genome-wide significance. The findings demonstrate overlap of genes affected by coxibs and those mediating CAD risk and points to further mechanisms, which are potentially responsible for coxib-associated CAD risk. The novel approach furthermore suggests that genetic studies may be useful to explore the clinical relevance of off-target drug effects.

12 citations

Journal ArticleDOI
TL;DR: Two studies examine the role of variation in the gene for transforming growth factor-β1 (TGF- β1) and serve to highlight a more general dilemma facing researchers in this area, namely which CAD phenotype to look at and how best to measure it.
Abstract: Conventional risk factors explain only about half the risk of coronary artery disease (CAD) [1]. The strong familial predisposition to CAD [2], combined with advances in DNA analysis, has led to a proliferation of studies in recent years attempting to identify genetic factors that influence risk. The approach taken by most studies has been to examine the association of naturally occurring genetic variation (polymorphisms) in candidate genes with risk of or severity of CAD. Given the complex pathophysiology of coronary atherosclerosis and the processes that lead to the clinical syndromes [3], there is no lack of candidate genes worthy of study. In this issue of Clinical Science, two studies examine the role of variation in the gene for transforming growth factor-β1 (TGF-β1). TGF-β1 is a cytokine with several functions which could both inhibit and advance the development and progression of atherosclerosis. However, in 655 patients with CAD confirmed angiographically and 244 angiographically normal individuals, Syrris et al. [4] found no association of five different polymorphisms in the gene with CAD. Haplotypes constructed from linkage of individual polymorphisms also did not associate with CAD. Wang et al. [5], on the other hand, examined whether a polymorphism (C509T) in the promoter region of the gene, one of those also studied by Syrris et al. [4], influenced the severity of coronary disease, as judged by the number of significantly diseased vessels at angiography in 371 patients. Once again, no association was found of the polymorphism with this phenotype, or indeed with a prior history of myocardial infarction in the cohort. In addtion, the polymorphism did not have any discernible effect on the circulating level of TGF-β1. This is of particular relevance as circulating TGF-β1 levels have been found to be altered in subjects with severe coronary disease, although the data are conflicting with both depressed [6] and elevated [7] levels being reported. Association studies examine the role of individual polymorphisms and, by inference, any variations that are in significant linkage disequilibrium. However, when negative as in the studies of Syrris et al. and Wang et al., they cannot exclude an effect of a gene locus. Construction of haplotypes as done by Syrris et al. brings association studies closer to the type of locus exclusion permitted by linkage analysis, although in this case the study was probably underpowered to do this adequately. Thus it remains possible that other variations at the TGFβ1 locus still play a role in the pathogenesis of CAD. However, the two studies usefully serve to highlight a more general dilemma facing researchers in this area, namely which CAD phenotype to look at and how best to measure it. Because of both clinical relevance and patient access, two CAD phenotypes have been most commonly studied – myocardial infarction and angiographically documented coronary disease. Myocardial infarction has the great advantage of having well-defined criteria for diagnosis. An important consequence of this is that it is also relatively easy to identify controls who have not suffered an event, although as the condition can occur without any prior symptoms, and is an ageand gender-dependent process, it is important that controls are well matched for these criteria, in addition to others such as ethnicity which can impact on allele frequencies. The other major advantage of myocardial infarction as a phenotype is that in most cases it can be fairly accurately timed. This means that, in addition to any overall association, one can also examine whether an allele or genotype is associated more strongly with myocardial infarction in younger subjects or with a younger mean age at time of myocardial infarction, features one may expect of a risk factor. The main disadvantage of studying myocardial infarction is the unavoidable loss of subjects through fatality when subjects are recruited after the incident event, even if this is done in admission wards or coronary care units, as more than half the acute deaths occur before subjects even reach hospitals. This could become of paramount importance if a genetic factor, unknowingly, also influences survival after myocardial infarction. This is of course the major reason why findings from prospective studies carry extra weight. Use of the presence of coronary disease documented angiographically as a phenotype poses different problems. First, it is a chronic process and the onset is difficult to define. Secondly, although angiography is useful in identifying the presence of haemodynamically significant coronary stenoses (often defined as stenoses " 50%). it is an insensitive technique for quantifying the extent of atheroma. Patterns vary from focal lesions to diffuse involvement of large segments of vessels. Indeed, an interesting but unresolved question is whether different biological processes influence the pattern of coronary atheroma deposition. Furthermore, recent studies using intravascular coronary ultrasound [8,9] have shown that considerable atheroma may already exist in even angiographically ‘normal ’ coronary segments due to extrinsically directed remodelling [10]. Therefore, classification of severity of coronary disease simply on the basis of the

12 citations

Journal ArticleDOI
01 May 1997
TL;DR: In this article, the role of renin Ren-1d in regulating cardiovascular homeostasis was investigated in mice with a disrupted renin 1d gene and showed that Ren-2 transcripts were present.
Abstract: Renin is an aspartyl protease that is involved in the conversion of angiotensinogen to angitensin II and hence participates in the regulation of blood pressure. Mice are polymorphic for the number of renin genes with some strains harbouring two renin genes, Ren-1d and Ren-2. To study the role of renin Ren-1d in regulating cardiovascular homeostasis, mice with a disrupted Ren-1d gene were created. Analyses of kidney renin mRNA expression in Ren-1d−/−/Ren-2+/+ mice demonstrated that only Ren-2 transcripts were present. Mean arterial blood pressures of Ren-1d+/+/Ren-2+/+, Ren- 1d+/−/Ren-2+/+ and Ren-1d−/−/Ren-2+/+ mice showed no significant differences. These observations demonstrate that the Ren-1d gene product is not essential for normal blood pressure maintenance under normal physiological conditions

11 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
Giuseppe Mancia1, Robert Fagard, Krzysztof Narkiewicz, Josep Redon, Alberto Zanchetti, Michael Böhm, Thierry Christiaens, Renata Cifkova, Guy De Backer, Anna F. Dominiczak, Maurizio Galderisi, Diederick E. Grobbee, Tiny Jaarsma, Paulus Kirchhof, Sverre E. Kjeldsen, Stéphane Laurent, Athanasios J. Manolis, Peter M. Nilsson, Luis M. Ruilope, Roland E. Schmieder, Per Anton Sirnes, Peter Sleight, Margus Viigimaa, Bernard Waeber, Faiez Zannad, Michel Burnier, Ettore Ambrosioni, Mark Caufield, Antonio Coca, Michael H. Olsen, Costas Tsioufis, Philippe van de Borne, José Luis Zamorano, Stephan Achenbach, Helmut Baumgartner, Jeroen J. Bax, Héctor Bueno, Veronica Dean, Christi Deaton, Çetin Erol, Roberto Ferrari, David Hasdai, Arno W. Hoes, Juhani Knuuti, Philippe Kolh2, Patrizio Lancellotti, Aleš Linhart, Petros Nihoyannopoulos, Massimo F Piepoli, Piotr Ponikowski, Juan Tamargo, Michal Tendera, Adam Torbicki, William Wijns, Stephan Windecker, Denis Clement, Thierry C. Gillebert, Enrico Agabiti Rosei, Stefan D. Anker, Johann Bauersachs, Jana Brguljan Hitij, Mark J. Caulfield, Marc De Buyzere, Sabina De Geest, Geneviève Derumeaux, Serap Erdine, Csaba Farsang, Christian Funck-Brentano, Vjekoslav Gerc, Giuseppe Germanò, Stephan Gielen, Herman Haller, Jens Jordan, Thomas Kahan, Michel Komajda, Dragan Lovic, Heiko Mahrholdt, Jan Östergren, Gianfranco Parati, Joep Perk, Jorge Polónia, Bogdan A. Popescu, Zeljko Reiner, Lars Rydén, Yuriy Sirenko, Alice Stanton, Harry A.J. Struijker-Boudier, Charalambos Vlachopoulos, Massimo Volpe, David A. Wood 
TL;DR: In this article, a randomized controlled trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly people was presented. But the authors did not discuss the effect of the combination therapy in patients living with systolic hypertension.
Abstract: ABCD : Appropriate Blood pressure Control in Diabetes ABI : ankle–brachial index ABPM : ambulatory blood pressure monitoring ACCESS : Acute Candesartan Cilexetil Therapy in Stroke Survival ACCOMPLISH : Avoiding Cardiovascular Events in Combination Therapy in Patients Living with Systolic Hypertension ACCORD : Action to Control Cardiovascular Risk in Diabetes ACE : angiotensin-converting enzyme ACTIVE I : Atrial Fibrillation Clopidogrel Trial with Irbesartan for Prevention of Vascular Events ADVANCE : Action in Diabetes and Vascular Disease: Preterax and Diamicron-MR Controlled Evaluation AHEAD : Action for HEAlth in Diabetes ALLHAT : Antihypertensive and Lipid-Lowering Treatment to Prevent Heart ATtack ALTITUDE : ALiskiren Trial In Type 2 Diabetes Using Cardio-renal Endpoints ANTIPAF : ANgioTensin II Antagonist In Paroxysmal Atrial Fibrillation APOLLO : A Randomized Controlled Trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly People ARB : angiotensin receptor blocker ARIC : Atherosclerosis Risk In Communities ARR : aldosterone renin ratio ASCOT : Anglo-Scandinavian Cardiac Outcomes Trial ASCOT-LLA : Anglo-Scandinavian Cardiac Outcomes Trial—Lipid Lowering Arm ASTRAL : Angioplasty and STenting for Renal Artery Lesions A-V : atrioventricular BB : beta-blocker BMI : body mass index BP : blood pressure BSA : body surface area CA : calcium antagonist CABG : coronary artery bypass graft CAPPP : CAPtopril Prevention Project CAPRAF : CAndesartan in the Prevention of Relapsing Atrial Fibrillation CHD : coronary heart disease CHHIPS : Controlling Hypertension and Hypertension Immediately Post-Stroke CKD : chronic kidney disease CKD-EPI : Chronic Kidney Disease—EPIdemiology collaboration CONVINCE : Controlled ONset Verapamil INvestigation of CV Endpoints CT : computed tomography CV : cardiovascular CVD : cardiovascular disease D : diuretic DASH : Dietary Approaches to Stop Hypertension DBP : diastolic blood pressure DCCT : Diabetes Control and Complications Study DIRECT : DIabetic REtinopathy Candesartan Trials DM : diabetes mellitus DPP-4 : dipeptidyl peptidase 4 EAS : European Atherosclerosis Society EASD : European Association for the Study of Diabetes ECG : electrocardiogram EF : ejection fraction eGFR : estimated glomerular filtration rate ELSA : European Lacidipine Study on Atherosclerosis ESC : European Society of Cardiology ESH : European Society of Hypertension ESRD : end-stage renal disease EXPLOR : Amlodipine–Valsartan Combination Decreases Central Systolic Blood Pressure more Effectively than the Amlodipine–Atenolol Combination FDA : U.S. Food and Drug Administration FEVER : Felodipine EVent Reduction study GISSI-AF : Gruppo Italiano per lo Studio della Sopravvivenza nell'Infarto Miocardico-Atrial Fibrillation HbA1c : glycated haemoglobin HBPM : home blood pressure monitoring HOPE : Heart Outcomes Prevention Evaluation HOT : Hypertension Optimal Treatment HRT : hormone replacement therapy HT : hypertension HYVET : HYpertension in the Very Elderly Trial IMT : intima-media thickness I-PRESERVE : Irbesartan in Heart Failure with Preserved Systolic Function INTERHEART : Effect of Potentially Modifiable Risk Factors associated with Myocardial Infarction in 52 Countries INVEST : INternational VErapamil SR/T Trandolapril ISH : Isolated systolic hypertension JNC : Joint National Committee JUPITER : Justification for the Use of Statins in Primary Prevention: an Intervention Trial Evaluating Rosuvastatin LAVi : left atrial volume index LIFE : Losartan Intervention For Endpoint Reduction in Hypertensives LV : left ventricle/left ventricular LVH : left ventricular hypertrophy LVM : left ventricular mass MDRD : Modification of Diet in Renal Disease MRFIT : Multiple Risk Factor Intervention Trial MRI : magnetic resonance imaging NORDIL : The Nordic Diltiazem Intervention study OC : oral contraceptive OD : organ damage ONTARGET : ONgoing Telmisartan Alone and in Combination with Ramipril Global Endpoint Trial PAD : peripheral artery disease PATHS : Prevention And Treatment of Hypertension Study PCI : percutaneous coronary intervention PPAR : peroxisome proliferator-activated receptor PREVEND : Prevention of REnal and Vascular ENdstage Disease PROFESS : Prevention Regimen for Effectively Avoiding Secondary Strokes PROGRESS : Perindopril Protection Against Recurrent Stroke Study PWV : pulse wave velocity QALY : Quality adjusted life years RAA : renin-angiotensin-aldosterone RAS : renin-angiotensin system RCT : randomized controlled trials RF : risk factor ROADMAP : Randomized Olmesartan And Diabetes MicroAlbuminuria Prevention SBP : systolic blood pressure SCAST : Angiotensin-Receptor Blocker Candesartan for Treatment of Acute STroke SCOPE : Study on COgnition and Prognosis in the Elderly SCORE : Systematic COronary Risk Evaluation SHEP : Systolic Hypertension in the Elderly Program STOP : Swedish Trials in Old Patients with Hypertension STOP-2 : The second Swedish Trial in Old Patients with Hypertension SYSTCHINA : SYSTolic Hypertension in the Elderly: Chinese trial SYSTEUR : SYSTolic Hypertension in Europe TIA : transient ischaemic attack TOHP : Trials Of Hypertension Prevention TRANSCEND : Telmisartan Randomised AssessmeNt Study in ACE iNtolerant subjects with cardiovascular Disease UKPDS : United Kingdom Prospective Diabetes Study VADT : Veterans' Affairs Diabetes Trial VALUE : Valsartan Antihypertensive Long-term Use Evaluation WHO : World Health Organization ### 1.1 Principles The 2013 guidelines on hypertension of the European Society of Hypertension (ESH) and the European Society of Cardiology …

14,173 citations

Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal ArticleDOI
Paul Burton1, David Clayton2, Lon R. Cardon, Nicholas John Craddock3  +192 moreInstitutions (4)
07 Jun 2007-Nature
TL;DR: This study has demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in theBritish population is generally modest.
Abstract: There is increasing evidence that genome-wide association ( GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study ( using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined similar to 2,000 individuals for each of 7 major diseases and a shared set of similar to 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 X 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals ( including 58 loci with single-point P values between 10(-5) and 5 X 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

9,244 citations

Journal ArticleDOI
Monkol Lek, Konrad J. Karczewski1, Konrad J. Karczewski2, Eric Vallabh Minikel2, Eric Vallabh Minikel1, Kaitlin E. Samocha, Eric Banks1, Timothy Fennell1, Anne H. O’Donnell-Luria2, Anne H. O’Donnell-Luria3, Anne H. O’Donnell-Luria1, James S. Ware, Andrew J. Hill4, Andrew J. Hill2, Andrew J. Hill1, Beryl B. Cummings1, Beryl B. Cummings2, Taru Tukiainen2, Taru Tukiainen1, Daniel P. Birnbaum1, Jack A. Kosmicki, Laramie E. Duncan1, Laramie E. Duncan2, Karol Estrada2, Karol Estrada1, Fengmei Zhao2, Fengmei Zhao1, James Zou1, Emma Pierce-Hoffman2, Emma Pierce-Hoffman1, Joanne Berghout5, David Neil Cooper6, Nicole A. Deflaux7, Mark A. DePristo1, Ron Do, Jason Flannick2, Jason Flannick1, Menachem Fromer, Laura D. Gauthier1, Jackie Goldstein2, Jackie Goldstein1, Namrata Gupta1, Daniel P. Howrigan2, Daniel P. Howrigan1, Adam Kiezun1, Mitja I. Kurki1, Mitja I. Kurki2, Ami Levy Moonshine1, Pradeep Natarajan, Lorena Orozco, Gina M. Peloso1, Gina M. Peloso2, Ryan Poplin1, Manuel A. Rivas1, Valentin Ruano-Rubio1, Samuel A. Rose1, Douglas M. Ruderfer8, Khalid Shakir1, Peter D. Stenson6, Christine Stevens1, Brett Thomas1, Brett Thomas2, Grace Tiao1, María Teresa Tusié-Luna, Ben Weisburd1, Hong-Hee Won9, Dongmei Yu, David Altshuler1, David Altshuler10, Diego Ardissino, Michael Boehnke11, John Danesh12, Stacey Donnelly1, Roberto Elosua, Jose C. Florez2, Jose C. Florez1, Stacey Gabriel1, Gad Getz1, Gad Getz2, Stephen J. Glatt13, Christina M. Hultman14, Sekar Kathiresan, Markku Laakso15, Steven A. McCarroll1, Steven A. McCarroll2, Mark I. McCarthy16, Mark I. McCarthy17, Dermot P.B. McGovern18, Ruth McPherson19, Benjamin M. Neale1, Benjamin M. Neale2, Aarno Palotie, Shaun Purcell8, Danish Saleheen20, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan21, Patrick F. Sullivan14, Jaakko Tuomilehto22, Ming T. Tsuang23, Hugh Watkins16, Hugh Watkins17, James G. Wilson24, Mark J. Daly1, Mark J. Daly2, Daniel G. MacArthur2, Daniel G. MacArthur1 
18 Aug 2016-Nature
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

8,758 citations