scispace - formally typeset
Search or ask a question
Author

Nilesh J. Samani

Bio: Nilesh J. Samani is an academic researcher from University of Leicester. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 149, co-authored 779 publications receiving 113545 citations. Previous affiliations of Nilesh J. Samani include University Hospitals of Leicester NHS Trust & Glenfield Hospital.


Papers
More filters
Journal ArticleDOI
Veryan Codd1, Christopher P. Nelson1, Eva Albrecht, Massimo Mangino2, Joris Deelen3, Jessica L. Buxton4, Jouke-Jan Hottenga5, Krista Fischer6, Tõnu Esko6, Ida Surakka7, Linda Broer, Dale R. Nyholt8, Irene Mateo Leach9, Perttu Salo, Sara Hägg10, Mary K. Matthews1, Jutta Palmen11, Giuseppe Danilo Norata, Paul F. O'Reilly4, Danish Saleheen12, Najaf Amin13, Anthony J. Balmforth14, Marian Beekman3, Rudolf A. de Boer9, Stefan Böhringer3, Peter S. Braund1, Paul Burton1, Anton J. M. de Craen3, Matthew Denniff1, Yanbin Dong15, Konstantinos Douroudis6, Elena Dubinina1, Johan G. Eriksson, Katia Garlaschelli, Dehuang Guo15, Anna-Liisa Hartikainen16, Anjali K. Henders8, Jeanine J. Houwing-Duistermaat3, Laura Kananen7, Lennart C. Karssen13, Johannes Kettunen7, Norman Klopp, Vasiliki Lagou17, Elisabeth M. van Leeuwen13, Pamela A. F. Madden18, Reedik Mägi6, Patrik K. E. Magnusson10, Satu Männistö19, Satu Männistö20, Mark I. McCarthy21, Mark I. McCarthy17, Mark I. McCarthy22, Sarah E. Medland8, Evelin Mihailov6, Grant W. Montgomery8, Ben A. Oostra13, Aarno Palotie, Annette Peters, Helen Pollard1, Anneli Pouta16, Anneli Pouta20, Inga Prokopenko17, Samuli Ripatti, Veikko Salomaa19, Veikko Salomaa20, H. Eka D. Suchiman3, Ana M. Valdes2, Niek Verweij9, Ana Viñuela2, Xiaoling Wang23, Xiaoling Wang24, H-Erich Wichmann25, Elisabeth Widen7, Gonneke Willemsen5, Margaret J. Wright8, Kai Xia26, Xiangjun Xiao27, Dirk J. van Veldhuisen9, Alberico L. Catapano28, Martin D. Tobin1, Alistair S. Hall14, Alexandra I. F. Blakemore4, Wiek H. van Gilst9, Haidong Zhu24, Haidong Zhu23, Jeanette Erdmann, Muredach P. Reilly29, Sekar Kathiresan30, Sekar Kathiresan31, Heribert Schunkert, Philippa J. Talmud11, Nancy L. Pedersen10, Markus Perola6, Markus Perola7, Markus Perola20, Willem H. Ouwehand, Jaakko Kaprio, Nicholas G. Martin8, Cornelia M. van Duijn, Iiris Hovatta20, Iiris Hovatta7, Christian Gieger11, Andres Metspalu6, Dorret I. Boomsma5, Marjo-Riitta Järvelin, P. Eline Slagboom3, John R Thompson1, Tim D. Spector2, Pim van der Harst1, Nilesh J. Samani1, Nilesh J. Samani32 
TL;DR: In this paper, a genome-wide meta-analysis of 37,684 individuals with replication of selected variants in an additional 10,739 individuals was carried out to identify seven loci, including five new loci associated with mean leukocyte telomere length (LTL) (P < 5 × 10−8).
Abstract: Interindividual variation in mean leukocyte telomere length (LTL) is associated with cancer and several age-associated diseases. We report here a genome-wide meta-analysis of 37,684 individuals with replication of selected variants in an additional 10,739 individuals. We identified seven loci, including five new loci, associated with mean LTL (P < 5 × 10(-8)). Five of the loci contain candidate genes (TERC, TERT, NAF1, OBFC1 and RTEL1) that are known to be involved in telomere biology. Lead SNPs at two loci (TERC and TERT) associate with several cancers and other diseases, including idiopathic pulmonary fibrosis. Moreover, a genetic risk score analysis combining lead variants at all 7 loci in 22,233 coronary artery disease cases and 64,762 controls showed an association of the alleles associated with shorter LTL with increased risk of coronary artery disease (21% (95% confidence interval, 5-35%) per standard deviation in LTL, P = 0.014). Our findings support a causal role of telomere-length variation in some age-related diseases.

703 citations

Journal ArticleDOI
TL;DR: In this article, the authors investigated whether mean leucocyte telomere length is a predictor of the development of coronary heart disease and found that individuals in the middle and lowest tertiles of telomeres were more at risk of developing a heart disease event than those in the highest tertile.

693 citations

Journal ArticleDOI
TL;DR: Increased BMI in adults of European origin is associated with increased methylation at the HIF3A locus in blood cells and in adipose tissue, and perturbation of hypoxia inducible transcription factor pathways could have an important role in the response to increased weight in people.

690 citations

Journal ArticleDOI
John F. Peden1, Jemma C. Hopewell1, Danish Saleheen2, John C. Chambers3, Jorg Hager4, Nicole Soranzo5, Rory Collins1, John Danesh2, Paul Elliott3, Martin Farrall1, Kathy Stirrups5, Weihua Zhang3, Anders Hamsten6, Anders Hamsten7, Sarah Parish1, Mark Lathrop4, Hugh Watkins1, Robert Clarke1, Panos Deloukas5, Jaspal S. Kooner3, Anuj Goel1, Halit Ongen1, Rona J. Strawbridge6, Rona J. Strawbridge7, Simon Heath4, Anders Mälarstig7, Anders Mälarstig6, Anna Helgadottir1, John Öhrvik6, John Öhrvik7, Muhammed Murtaza5, Simon C. Potter5, Sarah E. Hunt5, Marc Delepine4, Shapour Jalilzadeh1, Tomas Axelsson8, Ann-Christine Syvänen8, Rhian Gwilliam5, Suzannah Bumpstead5, Emma Gray5, Sarah Edkins5, Lasse Folkersen7, Lasse Folkersen6, Theodosios Kyriakou1, Anders Franco-Cereceda6, Anders Gabrielsen6, Udo Seedorf9, Per Eriksson6, Per Eriksson7, Alison Offer1, Louise Bowman1, Peter Sleight1, Jane Armitage1, Richard Peto1, Gonçalo R. Abecasis10, Nabeel Ahmed, Mark J. Caulfield11, Peter Donnelly1, Philippe Froguel3, Angad S. Kooner, Mark I. McCarthy1, Nilesh J. Samani12, James Scott3, Joban Sehmi3, Angela Silveira6, Angela Silveira7, Mai-Lis Hellénius6, Ferdinand M. van't Hooft6, Ferdinand M. van't Hooft7, Gunnar O Olsson13, Stephan Rust9, Gerd Assmann9, Simona Barlera, Gianni Tognoni, Maria Grazia Franzosi, Pamela Linksted1, Fiona Green14, Asif Rasheed, Moazzam Zaidi, Nabi Shah, Maria Samuel, Nadeem Hayat Mallick, Muhammad Azhar, Khan Shah Zaman, Abdus Samad, M. Ishaq, Ali Raza Gardezi, Fazal-ur-Rehman Memon, Philippe M. Frossard, Tim D. Spector, Leena Peltonen15, Leena Peltonen5, Markku S. Nieminen, Juha Sinisalo, Veikko Salomaa, Samuli Ripatti15, Derrick A Bennett1, Karin Leander6, Bruna Gigante6, Ulf de Faire6, Silvia Pietri, Francesca Gori, Roberto Marchioli, Suthesh Sivapalaratnam16, John J.P. Kastelein16, Mieke D. Trip16, Eirini V. Theodoraki17, George V. Dedoussis17, Engert Jc18, Salim Yusuf19, Sonia S. Anand19 
TL;DR: Genome-wide association studies have identified 11 common variants convincingly associated with coronary artery disease (CAD), a modest number considering the apparent heritability of CAD(8) as mentioned in this paper.
Abstract: Genome-wide association studies have identified 11 common variants convincingly associated with coronary artery disease (CAD)(1-7), a modest number considering the apparent heritability of CAD(8). ...

654 citations

Journal ArticleDOI
Cecilia M. Lindgren1, Iris M. Heid2, Joshua C. Randall1, Claudia Lamina3  +152 moreInstitutions (36)
TL;DR: By focusing on anthropometric measures of central obesity and fat distribution, a meta-analysis of 16 genome-wide association studies informative for adult waist circumference and waist–hip ratio identified three loci implicated in the regulation of human adiposity.
Abstract: To identify genetic loci influencing central obesity and fat distribution, we performed a meta-analysis of 16 genome-wide association studies (GWAS, N = 38,580) informative for adult waist circumference (WC) and waist-hip ratio (WHR). We selected 26 SNPs for follow-up, for which the evidence of association with measures of central adiposity (WC and/or WHR) was strong and disproportionate to that for overall adiposity or height. Follow-up studies in a maximum of 70,689 individuals identified two loci strongly associated with measures of central adiposity; these map near TFAP2B (WC, P = 1.9x10(-11)) and MSRA (WC, P = 8.9x10(-9)). A third locus, near LYPLAL1, was associated with WHR in women only (P = 2.6x10(-8)). The variants near TFAP2B appear to influence central adiposity through an effect on overall obesity/fat-mass, whereas LYPLAL1 displays a strong female-only association with fat distribution. By focusing on anthropometric measures of central obesity and fat distribution, we have identified three loci implicated in the regulation of human adiposity.

648 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
Giuseppe Mancia1, Robert Fagard, Krzysztof Narkiewicz, Josep Redon, Alberto Zanchetti, Michael Böhm, Thierry Christiaens, Renata Cifkova, Guy De Backer, Anna F. Dominiczak, Maurizio Galderisi, Diederick E. Grobbee, Tiny Jaarsma, Paulus Kirchhof, Sverre E. Kjeldsen, Stéphane Laurent, Athanasios J. Manolis, Peter M. Nilsson, Luis M. Ruilope, Roland E. Schmieder, Per Anton Sirnes, Peter Sleight, Margus Viigimaa, Bernard Waeber, Faiez Zannad, Michel Burnier, Ettore Ambrosioni, Mark Caufield, Antonio Coca, Michael H. Olsen, Costas Tsioufis, Philippe van de Borne, José Luis Zamorano, Stephan Achenbach, Helmut Baumgartner, Jeroen J. Bax, Héctor Bueno, Veronica Dean, Christi Deaton, Çetin Erol, Roberto Ferrari, David Hasdai, Arno W. Hoes, Juhani Knuuti, Philippe Kolh2, Patrizio Lancellotti, Aleš Linhart, Petros Nihoyannopoulos, Massimo F Piepoli, Piotr Ponikowski, Juan Tamargo, Michal Tendera, Adam Torbicki, William Wijns, Stephan Windecker, Denis Clement, Thierry C. Gillebert, Enrico Agabiti Rosei, Stefan D. Anker, Johann Bauersachs, Jana Brguljan Hitij, Mark J. Caulfield, Marc De Buyzere, Sabina De Geest, Geneviève Derumeaux, Serap Erdine, Csaba Farsang, Christian Funck-Brentano, Vjekoslav Gerc, Giuseppe Germanò, Stephan Gielen, Herman Haller, Jens Jordan, Thomas Kahan, Michel Komajda, Dragan Lovic, Heiko Mahrholdt, Jan Östergren, Gianfranco Parati, Joep Perk, Jorge Polónia, Bogdan A. Popescu, Zeljko Reiner, Lars Rydén, Yuriy Sirenko, Alice Stanton, Harry A.J. Struijker-Boudier, Charalambos Vlachopoulos, Massimo Volpe, David A. Wood 
TL;DR: In this article, a randomized controlled trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly people was presented. But the authors did not discuss the effect of the combination therapy in patients living with systolic hypertension.
Abstract: ABCD : Appropriate Blood pressure Control in Diabetes ABI : ankle–brachial index ABPM : ambulatory blood pressure monitoring ACCESS : Acute Candesartan Cilexetil Therapy in Stroke Survival ACCOMPLISH : Avoiding Cardiovascular Events in Combination Therapy in Patients Living with Systolic Hypertension ACCORD : Action to Control Cardiovascular Risk in Diabetes ACE : angiotensin-converting enzyme ACTIVE I : Atrial Fibrillation Clopidogrel Trial with Irbesartan for Prevention of Vascular Events ADVANCE : Action in Diabetes and Vascular Disease: Preterax and Diamicron-MR Controlled Evaluation AHEAD : Action for HEAlth in Diabetes ALLHAT : Antihypertensive and Lipid-Lowering Treatment to Prevent Heart ATtack ALTITUDE : ALiskiren Trial In Type 2 Diabetes Using Cardio-renal Endpoints ANTIPAF : ANgioTensin II Antagonist In Paroxysmal Atrial Fibrillation APOLLO : A Randomized Controlled Trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly People ARB : angiotensin receptor blocker ARIC : Atherosclerosis Risk In Communities ARR : aldosterone renin ratio ASCOT : Anglo-Scandinavian Cardiac Outcomes Trial ASCOT-LLA : Anglo-Scandinavian Cardiac Outcomes Trial—Lipid Lowering Arm ASTRAL : Angioplasty and STenting for Renal Artery Lesions A-V : atrioventricular BB : beta-blocker BMI : body mass index BP : blood pressure BSA : body surface area CA : calcium antagonist CABG : coronary artery bypass graft CAPPP : CAPtopril Prevention Project CAPRAF : CAndesartan in the Prevention of Relapsing Atrial Fibrillation CHD : coronary heart disease CHHIPS : Controlling Hypertension and Hypertension Immediately Post-Stroke CKD : chronic kidney disease CKD-EPI : Chronic Kidney Disease—EPIdemiology collaboration CONVINCE : Controlled ONset Verapamil INvestigation of CV Endpoints CT : computed tomography CV : cardiovascular CVD : cardiovascular disease D : diuretic DASH : Dietary Approaches to Stop Hypertension DBP : diastolic blood pressure DCCT : Diabetes Control and Complications Study DIRECT : DIabetic REtinopathy Candesartan Trials DM : diabetes mellitus DPP-4 : dipeptidyl peptidase 4 EAS : European Atherosclerosis Society EASD : European Association for the Study of Diabetes ECG : electrocardiogram EF : ejection fraction eGFR : estimated glomerular filtration rate ELSA : European Lacidipine Study on Atherosclerosis ESC : European Society of Cardiology ESH : European Society of Hypertension ESRD : end-stage renal disease EXPLOR : Amlodipine–Valsartan Combination Decreases Central Systolic Blood Pressure more Effectively than the Amlodipine–Atenolol Combination FDA : U.S. Food and Drug Administration FEVER : Felodipine EVent Reduction study GISSI-AF : Gruppo Italiano per lo Studio della Sopravvivenza nell'Infarto Miocardico-Atrial Fibrillation HbA1c : glycated haemoglobin HBPM : home blood pressure monitoring HOPE : Heart Outcomes Prevention Evaluation HOT : Hypertension Optimal Treatment HRT : hormone replacement therapy HT : hypertension HYVET : HYpertension in the Very Elderly Trial IMT : intima-media thickness I-PRESERVE : Irbesartan in Heart Failure with Preserved Systolic Function INTERHEART : Effect of Potentially Modifiable Risk Factors associated with Myocardial Infarction in 52 Countries INVEST : INternational VErapamil SR/T Trandolapril ISH : Isolated systolic hypertension JNC : Joint National Committee JUPITER : Justification for the Use of Statins in Primary Prevention: an Intervention Trial Evaluating Rosuvastatin LAVi : left atrial volume index LIFE : Losartan Intervention For Endpoint Reduction in Hypertensives LV : left ventricle/left ventricular LVH : left ventricular hypertrophy LVM : left ventricular mass MDRD : Modification of Diet in Renal Disease MRFIT : Multiple Risk Factor Intervention Trial MRI : magnetic resonance imaging NORDIL : The Nordic Diltiazem Intervention study OC : oral contraceptive OD : organ damage ONTARGET : ONgoing Telmisartan Alone and in Combination with Ramipril Global Endpoint Trial PAD : peripheral artery disease PATHS : Prevention And Treatment of Hypertension Study PCI : percutaneous coronary intervention PPAR : peroxisome proliferator-activated receptor PREVEND : Prevention of REnal and Vascular ENdstage Disease PROFESS : Prevention Regimen for Effectively Avoiding Secondary Strokes PROGRESS : Perindopril Protection Against Recurrent Stroke Study PWV : pulse wave velocity QALY : Quality adjusted life years RAA : renin-angiotensin-aldosterone RAS : renin-angiotensin system RCT : randomized controlled trials RF : risk factor ROADMAP : Randomized Olmesartan And Diabetes MicroAlbuminuria Prevention SBP : systolic blood pressure SCAST : Angiotensin-Receptor Blocker Candesartan for Treatment of Acute STroke SCOPE : Study on COgnition and Prognosis in the Elderly SCORE : Systematic COronary Risk Evaluation SHEP : Systolic Hypertension in the Elderly Program STOP : Swedish Trials in Old Patients with Hypertension STOP-2 : The second Swedish Trial in Old Patients with Hypertension SYSTCHINA : SYSTolic Hypertension in the Elderly: Chinese trial SYSTEUR : SYSTolic Hypertension in Europe TIA : transient ischaemic attack TOHP : Trials Of Hypertension Prevention TRANSCEND : Telmisartan Randomised AssessmeNt Study in ACE iNtolerant subjects with cardiovascular Disease UKPDS : United Kingdom Prospective Diabetes Study VADT : Veterans' Affairs Diabetes Trial VALUE : Valsartan Antihypertensive Long-term Use Evaluation WHO : World Health Organization ### 1.1 Principles The 2013 guidelines on hypertension of the European Society of Hypertension (ESH) and the European Society of Cardiology …

14,173 citations

Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal ArticleDOI
Paul Burton1, David Clayton2, Lon R. Cardon, Nicholas John Craddock3  +192 moreInstitutions (4)
07 Jun 2007-Nature
TL;DR: This study has demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in theBritish population is generally modest.
Abstract: There is increasing evidence that genome-wide association ( GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study ( using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined similar to 2,000 individuals for each of 7 major diseases and a shared set of similar to 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 X 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals ( including 58 loci with single-point P values between 10(-5) and 5 X 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

9,244 citations

Journal ArticleDOI
Monkol Lek, Konrad J. Karczewski1, Konrad J. Karczewski2, Eric Vallabh Minikel2, Eric Vallabh Minikel1, Kaitlin E. Samocha, Eric Banks1, Timothy Fennell1, Anne H. O’Donnell-Luria2, Anne H. O’Donnell-Luria3, Anne H. O’Donnell-Luria1, James S. Ware, Andrew J. Hill4, Andrew J. Hill2, Andrew J. Hill1, Beryl B. Cummings1, Beryl B. Cummings2, Taru Tukiainen2, Taru Tukiainen1, Daniel P. Birnbaum1, Jack A. Kosmicki, Laramie E. Duncan1, Laramie E. Duncan2, Karol Estrada2, Karol Estrada1, Fengmei Zhao2, Fengmei Zhao1, James Zou1, Emma Pierce-Hoffman2, Emma Pierce-Hoffman1, Joanne Berghout5, David Neil Cooper6, Nicole A. Deflaux7, Mark A. DePristo1, Ron Do, Jason Flannick2, Jason Flannick1, Menachem Fromer, Laura D. Gauthier1, Jackie Goldstein2, Jackie Goldstein1, Namrata Gupta1, Daniel P. Howrigan2, Daniel P. Howrigan1, Adam Kiezun1, Mitja I. Kurki1, Mitja I. Kurki2, Ami Levy Moonshine1, Pradeep Natarajan, Lorena Orozco, Gina M. Peloso1, Gina M. Peloso2, Ryan Poplin1, Manuel A. Rivas1, Valentin Ruano-Rubio1, Samuel A. Rose1, Douglas M. Ruderfer8, Khalid Shakir1, Peter D. Stenson6, Christine Stevens1, Brett Thomas1, Brett Thomas2, Grace Tiao1, María Teresa Tusié-Luna, Ben Weisburd1, Hong-Hee Won9, Dongmei Yu, David Altshuler1, David Altshuler10, Diego Ardissino, Michael Boehnke11, John Danesh12, Stacey Donnelly1, Roberto Elosua, Jose C. Florez2, Jose C. Florez1, Stacey Gabriel1, Gad Getz1, Gad Getz2, Stephen J. Glatt13, Christina M. Hultman14, Sekar Kathiresan, Markku Laakso15, Steven A. McCarroll1, Steven A. McCarroll2, Mark I. McCarthy16, Mark I. McCarthy17, Dermot P.B. McGovern18, Ruth McPherson19, Benjamin M. Neale1, Benjamin M. Neale2, Aarno Palotie, Shaun Purcell8, Danish Saleheen20, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan21, Patrick F. Sullivan14, Jaakko Tuomilehto22, Ming T. Tsuang23, Hugh Watkins16, Hugh Watkins17, James G. Wilson24, Mark J. Daly1, Mark J. Daly2, Daniel G. MacArthur2, Daniel G. MacArthur1 
18 Aug 2016-Nature
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

8,758 citations