PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
Shaun Purcell,Shaun Purcell,Benjamin M. Neale,Benjamin M. Neale,Kathe Todd-Brown,Lori Thomas,Manuel A. R. Ferreira,David Bender,David Bender,Julian Maller,Julian Maller,Pamela Sklar,Pamela Sklar,Paul I.W. de Bakker,Paul I.W. de Bakker,Mark J. Daly,Mark J. Daly,Pak C. Sham +17 more
Reads0
Chats0
TLDR
This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.Abstract:
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.read more
Citations
More filters
Journal ArticleDOI
Investigation of dyslexia and SLI risk variants in reading- and language-impaired subjects
Dianne F. Newbury,Silvia Paracchini,Thomas S. Scerri,Laura Winchester,Laura Addis,Alex J. Richardson,J. Walter,John F. Stein,Joel B. Talcott,Anthony P. Monaco +9 more
TL;DR: The role of variants in these genes (namely MRPL19/C20RF3,ROBO1,DCDC2, KIAA0319, DYX1C1, CNTNAP2, ATP2C2 and CMIP) in the aetiology of SLI and dyslexia is investigated.
Journal ArticleDOI
An analytical framework for whole-genome sequence association studies and its implications for autism spectrum disorder.
Donna M. Werling,Harrison Brand,Harrison Brand,Joon Yong An,Matthew R. Stone,Lingxue Zhu,Joseph T. Glessner,Joseph T. Glessner,Ryan L. Collins,Shan Dong,Ryan M. Layer,Eirene Markenscoff-Papadimitriou,Andrew Farrell,Grace Schwartz,Harold Z. Wang,Benjamin Currall,Benjamin Currall,Xuefang Zhao,Xuefang Zhao,Jeanselle Dea,Clif Duhn,Carolyn A. Erdman,Michael C. Gilson,Rachita Yadav,Rachita Yadav,Robert E. Handsaker,Robert E. Handsaker,Seva Kashin,Seva Kashin,Lambertus Klei,Jeffrey D. Mandell,Tomasz J. Nowakowski,Yuwen Liu,Sirisha Pochareddy,Louw Smith,Michael F. Walker,Matthew J. Waterman,Xin He,Arnold R. Kriegstein,John L.R. Rubenstein,Nenad Sestan,Steven A. McCarroll,Steven A. McCarroll,Benjamin M. Neale,Benjamin M. Neale,Hilary Coon,A. Jeremy Willsey,Joseph D. Buxbaum,Mark J. Daly,Mark J. Daly,Matthew W. State,Aaron R. Quinlan,Gabor T. Marth,Kathryn Roeder,Bernie Devlin,Michael E. Talkowski,Stephen Sanders +56 more
TL;DR: Analyses of 519 autism spectrum disorder families did not identify association with any categories after correction for 4,123 effective tests, and the work suggests that robust results from WGS studies will require large cohorts and strategies that consider the substantial multiple-testing burden.
Journal ArticleDOI
Meta-analysis identifies five novel loci associated with endometriosis highlighting key genes involved in hormone metabolism
Yadav Sapkota,Yadav Sapkota,Valgerdur Steinthorsdottir,Andrew P. Morris,Andrew P. Morris,Amelie Fassbender,Nilufer Rahmioglu,Immaculata De Vivo,Immaculata De Vivo,Julie E. Buring,Julie E. Buring,Futao Zhang,Todd L. Edwards,Sarah H. Jones,Dorien O,Daniëlle Peterse,Kathryn M. Rexrode,Kathryn M. Rexrode,Paul M. Ridker,Paul M. Ridker,Andrew J. Schork,Andrew J. Schork,Stuart MacGregor,Nicholas G. Martin,Christian M. Becker,Sosuke Adachi,Kosuke Yoshihara,Takayuki Enomoto,Atsushi Takahashi,Yoichiro Kamatani,Koichi Matsuda,Michiaki Kubo,Gudmar Thorleifsson,Reynir Tómas Geirsson,Unnur Thorsteinsdottir,Unnur Thorsteinsdottir,Leanne Wallace,Leanne Wallace,Jian Yang,Digna R. Velez Edwards,Mette Nyegaard,Mette Nyegaard,Siew-Kee Low,Krina T. Zondervan,Krina T. Zondervan,Stacey A. Missmer,Stacey A. Missmer,Thomas D'Hooghe,Thomas D'Hooghe,Grant W. Montgomery,Grant W. Montgomery,Daniel I. Chasman,Daniel I. Chasman,Kari Stefansson,Kari Stefansson,Joyce Y. Tung,Dale R. Nyholt,Dale R. Nyholt +57 more
TL;DR: A meta-analysis of genome-wide association case-control data sets for endometriosis highlights novel variants in or near specific genes with important roles in sex steroid hormone signalling and function, and offers unique opportunities for more targeted functional research efforts.
Journal ArticleDOI
Identification of a shared genetic susceptibility locus for coronary heart disease and periodontitis.
Arne S. Schaefer,Gesa M. Richter,Birte Groessner-Schreiber,Barbara Noack,Michael Nothnagel,Nour-Eddine El Mokhtari,Bruno G. Loos,Søren Jepsen,Stefan Schreiber +8 more
TL;DR: It is demonstrated that CHD and periodontitis are genetically related by at least one susceptibility locus, which is possibly involved in ANRIL activity and independent of diabetes associated risk variants within this region.
Journal ArticleDOI
An atlas of dynamic chromatin landscapes in mouse fetal development
David U. Gorkin,David U. Gorkin,Iros Barozzi,Iros Barozzi,Yuan Zhao,Yuan Zhao,Yanxiao Zhang,Hui Huang,Hui Huang,Ah Young Lee,Bin Li,Joshua Chiou,Andre Wildberg,Bo Ding,Bo Zhang,Mengchi Wang,J. Seth Strattan,Jean M. Davidson,Yunjiang Qiu,Yunjiang Qiu,Veena Afzal,Jennifer A. Akiyama,Ingrid Plajzer-Frick,Catherine S. Novak,Momoe Kato,Tyler H. Garvin,Quan T. Pham,Anne N. Harrington,Brandon J. Mannion,Elizabeth Lee,Yoko Fukuda-Yuzawa,Yupeng He,Yupeng He,Sebastian Preissl,Sebastian Preissl,Sora Chee,Jee Yun Han,Brian A. Williams,Diane Trout,Henry Amrhein,Hongbo Yang,J. Michael Cherry,Wei Wang,Kyle J. Gaulton,Joseph R. Ecker,Yin Shen,Diane E. Dickel,Axel Visel,Axel Visel,Axel Visel,Len A. Pennacchio,Len A. Pennacchio,Len A. Pennacchio,Bing Ren +53 more
TL;DR: Analysis of chromatin state and accessibility in mouse tissues from twelve sites and eight developmental stages provides the most comprehensive view of Chromatin dynamics to date.
References
More filters
Journal ArticleDOI
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Journal ArticleDOI
Inference of population structure using multilocus genotype data
TL;DR: Pritch et al. as discussed by the authors proposed a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations, which can be applied to most of the commonly used genetic markers, provided that they are not closely linked.
Book
Statistical methods for rates and proportions
TL;DR: In this paper, the basic theory of Maximum Likelihood Estimation (MLE) is used to detect a difference between two different proportions of a given proportion in a single proportion.
Journal ArticleDOI
Haploview: analysis and visualization of LD and haplotype maps
TL;DR: Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface.
Journal ArticleDOI
Categorical Data Analysis
TL;DR: In this article, categorical data analysis was used for categorical classification of categorical categorical datasets.Categorical Data Analysis, categorical Data analysis, CDA, CPDA, CDSA