PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
Shaun Purcell,Shaun Purcell,Benjamin M. Neale,Benjamin M. Neale,Kathe Todd-Brown,Lori Thomas,Manuel A. R. Ferreira,David Bender,David Bender,Julian Maller,Julian Maller,Pamela Sklar,Pamela Sklar,Paul I.W. de Bakker,Paul I.W. de Bakker,Mark J. Daly,Mark J. Daly,Pak C. Sham +17 more
Reads0
Chats0
TLDR
This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.Abstract:
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.read more
Citations
More filters
Journal ArticleDOI
rMVP: A Memory-efficient, Visualization-enhanced, and Parallel-accelerated tool for Genome-Wide Association Study.
Lilin Yin,Haohao Zhang,Zhenshuang Tang,Xu Jingya,Dong Yin,Zhiwu Zhang,Xiaohui Yuan,Mengjin Zhu,Shuhong Zhao,Xinyun Li,Xiaolei Liu +10 more
TL;DR: Xiaolei et al. as discussed by the authors presented a Memory-efficient, Visualization-enhanced, and Parallel-accelerated R package called "rMVP" to address the need for improved GWAS computation.
Journal ArticleDOI
Analysis and application of European genetic substructure using 300 K SNP information.
Chao Tian,Robert M. Plenge,Robert M. Plenge,Michael Ransom,Annette Lee,Pablo Villoslada,Carlo Selmi,Carlo Selmi,Lars Klareskog,Ann E. Pulver,Lihong Qi,Peter K. Gregersen,Michael F. Seldin +12 more
TL;DR: Two sets of European substructure ancestry informative markers (ESAIMs) were identified that provide substantial substructure information that can be used for improving error rates in association testing of candidate genes and in replication studies of WGA scans.
Journal ArticleDOI
Functional variants in the LRRK2 gene confer shared effects on risk for Crohn's disease and Parkinson's disease.
Ken Y. Hui,Heriberto Fernandez-Hernandez,Jianzhong Hu,Adam Schaffner,Nathan Pankratz,Nai Yun Hsu,Ling-Shiang Chuang,Shai Carmi,Nicole Villaverde,Xianting Li,Manual Rivas,Manual Rivas,Adam P. Levine,Xiuliang Bao,Philippe R Labrias,Talin Haritunians,Darren Ruane,Kyle Gettler,Ernie Chen,Dalin Li,Elena R. Schiff,Nikolas Pontikos,Nir Barzilai,Steven R. Brant,Susan B. Bressman,Adam S. Cheifetz,Lorraine N. Clark,Mark J. Daly,Mark J. Daly,Robert J. Desnick,Richard H. Duerr,Seymour Katz,Seymour Katz,Seymour Katz,Todd Lencz,Richard H. Myers,Harry Ostrer,Laurie J. Ozelius,Laurie J. Ozelius,Haydeh Payami,Yakov Peter,Yakov Peter,John D. Rioux,Anthony W. Segal,William K. Scott,Mark S. Silverberg,Jeffery M. Vance,Iban Ubarretxena-Belandia,Tatiana Foroud,Gil Atzmon,Gil Atzmon,Itsik Pe'er,Yiannis A. Ioannou,Dermot P.B. McGovern,Zhenyu Yue,Eric E. Schadt,Judy H. Cho,Judy H. Cho,Inga Peter +58 more
TL;DR: The presence of shared LRRK2 alleles in CD and PD provides refined insight into disease mechanisms and may have major implications for the treatment of these two seemingly unrelated diseases.
Journal ArticleDOI
Collaborative Meta-analysis: Associations of 150 Candidate Genes With Osteoporosis and Osteoporotic Fracture
J. Brent Richards,Fotini K. Kavvoura,Fernando Rivadeneira,Unnur Styrkarsdottir,Karol Estrada,Bjarni V. Halldorsson,Yi-Hsiang Hsu,M. Carola Zillikens,Scott Wilson,Benjamin H. Mullin,Najaf Amin,Yurii S. Aulchenko,L. Adrienne Cupples,Panagiotis Deloukas,Serkalem Demissie,Albert Hofman,Augustine Kong,David Karasik,Joyce B. J. van Meurs,Ben A. Oostra,Huibert A. P. Pols,Gunnar Sigurdsson,Unnur Thorsteinsdottir,Nicole Soranzo,Frances M K Williams,Yanhua Zhou,Stuart H. Ralston,Gudmar Thorleifsson,Cornelia M. van Duijn,Douglas P. Kiel,Kari Stefansson,André G. Uitterlinden,John P. A. Ioannidis,Tim D. Spector +33 more
TL;DR: This analysis of genome-wide association results from 5 large populations found many common genetic variants that have robust statistical evidence for association with various traits and diseases, including osteoporosis.
Journal ArticleDOI
An integrated genetic-epigenetic analysis of schizophrenia : evidence for co-localization of genetic associations and differential DNA methylation
Eilis Hannon,Emma Dempster,Joana Viana,Joe Burrage,Adam Smith,Ruby Macdonald,David St Clair,Colette J Mustard,Gerome Breen,Sebastian Therman,Jaakko Kaprio,Jaakko Kaprio,Timothea Toulopoulou,Hilleke E. Hulshoff Pol,Marc M. Bohlken,René S. Kahn,Igor Nenadic,Christina M. Hultman,Robin M. Murray,David A. Collier,David A. Collier,Nick Bass,Hugh Gurling,Andrew McQuillin,Leonard C. Schalkwyk,Leonard C. Schalkwyk,Jonathan Mill,Jonathan Mill,Jonathan Mill +28 more
TL;DR: This paper performed a multi-stage epigenome-wide association study, quantifying genome-wide patterns of DNA methylation in a total of 1714 individuals from three independent sample cohorts.
References
More filters
Journal ArticleDOI
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Journal ArticleDOI
Inference of population structure using multilocus genotype data
TL;DR: Pritch et al. as discussed by the authors proposed a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations, which can be applied to most of the commonly used genetic markers, provided that they are not closely linked.
Book
Statistical methods for rates and proportions
TL;DR: In this paper, the basic theory of Maximum Likelihood Estimation (MLE) is used to detect a difference between two different proportions of a given proportion in a single proportion.
Journal ArticleDOI
Haploview: analysis and visualization of LD and haplotype maps
TL;DR: Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface.
Journal ArticleDOI
Categorical Data Analysis
TL;DR: In this article, categorical data analysis was used for categorical classification of categorical categorical datasets.Categorical Data Analysis, categorical Data analysis, CDA, CPDA, CDSA