PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

doi:10.1086/519795

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale², Benjamin M. Neale³, Kathe Todd-Brown¹, Lori Thomas¹, Manuel A. R. Ferreira¹, David Bender¹, David Bender², Julian Maller², Julian Maller¹, Pamela Sklar², Pamela Sklar¹, Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly¹, Mark J. Daly², Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Harvard University¹, Massachusetts Institute of Technology², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics (Elsevier)-Vol. 81, Iss: 3, pp 559-575

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

read less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

Citations

References

"PLINK: A Tool Set for Whole-Genome ..." refers methods in this paper

Related Papers (5)