PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
Shaun Purcell,Shaun Purcell,Benjamin M. Neale,Benjamin M. Neale,Kathe Todd-Brown,Lori Thomas,Manuel A. R. Ferreira,David Bender,David Bender,Julian Maller,Julian Maller,Pamela Sklar,Pamela Sklar,Paul I.W. de Bakker,Paul I.W. de Bakker,Mark J. Daly,Mark J. Daly,Pak C. Sham +17 more
Reads0
Chats0
TLDR
This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.Abstract:
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.read more
Citations
More filters
Journal ArticleDOI
Genome-wide association studies establish that human intelligence is highly heritable and polygenic
Gail Davies,Albert Tenesa,Albert Tenesa,Antony Payton,Jian Yang,Sarah E. Harris,D. C. Liewald,Xiayi Ke,S. Le Hellard,Andrea Christoforou,Michelle Luciano,Kevin A. McGhee,Lorna M. Lopez,Alan J. Gow,Janie Corley,Paul Redmond,Helen C. Fox,Paul Haggarty,Lawrence J. Whalley,Geraldine McNeill,Michael E. Goddard,Thomas Espeseth,Astri J. Lundervold,Ivar Reinvang,Andrew Pickles,Vidar M. Steen,Vidar M. Steen,William E R Ollier,David J. Porteous,Michael A. Horan,John M. Starr,Neil Pendleton,Peter M. Visscher,Peter M. Visscher,Ian J. Deary +34 more
TL;DR: A genome-wide analysis of unrelated adults with data on single nucleotide polymorphisms and detailed phenotypes on cognitive traits unequivocally confirms that a substantial proportion of individual differences in human intelligence is due to genetic variation, and is consistent with many genes of small effects underlying the additive genetic influences on intelligence.
Journal ArticleDOI
PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files.
TL;DR: PopLDdecay, an open source software, for LD decay analysis from VCF files is fast and is able to handle large number of variants from sequencing data and is also storage saving by avoiding exporting pair-wise results of LD measurements.
Journal ArticleDOI
High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis
Steve Eyre,John Bowes,John Bowes,Dorothée Diogo,Dorothée Diogo,Annette Lee,Anne Barton,Anne Barton,Paul Martin,Paul Martin,Alexandra Zhernakova,Alexandra Zhernakova,Eli A. Stahl,Eli A. Stahl,Sebastien Viatte,Sebastien Viatte,Kate McAllister,Kate McAllister,Christopher I. Amos,Leonid Padyukov,René E. M. Toes,Tom W J Huizinga,Cisca Wijmenga,Gosia Trynka,Lude Franke,Harm-Jan Westra,Lars Alfredsson,Xinli Hu,Cynthia Sandor,Cynthia Sandor,Paul I.W. de Bakker,Sonia Davila,Chiea Chuen Khor,Khai Koon Heng,Robert Andrews,Sarah Edkins,Sarah E. Hunt,Cordelia Langford,Deborah P M Symmons,Deborah P M Symmons,Genomics Study Syndicate,Patrick Concannon,Suna Onengut-Gumuscu,Stephen S. Rich,Panos Deloukas,Miguel A. Gonzalez-Gay,Luis Rodriguez-Rodriguez,Lisbeth Ärlsetig,Javier Martin,Solbritt Rantapää-Dahlqvist,Robert M. Plenge,Robert M. Plenge,Soumya Raychaudhuri,Lars Klareskog,Peter K. Gregersen,Jane Worthington,Jane Worthington +56 more
TL;DR: This study illustrates the advantages of dense SNP mapping analysis to inform subsequent functional investigations and refined the peak of association to a single gene for 19 loci, identified secondary independent effects at 6 loci and identified association to low-frequency variants at 4 loci.
Journal ArticleDOI
HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing.
Hansi Weissensteiner,Dominic Pacher,Anita Kloss-Brandstätter,Lukas Forer,Günther Specht,Hans-Jürgen Bandelt,Florian Kronenberg,Antonio Salas,Sebastian Schönherr +8 more
TL;DR: This work presents the completely updated version HaploGrep 2 offering several advanced features, including a generic rule-based system for immediate quality control (QC), which allows detecting artificial recombinants and missing variants as well as annotating rare and phantom mutations.
Journal ArticleDOI
Analysis of polygenic risk score usage and performance in diverse human populations
Laramie E. Duncan,Hanyang Shen,Bizu Gelaye,J. Meijsen,Kerry J. Ressler,Marcus W. Feldman,Roseann E. Peterson,Benjamin W. Domingue +7 more
TL;DR: Findings highlight the need for improved treatment of linkage disequilibrium and variant frequencies when applying polygenic scoring to cohorts of non-European ancestry, and bolster the rationale for large-scale GWAS in diverse human populations.
References
More filters
Journal ArticleDOI
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Journal ArticleDOI
Inference of population structure using multilocus genotype data
TL;DR: Pritch et al. as discussed by the authors proposed a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations, which can be applied to most of the commonly used genetic markers, provided that they are not closely linked.
Book
Statistical methods for rates and proportions
TL;DR: In this paper, the basic theory of Maximum Likelihood Estimation (MLE) is used to detect a difference between two different proportions of a given proportion in a single proportion.
Journal ArticleDOI
Haploview: analysis and visualization of LD and haplotype maps
TL;DR: Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface.
Journal ArticleDOI
Categorical Data Analysis
TL;DR: In this article, categorical data analysis was used for categorical classification of categorical categorical datasets.Categorical Data Analysis, categorical Data analysis, CDA, CPDA, CDSA