scispace - formally typeset
Open AccessJournal ArticleDOI

Genome-wide association analysis by lasso penalized logistic regression

TLDR
The performance of lasso penalized logistic regression in case-control disease gene mapping with a large number of SNPs (single nucleotide polymorphisms) predictors is evaluated and coeliac disease results replicate the previous SNP results and shed light on possible interactions among the SNPs.
Abstract
Motivation: In ordinary regression, imposition of a lasso penalty makes continuous model selection straightforward. Lasso penalized regression is particularly advantageous when the number of predictors far exceeds the number of observations. Method: The present article evaluates the performance of lasso penalized logistic regression in case–control disease gene mapping with a large number of SNPs (single nucleotide polymorphisms) predictors. The strength of the lasso penalty can be tuned to select a predetermined number of the most relevant SNPs and other predictors. For a given value of the tuning constant, the penalized likelihood is quickly maximized by cyclic coordinate ascent. Once the most potent marginal predictors are identified, their two-way and higher order interactions can also be examined by lasso penalized logistic regression. Results: This strategy is tested on both simulated and real data. Our findings on coeliac disease replicate the previous SNP results and shed light on possible interactions among the SNPs. Availability: The software discussed is available in Mendel 9.0 at the UCLA Human Genetics web site. Contact: klange@ucla.edu Supplementary information: Supplementary data are available at Bioinformatics online.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Regularization Paths for Generalized Linear Models via Coordinate Descent

TL;DR: In comparative timings, the new algorithms are considerably faster than competing methods and can handle large problems and can also deal efficiently with sparse features.
BookDOI

Statistical Learning with Sparsity: The Lasso and Generalizations

TL;DR: Statistical Learning with Sparsity: The Lasso and Generalizations presents methods that exploit sparsity to help recover the underlying signal in a set of data and extract useful and reproducible patterns from big datasets.
Journal ArticleDOI

Power and Predictive Accuracy of Polygenic Risk Scores

TL;DR: It is shown that published studies with significant association of polygenic scores have been well powered, whereas those with negative results can be explained by low sample size, and that useful levels of prediction may only be approached when predictors are estimated from very large samples.
Journal ArticleDOI

Analysing biological pathways in genome-wide association studies

TL;DR: The development of pathway-based approaches for GWA studies are reviewed, their practical use and caveats are discussed, and it is suggested that pathway- based approaches may also be useful for future GWA study data sets with sequencing data.
Journal ArticleDOI

Polygenic modeling with bayesian sparse linear mixed models.

TL;DR: This work applies Bayesian sparse linear mixed model (BSLMM) and compares it with other methods for two polygenic modeling applications: estimating the proportion of variance in phenotypes explained (PVE) by available genotypes, and phenotype (or breeding value) prediction, and demonstrates that BSLMM considerably outperforms either of the other two methods.
References
More filters
Journal ArticleDOI

Identification of SNP interactions using logic regression.

Holger Schwender, +1 more
- 01 Jan 2008 - 
TL;DR: This paper shows how logic regression can be employed to identify SNP interactions explanatory for the disease status in a case-control study and proposes 2 measures for quantifying the importance of these interactions for classification.
Journal ArticleDOI

Accommodating Linkage Disequilibrium in Genetic-Association Analyses via Ridge Regression

TL;DR: The results suggest that ridge regression and related techniques have the potential to distinguish causative from noncausative variations in association studies.
Journal ArticleDOI

A Fast Method for Computing High-Significance Disease Association in Large Population-Based Studies

TL;DR: This work presents a faster algorithm for accurately calculating low P values in case-control association studies based on importance sampling and on accounting for the decay in linkage disequilibrium along the chromosome, which significantly increases the problem-size range for which accurate, meaningful association results are attainable.
Journal ArticleDOI

Statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic study for complex diseases

Yulan Liang, +1 more
- 28 Mar 2008 - 
TL;DR: A review of recent statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic association studies for complex diseases and more specific approaches for SNPs data, which include unsupervised haplotype mapping, tag SNP selection, and supervised SNPs selection are presented.
Posted Content

LASSO-Patternsearch algorithm with application to ophthalmology and genomic data

TL;DR: The LASSO-Patternsearch algorithm as discussed by the authors is designed for the case where there is a possibly very large number of candidate patterns but it is believed that only a relatively small number are important.
Related Papers (5)