scispace - formally typeset
Open AccessJournal ArticleDOI

Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data

Jiang Gui, +1 more
- Vol. 21, Iss: 13, pp 3001-3008
Reads0
Chats0
TLDR
Li et al. as discussed by the authors proposed a least-angle regression (LARS) method to select genes that are relevant to patients' survival and to build a predictive model for future prediction, which can be used for identifying important genes that were related to time to death due to cancer and for predicting the survival of future patients.
Abstract
Motivation: An important application of microarray technology is to relate gene expression profiles to various clinical phenotypes of patients. Success has been demonstrated in molecular classification of cancer in which the gene expression data serve as predictors and different types of cancer serve as a categorical outcome variable. However, there has been less research in linking gene expression profiles to the censored survival data such as patients' overall survival time or time to cancer relapse. It would be desirable to have models with good prediction accuracy and parsimony property. Results: We propose to use the L1 penalized estimation for the Cox model to select genes that are relevant to patients' survival and to build a predictive model for future prediction. The computational difficulty associated with the estimation in the high-dimensional and low-sample size settings can be efficiently solved by using the recently developed least-angle regression (LARS) method. Our simulation studies and application to real datasets on predicting survival after chemotherapy for patients with diffuse large B-cell lymphoma demonstrate that the proposed procedure, which we call the LARS--Cox procedure, can be used for identifying important genes that are related to time to death due to cancer and for building a parsimonious model for predicting the survival of future patients. The LARS--Cox regression gives better predictive performance than the L2 penalized regression and a few other dimension-reduction based methods. Conclusions: We conclude that the proposed LARS--Cox procedure can be very useful in identifying genes relevant to survival phenotypes and in building a parsimonious predictive model that can be used for classifying future patients into clinically relevant high- and low-risk groups based on the gene expression profile and survival times of previous patients. Supplementary information: http://dna.ucdavis.edu/~hli/LARSCox-Appendix.pdf Contact: hli@ucdavis.edu

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent

TL;DR: This work introduces a pathwise algorithm for the Cox proportional hazards model, regularized by convex combinations of ℓ1 andℓ2 penalties (elastic net), and employs warm starts to find a solution along a regularization path.
Journal ArticleDOI

L1 Penalized Estimation in the Cox Proportional Hazards Model

TL;DR: A novel algorithm that efficiently computes L1 penalized (lasso) estimates of parameters in high‐dimensional models, based on a combination of gradient ascent optimization with the Newton–Raphson algorithm, which is described for a general likelihood function.
Journal ArticleDOI

Radiomics Signature: A Potential Biomarker for the Prediction of Disease-Free Survival in Early-Stage (I or II) Non-Small Cell Lung Cancer.

TL;DR: Combination of the radiomics signature, traditional staging system, and other clinical-pathologic risk factors performed better for individualized DFS estimation in patients with early-stage NSCLC, which might enable a step forward precise medicine.
Journal ArticleDOI

Regularized gene selection in cancer microarray meta-analysis.

TL;DR: Simulation studies and analyses of multiple pancreatic and liver cancer experiments demonstrate the superior performance of the Meta Threshold Gradient Descent Regularization approach for gene selection in the meta analysis of cancer microarray data.
Journal ArticleDOI

Sparse Bayesian infinite factor models

TL;DR: This work proposes a multiplicative gamma process shrinkage prior on the factor loadings which allows introduction of infinitely many factors, with the loadings increasingly shrunk towards zero as the column index increases, and develops an efficient Gibbs sampler that scales well as data dimensionality increases.
References
More filters
Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Book ChapterDOI

Regression Models and Life-Tables

TL;DR: The analysis of censored failure times is considered in this paper, where the hazard function is taken to be a function of the explanatory variables and unknown regression coefficients multiplied by an arbitrary and unknown function of time.
Journal ArticleDOI

Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

TL;DR: A generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case and suggests a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.
Journal ArticleDOI

Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications

TL;DR: Survival analyses on a subcohort of patients with locally advanced breast cancer uniformly treated in a prospective study showed significantly different outcomes for the patients belonging to the various groups, including a poor prognosis for the basal-like subtype and a significant difference in outcome for the two estrogen receptor-positive groups.
Related Papers (5)