Showing papers by "Robert Tibshirani published in 2013"

PDF

Open Access

Book•

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

[...]

Trevor Hastie¹, Robert Tibshirani, Jerome H. Friedman•Institutions (1)

28 Jul 2013

TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.

...read moreread less

Abstract: During the past decade there has been an explosion in computation and information technology. With it have come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics. Many of these tools have common underpinnings but are often expressed with different terminology. This book describes the important ideas in these areas in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of color graphics. It is a valuable resource for statisticians and anyone interested in data mining in science or industry. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees and boosting---the first comprehensive treatment of this topic in any book. This major new edition features many topics not covered in the original, including graphical models, random forests, ensemble methods, least angle regression and path algorithms for the lasso, non-negative matrix factorization, and spectral clustering. There is also a chapter on methods for ``wide'' data (p bigger than n), including multiple testing and false discovery rates. Trevor Hastie, Robert Tibshirani, and Jerome Friedman are professors of statistics at Stanford University. They are prominent researchers in this area: Hastie and Tibshirani developed generalized additive models and wrote a popular book of that title. Hastie co-developed much of the statistical modeling software and environment in R/S-PLUS and invented principal curves and surfaces. Tibshirani proposed the lasso and is co-author of the very successful An Introduction to the Bootstrap. Friedman is the co-inventor of many data-mining tools including CART, MARS, projection pursuit and gradient boosting.

...read moreread less

19,261 citations

Book•DOI•

An introduction to statistical learning

[...]

Gareth M. James, Daniela Witten, Trevor Hastie, Robert Tibshirani

01 Jan 2013

TL;DR: An introduction to statistical learning provides an accessible overview of the essential toolset for making sense of the vast and complex data sets that have emerged in science, industry, and other sectors in the past twenty years.

...read moreread less

Abstract: Statistics An Intduction to Stistical Lerning with Applications in R An Introduction to Statistical Learning provides an accessible overview of the fi eld of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fi elds ranging from biology to fi nance to marketing to astrophysics in the past twenty years. Th is book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classifi cation, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fi elds, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical soft ware platform. Two of the authors co-wrote Th e Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. Th is book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. Th e text assumes only a previous course in linear regression and no knowledge of matrix algebra.

...read moreread less

8,207 citations

Journal Article•DOI•

A Sparse-Group Lasso

[...]

Noah Simon¹, Jerome H. Friedman¹, Trevor Hastie¹, Robert Tibshirani¹•Institutions (1)

Stanford University¹

30 May 2013-Journal of Computational and Graphical Statistics

TL;DR: A regularized model for linear regression with ℓ1 andℓ2 penalties is introduced and it is shown that it has the desired effect of group-wise and within group sparsity.

...read moreread less

Abstract: For high-dimensional supervised learning problems, often using problem-specific assumptions can lead to greater accuracy. For problems with grouped covariates, which are believed to have sparse effects both on a group and within group level, we introduce a regularized model for linear regression with l1 and l2 penalties. We discuss the sparsity and other regularization properties of the optimal fit for this model, and show that it has the desired effect of group-wise and within group sparsity. We propose an algorithm to fit the model via accelerated generalized gradient descent, and extend this model and algorithm to convex loss functions. We also demonstrate the efficacy of our model and the efficiency of our algorithm on simulated data. This article has online supplementary material.

...read moreread less

1,233 citations

Journal Article•DOI•

A significance test for the lasso

[...]

Richard A. Lockhart¹, Jonathan Taylor², Ryan J. Tibshirani³, Robert Tibshirani²•Institutions (3)

Simon Fraser University¹, Stanford University², Carnegie Mellon University³

30 Jan 2013-arXiv: Statistics Theory

TL;DR: A simple test statistic based on lasso fitted values is proposed, called the covariance test statistic, and it is shown that when the true model is linear, this statistic has an Exp(1) asymptotic distribution under the null hypothesis (the null being that all truly active variables are contained in the current lasso model).

...read moreread less

Abstract: In the sparse linear regression setting, we consider testing the significance of the predictor variable that enters the current lasso model, in the sequence of models visited along the lasso solution path. We propose a simple test statistic based on lasso fitted values, called the covariance test statistic, and show that when the true model is linear, this statistic has an $\operatorname {Exp}(1)$ asymptotic distribution under the null hypothesis (the null being that all truly active variables are contained in the current lasso model). Our proof of this result for the special case of the first predictor to enter the model (i.e., testing for a single significant predictor variable against the global null) requires only weak assumptions on the predictor matrix $X$. On the other hand, our proof for a general step in the lasso path places further technical assumptions on $X$ and the generative model, but still allows for the important high-dimensional case $p>n$, and does not necessarily require that the current lasso model achieves perfect recovery of the truly active variables. Of course, for testing the significance of an additional variable between two nested linear models, one typically uses the chi-squared test, comparing the drop in residual sum of squares (RSS) to a $\chi^2_1$ distribution. But when this additional variable is not fixed, and has been chosen adaptively or greedily, this test is no longer appropriate: adaptivity makes the drop in RSS stochastically much larger than $\chi^2_1$ under the null hypothesis. Our analysis explicitly accounts for adaptivity, as it must, since the lasso builds an adaptive sequence of linear models as the tuning parameter $\lambda$ decreases. In this analysis, shrinkage plays a key role: though additional variables are chosen adaptively, the coefficients of lasso active variables are shrunken due to the $\ell_1$ penalty. Therefore, the test statistic (which is based on lasso fitted values) is in a sense balanced by these two opposing properties - adaptivity and shrinkage - and its null distribution is tractable and asymptotically $\operatorname {Exp}(1)$.

...read moreread less

520 citations

Journal Article•DOI•

Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data

[...]

Jun Li¹, Robert Tibshirani¹•Institutions (1)

Stanford University¹

01 Oct 2013-Statistical Methods in Medical Research

TL;DR: A simple, non-parametric method with resampling to account for the different sequencing depths is introduced, and it is found that the method discovers more consistent patterns than competing methods.

...read moreread less

Abstract: We discuss the identification of features that are associated with an outcome in RNA-Sequencing (RNA-Seq) and other sequencing-based comparative genomic experiments. RNA-Seq data takes the form of counts, so models based on the normal distribution are generally unsuitable. The problem is especially challenging because different sequencing experiments may generate quite different total numbers of reads, or 'sequencing depths'. Existing methods for this problem are based on Poisson or negative binomial models: they are useful but can be heavily influenced by 'outliers' in the data. We introduce a simple, non-parametric method with resampling to account for the different sequencing depths. The new method is more robust than parametric methods. It can be applied to data with quantitative, survival, two-class or multiple-class outcomes. We compare our proposed method to Poisson and negative binomial-based methods in simulated and real data sets, and find that our method discovers more consistent patterns than competing methods.

...read moreread less

431 citations

Journal Article•DOI•

A lasso for hierarchical interactions

[...]

Jacob Bien¹, Jonathan Taylor¹, Robert Tibshirani¹•Institutions (1)

Stanford University¹

01 Jun 2013-Annals of Statistics

TL;DR: The Hierarchical Lasso as mentioned in this paper adds a set of convex constraints to the lasso to produce sparse interaction models that honor the hierarchy restriction that an interaction only be included in a model if one or both variables are marginally important.

...read moreread less

Abstract: We add a set of convex constraints to the lasso to produce sparse interaction models that honor the hierarchy restriction that an interaction only be included in a model if one or both variables are marginally important. We give a precise characterization of the effect of this hierarchy constraint, prove that hierarchy holds with probability one and derive an unbiased estimate for the degrees of freedom of our estimator. A bound on this estimate reveals the amount of fitting "saved" by the hierarchy constraint. We distinguish between parameter sparsity-the number of nonzero coefficients-and practical sparsity-the number of raw variables one must measure to make a new prediction. Hierarchy focuses on the latter, which is more closely tied to important data collection concerns such as cost, time and effort. We develop an algorithm, available in the R package hierNet, and perform an empirical study of our method.

...read moreread less

370 citations

Book Chapter•DOI•

Tree-Based Methods

[...]

Gareth M. James¹, Daniela Witten², Trevor Hastie³, Robert Tibshirani³•Institutions (3)

University of Southern California¹, University of Washington², Stanford University³

01 Jan 2013

TL;DR: This chapter describes tree-based methods for regression and classification, which involve stratifying or segmenting the predictor space into a number of simple regions.

...read moreread less

Abstract: In this chapter, we describe tree-based methods for regression and classification. These involve stratifying or segmenting the predictor space into a number of simple regions. In order to make a prediction for a given observation, we typically use the mean or the mode of the training observations in the region to which it belongs. Since the set of splitting rules used to segment the predictor space can be summarized in a tree, these types of approaches are known as decision tree methods.

...read moreread less

145 citations

Book Chapter•DOI•

Support Vector Machines

[...]

Gareth M. James¹, Daniela Witten², Trevor Hastie³, Robert Tibshirani³•Institutions (3)

University of Southern California¹, University of Washington², Stanford University³

01 Jan 2013

TL;DR: This chapter discusses the support vector machine (SVM), an approach for classification that was developed in the computer science community in the 1990s and that has grown in popularity since then.

...read moreread less

Abstract: In this chapter, we discuss the support vector machine (SVM), an approach for classification that was developed in the computer science community in the 1990s and that has grown in popularity since then. SVMs have been shown to perform well in a variety of settings, and are often considered one of the best “out of the box” classifiers.

...read moreread less

84 citations

Book Chapter•DOI•

Linear Model Selection and Regularization

[...]

Gareth M. James¹, Daniela Witten², Trevor Hastie³, Robert Tibshirani³•Institutions (3)

University of Southern California¹, University of Washington², Stanford University³

01 Jan 2013

TL;DR: In the regression setting, the standard linear model is commonly used to describe the relationship between a response Y and a set of variables, and one typically fits this model using least squares as mentioned in this paper.

...read moreread less

Abstract: In the regression setting, the standard linear model $$\displaystyle{ Y =\beta _{0} +\beta _{1}X_{1} + \cdots +\beta _{p}X_{p}+\epsilon }$$ (6.1) is commonly used to describe the relationship between a response Y and a set of variables $X_{1},X_{2},\ldots,X_{p}$. We have seen in Chapter 3 that one typically fits this model using least squares.

...read moreread less

52 citations

Posted Content•

A Study of Error Variance Estimation in Lasso Regression

[...]

Stephen Reid¹, Robert Tibshirani, Jerome H. Friedman•Institutions (1)

Stanford University¹

21 Nov 2013-arXiv: Methodology

TL;DR: In this article, the authors review several variance estimators and perform a reasonably extensive simulation study in an attempt to compare their finite sample performance, and it would seem from the results that variance estimation with adaptively chosen regularisation parameters perform admirably over a broad range of sparsity and signal strength settings.

...read moreread less

Abstract: Variance estimation in the linear model when $p > n$ is a difficult problem. Standard least squares estimation techniques do not apply. Several variance estimators have been proposed in the literature, all with accompanying asymptotic results proving consistency and asymptotic normality under a variety of assumptions. It is found, however, that most of these estimators suffer large biases in finite samples when true underlying signals become less sparse with larger per element signal strength. One estimator seems to be largely neglected in the literature: a residual sum of squares based estimator using Lasso coefficients with regularisation parameter selected adaptively (via cross-validation). In this paper, we review several variance estimators and perform a reasonably extensive simulation study in an attempt to compare their finite sample performance. It would seem from the results that variance estimators with adaptively chosen regularisation parameters perform admirably over a broad range of sparsity and signal strength settings. Finally, some intial theoretical analyses pertaining to these types of estimators are proposed and developed.

...read moreread less

50 citations

Posted Content•

Adaptive testing for the graphical lasso

[...]

Max G'Sell, Jonathan Taylor, Robert Tibshirani

17 Jul 2013-arXiv: Statistics Theory

TL;DR: A simple test statistic based on a subsequence of the knots in the graphical lasso path has an exponential asymptotic null distribution, under the null hypothesis that the model contains the true connected components.

...read moreread less

Abstract: We consider tests of significance in the setting of the graphical lasso for inverse covariance matrix estimation We propose a simple test statistic based on a subsequence of the knots in the graphical lasso path We show that this statistic has an exponential asymptotic null distribution, under the null hypothesis that the model contains the true connected components Though the null distribution is asymptotic, we show through simulation that it provides a close approximation to the true distribution at reasonable sample sizes Thus the test provides a simple, tractable test for the significance of new edges as they are introduced into the model Finally, we show connections between our results and other results for regularized regression, as well as extensions of our results to other correlation matrix based methods like single-linkage clustering

...read moreread less

Journal Article•DOI•

Scientific research in the age of omics: the good, the bad, and the sloppy.

[...]

Daniela Witten¹, Robert Tibshirani²•Institutions (2)

University of Washington¹, Stanford University²

01 Jan 2013-Journal of the American Medical Informatics Association

TL;DR: An online, open-access, postpublication, peer review system that will increase the accountability of scientists for the quality of their research and the ability of readers to distinguish good from sloppy science is urged.

...read moreread less

Journal Article•DOI•

Trauma-associated Human Neutrophil Alterations Revealed by Comparative Proteomics Profiling

[...]

Jian-Ying Zhou¹, Ravi K. Krovvidi¹, Yuqian Gao¹, Hong Gao², Brianne O. Petritis¹, Asit De³, Carol L. Miller-Graziano³, Paul E. Bankey³, Vladislav A. Petyuk¹, Carrie D. Nicora¹, Therese R. W. Clauss¹, Ronald J. Moore¹, Tujin Shi¹, Joseph N. Brown¹, Amit Kaushal², Wenzhong Xiao⁴, Wenzhong Xiao², Ronald W. Davis², Ronald V. Maier⁴, Ronald G. Tompkins⁵, Wei-Jun Qian¹, David G. Camp¹, Richard D. Smith¹, Henry V. Baker, Ulysses J. Balis, Timothy R. Billiar, Bernard H. Brownstein, Steven E. Calvano, Irshad H. Chaudry, J. Perren Cobb, Joseph Cuschieri, Bradley D. Freeman, Richard L. Gamelli, Nicole S. Gibran, Brian G. Harbrecht, Douglas Hayden, Laura Hennessy, Jureta W. Horton, Jeffrey R. Johnson, Matthew B. Klein, Stephen F. Lowry, John A. Mannick, Philip H. Mason, Grace P. McDonald-Smith, Michael N. Mindrinos, Joseph P. Minei, Ernest E. Moore, Avery B. Nathens, Grant E. O'Keefe, Laurence G. Rahme, Daniel G. Remick, David A. Schoenfeld, Michael B. Shapiro, Geoffrey M. Silver, John D. Storey, Robert Tibshirani, Mehmet Toner, H. Shaw Warren, Michael West - Show less +55 more•Institutions (5)

Pacific Northwest National Laboratory¹, Stanford University², University of Rochester³, University of Washington⁴, Harvard University⁵

01 Aug 2013-Proteomics Clinical Applications

TL;DR: Polymorphonuclear neutrophils play an important role in mediating the innate immune response after severe traumatic injury; however, the cellular proteome response to traumatic condition is still largely unknown.

...read moreread less

Abstract: PURPOSE Polymorphonuclear neutrophils (PMNs) play an important role in mediating the innate immune response after severe traumatic injury; however, the cellular proteome response to traumatic condition is still largely unknown.

...read moreread less

Journal Article•DOI•

Classification of patients from time-course gene expression

[...]

Yuping Zhang¹, Robert Tibshirani¹, Ronald W. Davis¹•Institutions (1)

Stanford University¹

01 Jan 2013-Biostatistics

TL;DR: Making good use of time-course information of gene expression improved the performance of classification compared with using gene expression from individual time points only, and this led to a new classification method using time- Course gene expressions.

...read moreread less

Abstract: Classifying patients into different risk groups based on their genomic measurements can help clinicians design appropriate clinical treatment plans. To produce such a classification, gene expression data were collected on a cohort of burn patients, who were monitored across multiple time points. This led us to develop a new classification method using time-course gene expressions. Our results showed that making good use of time-course information of gene expression improved the performance of classification compared with using gene expression from individual time points only. Our method is implemented into an R-package: time-course prediction analysis using microarray.

...read moreread less

Posted Content•

False Discovery Rate Control for Sequential Selection Procedures, with Application to the Lasso

[...]

Max G'Sell, Stefan Wager, Alexandra Chouldechova, Robert Tibshirani

20 Sep 2013

TL;DR: In this paper, the authors consider a multiple hypothesis testing setting where the hypotheses are ordered and one is only permitted to reject an initial contiguous block, H_1,\dots,H_k, of hypotheses.

...read moreread less

Abstract: We consider a multiple hypothesis testing setting where the hypotheses are ordered and one is only permitted to reject an initial contiguous block, H_1,\dots,H_k, of hypotheses. A rejection rule in this setting amounts to a procedure for choosing the stopping point k. This setting is inspired by the sequential nature of many model selection problems, where choosing a stopping point or a model is equivalent to rejecting all hypotheses up to that point and none thereafter. We propose two new testing procedures, and prove that they control the false discovery rate in the ordered testing setting. We also show how the methods can be applied to model selection using recent results on p-values in sequential model selection settings.

...read moreread less

Posted Content•

Sequential Selection Procedures and False Discovery Rate Control

[...]

Max G'Sell¹, Stefan Wager², Alexandra Chouldechova¹, Robert Tibshirani²•Institutions (2)

Carnegie Mellon University¹, Stanford University²

20 Sep 2013-arXiv: Statistics Theory

TL;DR: This work proposes two new testing procedures and proves that they control the false discovery rate in the ordered testing setting and shows how the methods can be applied to model selection by using recent results on p‐values in sequential model selection settings.

...read moreread less

In praise of sparsity and convexity

[...]

Robert Tibshirani

01 Jan 2013

TL;DR: The field of machine learning, discussed in this volume by my friend Larry Wasserman, has exploded and brought along with it the computational side of statistical research as mentioned in this paper, which is a thriving discipline, more and more an essential part of science, business and societal activities.

...read moreread less

Abstract: When asked to reflect on an anniversary of their field, scientists in most fields would sing the praises of their subject. As a statistician, I will do the same. However, here the praise is justified! Statistics is a thriving discipline, more and more an essential part of science, business and societal activities. Class enrollments are up — it seems that everyone wants to be a statistician — and there are jobs everywhere. The field of machine learning, discussed in this volume by my friend Larry Wasserman, has exploded and brought along with it the computational side of statistical research. Hal Varian, Chief Economist at Google, said “I keep saying that the sexy job in the next 10 years will be statisticians. And I’m not kidding.” Nate Silver, creator of the New York Times political forecasting blog “538” was constantly in the news and on talk shows in the runup to the 2012 US election. Using careful statistical modelling, he forecasted the election with near 100% accuracy (in contrast to many others). Although his training is in economics, he (proudly?) calls himself a statistician. When meeting people at a party, the label “Statistician” used to kill one’s chances of making a new friend. But no longer! In the midst of all this excitement about the growing importance of statistics, there are fascinating developments within the field itself. Here I will discuss one that has been the focus my research and that of many other statisticians.

...read moreread less

Journal Article•DOI•

Identification of gene microarray expression profiles in patients with chronic graft-versus-host disease following allogeneic hematopoietic cell transplantation

[...]

Holbrook E Kohrt¹, Lu Tian¹, Li Li¹, Ash A. Alizadeh¹, Sue Hsieh¹, Robert Tibshirani¹, Samuel Strober¹, Minnie M. Sarwal¹, Robert Lowsky¹ - Show less +5 more•Institutions (1)

Stanford University¹

01 Jul 2013-Clinical Immunology

TL;DR: The dominant gene signature in patients with chronic GVHD represented compensatory responses that control inflammation and included the interleukin-1 decoy receptor, IL-1 receptor type II, and genes that were profibrotic and associated with the IL-4,IL-6 and IL-10 signaling pathways.

...read moreread less

Posted Content•

False Variable Selection Rates in Regression

[...]

Max G'Sell, Trevor Hastie, Robert Tibshirani

10 Feb 2013-arXiv: Methodology

TL;DR: This paper shows that this full model definition of FDR suffers from unintuitive and potentially undesirable behavior in the presence of correlated predictors, and proposes a new false selection error criterion, the False Variable Rate (FVR), that avoids these problems and behaves in a more intuitive manner.

...read moreread less

Abstract: There has been recent interest in extending the ideas of False Discovery Rates (FDR) to variable selection in regression settings. Traditionally the FDR in these settings has been defined in terms of the coefficients of the full regression model. Recent papers have struggled with controlling this quantity when the predictors are correlated. This paper shows that this full model definition of FDR suffers from unintuitive and potentially undesirable behavior in the presence of correlated predictors. We propose a new false selection error criterion, the False Variable Rate (FVR), that avoids these problems and behaves in a more intuitive manner. We discuss the behavior of this criterion and how it compares with the traditional FDR, as well as presenting guidelines for determining which is appropriate in a particular setting. Finally, we present a simple estimation procedure for FVR in stepwise variable selection. We analyze the performance of this estimator and draw connections to recent estimators in the literature.

...read moreread less

Journal Article•DOI•

Physicians' warnings for unfit drivers and risk of road crashes.

[...]

Donald A. Redelmeier, Christopher J. Yarnell, Robert Tibshirani

03 Jan 2013-The New England Journal of Medicine

Book Chapter•DOI•

Moving Beyond Linearity

[...]

Gareth M. James¹, Daniela Witten², Trevor Hastie³, Robert Tibshirani³•Institutions (3)

University of Southern California¹, University of Washington², Stanford University³

01 Jan 2013

TL;DR: This chapter relaxes the linearity assumption while still attempting to maintain as much interpretability as possible by examining very simple extensions of linear models like polynomial regression and step functions, as well as more sophisticated approaches such as splines, local regression, and generalized additive models.

...read moreread less

Abstract: So far in this book, we have mostly focused on linear models. Linear models are relatively simple to describe and implement, and have advantages over other approaches in terms of interpretation and inference. However, standard linear regression can have significant limitations in terms of predictive power. This is because the linearity assumption is almost always an approximation, and sometimes a poor one. In Chapter 6 we see that we can improve upon least squares using ridge regression, the lasso, principal components regression, and other techniques. In that setting, the improvement is obtained by reducing the complexity of the linear model, and hence the variance of the estimates. But we are still using a linear model, which can only be improved so far! In this chapter we relax the linearity assumption while still attempting to maintain as much interpretability as possible. We do this by examining very simple extensions of linear models like polynomial regression and step functions, as well as more sophisticated approaches such as splines, local regression, and generalized additive models.

...read moreread less

Posted Content•

An Investigation of Methods for Handling Missing Data with Penalized Regression

[...]

Yunjin Choi, Robert Tibshirani

08 Oct 2013-arXiv: Applications

TL;DR: A method for estimating the parameters which compensates for the missing observations is introduced, which first, derive an unbiased estimator of the objective function with respect to the missing data and then, modify the criterion to ensure convexity.

...read moreread less

Abstract: We investigate methods for penalized regression in the presence of missing observations. This paper introduces a method for estimating the parameters which compensates for the missing observations. We first, derive an unbiased estimator of the objective function with respect to the missing data and then, modify the criterion to ensure convexity. Finally, we extend our approach to a family of models that embraces the mean imputation method. These approaches are compared to the mean imputation method, one of the simplest methods for dealing with missing observations problem, via simulations. We also investigate the problem of making predictions when there are missing values in the test set.

...read moreread less

Posted Content•

A Component Lasso

[...]

Nadine Hussami¹, Robert Tibshirani¹•Institutions (1)

Stanford University¹

18 Nov 2013-arXiv: Machine Learning

TL;DR: In this paper, a new sparse regression method called the component lasso is proposed, which uses the connected-components structure of the sample covariance matrix to split the problem into smaller ones and then solves the subproblems separately, obtaining a coefficient vector for each one.

...read moreread less

Abstract: We propose a new sparse regression method called the component lasso, based on a simple idea. The method uses the connected-components structure of the sample covariance matrix to split the problem into smaller ones. It then solves the subproblems separately, obtaining a coefficient vector for each one. Then, it uses non-negative least squares to recombine the different vectors into a single solution. This step is useful in selecting and reweighting components that are correlated with the response. Simulated and real data examples show that the component lasso can outperform standard regression methods such as the lasso and elastic net, achieving a lower mean squared error as well as better support recovery.

...read moreread less

Posted Content•

Sensitivity Analysis for Inference with Partially Identifiable Covariance Matrices

[...]

Max G'Sell¹, Shai S. Shen-Orr², Robert Tibshirani¹•Institutions (2)

Stanford University¹, Rappaport Faculty of Medicine²

10 Aug 2013-arXiv: Methodology

TL;DR: In this article, a method based on semidefinite programming is proposed to automatically quantify the bias of missing value imputation via conditional expectation via conditional expectations, and the method can give an accurate assessment of the true error in cases where estimates based on sampling uncertainty alone are overly optimistic.

...read moreread less

Abstract: In some multivariate problems with missing data, pairs of variables exist that are never observed together. For example, some modern biological tools can produce data of this form. As a result of this structure, the covariance matrix is only partially identifiable, and point estimation requires that identifying assumptions be made. These assumptions can introduce an unknown and potentially large bias into the inference. This paper presents a method based on semidefinite programming for automatically quantifying this potential bias by computing the range of possible equal-likelihood inferred values for convex functions of the covariance matrix. We focus on the bias of missing value imputation via conditional expectation and show that our method can give an accurate assessment of the true error in cases where estimates based on sampling uncertainty alone are overly optimistic.

...read moreread less

acute lymphoblastic leukemia Notch signals positively regulate activity of the mTOR pathway in T-cell

[...]

Steven M. Chan, Andrew P. Weng, Robert Tibshirani, Jon C. Aster, Paul J. Utz - Show less +1 more

01 Jan 2013