scispace - formally typeset
G

Guenther Walther

Researcher at Stanford University

Publications -  62
Citations -  7003

Guenther Walther is an academic researcher from Stanford University. The author has contributed to research in topics: Scan statistic & Cluster analysis. The author has an hindex of 24, co-authored 62 publications receiving 5965 citations.

Papers
More filters
Journal ArticleDOI

Estimating the number of clusters in a data set via the gap statistic

TL;DR: In this paper, the authors proposed a method called the "gap statistic" for estimating the number of clusters (groups) in a set of data, which uses the output of any clustering algorithm (e.g. K-means or hierarchical), comparing the change in within-cluster dispersion with that expected under an appropriate reference null distribution.
Journal ArticleDOI

Cluster Validation by Prediction Strength

TL;DR: The key idea is to view clustering as a supervised classification problem, in which the “true” class labels are estimated, and the resulting “prediction strength” measure assesses how many groups can be predicted from the data, and how well.
Journal ArticleDOI

Forward stagewise regression and the monotone lasso

TL;DR: The least angle regression and forward stagewise algorithms for solving penalized least squares regression problems are considered and a condition under which the coefficient paths of the lasso are monotone is studied, and hence the different algorithms coincide.
Journal ArticleDOI

Forward Stagewise Regression and the Monotone Lasso

TL;DR: In this paper, the authors consider the least angle regression and forward stagewise algorithms for solving penalized least squares regression problems, and show that the latter is a monotone version of the lasso.
Posted Content

Adaptive Concentration of Regression Trees, with Application to Random Forests

TL;DR: This approach breaks tree training into a model selection phase, followed by a model fitting phase where the best regression model consistent with these splits is found, and shows that the fitted regression tree concentrates around the optimal predictor with the same splits.