Bias in Estimating the Variance of K-Fold Cross-Validation

doi:10.1007/0-387-24555-3_5

Book ChapterDOI

Bias in Estimating the Variance of K-Fold Cross-Validation

Yoshua Bengio, +1 more

- pp 75-95

Chats0

TLDR

The main theorem shows that there exists no universal (valid under all distributions) unbiased estimator of the variance of K-fold cross-validation, based on a single computation of the K- fold cross- validation estimator.

Abstract:

Most machine learning researchers perform quantitative experiments to estimate generalization error and compare the perforniance of different algorithms (in particular, their proposed algorithmn). In order to be able to draw statistically convincing conclusions, it is important to estimate the uncertainty of such estimates. This paper studies the very commonly used K-fold cross-validation estimator of generalization performance. The main theorem shows that there exists no universal (valid under all distributions) unbiased estimator of the variance of K-fold cross-validation, based on a single computation of the K-fold cross-validation estimator. The analysis that accompanies this result is based on the eigen-decomposition of the covariance matrix of errors, which has only three different eigenvalues corresponding to three degrees of freedom of the matrix and three components of the total variance. This analysis helps to better understand the nature of the problem and how it can make naive estimators (that don't take into account the error correlations due to the overlap between training and test sets) grossly underestimate variance. This is confirmed by numerical experiments in which the three components of the variance are compared when the difficulty of the learning problem and the number of folds are varied.

Bias in Estimating the Variance of K-Fold Cross-Validation

Citations

Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation

Predicting Diabetes Mellitus With Machine Learning Techniques.

Study of Deep Learning Techniques for Side-Channel Analysis and Introduction to ASCAD Database.

Feature Selection With Harmony Search

Deep learning for side-channel analysis and introduction to ASCAD database

References

An introduction to the bootstrap

A study of cross-validation and bootstrap for accuracy estimation and model selection

Generalized Additive Models.

Cross-Validatory Choice and Assessment of Statistical Predictions

A Probabilistic Theory of Pattern Recognition

Related Papers (5)

Extending Data Reliability Measure to a Filter Approach for Soft Subspace Clustering

No Unbiased Estimator of the Variance of K-Fold Cross-Validation

Cross-Validation and Mean-Square Stability

Estimating the maximum expected value through Gaussian approximation

Estimating constituent loads