Sparse bayesian learning and the relevance vector machine

doi:10.1162/15324430152748236

Open AccessJournal ArticleDOI

Sparse bayesian learning and the relevance vector machine

Michael E. Tipping

- 01 Sep 2001 -

Journal of Machine Learning Research

- Vol. 1, pp 211-244

Chats0

TLDR

It is demonstrated that by exploiting a probabilistic Bayesian learning framework, the 'relevance vector machine' (RVM) can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages.

Abstract:

This paper introduces a general Bayesian framework for obtaining sparse solutions to regression and classification tasks utilising models linear in the parameters Although this framework is fully general, we illustrate our approach with a particular specialisation that we denote the 'relevance vector machine' (RVM), a model of identical functional form to the popular and state-of-the-art 'support vector machine' (SVM) We demonstrate that by exploiting a probabilistic Bayesian learning framework, we can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages These include the benefits of probabilistic predictions, automatic estimation of 'nuisance' parameters, and the facility to utilise arbitrary basis functions (eg non-'Mercer' kernels) We detail the Bayesian framework and associated learning algorithm for the RVM, and give some illustrative examples of its application along with some comparative benchmarks We offer some explanation for the exceptional degree of sparsity obtained, and discuss and demonstrate some of the advantageous features, and potential extensions, of Bayesian relevance learning

Sparse bayesian learning and the relevance vector machine

Citations

Computational detection and location of transcription start sites in mammalian genomic DNA.

Bayesian Inference: An Introduction to Principles and Practice in Machine Learning

Sparse Regularization via Convex Analysis

Machine Learning Methods for Predicting Failures in Hard Drives: A Multiple-Instance Application

Learning to Parse Pictures of People

References

The Nature of Statistical Learning Theory

Statistical learning theory

Neural networks for pattern recognition

Pattern Classification and Scene Analysis.

Pattern classification and scene analysis

Related Papers (5)

Regression Shrinkage and Selection via the Lasso

Pattern Recognition and Machine Learning

Compressed sensing

The Nature of Statistical Learning Theory

Statistical learning theory