scispace - formally typeset
Open AccessJournal ArticleDOI

Sparse bayesian learning and the relevance vector machine

Michael E. Tipping
- 01 Sep 2001 - 
- Vol. 1, pp 211-244
Reads0
Chats0
TLDR
It is demonstrated that by exploiting a probabilistic Bayesian learning framework, the 'relevance vector machine' (RVM) can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages.
Abstract
This paper introduces a general Bayesian framework for obtaining sparse solutions to regression and classification tasks utilising models linear in the parameters Although this framework is fully general, we illustrate our approach with a particular specialisation that we denote the 'relevance vector machine' (RVM), a model of identical functional form to the popular and state-of-the-art 'support vector machine' (SVM) We demonstrate that by exploiting a probabilistic Bayesian learning framework, we can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages These include the benefits of probabilistic predictions, automatic estimation of 'nuisance' parameters, and the facility to utilise arbitrary basis functions (eg non-'Mercer' kernels) We detail the Bayesian framework and associated learning algorithm for the RVM, and give some illustrative examples of its application along with some comparative benchmarks We offer some explanation for the exceptional degree of sparsity obtained, and discuss and demonstrate some of the advantageous features, and potential extensions, of Bayesian relevance learning

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Prediction of Drug–Target Interaction Networks from the Integration of Protein Sequences and Drug Chemical Structures

TL;DR: A novel computational approach based on protein sequence, namely PDTPS (Predicting Drug Targets with Protein Sequence) to predict DTI is proposed, which has good prediction performance and is a useful tool and suitable for predicting DTI, as well as other bioinformatics tasks.
Journal ArticleDOI

A Modified Relevance Vector Machine for PEM Fuel-Cell Stack Aging Prediction

TL;DR: In this article, a modified relevance vector machine (RVM) was proposed to predict the performance degradation of PEMFC stack output voltage over time, and the results have demonstrated that the modified RVM can achieve better performance of prediction than SVM, especially in the cases with relatively small training data sets.
Journal ArticleDOI

Kernel-based online machine learning and support vector reduction

TL;DR: It is shown that the concept of span of support vectors can be used to build a classifier that performs reasonably well while satisfying given space and time constraints, thus making it potentially suitable for such online situations.
Proceedings ArticleDOI

Sparse signal recovery in the presence of correlated multiple measurement vectors

TL;DR: This work model sources as AR processes, and then incorporate such information into the framework of sparse Bayesian learning for sparse signal recovery, and proposes algorithms to account for temporal correlation and its impact on performance.
Proceedings ArticleDOI

Multi-task compressive sensing with Dirichlet process priors

TL;DR: This paper proposes a novel multitask compressive sensing framework based on a Bayesian formalism, where a Dirichlet process (DP) prior is employed, yielding a principled means of simultaneously inferring the appropriate sharing mechanisms as well as CS inversion for each task.
References
More filters
Book

The Nature of Statistical Learning Theory

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

Statistical learning theory

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
Book

Neural networks for pattern recognition

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.
Book

Pattern classification and scene analysis

TL;DR: In this article, a unified, comprehensive and up-to-date treatment of both statistical and descriptive methods for pattern recognition is provided, including Bayesian decision theory, supervised and unsupervised learning, nonparametric techniques, discriminant analysis, clustering, preprosessing of pictorial data, spatial filtering, shape description techniques, perspective transformations, projective invariants, linguistic procedures, and artificial intelligence techniques for scene analysis.