scispace - formally typeset
Open AccessJournal ArticleDOI

Sparse bayesian learning and the relevance vector machine

Michael E. Tipping
- 01 Sep 2001 - 
- Vol. 1, pp 211-244
Reads0
Chats0
TLDR
It is demonstrated that by exploiting a probabilistic Bayesian learning framework, the 'relevance vector machine' (RVM) can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages.
Abstract
This paper introduces a general Bayesian framework for obtaining sparse solutions to regression and classification tasks utilising models linear in the parameters Although this framework is fully general, we illustrate our approach with a particular specialisation that we denote the 'relevance vector machine' (RVM), a model of identical functional form to the popular and state-of-the-art 'support vector machine' (SVM) We demonstrate that by exploiting a probabilistic Bayesian learning framework, we can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages These include the benefits of probabilistic predictions, automatic estimation of 'nuisance' parameters, and the facility to utilise arbitrary basis functions (eg non-'Mercer' kernels) We detail the Bayesian framework and associated learning algorithm for the RVM, and give some illustrative examples of its application along with some comparative benchmarks We offer some explanation for the exceptional degree of sparsity obtained, and discuss and demonstrate some of the advantageous features, and potential extensions, of Bayesian relevance learning

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Modelling of diesel engine performance using advanced machine learning methods under scarce and exponential data set

TL;DR: Kernel based ELM with the logarithmic transformation and hybrid inference is far better than basic ELM, LS-SVM, RVM, BPNN and RBFNN, in terms of prediction accuracy and training time.
Journal ArticleDOI

Cognitive Radio for Smart Grid: Theory, Algorithms, and Security

TL;DR: The novel concept of incorporating a cognitive radio network as the communications infrastructure for the smart grid is presented and experimental results obtained by using dimensionality reduction techniques such as principal component analysis (PCA), kernel PCA, and landmark maximum variance unfolding (LMVU) on Wi-Fi signal measurements are presented in a spectrum sensing context.
Posted Content

Car that Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models

TL;DR: This work proposes an Autoregressive Input-Output HMM to model the contextual information alongwith the maneuvers in driving maneuvers and shows that it can anticipate maneuvers 3.5 seconds before they occur with over 80% F1-score in real-time.
Journal ArticleDOI

Fast Inverse-Free Sparse Bayesian Learning via Relaxed Evidence Lower Bound Maximization

TL;DR: By invoking a fundamental property for smooth functions, a relaxed evidence lower bound is obtained (relaxed-ELBO) that is computationally more amiable than the conventional ELBO used by sparse Bayesian learning and leads to a computationally efficient inverse-free sparseBayesian learning algorithm.
Journal ArticleDOI

Integrating relevance vector machines and genetic algorithms for optimization of seed-separating process

TL;DR: A hybrid intelligent approach based on relevance vector machines (RVMs) and genetic algorithms (GAs) for optimal control of parameters of nonlinear manufacturing processes and the experimental results show the effectiveness of the proposed hybrid approach.
References
More filters
Book

The Nature of Statistical Learning Theory

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

Statistical learning theory

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
Book

Neural networks for pattern recognition

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.
Book

Pattern classification and scene analysis

TL;DR: In this article, a unified, comprehensive and up-to-date treatment of both statistical and descriptive methods for pattern recognition is provided, including Bayesian decision theory, supervised and unsupervised learning, nonparametric techniques, discriminant analysis, clustering, preprosessing of pictorial data, spatial filtering, shape description techniques, perspective transformations, projective invariants, linguistic procedures, and artificial intelligence techniques for scene analysis.