Multi-Task Learning for Classification with Dirichlet Process Priors

Open AccessJournal Article

Multi-Task Learning for Classification with Dirichlet Process Priors

Ya Xue, +3 more

- 01 May 2007 -

Journal of Machine Learning Research

- Vol. 8, Iss: 2, pp 35-63

Chats0

TLDR

Experimental results on two real life MTL problems indicate that the proposed algorithms automatically identify subgroups of related tasks whose training data appear to be drawn from similar distributions are more accurate than simpler approaches such as single-task learning, pooling of data across all tasks, and simplified approximations to DP.

Abstract:

Consider the problem of learning logistic-regression models for multiple classification tasks, where the training data set for each task is not drawn from the same statistical distribution. In such a multi-task learning (MTL) scenario, it is necessary to identify groups of similar tasks that should be learned jointly. Relying on a Dirichlet process (DP) based statistical model to learn the extent of similarity between classification tasks, we develop computationally efficient algorithms for two different forms of the MTL problem. First, we consider a symmetric multi-task learning (SMTL) situation in which classifiers for multiple tasks are learned jointly using a variational Bayesian (VB) algorithm. Second, we consider an asymmetric multi-task learning (AMTL) formulation in which the posterior density function from the SMTL model parameters (from previous tasks) is used as a prior for a new task: this approach has the significant advantage of not requiring storage and use of all previous data from prior tasks. The AMTL formulation is solved with a simple Markov Chain Monte Carlo (MCMC) construction. Experimental results on two real life MTL problems indicate that the proposed algorithms: (a) automatically identify subgroups of related tasks whose training data appear to be drawn from similar distributions; and (b) are more accurate than simpler approaches such as single-task learning, pooling of data across all tasks, and simplified approximations to DP.

Multi-Task Learning for Classification with Dirichlet Process Priors

Citations

Machine Learning : A Probabilistic Perspective

An Overview of Multi-Task Learning in Deep Neural Networks

Convex multi-task feature learning

Dataset Shift in Machine Learning

A Survey on Multi-Task Learning

References

Monte Carlo Statistical Methods

Multitask Learning

A Bayesian Analysis of Some Nonparametric Problems

Primary, Secondary, and Meta-Analysis of Research

An introduction to variational methods for graphical models

Related Papers (5)

Regularized multi--task learning

A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

Learning Multiple Tasks with Kernel Methods

Convex multi-task feature learning

Multitask Learning