Constructing informative priors using transfer learning

doi:10.1145/1143844.1143934

Open AccessProceedings ArticleDOI

Constructing informative priors using transfer learning

Rajat Raina, +2 more

- pp 713-720

Chats0

TLDR

An algorithm for automatically constructing a multivariate Gaussian prior with a full covariance matrix for a given supervised learning task, which relaxes a commonly used but overly simplistic independence assumption, and allows parameters to be dependent.

Abstract:

Many applications of supervised learning require good generalization from limited labeled data. In the Bayesian setting, we can try to achieve this goal by using an informative prior over the parameters, one that encodes useful domain knowledge. Focusing on logistic regression, we present an algorithm for automatically constructing a multivariate Gaussian prior with a full covariance matrix for a given supervised learning task. This prior relaxes a commonly used but overly simplistic independence assumption, and allows parameters to be dependent. The algorithm uses other "similar" learning problems to estimate the covariance of pairs of individual parameters. We then use a semidefinite program to combine these estimates and learn a good prior for the current learning task. We apply our methods to binary text classification, and demonstrate a 20 to 40% test error reduction over a commonly used prior.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A Survey on Transfer Learning

Sinno Jialin Pan, +1 more

- 01 Oct 2010 -

IEEE Transactions on Knowledge and Data ...

TL;DR: The relationship between transfer learning and other related machine learning techniques such as domain adaptation, multitask learning and sample selection bias, as well as covariate shift are discussed.

...read moreread less

Journal ArticleDOI

A survey of machine learning for big data processing

Junfei Qiu, +4 more

- 28 May 2016 -

EURASIP Journal on Advances in Signal Pr...

TL;DR: A literature survey of the latest advances in researches on machine learning for big data processing finds some promising learning methods in recent studies, such as representation learning, deep learning, distributed and parallel learning, transfer learning, active learning, and kernel-based learning.

...read moreread less

Proceedings ArticleDOI

Transfer defect learning

Jaechang Nam, +2 more

TL;DR: A state-of-the-art transfer learning approach is applied to make feature distributions in source and target projects similar, and a novel transfer defect learning approach, TCA+, is proposed, by extending TCA.

...read moreread less

Proceedings Article

Transferring naive bayes classifiers for text classification

Wenyuan Dai, +3 more

TL;DR: This paper proposes a novel transfer-learning algorithm for text classification based on an EM-based Naive Bayes classifiers and shows that the algorithm outperforms the traditional supervised and semi-supervised learning algorithms when the distributions of the training and test sets are increasingly different.

...read moreread less

Posted Content

A Convex Formulation for Learning Task Relationships in Multi-Task Learning

Yu Zhang, +1 more

- 15 Mar 2012 -

arXiv: Learning

TL;DR: This paper proposes a regularization formulation for learning the relationships between tasks in multi-task learning, called MTRL, which can also describe negative task correlation and identify outlier tasks based on the same underlying principle.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

WordNet: a lexical database for English

George A. Miller

- 01 Nov 1995 -

Communications of The ACM

TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.

...read moreread less

Journal ArticleDOI

Bootstrap Methods: Another Look at the Jackknife

Bradley Efron

- 01 Jan 1979 -

Annals of Statistics

TL;DR: In this article, the authors discuss the problem of estimating the sampling distribution of a pre-specified random variable R(X, F) on the basis of the observed data x.

...read moreread less

Proceedings Article

On Spectral Clustering: Analysis and an algorithm

Andrew Y. Ng, +2 more

TL;DR: A simple spectral clustering algorithm that can be implemented using a few lines of Matlab is presented, and tools from matrix perturbation theory are used to analyze the algorithm, and give conditions under which it can be expected to do well.

...read moreread less

Book

Spectral Graph Theory

Fan Chung

TL;DR: Eigenvalues and the Laplacian of a graph Isoperimetric problems Diameters and eigenvalues Paths, flows, and routing Eigen values and quasi-randomness

...read moreread less

Book ChapterDOI

Bootstrap Methods: Another Look at the Jackknife

David Hinkley

Constructing informative priors using transfer learning

Citations

A Survey on Transfer Learning

A survey of machine learning for big data processing

Transfer defect learning

Transferring naive bayes classifiers for text classification

A Convex Formulation for Learning Task Relationships in Multi-Task Learning

References

WordNet: a lexical database for English

Bootstrap Methods: Another Look at the Jackknife

On Spectral Clustering: Analysis and an algorithm

Spectral Graph Theory

Bootstrap Methods: Another Look at the Jackknife

Related Papers (5)

A Survey on Transfer Learning

Multitask learning

Self-taught learning: transfer learning from unlabeled data

Domain Adaptation with Structural Correspondence Learning

Regularized multi--task learning