Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace

Open AccessProceedings Article

Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace

- pp 2927-2936

TLDR

In this article, a task-specific learner of an EMMT-net performs gradient descent with respect to a meta-learned distance metric, which warps the activation space to be more sensitive to task identity.

Abstract:

Gradient-based meta-learning methods leverage gradient descent to learn the commonalities among various tasks. While previous such methods have been successful in meta-learning tasks, they resort to simple gradient descent during meta-testing. Our primary contribution is the {\em MT-net}, which enables the meta-learner to learn on each layer's activation space a subspace that the task-specific learner performs gradient descent on. Additionally, a task-specific learner of an {\em MT-net} performs gradient descent with respect to a meta-learned distance metric, which warps the activation space to be more sensitive to task identity. We demonstrate that the dimension of this learned subspace reflects the complexity of the task-specific learner's adaptation task, and also that our model is less sensitive to the choice of initial learning rates than previous gradient-based meta-learning methods. Our method achieves state-of-the-art or comparable performance on few-shot classification and regression tasks.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Generalizing from a Few Examples: A Survey on Few-shot Learning

Yaqing Wang, +3 more

- 12 Jun 2020 -

ACM Computing Surveys

TL;DR: A thorough survey to fully understand Few-shot Learning (FSL), and categorizes FSL methods from three perspectives: data, which uses prior knowledge to augment the supervised experience; model, which used to reduce the size of the hypothesis space; and algorithm, which using prior knowledgeto alter the search for the best hypothesis in the given hypothesis space.

...read moreread less

Posted Content

Generalizing from a Few Examples: A Survey on Few-Shot Learning

Yaqing Wang, +3 more

- 10 Apr 2019 -

arXiv: Learning

TL;DR: A thorough survey to fully understand Few-Shot Learning (FSL), and categorizes FSL methods from three perspectives: data, which uses prior knowledge to augment the supervised experience; model, which used to reduce the size of the hypothesis space; and algorithm, which using prior knowledgeto alter the search for the best hypothesis in the given hypothesis space.

...read moreread less

Posted Content

Meta-Learning in Neural Networks: A Survey

Timothy M. Hospedales, +3 more

- 11 Apr 2020 -

arXiv: Learning

TL;DR: A new taxonomy is proposed that provides a more comprehensive breakdown of the space of meta-learning methods today, including few-shot learning, reinforcement learning and architecture search, and promising applications and successes.

...read moreread less

Posted Content

Meta-Learning with Latent Embedding Optimization

Andrei Rusu, +6 more

- 16 Jul 2018 -

arXiv: Learning

TL;DR: In this article, a data-dependent latent generative representation of model parameters is learned and a gradient-based meta-learning is performed in a low-dimensional latent space for few-shot learning.

...read moreread less

Proceedings ArticleDOI

Meta-Transfer Learning for Few-Shot Learning

Qianru Sun, +3 more

TL;DR: In this paper, the authors proposed a meta-transfer learning approach to adapt a base-learner to a new task for which only a few labeled samples are available, which learns scaling and shifting functions of DNN weights for each task.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Journal ArticleDOI

A Stochastic Approximation Method

Herbert Robbins, +1 more

- 01 Sep 1951 -

Annals of Mathematical Statistics

TL;DR: In this article, a method for making successive experiments at levels x1, x2, ··· in such a way that xn will tend to θ in probability is presented.

...read moreread less

Collapse

Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace

Citations

Generalizing from a Few Examples: A Survey on Few-shot Learning

Generalizing from a Few Examples: A Survey on Few-Shot Learning

Meta-Learning in Neural Networks: A Survey

Meta-Learning with Latent Embedding Optimization

Meta-Transfer Learning for Few-Shot Learning

References

Adam: A Method for Stochastic Optimization

ImageNet Classification with Deep Convolutional Neural Networks

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Human-level control through deep reinforcement learning

A Stochastic Approximation Method

Related Papers (5)

Prototypical Networks for Few-shot Learning

Matching networks for one shot learning

Model-agnostic meta-learning for fast adaptation of deep networks

Optimization as a Model for Few-Shot Learning

Learning to Compare: Relation Network for Few-Shot Learning