What Matters For Meta-Learning Vision Regression Tasks?

doi:10.1109/CVPR52688.2022.01436

Proceedings ArticleDOI

What Matters For Meta-Learning Vision Regression Tasks?

Ni Gao, +4 more

- pp 14756-14766

Chats0

TLDR

This paper designs two new types of cross-category level vision regression tasks, namely object discovery and pose estimation of unprecedented complexity in the meta-learning domain for computer vision and proposes the addition of functional contrastive learning (FCL) over the task representations in Conditional Neural Processes (CNPs).

Abstract:

Meta-learning is widely used in few-shot classification and function regression due to its ability to quickly adapt to unseen tasks. However, it has not yet been well explored on regression tasks with high dimensional inputs such as images. This paper makes two main contributions that help understand this barely explored area. First, we design two new types of cross-category level vision regression tasks, namely object discovery and pose estimation of unprecedented complexity in the meta-learning domain for computer vision. To this end, we (i) exhaustively evaluate common meta-learning techniques on these tasks, and (ii) quantitatively analyze the effect of various deep learning techniques commonly used in recent meta-learning algorithms in order to strengthen the generalization capability: data augmentation, domain randomization, task augmentation and meta-regularization. Finally, we (iii) provide some insights and practical recommendations for training meta-learning algorithms on vision regression tasks. Second, we propose the addition of functional contrastive learning (FCL) over the task representations in Conditional Neural Processes (CNPs) and train in an end-to-end fashion. The experimental results show that the results of prior work are misleading as a consequence of a poor choice of the loss function as well as too small meta-training sets. Specifically, we find that CNPs outperform MAML on most tasks without fine-tuning. Furthermore, we observe that naive task augmentation without a tailored design results in underfitting.

What Matters For Meta-Learning Vision Regression Tasks?

Citations

C-Mixup: Improving Generalization in Regression

The Neural Process Family: Survey, Applications and Perspectives

Meta-Learning with Self-Improving Momentum Target

A Meta-Learning Approach for Few-Shot Face Forgery Segmentation and Classification

Better use of experience from other reservoirs for accurate production forecasting by learn-to-learn method

References

ImageNet: A large-scale hierarchical image database

Microsoft COCO: Common Objects in Context

The Pascal Visual Object Classes (VOC) Challenge

Are we ready for autonomous driving? The KITTI vision benchmark suite

A Simple Framework for Contrastive Learning of Visual Representations