scispace - formally typeset
Proceedings ArticleDOI

What Matters For Meta-Learning Vision Regression Tasks?

Reads0
Chats0
TLDR
This paper designs two new types of cross-category level vision regression tasks, namely object discovery and pose estimation of unprecedented complexity in the meta-learning domain for computer vision and proposes the addition of functional contrastive learning (FCL) over the task representations in Conditional Neural Processes (CNPs).
Abstract
Meta-learning is widely used in few-shot classification and function regression due to its ability to quickly adapt to unseen tasks. However, it has not yet been well explored on regression tasks with high dimensional inputs such as images. This paper makes two main contributions that help understand this barely explored area. First, we design two new types of cross-category level vision regression tasks, namely object discovery and pose estimation of unprecedented complexity in the meta-learning domain for computer vision. To this end, we (i) exhaustively evaluate common meta-learning techniques on these tasks, and (ii) quantitatively analyze the effect of various deep learning techniques commonly used in recent meta-learning algorithms in order to strengthen the generalization capability: data augmentation, domain randomization, task augmentation and meta-regularization. Finally, we (iii) provide some insights and practical recommendations for training meta-learning algorithms on vision regression tasks. Second, we propose the addition of functional contrastive learning (FCL) over the task representations in Conditional Neural Processes (CNPs) and train in an end-to-end fashion. The experimental results show that the results of prior work are misleading as a consequence of a poor choice of the loss function as well as too small meta-training sets. Specifically, we find that CNPs outperform MAML on most tasks without fine-tuning. Furthermore, we observe that naive task augmentation without a tailored design results in underfitting.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

C-Mixup: Improving Generalization in Regression

TL;DR: A simple yet powerful algorithm, C-Mixup, to improve generalization on regression tasks by selectively interpolating examples with similar labels that mitigates the effects of domain-associated information and yields domain-invariant representations.
Journal ArticleDOI

The Neural Process Family: Survey, Applications and Perspectives

TL;DR: A comprehensive survey of NPF models is needed to organize and relate their motivation, methodology, and experiments and shed light on their potential to bring several recent advances in other deep learning domains under one umbrella.
Proceedings ArticleDOI

Meta-Learning with Self-Improving Momentum Target

TL;DR: This work proposes a simple yet effective method, coined Self-improving Momentum Target (SiMT), which generates the target model by adapting from the temporal ensemble of the meta-learner, i.e., the momentum network, and demonstrates that SiMT brings a significant performance gain when combined with a wide range of meta-learning methods under various applications.
Journal ArticleDOI

A Meta-Learning Approach for Few-Shot Face Forgery Segmentation and Classification

Yih-Kai Lin, +1 more
- 31 Mar 2023 - 
TL;DR: In this article , a meta-learning approach was adopted to develop a highly adaptive detector for detecting new forging techniques, making it possible to fine-tune the detector with only a few new forged samples.
Journal ArticleDOI

Better use of experience from other reservoirs for accurate production forecasting by learn-to-learn method

TL;DR: Wang et al. as mentioned in this paper proposed a learn-to-learn method to predict future production indices for the target reservoir by only training with very limited samples collected from reservoirs in the same class.
References
More filters
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Proceedings ArticleDOI

Are we ready for autonomous driving? The KITTI vision benchmark suite

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.
Posted Content

A Simple Framework for Contrastive Learning of Visual Representations

TL;DR: It is shown that composition of data augmentations plays a critical role in defining effective predictive tasks, and introducing a learnable nonlinear transformation between the representation and the contrastive loss substantially improves the quality of the learned representations, and contrastive learning benefits from larger batch sizes and more training steps compared to supervised learning.