scispace - formally typeset
Open AccessProceedings ArticleDOI

Unified Deep Supervised Domain Adaptation and Generalization

Reads0
Chats0
TLDR
This work provides a unified framework for addressing the problem of visual supervised domain adaptation and generalization with deep models by reverting to point-wise surrogates of distribution distances and similarities by exploiting the Siamese architecture.
Abstract
This work provides a unified framework for addressing the problem of visual supervised domain adaptation and generalization with deep models. The main idea is to exploit the Siamese architecture to learn an embedding subspace that is discriminative, and where mapped visual domains are semantically aligned and yet maximally separated. The supervised setting becomes attractive especially when only few target data samples need to be labeled. In this scenario, alignment and separation of semantic probability distributions is difficult because of the lack of data. We found that by reverting to point-wise surrogates of distribution distances and similarities provides an effective solution. In addition, the approach has a high “speed” of adaptation, which requires an extremely low number of labeled target training samples, even one per category can be effective. The approach is extended to domain generalization. For both applications the experiments show very promising results.

read more

Citations
More filters
Posted Content

A Domain Generalization Perspective on Listwise Context Modeling

TL;DR: This article proposed Query-Invariant Listwise Context Modeling (QILCM) which eliminates the detrimental influence of inter-query variability by learning \textit{query-invariant} latent representations, such that the ranking system could generalize better to unseen queries.
Posted Content

Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection

TL;DR: Self-Guided Adaptation (SGA) as mentioned in this paper is proposed to align feature representation and transfer object detection models across domains while considering the instantaneous alignment difficulty, the core of SGA is to calculate "hardness" factors for sample pairs indicating domain distance in a kernel space, the proposed SGA adaptively indicates the importance of samples and assigns them different constrains.
Journal ArticleDOI

Domain Discrepancy Elimination and Mean Face Representation Learning for NIR-VIS Face Recognition

TL;DR: Wang et al. as discussed by the authors proposed a novel Domain discrepancy elimination and Mean face representation learning (DEMR) for NIR-VIS face recognition, which consists of two key components comprising Class-wise Domain Discrepancy Elimination (CDDE) and Cross-modal Mean Face Alignment (CMFA).
Posted Content

Efficient Video Understanding via Layered Multi Frame-Rate Analysis

TL;DR: A dual frame-rate system that brings in the best of both worlds: A modulator stream that executes an expensive models robust to environmental factors at a low frame rate to extract slowly changing features describing the environment, and a prediction stream thatexecute a light-weight model at real-time to extract transient signals that describes particularities of the current frame.
Journal ArticleDOI

Synthetic Depth Transfer for Monocular 3D Object Pose Estimation in the Wild

TL;DR: A deep convolutional neural network is proposed with an RGB-to-Depth Embedding module and a Synthetic-Real Adaptation module to extract RGB and depth features from a single RGB image with the help of synthetic RGB-depth image pairs for object pose estimation.
References
More filters
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Journal ArticleDOI

Gradient-based learning applied to document recognition

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
Journal Article

Visualizing Data using t-SNE

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Related Papers (5)