A Robust Learning Approach to Domain Adaptive Object Detection

doi:10.1109/ICCV.2019.00057

Open AccessProceedings ArticleDOI

A Robust Learning Approach to Domain Adaptive Object Detection

- pp 480-490

TLDR

In this article, the authors propose a robust object detection framework that is resilient to noise in bounding box class labels, locations and size annotations. But the model is trained on the target domain using a set of noisy object bounding boxes that are obtained by a detection model trained only in the source domain.

Abstract:

Domain shift is unavoidable in real-world applications of object detection. For example, in self-driving cars, the target domain consists of unconstrained road environments which cannot all possibly be observed in training data. Similarly, in surveillance applications sufficiently representative training data may be lacking due to privacy regulations. In this paper, we address the domain adaptation problem from the perspective of robust learning and show that the problem may be formulated as training with noisy labels. We propose a robust object detection framework that is resilient to noise in bounding box class labels, locations and size annotations. To adapt to the domain shift, the model is trained on the target domain using a set of noisy object bounding boxes that are obtained by a detection model trained only in the source domain. We evaluate the accuracy of our approach in various source/target domain pairs and demonstrate that the model significantly improves the state-of-the-art on multiple domain adaptation scenarios on the SIM10K, Cityscapes and KITTI datasets.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

Yangtao Zheng, +3 more

TL;DR: Zhang et al. as mentioned in this paper proposed a coarse-to-fine feature adaptation approach to cross-domain object detection, where foreground regions are extracted by adopting the attention mechanism, and aligned according to their marginal distributions via multi-layer adversarial learning in the common feature space.

...read moreread less

Proceedings ArticleDOI

Cross-Domain Detection via Graph-Induced Prototype Alignment

Minghao Xu, +4 more

TL;DR: A Graph-induced Prototype Alignment framework to seek for category-level domain alignment via elaborate prototype representations through graph-based information propagation among region proposals, and in order to alleviate the negative effect of class-imbalance on domain adaptation, a Class-reweighted Contrastive Loss is designed to harmonize the adaptation training process.

...read moreread less

Book ChapterDOI

Domain Adaptive Object Detection via Asymmetric Tri-Way Faster-RCNN

Zhenwei He, +1 more

TL;DR: This work proposes an asymmetric tri-way Faster-RCNN (ATF) for domain adaptive object detection, which has two distinct merits: 1) A ancillary net supervised by source label is deployed to learn anCillary target features and simultaneously preserve the discrimination of source domain, which enhances the structural discrimination of domain alignment.

...read moreread less

Proceedings ArticleDOI

ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

Jihan Yang, +4 more

TL;DR: ST3D as discussed by the authors proposed a domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds by pre-training the 3D detector on the source domain with a proposed random object scaling strategy for mitigating the negative effects of source domain bias.

...read moreread less

Posted Content

Unbiased Mean Teacher for Cross-domain Object Detection

Jinhong Deng, +3 more

- 02 Mar 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A new Unbiased Mean Teacher (UMT) model for cross-domain object detection is proposed, revealing that there often exists a considerable model bias for the simple mean teacher (MT) model in cross- domain scenarios, and eliminating the model bias with several simple yet highly effective strategies.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, +3 more

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.

...read moreread less

Book ChapterDOI

SSD: Single Shot MultiBox Detector

Wei Liu, +6 more

TL;DR: The approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location, which makes SSD easy to train and straightforward to integrate into systems that require a detection component.

...read moreread less

Collapse

A Robust Learning Approach to Domain Adaptive Object Detection

Citations

Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

Cross-Domain Detection via Graph-Induced Prototype Alignment

Domain Adaptive Object Detection via Asymmetric Tri-Way Faster-RCNN

ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

Unbiased Mean Teacher for Cross-domain Object Detection

References

ImageNet: A large-scale hierarchical image database

Histograms of oriented gradients for human detection

You Only Look Once: Unified, Real-Time Object Detection

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

SSD: Single Shot MultiBox Detector

Related Papers (5)

Deep Residual Learning for Image Recognition

SSD: Single Shot MultiBox Detector

The Cityscapes Dataset for Semantic Urban Scene Understanding

You Only Look Once: Unified, Real-Time Object Detection

Fast R-CNN