scispace - formally typeset
Open AccessProceedings ArticleDOI

A Robust Learning Approach to Domain Adaptive Object Detection

TLDR
In this article, the authors propose a robust object detection framework that is resilient to noise in bounding box class labels, locations and size annotations. But the model is trained on the target domain using a set of noisy object bounding boxes that are obtained by a detection model trained only in the source domain.
Abstract
Domain shift is unavoidable in real-world applications of object detection. For example, in self-driving cars, the target domain consists of unconstrained road environments which cannot all possibly be observed in training data. Similarly, in surveillance applications sufficiently representative training data may be lacking due to privacy regulations. In this paper, we address the domain adaptation problem from the perspective of robust learning and show that the problem may be formulated as training with noisy labels. We propose a robust object detection framework that is resilient to noise in bounding box class labels, locations and size annotations. To adapt to the domain shift, the model is trained on the target domain using a set of noisy object bounding boxes that are obtained by a detection model trained only in the source domain. We evaluate the accuracy of our approach in various source/target domain pairs and demonstrate that the model significantly improves the state-of-the-art on multiple domain adaptation scenarios on the SIM10K, Cityscapes and KITTI datasets.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

TL;DR: Zhang et al. as mentioned in this paper proposed a coarse-to-fine feature adaptation approach to cross-domain object detection, where foreground regions are extracted by adopting the attention mechanism, and aligned according to their marginal distributions via multi-layer adversarial learning in the common feature space.
Proceedings ArticleDOI

Cross-Domain Detection via Graph-Induced Prototype Alignment

TL;DR: A Graph-induced Prototype Alignment framework to seek for category-level domain alignment via elaborate prototype representations through graph-based information propagation among region proposals, and in order to alleviate the negative effect of class-imbalance on domain adaptation, a Class-reweighted Contrastive Loss is designed to harmonize the adaptation training process.
Book ChapterDOI

Domain Adaptive Object Detection via Asymmetric Tri-Way Faster-RCNN

TL;DR: This work proposes an asymmetric tri-way Faster-RCNN (ATF) for domain adaptive object detection, which has two distinct merits: 1) A ancillary net supervised by source label is deployed to learn anCillary target features and simultaneously preserve the discrimination of source domain, which enhances the structural discrimination of domain alignment.
Proceedings ArticleDOI

ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

TL;DR: ST3D as discussed by the authors proposed a domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds by pre-training the 3D detector on the source domain with a proposed random object scaling strategy for mitigating the negative effects of source domain bias.
Posted Content

Unbiased Mean Teacher for Cross-domain Object Detection

TL;DR: A new Unbiased Mean Teacher (UMT) model for cross-domain object detection is proposed, revealing that there often exists a considerable model bias for the simple mean teacher (MT) model in cross- domain scenarios, and eliminating the model bias with several simple yet highly effective strategies.
References
More filters
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.
Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.
Book ChapterDOI

SSD: Single Shot MultiBox Detector

TL;DR: The approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location, which makes SSD easy to train and straightforward to integrate into systems that require a detection component.
Related Papers (5)