Confidence-and-Refinement Adaptation Model for Cross-Domain Semantic Segmentation

doi:10.1109/tits.2022.3140481

Open AccessJournal ArticleDOI

Confidence-and-Refinement Adaptation Model for Cross-Domain Semantic Segmentation

Xiaohong Zhang, +5 more

- 01 Jul 2022 -

IEEE Transactions on Intelligent Transpo...

- Vol. 23, Iss: 7, pp 9529-9542

Chats0

TLDR

A novel multi-level UDA model named Confidence-and-Refinement Adaptation Model (CRAM), which contains a confidence-aware entropy alignment (CEA) module and a style feature alignment (SFA) module, which achieves comparable performance with the existing state-of-the-art works with advantages in simplicity and convergence speed.

Abstract:

With the rapid development of convolutional neural networks (CNNs), significant progress has been achieved in semantic segmentation. Despite the great success, such deep learning approaches require large scale real-world datasets with pixel-level annotations. However, considering that pixel-level labeling of semantics is extremely laborious, many researchers turn to utilize synthetic data with free annotations. But due to the clear domain gap, the segmentation model trained with the synthetic images tends to perform poorly on the real-world datasets. Unsupervised domain adaptation (UDA) for semantic segmentation recently gains an increasing research attention, which aims at alleviating the domain discrepancy. Existing methods in this scope either simply align features or the outputs across the source and target domains or have to deal with the complex image processing and post-processing problems. In this work, we propose a novel multi-level UDA model named Confidence-and-Refinement Adaptation Model (CRAM), which contains a confidence-aware entropy alignment (CEA) module and a style feature alignment (SFA) module. Through CEA, the adaptation is done locally via adversarial learning in the output space, making the segmentation model pay attention to the high-confident predictions. Furthermore, to enhance the model transfer in the shallow feature space, the SFA module is applied to minimize the appearance gap across domains. Experiments on two challenging UDA benchmarks “GTA5-to-Cityscapes” and “SYNTHIA-to-Cityscapes” demonstrate the effectiveness of CRAM. We achieve comparable performance with the existing state-of-the-art works with advantages in simplicity and convergence speed.

Confidence-and-Refinement Adaptation Model for Cross-Domain Semantic Segmentation

Citations

Threshold-Adaptive Unsupervised Focal Loss for Domain Adaptation of Semantic Segmentation

Threshold-Adaptive Unsupervised Focal Loss for Domain Adaptation of Semantic Segmentation

Combining Pixel-Level and Structure-Level Adaptation for Semantic Segmentation

Category-Level Adversaries for Outdoor LiDAR Point Clouds Cross-Domain Semantic Segmentation

Category-Level Adversaries for Outdoor LiDAR Point Clouds Cross-Domain Semantic Segmentation

References

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

Fully convolutional networks for semantic segmentation

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Related Papers (5)

Deep Learning and Its Application to Medical Image Segmentation

Performance evaluation of 2D and 3D deep learning approaches for automatic segmentation of multiple organs on CT images

Deep Convolutional Neural Network Using Triplets of Faces, Deep Ensemble, and Score-Level Fusion for Face Recognition

Survey of brain tumor segmentation with deep neural networks

Advances Towards Automatic Detection and Classification of Parasites Microscopic Images Using Deep Convolutional Neural Network: Methods, Models and Research Directions