Pseudo-Representation Labeling Semi-Supervised Learning.

Open AccessPosted Content

Pseudo-Representation Labeling Semi-Supervised Learning.

Song-Bo Yang, +1 more

- 31 May 2020 -

arXiv: Learning

Chats0

TLDR

The pseudo-representation labeling is a simple and flexible framework that utilizes pseudo-labeling techniques to iteratively label a small amount of unlabeled data and use them as training data and outperforms the current state-of-the-art semi-supervised learning methods in industrial types of classification problems such as the WM-811K wafer map and the MIT-BIH Arrhythmia dataset.

Abstract:

In recent years, semi-supervised learning (SSL) has shown tremendous success in leveraging unlabeled data to improve the performance of deep learning models, which significantly reduces the demand for large amounts of labeled data. Many SSL techniques have been proposed and have shown promising performance on famous datasets such as ImageNet and CIFAR-10. However, some exiting techniques (especially data augmentation based) are not suitable for industrial applications empirically. Therefore, this work proposes the pseudo-representation labeling, a simple and flexible framework that utilizes pseudo-labeling techniques to iteratively label a small amount of unlabeled data and use them as training data. In addition, our framework is integrated with self-supervised representation learning such that the classifier gains benefits from representation learning of both labeled and unlabeled data. This framework can be implemented without being limited at the specific model structure, but a general technique to improve the existing model. Compared with the existing approaches, the pseudo-representation labeling is more intuitive and can effectively solve practical problems in the real world. Empirically, it outperforms the current state-of-the-art semi-supervised learning methods in industrial types of classification problems such as the WM-811K wafer map and the MIT-BIH Arrhythmia dataset.

Pseudo-Representation Labeling Semi-Supervised Learning.

Citations

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

References

Colorful Image Colorization

Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

RandAugment: Practical Automated Data Augmentation with a Reduced Search Space

A Simple Weight Decay Can Improve Generalization

mixup: Beyond Empirical Risk Minimization

Related Papers (5)

A robust semi-supervised learning approach via mixture of label information

PseudoSeg: Designing Pseudo Labels for Semantic Segmentation

An Incremental Self-Labeling Strategy for Semi-Supervised Deep Learning Based on Generative Adversarial Networks

On incrementally using a small portion of strong unlabeled data for semi-supervised learning algorithms

Instance labeling in semi-supervised learning with meaning values of words