Selective Classification for Deep Neural Networks

Open AccessProceedings Article

Selective Classification for Deep Neural Networks

Yonatan Geifman, +1 more

- Vol. 30, pp 4878-4887

Chats0

TLDR

A method to construct a selective classifier given a trained neural network, which allows a user to set a desired risk level and the classifier rejects instances as needed, to grant the desired risk (with high probability).

Abstract:

Selective classification techniques (also known as reject option) have not yet been considered in the context of deep neural networks (DNNs). These techniques can potentially significantly improve DNNs prediction performance by trading-off coverage. In this paper we propose a method to construct a selective classifier given a trained neural network. Our method allows a user to set a desired risk level. At test time, the classifier rejects instances as needed, to grant the desired risk (with high probability). Empirical results over CIFAR and ImageNet convincingly demonstrate the viability of our method, which opens up possibilities to operate DNNs in mission-critical applications. For example, using our method an unprecedented 2% error in top-5 ImageNet classification can be guaranteed with probability 99.9%, with almost 60% test coverage.

Citations

PDF

Open Access

More filters

Posted Content

Leveraging Uncertainty in Deep Learning for Selective Classification

Mehmet Yigit Yildirim, +2 more

- 23 May 2019 -

arXiv: Learning

TL;DR: This study proposes a mixed-integer programming framework for classification with reject option (also known as selective classification), that investigates and combines model uncertainty and predictive mean to identify optimal classification and rejection regions.

...read moreread less

Proceedings ArticleDOI

Risk-Controlled Selective Prediction for Regression Deep Neural Network Models

Wenming Jiang, +2 more

TL;DR: This paper considered the selective regression problem from a risk-coverage point of view, and proposed a method to construct a selective regression model given a trained regression DNN model and a desired regression error risk, and utilized blending variance to quantify uncertainty in regression NNs.

...read moreread less

Proceedings ArticleDOI

Knowing the No-match: Entity Alignment with Dangling Cases

Zequn Sun, +2 more

TL;DR: In this article, a multi-task learning framework for both entity alignment and dangling entity detection is proposed, which can opt to abstain from predicting alignment for the detected dangling entities.

...read moreread less

Posted Content

$\mathcal{G}$-Distillation: Reducing Overconfident Errors on Novel Samples

Zhizhong Li, +1 more

TL;DR: A simple solution that reduces overconfident errors of samples from an unknown novel distribution without increasing evaluation time is proposed: train an ensemble of classifiers and then distill into a single model using both labeled and unlabeled examples.

...read moreread less

Posted Content

Consistent Accelerated Inference via Confident Adaptive Transformers

Tal Schuster, +3 more

- 18 Apr 2021 -

arXiv: Computation and Language

TL;DR: In this paper, the authors develop a novel approach for confidently accelerating inference in the large and expensive multilayer Transformers that are now ubiquitous in natural language processing (NLP) that can increase efficiency but can come with unpredictable performance costs.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Proceedings ArticleDOI

Very deep convolutional neural network based image classification using small training sample size

Shuying Liu, +1 more

TL;DR: In this article, a modified VGG-16 network was used to fit CIFAR-10 without severe overfitting and achieved 8.45% error rate on the dataset.

...read moreread less

Journal ArticleDOI

An optimum character recognition system using decision functions

C. K. Chow

- 01 Dec 1957 -

Ire Transactions on Electronic Computers

TL;DR: The character recognition problem, usually resulting from characters being corrupted by printing deterioration and/or inherent noise of the devices, is considered from the viewpoint of statistical decision theory and the optimum recogition is obtained.

...read moreread less

Selective Classification for Deep Neural Networks

Citations

Leveraging Uncertainty in Deep Learning for Selective Classification

Risk-Controlled Selective Prediction for Regression Deep Neural Network Models

Knowing the No-match: Entity Alignment with Dangling Cases

$\mathcal{G}$-Distillation: Reducing Overconfident Errors on Novel Samples

Consistent Accelerated Inference via Confident Adaptive Transformers

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Large Scale Visual Recognition Challenge

Very deep convolutional neural network based image classification using small training sample size

An optimum character recognition system using decision functions

Related Papers (5)

Dropout as a Bayesian approximation: representing model uncertainty in deep learning

Deep Residual Learning for Image Recognition

Learning Multiple Layers of Features from Tiny Images

On calibration of modern neural networks

Explaining and Harnessing Adversarial Examples