Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

doi:10.1109/ICCV.2017.388

Open AccessProceedings ArticleDOI

Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

Christian Rupprecht, +4 more

- pp 3611-3620

Chats0

TLDR

This work proposes a frame-work for reformulating existing single-prediction models as multiple hypothesis prediction (MHP) models and an associated meta loss and optimization procedure to train them, and finds that MHP models outperform their single-hypothesis counterparts in all cases and expose valuable insights into the variability of predictions.

Abstract:

Many prediction tasks contain uncertainty. In some cases, uncertainty is inherent in the task itself. In future prediction, for example, many distinct outcomes are equally valid. In other cases, uncertainty arises from the way data is labeled. For example, in object detection, many objects of interest often go unlabeled, and in human pose estimation, occluded joints are often labeled with ambiguous values. In this work we focus on a principled approach for handling such scenarios. In particular, we propose a frame-work for reformulating existing single-prediction models as multiple hypothesis prediction (MHP) models and an associated meta loss and optimization procedure to train them. To demonstrate our approach, we consider four diverse applications: human pose estimation, future prediction, image classification and segmentation. We find that MHP models outperform their single-hypothesis counterparts in all cases, and that MHP models simultaneously expose valuable insights into the variability of predictions.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Transmission-Guided Bayesian Generative Model for Smoke Segmentation

Siyuan Yan, +2 more

- 28 Jun 2022 -

Proceedings of the ... AAAI Conference o...

TL;DR: In this paper , a transmission-guided local coherence loss is proposed to guide the network to learn pairwise relationships based on pixel distance and the transmission feature for smoke segmentation.

...read moreread less

Posted Content

Goal-Directed Occupancy Prediction for Lane-Following Actors

Poornima Kaniarasu, +2 more

- 06 Sep 2020 -

arXiv: Signal Processing

TL;DR: This work proposes a new method that leverages the mapped road topology to reason over possible goals and predict the future spatial occupancy of dynamic road actors and shows that it is able to accurately predict future occupancy that remains consistent with the mapped lane geometry and naturally captures multi-modality based on the local scene context.

...read moreread less

Proceedings Article

Orientation Estimation of Abdominal Ultrasound Images with Multi-Hypotheses Networks

Timo Horstmann, +3 more

TL;DR: This work not only train neural networks to predict the absolute orientation of ultrasound frames, but also to produce a confidence for each prediction, which allows them to select only the most confident frames in the clip.

...read moreread less

Posted Content

Interpretable Social Anchors for Human Trajectory Forecasting in Crowds

Parth Kothari, +2 more

- 07 May 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, the authors leverage the power of discrete choice models to learn interpretable rule-based intents, and subsequently utilize the expressibility of neural networks to model scene-specific residual.

...read moreread less

Journal ArticleDOI

HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation

Akash Sengupta, +2 more

- 11 May 2023 -

arXiv.org

TL;DR: HuManiFlow as discussed by the authors uses the human kinematic tree to factorise full body pose into ancestor-conditioned per-body part pose distributions in an autoregressive manner.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

Collapse

Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

Citations

Transmission-Guided Bayesian Generative Model for Smoke Segmentation

Goal-Directed Occupancy Prediction for Lane-Following Actors

Orientation Estimation of Abdominal Ultrasound Images with Multi-Hypotheses Networks

Interpretable Social Anchors for Human Trajectory Forecasting in Crowds

HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Gradient-based learning applied to document recognition

Dropout: a simple way to prevent neural networks from overfitting

Related Papers (5)

Adam: A Method for Stochastic Optimization

U-Net: Convolutional Networks for Biomedical Image Segmentation

Deep Residual Learning for Image Recognition

Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles

Dropout as a Bayesian approximation: representing model uncertainty in deep learning