scispace - formally typeset
Open AccessProceedings ArticleDOI

3D Ball Localization From A Single Calibrated Image

TLDR
In this article , a small neural network trained on image patches around candidates generated by a conventional ball detector is used to predict the confidence of having a ball in the image patch, and through its confidence output, the model improves the detection rate by filtering the candidates produced by the detector.
Abstract
Ball 3D localization in team sports has various applications including automatic offside detection in soccer, or shot release localization in basketball. Today, this task is either resolved by using expensive multi-views setups, or by restricting the analysis to ballistic trajectories. In this work, we propose to address the task on a single image from a calibrated monocular camera by estimating ball diameter in pixels and use the knowledge of real ball diameter in meters. This approach is suitable for any game situation where the ball is (even partly) visible. To achieve this, we use a small neural network trained on image patches around candidates generated by a conventional ball detector. Besides predicting ball diameter, our network outputs the confidence of having a ball in the image patch. Validations on 3 basketball datasets reveals that our model gives remarkable predictions on ball 3D localization. In addition, through its confidence output, our model improves the detection rate by filtering the candidates produced by the detector. The contributions of this work are (i) the first model to address 3D ball localization on a single image, (ii) an effective method for ball 3D annotation from single calibrated images, (iii) a high quality 3D ball evaluation dataset annotated from a single viewpoint. In addition, the code to reproduce this research will be made freely available at https://github.com/gabriel-vanzandycke/deepsport

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations

TL;DR: DeepSportradar-v1, a suite of computer vision tasks, datasets and benchmarks for automated sport understanding, is introduced to close the gap between academic research and real world settings.
References
More filters
Journal ArticleDOI

Gradient-based learning applied to document recognition

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
Journal ArticleDOI

Use of the Hough transformation to detect lines and curves in pictures

TL;DR: It is pointed out that the use of angle-radius rather than slope-intercept parameters simplifies the computation further, and how the method can be used for more general curve fitting.
Journal ArticleDOI

Robust Estimation of a Location Parameter

TL;DR: In this article, a new approach toward a theory of robust estimation is presented, which treats in detail the asymptotic theory of estimating a location parameter for contaminated normal distributions, and exhibits estimators that are asyptotically most robust (in a sense to be specified) among all translation invariant estimators.
Book ChapterDOI

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

TL;DR: ICNet as discussed by the authors proposes an image cascade network (ICNet) that incorporates multi-resolution branches under proper label guidance to reduce a large portion of computation for pixel-wise label inference, which yields real-time inference on a single GPU card.
Proceedings ArticleDOI

A Semi-automatic System for Ground Truth Generation of Soccer Video Sequences

TL;DR: This paper proposes a semi-automatic system that generates an initial ground truth estimation, and then provides a user-friendly interface to manually validate or correct the track results.