ICDAR 2015 competition on Robust Reading

doi:10.1109/ICDAR.2015.7333942

Proceedings ArticleDOI

ICDAR 2015 competition on Robust Reading

- pp 1156-1160

TLDR

A new Challenge 4 on Incidental Scene Text has been added to the Challenges on Born-Digital Images, Focused Scene Images and Video Text and tasks assessing End-to-End system performance have been introduced to all Challenges.

Abstract:

Results of the ICDAR 2015 Robust Reading Competition are presented. A new Challenge 4 on Incidental Scene Text has been added to the Challenges on Born-Digital Images, Focused Scene Images and Video Text. Challenge 4 is run on a newly acquired dataset of 1,670 images evaluating Text Localisation, Word Recognition and End-to-End pipelines. In addition, the dataset for Challenge 3 on Video Text has been substantially updated with more video sequences and more accurate ground truth data. Finally, tasks assessing End-to-End system performance have been introduced to all Challenges. The competition took place in the first quarter of 2015, and received a total of 44 submissions. Only the tasks newly introduced in 2015 are reported on. The datasets, the ground truth specification and the evaluation protocols are presented together with the results and a brief summary of the participating methods.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

DOTA: A Large-Scale Dataset for Object Detection in Aerial Images

Gui-Song Xia, +8 more

TL;DR: The Dataset for Object Detection in Aerial Images (DOTA) as discussed by the authors is a large-scale dataset of aerial images collected from different sensors and platforms and contains objects exhibiting a wide variety of scales, orientations, and shapes.

...read moreread less

Proceedings ArticleDOI

EAST: An Efficient and Accurate Scene Text Detector

Xinyu Zhou, +6 more

TL;DR: This work proposes a simple yet powerful pipeline that yields fast and accurate text detection in natural scenes, and significantly outperforms state-of-the-art methods in terms of both accuracy and efficiency.

...read moreread less

Proceedings ArticleDOI

Synthetic Data for Text Localisation in Natural Images

Ankush Gupta, +2 more

TL;DR: In this article, a Fully-Convolutional Regression Network (FCRN) was proposed to perform text detection and bounding-box regression at all locations and multiple scales in an image.

...read moreread less

Journal ArticleDOI

Arbitrary-Oriented Scene Text Detection via Rotation Proposals

Jianqi Ma, +6 more

- 23 Mar 2018 -

IEEE Transactions on Multimedia

TL;DR: The Rotation Region Proposal Networks are designed to generate inclined proposals with text orientation angle information that are adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation.

...read moreread less

Book ChapterDOI

Detecting Text in Natural Image with Connectionist Text Proposal Network

Zhi Tian, +6 more

TL;DR: The Connectionist Text Proposal Network (CTPN) as mentioned in this paper detects a text line in a sequence of fine-scale text proposals directly in convolutional feature maps, and develops a vertical anchor mechanism that jointly predicts location and text/non-text score of each fixed-width proposal.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The Pascal Visual Object Classes Challenge: A Retrospective

Mark Everingham, +5 more

- 01 Jan 2015 -

International Journal of Computer Vision

TL;DR: A review of the Pascal Visual Object Classes challenge from 2008-2012 and an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.

...read moreread less

Journal ArticleDOI

Evaluating multiple object tracking performance: the CLEAR MOT metrics

Keni Bernardin, +1 more

- 01 Feb 2008 -

Eurasip Journal on Image and Video Proce...

TL;DR: This work introduces two intuitive and general metrics to allow for objective comparison of tracker characteristics, focusing on their precision in estimating object locations, their accuracy in recognizing object configurations and their ability to consistently label objects over time.

...read moreread less

Book ChapterDOI

Exploiting the circulant structure of tracking-by-detection with kernels

João F. Henriques, +3 more

TL;DR: Using the well-established theory of Circulant matrices, this work provides a link to Fourier analysis that opens up the possibility of extremely fast learning and detection with the Fast Fourier Transform, which can be done in the dual space of kernel machines as fast as with linear classifiers.

...read moreread less

Proceedings ArticleDOI

ICDAR 2013 Robust Reading Competition

Dimosthenis Karatzas, +9 more

TL;DR: The datasets and ground truth specification are described, the performance evaluation protocols used are details, and the final results are presented along with a brief summary of the participating methods.

...read moreread less

Proceedings ArticleDOI

End-to-end scene text recognition

Kai Wang, +2 more

TL;DR: While scene text recognition has generally been treated with highly domain-specific methods, the results demonstrate the suitability of applying generic computer vision methods.

...read moreread less

Collapse

An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition

Baoguang Shi, +2 more

- 01 Nov 2017 -

IEEE Transactions on Pattern Analysis an...

ICDAR 2015 competition on Robust Reading

Citations

DOTA: A Large-Scale Dataset for Object Detection in Aerial Images

EAST: An Efficient and Accurate Scene Text Detector

Synthetic Data for Text Localisation in Natural Images

Arbitrary-Oriented Scene Text Detection via Rotation Proposals

Detecting Text in Natural Image with Connectionist Text Proposal Network

References

The Pascal Visual Object Classes Challenge: A Retrospective

Evaluating multiple object tracking performance: the CLEAR MOT metrics

Exploiting the circulant structure of tracking-by-detection with kernels

ICDAR 2013 Robust Reading Competition

End-to-end scene text recognition

Related Papers (5)

ICDAR 2013 Robust Reading Competition

Deep Residual Learning for Image Recognition

Synthetic Data for Text Localisation in Natural Images

EAST: An Efficient and Accurate Scene Text Detector

An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition