Conference

International Conference on Document Analysis and Recognition

About: International Conference on Document Analysis and Recognition is an academic conference. The conference publishes majorly in the area(s): Handwriting recognition & Feature extraction. Over the lifetime, 3952 publications have been published by the conference receiving 97550 citations.

...read moreread less

Topics: Handwriting recognition, Feature extraction, Optical character recognition, Image segmentation, Intelligent character recognition ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Random decision forests

[...]

Tin Kam Ho¹•Institutions (1)

Bell Labs¹

14 Aug 1995

TL;DR: In this article, the authors proposed a method to construct tree-based classifiers whose capacity can be arbitrarily expanded for increases in accuracy for both training and unseen data, which can be monotonically improved by building multiple trees in different subspaces of the feature space.

...read moreread less

Abstract: Decision trees are attractive classifiers due to their high execution speed. But trees derived with traditional methods often cannot be grown to arbitrary complexity for possible loss of generalization accuracy on unseen data. The limitation on complexity usually means suboptimal accuracy on training data. Following the principles of stochastic modeling, we propose a method to construct tree-based classifiers whose capacity can be arbitrarily expanded for increases in accuracy for both training and unseen data. The essence of the method is to build multiple trees in randomly selected subspaces of the feature space. Trees in, different subspaces generalize their classification in complementary ways, and their combined classification can be monotonically improved. The validity of the method is demonstrated through experiments on the recognition of handwritten digits.

...read moreread less

2,957 citations

Proceedings Article•DOI•

Best practices for convolutional neural networks applied to visual document analysis

[...]

Patrice Y. Simard¹, David W. Steinkraus¹, John Platt¹•Institutions (1)

Microsoft¹

03 Aug 2003

TL;DR: A set of concrete bestpractices that document analysis researchers can use to get good results with neural networks, including a simple "do-it-yourself" implementation of convolution with a flexible architecture suitable for many visual document problems.

...read moreread less

Abstract: Neural networks are a powerful technology forclassification of visual inputs arising from documents.However, there is a confusing plethora of different neuralnetwork methods that are used in the literature and inindustry. This paper describes a set of concrete bestpractices that document analysis researchers can use toget good results with neural networks. The mostimportant practice is getting a training set as large aspossible: we expand the training set by adding a newform of distorted data. The next most important practiceis that convolutional neural networks are better suited forvisual document tasks than fully connected networks. Wepropose that a simple "do-it-yourself" implementation ofconvolution with a flexible architecture is suitable formany visual document problems. This simpleconvolutional neural network does not require complexmethods, such as momentum, weight decay, structure-dependentlearning rates, averaging layers, tangent prop,or even finely-tuning the architecture. The end result is avery simple yet general architecture which can yieldstate-of-the-art performance for document analysis. Weillustrate our claims on the MNIST set of English digitimages.

...read moreread less

2,783 citations

Proceedings Article•DOI•

An Overview of the Tesseract OCR Engine

[...]

Ray Smith¹•Institutions (1)

Google¹

23 Sep 2007

TL;DR: The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview.

...read moreread less

Abstract: The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on aspects that are novel or at least unusual in an OCR engine, including in particular the line finding, features/classification methods, and the adaptive classifier.

...read moreread less

1,530 citations

Proceedings Article•DOI•

ICDAR 2015 competition on Robust Reading

[...]

Dimosthenis Karatzas¹, Lluis Gomez-Bigorda¹, Anguelos Nicolaou¹, Suman K. Ghosh¹, Andrew D. Bagdanov¹, Masakazu Iwamura², Jiri Matas³, Lukas Neumann³, Vijay Chandrasekhar⁴, Shijian Lu⁴, Faisal Shafait⁵, Seiichi Uchida⁶, Ernest Valveny¹ - Show less +9 more•Institutions (6)

Autonomous University of Barcelona¹, Osaka Prefecture University², Czech Technical University in Prague³, Institute for Infocomm Research Singapore⁴, National University of Science and Technology⁵, Kyushu University⁶

23 Aug 2015

TL;DR: A new Challenge 4 on Incidental Scene Text has been added to the Challenges on Born-Digital Images, Focused Scene Images and Video Text and tasks assessing End-to-End system performance have been introduced to all Challenges.

...read moreread less

Abstract: Results of the ICDAR 2015 Robust Reading Competition are presented. A new Challenge 4 on Incidental Scene Text has been added to the Challenges on Born-Digital Images, Focused Scene Images and Video Text. Challenge 4 is run on a newly acquired dataset of 1,670 images evaluating Text Localisation, Word Recognition and End-to-End pipelines. In addition, the dataset for Challenge 3 on Video Text has been substantially updated with more video sequences and more accurate ground truth data. Finally, tasks assessing End-to-End system performance have been introduced to all Challenges. The competition took place in the first quarter of 2015, and received a total of 44 submissions. Only the tasks newly introduced in 2015 are reported on. The datasets, the ground truth specification and the evaluation protocols are presented together with the results and a brief summary of the participating methods.

...read moreread less

1,224 citations

Proceedings Article•DOI•

ICDAR 2013 Robust Reading Competition

[...]

Dimosthenis Karatzas¹, Faisal Shafait², Seiichi Uchida³, Masakazu Iwamura⁴, Lluís Gómez i Bigorda¹, Sergi Robles Mestre¹, Joan Mas¹, David Fernandez Mota¹, Jon Almazan¹, Lluís-Pere de las Heras¹ - Show less +6 more•Institutions (4)

Autonomous University of Barcelona¹, University of Western Australia², Kyushu University³, Osaka Prefecture University⁴

25 Aug 2013

TL;DR: The datasets and ground truth specification are described, the performance evaluation protocols used are details, and the final results are presented along with a brief summary of the participating methods.

...read moreread less

Abstract: This report presents the final results of the ICDAR 2013 Robust Reading Competition. The competition is structured in three Challenges addressing text extraction in different application domains, namely born-digital images, real scene images and real-scene videos. The Challenges are organised around specific tasks covering text localisation, text segmentation and word recognition. The competition took place in the first quarter of 2013, and received a total of 42 submissions over the different tasks offered. This report describes the datasets and ground truth specification, details the performance evaluation protocols used and presents the final results along with a brief summary of the participating methods.

...read moreread less

1,191 citations

Collapse

Performance

Metrics

3,952

Papers

97,550

Citations

No. of papers from the Conference in previous years
Year	Papers
2021	266
2019	330
2017	314
2015	254
2013	287
2011	298