Word Spotting in Cluttered Environment

doi:10.1007/978-981-32-9291-8_14

Citations

PDF

Open Access

More filters

Journal Article•DOI•

PUNet: Novel and efficient deep neural network architecture for handwritten documents word spotting

[...]

Omar Boudraa, Dominique Michelucci, Walid-Khaled Hidouci

01 Jan 2022-Pattern Recognition Letters

TL;DR: In this paper , a robust deep learning based framework for data exploration of ancient documents by applying Transfer Learning from the U-Net Network and using Pyramidal Histogram of Character Encryption is proposed.

...read moreread less

4 citations

References

PDF

Open Access

More filters

Journal Article•DOI•

A threshold selection method from gray level histograms

[...]

Nobuyuki Otsu

01 Jan 1979-IEEE Transactions on Systems, Man, and Cybernetics

37,017 citations

Proceedings Article•

Using dynamic time warping to find patterns in time series

[...]

Donald J. Berndt¹, James Clifford¹•Institutions (1)

New York University¹

31 Jul 1994

TL;DR: Preliminary experiments with a dynamic programming approach to pattern detection in databases, based on the dynamic time warping technique used in the speech recognition field, are described.

...read moreread less

Abstract: Knowledge discovery in databases presents many interesting challenges within the context of providing computer tools for exploring large data archives. Electronic data repositories are growing quickly and contain data from commercial, scientific, and other domains. Much of this data is inherently temporal, such as stock prices or NASA telemetry data. Detecting patterns in such data streams or time series is an important knowledge discovery task. This paper describes some preliminary experiments with a dynamic programming approach to the problem. The pattern detection algorithm is based on the dynamic time warping technique used in the speech recognition field.

...read moreread less

3,229 citations

Proceedings Article•DOI•

Word image matching using dynamic time warping

[...]

Toni M. Rath¹, R. Manmatha¹•Institutions (1)

University of Massachusetts Amherst¹

18 Jun 2003

TL;DR: This work presents an algorithm for matching handwritten words in noisy historical documents that performs better and is faster than competing matching techniques and presents experimental results on two different data sets from the George Washington collection.

...read moreread less

Abstract: Libraries and other institutions are interested in providing access to scanned versions of their large collections of handwritten historical manuscripts on electronic media. Convenient access to a collection requires an index, which is manually created at great labor and expense. Since current handwriting recognizers do not perform well on historical documents, a technique called word spotting has been developed: clusters with occurrences of the same word in a collection are established using image matching. By annotating "interesting" clusters, an index can be built automatically. We present an algorithm for matching handwritten words in noisy historical documents. The segmented word images are preprocessed to create sets of 1-dimensional features, which are then compared using dynamic time warping. We present experimental results on two different data sets from the George Washington collection. Our experiments show that this algorithm performs better and is faster than competing matching techniques.

...read moreread less

626 citations

Journal Article•DOI•

Word spotting for historical documents

[...]

Toni M. Rath¹, R. Manmatha¹•Institutions (1)

University of Massachusetts Amherst¹

04 Apr 2007-International Journal on Document Analysis and Recognition

TL;DR: It is shown in a subset of the George Washington collection that such a word spotting technique can outperform a Hidden Markov Model word-based recognition technique in terms of word error rates.

...read moreread less

Abstract: Searching and indexing historical handwritten collections are a very challenging problem. We describe an approach called word spotting which involves grouping word images into clusters of similar words by using image matching to find similarity. By annotating “interesting” clusters, an index that links words to the locations where they occur can be built automatically. Image similarities computed using a number of different techniques including dynamic time warping are compared. The word similarities are then used for clustering using both K-means and agglomerative clustering techniques. It is shown in a subset of the George Washington collection that such a word spotting technique can outperform a Hidden Markov Model word-based recognition technique in terms of word error rates.

...read moreread less

368 citations

Journal Article•DOI•

Lexicon-free handwritten word spotting using character HMMs

[...]

Andreas Fischer¹, Andreas Keller¹, Volkmar Frinken¹, Horst Bunke¹•Institutions (1)

University of Bern¹

01 May 2012-Pattern Recognition Letters

TL;DR: For a multi-writer scenario on the IAM off-line database as well as for two single writer scenarios on historical data sets, it is shown that the proposed learning-based system outperforms a standard template matching method.

...read moreread less

293 citations

Word Spotting in Cluttered Environment

Citations

References

Related Papers (5)