scispace - formally typeset
Journal ArticleDOI

Consensus-based clustering for document image segmentation

Reads0
Chats0
TLDR
A consensus-based clustering approach for document image segmentation that is used iteratively with a classifier to label each primitive block and shows that the dependency of classification performance on the training data is significantly reduced.
Abstract
Segmentation of a document image plays an important role in automatic document processing. In this paper, we propose a consensus-based clustering approach for document image segmentation. In this method, the foreground regions of a document image are grouped into a set of primitive blocks, and a set of features is extracted from them. Similarities among the blocks are computed on each feature using a hypothesis test-based similarity measure. Based on the consensus of these similarities, clustering is performed on the primitive blocks. This clustering approach is used iteratively with a classifier to label each primitive block. Experimental results show the effectiveness of the proposed method. It is further shown in the experimental results that the dependency of classification performance on the training data is significantly reduced.

read more

Citations
More filters
Proceedings ArticleDOI

Document Image Page Segmentation and Character Recognition as Semantic Segmentation

TL;DR: This work uses fully supervised Deep CNN semantic segmentation to separate content layers from historical document images containing diverse content types, including handwriting, machine print, form lines, and stamps, using CNNs for semantic pixel labeling.
Proceedings ArticleDOI

ICDAR2019 Competition on Recognition of Early Indian Printed Documents – REID2019

TL;DR: Different evaluation metrics were used to gain an in-sight into the algorithms, including new character accuracy metrics to better reflect the difficult circumstances presented by the documents.
Book ChapterDOI

Classification Methods in Image Analysis with a Special Focus on Medical Analytics

TL;DR: The design and application of classification methods for image analysis and processing are described in multiple contexts where the image analysis plays a very important role, including security and biometrics, aerospace and satellite monitoring, document analysis, natural language understanding, and information retrieval.
Journal ArticleDOI

Bin Ratio-Based Histogram Distances and their Application to Image Classification

TL;DR: The report will summarize the advance classification approaches that are used to improve accuracy levels of SVM, DTC, Artificial Neural Network, Expert System Classifier and ANN that maximize the accuracy level.
References
More filters
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Journal ArticleDOI

LIBSVM: A library for support vector machines

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Book

Introduction to Algorithms

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.
Proceedings Article

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.
Related Papers (5)