iDocChip: A Configurable Hardware Architecture for Historical Document Image Processing
Menbere Kina Tekleyohannes,Vladimir Rybalkin,Muhammad Mohsin Ghaffar,Javier Alejandro Varela,Norbert Wehn,Andreas Dengel +5 more
TLDR
In this paper, the authors proposed a low power energy-efficient accelerator with real-time capabilities called iDocChip, which is a configurable hybrid hardware-software programmable system-on-chip (SoC) based anyOCR for digitizing historical documents.Abstract:
In recent years, $$\hbox {optical character recognition (OCR)}$$
systems have been used to digitally preserve historical archives. To transcribe historical archives into a machine-readable form, first, the documents are scanned, then an $$\hbox {OCR}$$
is applied. In order to digitize documents without the need to remove them from where they are archived, it is valuable to have a portable device that combines scanning and $$\hbox {OCR}$$
capabilities. Nowadays, there exist many commercial and open-source document digitization techniques, which are optimized for contemporary documents. However, they fail to give sufficient text recognition accuracy for transcribing historical documents due to the severe quality degradation of such documents. On the contrary, the anyOCR system, which is designed to mainly digitize historical documents, provides high accuracy. However, this comes at a cost of high computational complexity resulting in long runtime and high power consumption. To tackle these challenges, we propose a low power energy-efficient accelerator with real-time capabilities called iDocChip, which is a configurable hybrid hardware-software programmable $$\hbox {System-on-Chip (SoC)}$$
based on anyOCR for digitizing historical documents. In this paper, we focus on one of the most crucial processing steps in the anyOCR system: Text and Image Segmentation, which makes use of a multi-resolution morphology-based algorithm. Moreover, an optimized $$\hbox {FPGA}$$
-based hybrid architecture of this anyOCR step along with its optimized software implementations are presented. We demonstrate our results on multiple embedded and general-purpose platforms with respect to runtime and power consumption. The resulting hardware accelerator outperforms the existing anyOCR by 6.2
$$\times$$
, while achieving 207
$$\times$$
higher energy-efficiency and maintaining its high accuracy.read more
Citations
More filters
Journal ArticleDOI
High-Performance Matrix Eigenvalue Decomposition Using the Parallel Jacobi Algorithm on FPGA
Journal ArticleDOI
Adaptive Threshold-Based Database Preparation Method for Handwritten Image Classification
TL;DR: In this paper , an adaptive thresholding-based database preparation method is proposed for handwritten character classification and recognition system, which is based on the similarity score (SS) of an existing HCI images in the respective class.
Journal ArticleDOI
iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing.
Menbere Kina Tekleyohannes,Vladimir Rybalkin,Muhammad Mohsin Ghaffar,Javier Alejandro Varela,Norbert Wehn,Andreas Dengel +5 more
TL;DR: In this article, a configurable hardware-software programmable SoC called iDocChip that makes use of anyOCR techniques to achieve high accuracy was designed and implemented for portable devices that combine scanning and OCR capabilities.
References
More filters
Journal ArticleDOI
Angel-Eye: A Complete Design Flow for Mapping CNN Onto Embedded FPGA
Kaiyuan Guo,Lingzhi Sui,Jiantao Qiu,Jincheng Yu,Wang Junbin,Song Yao,Song Han,Yu Wang,Huazhong Yang +8 more
TL;DR: This paper proposes Angel-Eye, a programmable and flexible CNN accelerator architecture, together with data quantization strategy and compilation tool, which achieves similar performance and delivers up to better energy efficiency than peer FPGA implementation on the same platform.
Posted Content
Recurrent Neural Networks Hardware Implementation on FPGA
TL;DR: In this article, a hardware implementation of Long Short Term Memory (LSTM) recurrent network on the programmable logic Zynq 7020 FPGA from Xilinx is presented.
Journal ArticleDOI
Text/non-text image classification in the wild with convolutional neural networks
TL;DR: A novel convolutional neural network variant, called multi-scale spatial partition network (MSP-Net), which classifies images that contain text or not, by predicting text existence in all image blocks, which are spatial partitions at multiple scales on an input image.
Journal ArticleDOI
A comprehensive survey of mostly textual document segmentation algorithms since 2008
TL;DR: This survey highlights the variety of the approaches that have been proposed for document image segmentation since 2008 and provides a clear typology of documents and of document images segmentation algorithms.
Proceedings ArticleDOI
Document image segmentation using discriminative learning over connected components
TL;DR: This work trains a self-tunable multi-layer perceptron (MLP) classifier for distinguishing between text and non-text connected components using shape and context information as a feature vector to introduce connected component based classification.