scispace - formally typeset
Open AccessProceedings ArticleDOI

MIDV-2019: challenges of the modern mobile-based document OCR

Konstantin B. Bulatov, +2 more
- Vol. 11433, pp 717-722
TLDR
In this article, the authors presented a new dataset, the MIDV-2019 dataset, containing video clips shot with modern high-resolution mobile cameras, with strong projective distortions and with low lighting conditions.
Abstract
Recognition of identity documents using mobile devices has become a topic of a wide range of computer vision research. The portfolio of methods and algorithms for solving such tasks as face detection, document detection and rectification, text field recognition, and other, is growing, and the scarcity of datasets has become an important issue. One of the openly accessible datasets for evaluating such methods is MIDV-500, containing video clips of 50 identity document types in various conditions. However, the variability of capturing conditions in MIDV-500 did not address some of the key issues, mainly significant projective distortions and different lighting conditions. In this paper we present a MIDV-2019 dataset, containing video clips shot with modern high-resolution mobile cameras, with strong projective distortions and with low lighting conditions. The description of the added data is presented, and experimental baselines for text field recognition in different conditions.

read more

Citations
More filters
Proceedings ArticleDOI

Houghencoder: Neural Network Architecture for Document Image Semantic Segmentation

TL;DR: HoughEncoder outperforms UNet which shows state-of-the-art results in many semantic image segmentation tasks even while it has a one hundred times fewer parameters.
Journal ArticleDOI

Text recognition for Vietnamese identity card based on deep features network

TL;DR: This paper investigates to develop a method for Vietnamese identity card recognition based on deep features network that achieves an accuracy of more than 96.7% and 89.8% on character level and word level, respectively.
Posted Content

MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis

TL;DR: The MIDV-2020 dataset as discussed by the authors contains 1000 video clips, 2000 scanned images, and 1000 photos of 1000 unique mock identity documents, each with unique text field values and unique artificially generated faces, with rich annotation.
Book ChapterDOI

Fast End-to-End Deep Learning Identity Document Detection, Classification and Cropping

TL;DR: In this article, a modular approach using a fully multi-stage deep learning based approach is proposed to detect, classifying, and aligning captured documents onto their reference model, which allows to accurately classify the document and estimate its quadrilateral (localization).
Journal ArticleDOI

MIDV-2020: a comprehensive benchmark dataset for identity document analysis

- 01 Apr 2022 - 
TL;DR: The MIDV-2020 dataset as mentioned in this paper contains 1000 video clips, 2000 scanned images and 1000 photos of 1000 unique mock identity documents, each with unique text field values and unique artificially generated faces, with rich annotation.
Related Papers (4)