scispace - formally typeset
Proceedings ArticleDOI

Fine-grained classification of identity document types with only one example

TLDR
Different techniques for recognizing types of partly very similar identity documents using state-of-the-art visual recognition approaches including feature representations based on recent achievements with convolutional neural networks are developed and evaluated.
Abstract
In this paper, we tackle the task of recognizing types of partly very similar identity documents using state-of-the-art visual recognition approaches. Given a scanned document, the goal is to identify the country of issue, the type of document, and its version. Whereas recognizing the individual parts of a document with known standardized layout can be done reliably, identifying the type of a document and therefore also its layout is a challenging problem due to the large variety of documents. In our paper, we develop and evaluate different techniques for this application including feature representations based on recent achievements with convolutional neural networks. On a dataset with 74 different classes and using only one training image per class, our best approach achieves a mean class-wise accuracy of 97.7%.

read more

Citations
More filters
Journal ArticleDOI

MIDV-500: A Dataset for Identity Documents Analysis and Recognition on Mobile Devices in Video Stream

TL;DR: This paper presents a Mobile Identity Document Video dataset (MIDV-500) consisting of 500 video clips for 50 different identity document types with ground truth which allows to perform research in a wide scope of document analysis problems.
Proceedings ArticleDOI

Analysis of Convolutional Neural Networks for Document Image Classification

TL;DR: In this article, the authors conducted a large empirical study to find what aspects of CNNs most affect performance on document images and showed that CNNs trained on RVL-CDIP learn region-specific layout features.
Proceedings ArticleDOI

Complex Document Classification and Localization Application on Identity Document Images

TL;DR: This paper addresses the classification of documents composed of few textual information and complex background (such as identity documents) and shows that training images are not necessary and only one reference image is enough to create a document model.
Proceedings ArticleDOI

Fast Method of ID Documents Location and Type Identification for Mobile and Server Application

TL;DR: Results show that using lines and quadrangles increase the location accuracy, and the proposed algorithm surpasses previously published works in classification precision and computational performance.
Book ChapterDOI

Machine Learning Techniques for Identity Document Verification in Uncontrolled Environments: A Case Study

TL;DR: A machine-learning based pipeline to process pictures of documents in distributed enrollment to services such as banking that relies on various analysis modules and visual features for verification of document type and legitimacy is presented.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.
Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.
Posted Content

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

TL;DR: DeCAF, an open-source implementation of deep convolutional activation features, along with all associated network parameters, are released to enable vision researchers to be able to conduct experimentation with deep representations across a range of visual concept learning paradigms.
Related Papers (5)