Fine-grained classification of identity document types with only one example

doi:10.1109/MVA.2015.7153149

Proceedings ArticleDOI

Fine-grained classification of identity document types with only one example

- pp 126-129

TLDR

Different techniques for recognizing types of partly very similar identity documents using state-of-the-art visual recognition approaches including feature representations based on recent achievements with convolutional neural networks are developed and evaluated.

Abstract:

In this paper, we tackle the task of recognizing types of partly very similar identity documents using state-of-the-art visual recognition approaches. Given a scanned document, the goal is to identify the country of issue, the type of document, and its version. Whereas recognizing the individual parts of a document with known standardized layout can be done reliably, identifying the type of a document and therefore also its layout is a challenging problem due to the large variety of documents. In our paper, we develop and evaluate different techniques for this application including feature representations based on recent achievements with convolutional neural networks. On a dataset with 74 different classes and using only one training image per class, our best approach achieves a mean class-wise accuracy of 97.7%.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

MIDV-500: A Dataset for Identity Documents Analysis and Recognition on Mobile Devices in Video Stream

Vladimir V. Arlazarov, +3 more

- 16 Jul 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper presents a Mobile Identity Document Video dataset (MIDV-500) consisting of 500 video clips for 50 different identity document types with ground truth which allows to perform research in a wide scope of document analysis problems.

...read moreread less

Proceedings ArticleDOI

Analysis of Convolutional Neural Networks for Document Image Classification

Chris Tensmeyer, +1 more

TL;DR: In this article, the authors conducted a large empirical study to find what aspects of CNNs most affect performance on document images and showed that CNNs trained on RVL-CDIP learn region-specific layout features.

...read moreread less

Proceedings ArticleDOI

Complex Document Classification and Localization Application on Identity Document Images

Ahmad Montaser Awal, +3 more

TL;DR: This paper addresses the classification of documents composed of few textual information and complex background (such as identity documents) and shows that training images are not necessary and only one reference image is enough to create a document model.

...read moreread less

Proceedings ArticleDOI

Fast Method of ID Documents Location and Type Identification for Mobile and Server Application

Natalya Skoryukina, +2 more

TL;DR: Results show that using lines and quadrangles increase the location accuracy, and the proposed algorithm surpasses previously published works in classification precision and computational performance.

...read moreread less

Book ChapterDOI

Machine Learning Techniques for Identity Document Verification in Uncontrolled Environments: A Case Study

Alejandra Castelblanco, +5 more

TL;DR: A machine-learning based pipeline to process pictures of documents in distributed enrollment to services such as banking that relies on various analysis modules and visual features for verification of document type and legitimacy is presented.

...read moreread less

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb, +3 more

- 01 Sep 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.

...read moreread less

Posted Content

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

Jeff Donahue, +6 more

- 06 Oct 2013 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: DeCAF, an open-source implementation of deep convolutional activation features, along with all associated network parameters, are released to enable vision researchers to be able to conduct experimentation with deep representations across a range of visual concept learning paradigms.

...read moreread less

arXiv: Computer Vision and Pattern Recog...

PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents

Sebastian Sudholt, +1 more

Fine-grained classification of identity document types with only one example

Citations

MIDV-500: A Dataset for Identity Documents Analysis and Recognition on Mobile Devices in Video Stream

Analysis of Convolutional Neural Networks for Document Image Classification

Complex Document Classification and Localization Application on Identity Document Images

Fast Method of ID Documents Location and Type Identification for Mobile and Server Application

Machine Learning Techniques for Identity Document Verification in Uncontrolled Environments: A Case Study

References

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Caffe: Convolutional Architecture for Fast Feature Embedding

Object Detection with Discriminatively Trained Part-Based Models

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

Related Papers (5)

Complex Document Classification and Localization Application on Identity Document Images

Use case visual Bag-of-Words techniques for camera based identity document classification

Convolutional Neural Networks for Document Image Classification

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents