Scale and Rotation Corrected CNNs (SRC-CNNs) for Scale and Rotation Invariant Character Recognition: SRC-CNN for Scale and Rotation Invariant Character Recognition

doi:10.1145/3293353.3293373

Proceedings ArticleDOI

Scale and Rotation Corrected CNNs (SRC-CNNs) for Scale and Rotation Invariant Character Recognition: SRC-CNN for Scale and Rotation Invariant Character Recognition

TLDR

It is demonstrated how the basic PCA based rotation and scale invariant image recognition can be integrated to CNN for achieving better rotational and scale invariances in classification.

Abstract:

Last decade has witnessed rapid growth for the popularity of Convolutional Neural Networks (CNNs), in detecting and classifying objects. The self trainable nature of CNNs makes them the strongest candidate as a classifier and a feature extractor. However, many of the existing CNN architectures fail recognizing texts or objects under input rotation and scaling. This paper introduces an elegant approach, 'Scale and Rotation Corrected CNN (SRC-CNN)' for scale and rotation invariant text recognition, exploiting the concept of principal component of characters. Prior to training and testing with baseline CNN, 'SRC-CNN' maps each character image to a reference orientation and scale, which is again derived from the character image itself. SRC-CNN is capable of recognizing characters in a document, even though they differ in orientation and scale greatly. The proposed method does not demand any training with samples which are scaled or rotated. The performance of proposed approach is validated on different character data sets like MNIST, MNIST_rot_12k and English alphabets and compared with state of the art rotation invariant classification networks. SRC-CNN is a generalized approach and can be extended for rotation and scale invariant classification of many other datasets as well, choosing any appropriate baseline CNN. Here we have demonstrated the generality of the proposed SRC-CNN on MNIST Fashion data set and found to perform well in rotation and scale invariant classification of objects as well. This paper demonstrates how the basic PCA based rotation and scale invariant image recognition can be integrated to CNN for achieving better rotational and scale invariances in classification.

Scale and Rotation Corrected CNNs (SRC-CNNs) for Scale and Rotation Invariant Character Recognition: SRC-CNN for Scale and Rotation Invariant Character Recognition

Citations

CNN-watershed: A watershed transform with predicted markers for corneal endothelium image segmentation

Deep Image Restoration Model: A Defense Method Against Adversarial Attacks

References

Spatial transformer networks

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

Harmonic Networks: Deep Translation and Rotation Equivariance

Oriented Response Networks

Exploiting cyclic symmetry in convolutional neural networks

Related Papers (5)

Learning scale-variant and scale-invariant features for deep image classification

Transform-Invariant Convolutional Neural Networks for Image Classification and Search

PSI-CNN: A Pyramid-Based Scale-Invariant CNN Architecture for Face Recognition Robust to Various Image Resolutions

Part-based convolutional neural network for visual recognition

2D and 3D face recognition using convolutional neural network