scispace - formally typeset
Proceedings ArticleDOI

Scale and Rotation Corrected CNNs (SRC-CNNs) for Scale and Rotation Invariant Character Recognition: SRC-CNN for Scale and Rotation Invariant Character Recognition

TLDR
It is demonstrated how the basic PCA based rotation and scale invariant image recognition can be integrated to CNN for achieving better rotational and scale invariances in classification.
Abstract
Last decade has witnessed rapid growth for the popularity of Convolutional Neural Networks (CNNs), in detecting and classifying objects. The self trainable nature of CNNs makes them the strongest candidate as a classifier and a feature extractor. However, many of the existing CNN architectures fail recognizing texts or objects under input rotation and scaling. This paper introduces an elegant approach, 'Scale and Rotation Corrected CNN (SRC-CNN)' for scale and rotation invariant text recognition, exploiting the concept of principal component of characters. Prior to training and testing with baseline CNN, 'SRC-CNN' maps each character image to a reference orientation and scale, which is again derived from the character image itself. SRC-CNN is capable of recognizing characters in a document, even though they differ in orientation and scale greatly. The proposed method does not demand any training with samples which are scaled or rotated. The performance of proposed approach is validated on different character data sets like MNIST, MNIST_rot_12k and English alphabets and compared with state of the art rotation invariant classification networks. SRC-CNN is a generalized approach and can be extended for rotation and scale invariant classification of many other datasets as well, choosing any appropriate baseline CNN. Here we have demonstrated the generality of the proposed SRC-CNN on MNIST Fashion data set and found to perform well in rotation and scale invariant classification of objects as well. This paper demonstrates how the basic PCA based rotation and scale invariant image recognition can be integrated to CNN for achieving better rotational and scale invariances in classification.

read more

Citations
More filters
Journal ArticleDOI

CNN-watershed: A watershed transform with predicted markers for corneal endothelium image segmentation

TL;DR: In this paper, the watershed algorithm was used for marker-driven segmentation of corneal endothelial cells and an encoder-decoder convolutional neural network trained in a sliding window set up to predict the probability of cell centers (markers) and cell borders.
Journal ArticleDOI

Deep Image Restoration Model: A Defense Method Against Adversarial Attacks

TL;DR: This paper presents a deep image restoration model that restores adversarial examples so that the target model is classified correctly again and proves that its results are better than other rival methods.
References
More filters
Proceedings Article

Spatial transformer networks

TL;DR: This work introduces a new learnable module, the Spatial Transformer, which explicitly allows the spatial manipulation of data within the network, and can be inserted into existing convolutional architectures, giving neural networks the ability to actively spatially transform feature maps.
Posted Content

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

TL;DR: Fashion-MNIST is intended to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms, as it shares the same image size, data format and the structure of training and testing splits.
Posted Content

Harmonic Networks: Deep Translation and Rotation Equivariance

TL;DR: Harmonic Networks as mentioned in this paper replace regular CNN filters with circular harmonics, returning a maximal response and orientation for every receptive field patch, which can encode complicated rotational invariants.
Proceedings ArticleDOI

Oriented Response Networks

TL;DR: Active rotating filters (ARFs) as mentioned in this paper can be used to produce within-class rotation-invariant deep features while maintaining inter-class discrimination for classification tasks, which can also be used for image and object orientation estimation.
Proceedings Article

Exploiting cyclic symmetry in convolutional neural networks

TL;DR: This work introduces four operations which can be inserted into neural network models as layers, andWhich can be combined to make these models partially equivariant to rotations, and which enable parameter sharing across different orientations.
Related Papers (5)