scispace - formally typeset
Proceedings ArticleDOI

Multi-View and Multi-Scale Fine-Grained Vehicle Classification with Channel Convolution Feature Fusion

TLDR
In this article, a fine-grained vehicle classification using a multi-branch convolutional neural network (CNN) with multiple views of the vehicle was proposed to solve the vehicle's make classification task.
Abstract
A computer vision solution applied to an automatic toll collection (ATC) with a subscription/membership is proposed in this paper. In this application, a unique identifier (ID) is related to a concrete vehicle and a membership. A camera system is put in place to verify that for each transaction the vehicle and the ID correspond with the actual membership data. The visual system extracts different vehicle characteristics including license plate number, make, model, color, number of axles, etc. The system then compares the extracted characteristics with those found in the membership. We focus on solving the vehicle's make classification task. We propose a fine-grained vehicle classification that exploits the multi-camera composition of the system by feeding a multi-branch convolutional neural network (CNN) with multiple views of the vehicle. Each branch of the network uses a cascade approach to localize the vehicle and its most salient regions, as well as extracting multi-scale features per view. The extracted features are late fused using a convolutional approach and used to classify the vehicle's make. Our network learns to extract discriminant features from different views and regions of interest and to fuse them in the best possible way to improve classification performance. The presented evaluations show that the proposed multi-view network architecture significantly improves the vehicle's make classification performance when compared to single view approaches.

read more

References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Proceedings ArticleDOI

Densely Connected Convolutional Networks

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.
Proceedings ArticleDOI

Rethinking the Inception Architecture for Computer Vision

TL;DR: In this article, the authors explore ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization.
Related Papers (5)