scispace - formally typeset
Proceedings ArticleDOI

Efficient video object classifier using locality-enhanced support vector machines

Reads0
Chats0
TLDR
This paper aims to merge multi-scale based selective encoding/classification technique and locality-enhanced support vector machine (SVM) to enable use of SVM in video applications with significantly reduced computational requirements.
Abstract
In multimedia applications such as MPEG-4, an efficient model is required to encode and classify video objects such as human, car and building. Recently, support vector machine (SVM) has been shown to be a good classifier; however, its large computational requirement prohibited its use in real time video processing applications. In this paper, a model is proposed that enables use of SVM in video applications. This paper aims to merge multi-scale based selective encoding/classification technique and locality-enhanced support vector machine (SVM). The proposed model allows selected image scales (of interest) to be encoded and classified more accurately by complex classifier such as SVM, whilst other image scales of less significance to be encoded and classified by simpler encoder/classifier. Image scales of interest are readily selected from multi-scale image processing paradigm. SVM is used to encode visual object information of significant image scale only; hence its use is efficient. Experiment with MPEG-4 video object encoding and classification shows that the performance of the proposed model is comparable with other models, however with significantly reduced computational requirements.

read more

Citations
More filters
Proceedings ArticleDOI

Expression-invariant face recognition system using subspace model analysis

TL;DR: The experimental results indicate that face recognition may be robust to facial changes and applicable and its potential applicability is indicated.
Proceedings ArticleDOI

Expression-invariant face recognition for small class problem

TL;DR: Results indicate that face recognition may be robust to facial expression changes, and used the FLD subspace analysis method showed to achieve better classification performances.
References
More filters
Journal ArticleDOI

Learning representations by back-propagating errors

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.
Journal ArticleDOI

The perceptron: a probabilistic model for information storage and organization in the brain.

TL;DR: This article will be concerned primarily with the second and third questions, which are still subject to a vast amount of speculation, and where the few relevant facts currently supplied by neurophysiology have not yet been integrated into an acceptable theory.
Book

The perception: a probabilistic model for information storage and organization in the brain

F. Rosenblatt
TL;DR: The second and third questions are still subject to a vast amount of speculation, and where the few relevant facts currently supplied by neurophysiology have not yet been integrated into an acceptable theory as mentioned in this paper.
Journal ArticleDOI

Backpropagation through time: what it does and how to do it

TL;DR: This paper first reviews basic backpropagation, a simple method which is now being widely used in areas like pattern recognition and fault diagnosis, and describes further extensions of this method, to deal with systems other than neural networks, systems involving simultaneous equations or true recurrent networks, and other practical issues which arise with this method.
Journal ArticleDOI

A general regression neural network

TL;DR: The general regression neural network (GRNN) is a one-pass learning algorithm with a highly parallel structure that provides smooth transitions from one observed value to another.
Related Papers (5)