scispace - formally typeset
Search or ask a question
Topic

Histogram of oriented gradients

About: Histogram of oriented gradients is a research topic. Over the lifetime, 2037 publications have been published within this topic receiving 55881 citations. The topic is also known as: HOG.


Papers
More filters
Proceedings ArticleDOI
02 Apr 2019
TL;DR: An automatic system of face expression recognition which is able to recognize all eight basic facial expressions which are normal, happy, angry, contempt, surprise, sad, fear and disgust is presented.
Abstract: Facial Expression Recognition (FER) has been an active topic of papers that were researched during 1990s till now, according to its importance, FER has achieved an extremely role in image processing area. FER typically performed in three stages include, face detection, feature extraction and classification. This paper presents an automatic system of face expression recognition which is able to recognize all eight basic facial expressions which are (normal, happy, angry, contempt, surprise, sad, fear and disgust) while many FER systems were proposed for recognizing only some of face expressions. For validating the method, the Extended Cohn-Kanade (CK+) dataset is used. The presented method uses Viola-Jones algorithm for face detection. Histogram of Oriented Gradients (HOG) is used as a descriptor for feature extraction from the images of expressive faces. Principal Component Analysis (PCA) applied to reduce dimensionality of the Features, to obtaining the most significant features. Finally, the presented method used three different classifiers which are Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Multilayer Perceptron Neural Network (MLPNN) for classifying the facial expressions and the results of them are compared. The experimental results show that the presented method provides the recognition rate with 93.53% when using SVM classifier, 82.97% when using MLP classifier and 79.97% when using KNN classifier which refers that the presented method provides better results while using SVM as a classifier.

51 citations

Proceedings ArticleDOI
23 Mar 2004
TL;DR: A shape representation method, the mountain-climbing sequence (MCS), that is invariant to translation, rotation, and scale problems and shows a superior matching ratio even in the presence of a modest level of deformation.
Abstract: Content-based image retrieval (CBIR) work includes feature selection, object representation, and matching. If a shape is used as feature, edge detection might be the first step to extract that feature. Invariance to translation, rotation, and scale is required by a good shape representation. Sustaining deformation contour matching is an important issue at the matching process. An efficient and robust shape-based image retrieval system is proposed. We use the Prompt edge detection method [H.J. Lin et al., (2001)] to detect edge points, which is compared with the Sobel edge detection method. We also introduce a shape representation method, the mountain-climbing sequence (MCS), that is invariant to translation, rotation, and scale problems. The results of our proposed method show a superior matching ratio even in the presence of a modest level of deformation.

50 citations

Proceedings ArticleDOI
28 Dec 2015
TL;DR: This work proposes to use the Subband Power Distribution (SPD) as a feature to capture the occurrences of these events by computing the histogram of amplitude values in each frequency band of a spectrogram image by using the so-called Sinkhorn kernel.
Abstract: Acoustic scene classification is a difficult problem mostly due to the high density of events concurrently occurring in audio scenes. In order to capture the occurrences of these events we propose to use the Subband Power Distribution (SPD) as a feature. We extract it by computing the histogram of amplitude values in each frequency band of a spectrogram image. The SPD allows us to model the density of events in each frequency band. Our method is evaluated on a large acoustic scene dataset using support vector machines. We outperform the previous methods when using the SPD in conjunction with the histogram of gradients. To reach further improvement, we also consider the use of an approximation of the earth mover's distance kernel to compare histograms in a more suitable way. Using the so-called Sinkhorn kernel improves the results on most of the feature configurations. Best performances reach a 92.8% F1 score.

50 citations

Proceedings ArticleDOI
01 Oct 2015
TL;DR: A new method of micro-movement detection by applying Histogram of Oriented Gradients as a feature descriptor on the authors' in-house high-speed video dataset of spontaneous micro facial movements is proposed.
Abstract: Detecting micro-facial movements in a video sequence is the first step in realising a system that can pick out rapid movements automatically as a person is being recorded. This paper proposes a new method of micro-movement detection by applying Histogram of Oriented Gradients as a feature descriptor on our in-house high-speed video dataset of spontaneous micro facial movements. Firstly the algorithm aligns and crops faces for each video using automatic facial point detection and affine transformation. Then a de-noising algorithm is applied to each video before splitting them into blocks where the Histogram of Oriented Gradient features are calculated for each frame in every video block. The Chi-Squared distance measure is then used to calculate dissimilarity in the spatial appearance between frames at a set interval. The final feature vector is calculated after normalisation of the raw distance values and peak detection is applied to 'spot' micro-facial movements. An individualised baseline threshold is used to determine the value a peak must exceed to be classed as a movement. The result is compared with a benchmark algorithm - feature difference analysis techniques for micro-facial movements using Local Binary Patterns. Results indicate the proposed method achieves higher Recall of 0.8429 and F1-measure of 0.7672.

50 citations

Proceedings ArticleDOI
03 Dec 2010
TL;DR: It is observed that SIFT and CHoG outperform MPEG-7 image signatures greatly in terms of feature-level Receiver Operating Characteristic performance and image-level matching and demonstrate such gains while being comparable with MPEG- 7 image signatures in bit-rate.
Abstract: We evaluate the performance of MPEG-7 image signatures, Compressed Histogram of Gradients descriptor (CHoG) and Scale Invariant Feature Transform (SIFT) descriptors for mobile visual search applications. We observe that SIFT and CHoG outperform MPEG-7 image signatures greatly in terms of feature-level Receiver Operating Characteristic (ROC) performance and image-level matching. Moreover, CHoG descriptors demonstrate such gains while being comparable with MPEG-7 image signatures in bit-rate.

50 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
89% related
Convolutional neural network
74.7K papers, 2M citations
87% related
Deep learning
79.8K papers, 2.1M citations
87% related
Image segmentation
79.6K papers, 1.8M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
86% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202356
2022181
2021116
2020189
2019179
2018240