scispace - formally typeset
Search or ask a question
Topic

Histogram of oriented gradients

About: Histogram of oriented gradients is a research topic. Over the lifetime, 2037 publications have been published within this topic receiving 55881 citations. The topic is also known as: HOG.


Papers
More filters
Journal ArticleDOI
TL;DR: A real time system to detect alphabets by recognizing the lip pattern based on texture and shape and modelled and tested for real time performance with a video of 10 frames per second.

16 citations

Journal ArticleDOI
TL;DR: A bibliography of nearly 1200 references related to computer vision and image analysis, arranged by subject matter is presented, covering topics including architectures; computational techniques; feature detection and segmentation; image analysis; and motion.
Abstract: This paper presents a bibliography of nearly 1200 references related to computer vision and image analysis, arranged by subject matter. The topics covered include architectures; computational techniques; feature detection and segmentation; image analysis; two-dimensional shape; pattern; color and texture; matching and stereo; three-dimensional recovery and analysis; three-dimensional shape; and motion. A few references are also given on related topics, such as geometry, graphics, image input/output and coding, image processing, optical processing, visual perception, neural nets, pattern recognition and artificial intelligence, as well as on applications.

16 citations

Journal ArticleDOI
TL;DR: A novel and robust approach to control auxiliary tasks in vehicles using hand gestures using Grassmann graph embedding discriminant analysis framework and results show that the proposed model outperforms and is comparable with the state-of-the-art methods.
Abstract: In this study, the authors propose a novel and robust approach to control auxiliary tasks in vehicles using hand gestures. First, they create a three-dimensional video volume by appending one frame to other that captures the motion history of frames. Then, they extract features using histogram of oriented gradients on each video volume. These features are represented in the form of subspaces on Grassmann manifold. To improve the recognition accuracy, they map the data from one manifold to another manifold with the help of a Grassmann kernel. Grassmann graph embedding discriminant analysis framework is used to classify the gestures. They perform experiments on two datasets: LISA and Cambridge Hand Gesture in three different testing methods such as 1/3-subject, 2/3-subject and cross-subject. Experimental results show that their proposed model outperforms and is comparable with the state-of-the-art methods.

16 citations

Proceedings ArticleDOI
13 Apr 2018
TL;DR: A multi-scale pedestrian detection is designed based on histogram of oriented gradients (HOG) and implemented on a field programmable gate array (FPGA) and the results show that the system costs 94,374 logic elements, which is about 82% of total logic elements s of Terasic DE2-115 development board.
Abstract: Pedestrian detection is needed for many vison applications including surveillance, Advanced Driver Assistance Systems (ADAS), Intelligent Transport System (ITS), drone, robotics, etc. There are different sizes of pedestrian or human in an image due to the different distances from the camera and different object's height. To detect all of the objects with different sizes, a multi-scale detector is needed. In this study, a multi-scale pedestrian detection is designed based on histogram of oriented gradients (HOG) and implement the method on a field programmable gate array (FPGA). The processing includes three stages: the input color image is converted to a gray one and then down sampled the gray image by 2 and by 4, respectively. different window sizes are used to extract the features of HOG from three size gray images. Final, linear support vector machine (SVM) is used to classify the extracted features for different window sizes. The experimental results show that the system costs 94,374 logic elements, which is about 82% of total logic elements s of Terasic DE2-115 development board. The system detection accuracy is about 97% on average and the processing speed can achieve 60 fps for 640×480 resolution.

16 citations

Proceedings ArticleDOI
04 Sep 2019
TL;DR: An intelligent vision system embedded on a smartphone and deployed in the wild to detect and recognize British Visual Language (BSL) signs automatically and shown an accuracy of over 99% with an average processing time of 170ms, thus appropriate for real-time visual signing.
Abstract: Developing assistive, cost-effective, non-invasive technologies to aid communication of people with hearing impairments is of prime importance in our society, in order to widen accessibility and inclusiveness. For this purpose, we have developed an intelligent vision system embedded on a smartphone and deployed in the wild. In particular, it integrates both computer vision methods involving Histogram of Oriented Gradients (HOG) and machine learning techniques such as multi-class Support Vector Machine (SVM) to detect and recognize British Visual Language (BSL) signs automatically. Our system was successfully tested on a real-world dataset containing 13,066 samples and shown an accuracy of over 99% with an average processing time of 170ms, thus appropriate for real-time visual signing.

16 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
89% related
Convolutional neural network
74.7K papers, 2M citations
87% related
Deep learning
79.8K papers, 2.1M citations
87% related
Image segmentation
79.6K papers, 1.8M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
86% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202356
2022181
2021116
2020189
2019179
2018240