scispace - formally typeset
Search or ask a question
Topic

Histogram of oriented gradients

About: Histogram of oriented gradients is a research topic. Over the lifetime, 2037 publications have been published within this topic receiving 55881 citations. The topic is also known as: HOG.


Papers
More filters
Proceedings ArticleDOI
01 Dec 2012
TL;DR: By incorporating these texture features into the proposed probabilistic models, this work could systematically infer the optimal hypothesis of parking statuses while dealing with occlusion effect, shadow effect, perspective distortion, and fluctuation of lighting condition in both day time and night time.
Abstract: We proposed a surface-based vacant parking space detection system. Unlike many car-oriented or space-oriented methods, the proposed system is parking-lot-oriented. In the system, we treat the whole parking lot as a structure consisting of plentiful surfaces. A surface-based hierarchical framework is then proposed to integrate the 3-D scene information with the patch-based image observation for the inference of vacant space. To be robust, the feature vector of each image patch is extracted based on the Histogram of Oriented Gradients (HOG) approach. By incorporating these texture features into the proposed probabilistic models, we could systematically infer the optimal hypothesis of parking statuses while dealing with occlusion effect, shadow effect, perspective distortion, and fluctuation of lighting condition in both day time and night time.

18 citations

Journal ArticleDOI
20 Jan 2016-Sensors
TL;DR: This paper presents a vision-based people detection system for improving safety in heavy machines composed of a monocular fisheye camera and a LiDAR, and presents a framework for harnessing theLiDAR modality in order to enhance the detection algorithm for different camera positions.
Abstract: This paper presents a vision-based people detection system for improving safety in heavy machines. We propose a perception system composed of a monocular fisheye camera and a LiDAR. Fisheye cameras have the advantage of a wide field-of-view, but the strong distortions that they create must be handled at the detection stage. Since people detection in fisheye images has not been well studied, we focus on investigating and quantifying the impact that strong radial distortions have on the appearance of people, and we propose approaches for handling this specificity, adapted from state-of-the-art people detection approaches. These adaptive approaches nevertheless have the drawback of high computational cost and complexity. Consequently, we also present a framework for harnessing the LiDAR modality in order to enhance the detection algorithm for different camera positions. A sequential LiDAR-based fusion architecture is used, which addresses directly the problem of reducing false detections and computational cost in an exclusively vision-based system. A heavy machine dataset was built, and different experiments were carried out to evaluate the performance of the system. The results are promising, in terms of both processing speed and performance.

18 citations

Patent
31 Dec 2013
TL;DR: In this paper, a method and system for recognizing behavior is disclosed, which includes: capturing at least one video stream of data on one or more subjects, extracting body skeleton data from the at least 1 video stream and computing feature extractions on the extracted body skeletons data to generate a plurality of 3D delta units for each frame of the extracted skeleton data.
Abstract: A method and system for recognizing behavior is disclosed, the method includes: capturing at least one video stream of data on one or more subjects; extracting body skeleton data from the at least one video stream of data; computing feature extractions on the extracted body skeleton data to generate a plurality of 3 dimensional delta units for each frame of the extracted body skeleton data; generating a plurality of histogram sequences for each frame by projecting the plurality of 3 dimensional delta units for each frame to a spherical coordinate system having a plurality of spherical bins; generating an energy map for each of the plurality of histogram sequences by mapping the plurality of spherical bins versus time; applying a Histogram of Oriented Gradients (HOG) algorithm on the plurality of energy maps to generate a single column vector; and classifying the single column vector as a behavior and/or emotion.

18 citations

Book ChapterDOI
01 Jan 2017
TL;DR: A holistic word recognition method is presented for the recognition of handwritten city names in Bangla script, which has achieved 90.65% accuracy on 10,000 samples comprising of 20 most popular city names of West Bengal, a state of India.
Abstract: In recent times, holistic word recognition has achieved enormous attention from the researchers due to its segmentation-free approach. In the present work, a holistic word recognition method is presented for the recognition of handwritten city names in Bangla script. At first, each word image is hypothetically segmented into equal number of grids. Then gradient-based features, inspired by Histogram of Oriented Gradients (HOG) feature descriptor, are extracted from each of the grids. For the selection of suitable classifier, five well-known classifiers are compared in terms of their recognition accuracies and finally the classifier Sequential Minimal Optimization (SMO) is chosen. The system has achieved 90.65% accuracy on 10,000 samples comprising of 20 most popular city names of West Bengal, a state of India.

18 citations

Book ChapterDOI
05 Oct 2015
TL;DR: This work introduces a new image dataset along with ground truth diagnosis for evaluating image-based cervical disease classification algorithms, and runs seven classic machine-learning algorithms to differentiate images of high-risk patient visits from those of low- risk patient visits.
Abstract: Cervical cancer is one of the most common types of cancer in women worldwide. Most deaths of cervical cancer occur in less developed areas of the world. In this work, we introduce a new image dataset along with ground truth diagnosis for evaluating image-based cervical disease classification algorithms. We collect a large number of cervigram images from a database provided by the US National Cancer Institute. From these images, we extract three types of complementary image features, including Pyramid histogram in L*A*B* color space PLAB, Pyramid Histogram of Oriented Gradients PHOG, and Pyramid histogram of Local Binary Patterns PLBP. PLAB captures color information, PHOG encodes edges and gradient information, and PLBP extracts texture information. Using these features, we run seven classic machine-learning algorithms to differentiate images of high-risk patient visits from those of low-risk patient visits. Extensive experiments are conducted on both balanced and imbalanced subsets of the data to compare the seven classifiers. These results can serve as a baseline for future research in cervical dysplasia classification using images. The image-based classifiers also outperform results of several other screening tests on the same datasets.

18 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
89% related
Convolutional neural network
74.7K papers, 2M citations
87% related
Deep learning
79.8K papers, 2.1M citations
87% related
Image segmentation
79.6K papers, 1.8M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
86% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202356
2022181
2021116
2020189
2019179
2018240