scispace - formally typeset
Journal ArticleDOI

Rotation-Invariant HOG Descriptors Using Fourier Analysis in Polar and Spherical Coordinates

Reads0
Chats0
TLDR
This paper presents a method to build rotation-invariant HOG descriptors using Fourier analysis in polar/spherical coordinates, which are closely related to the irreducible representation of the 2D/3D rotation groups.
Abstract
The histogram of oriented gradients (HOG) is widely used for image description and proves to be very effective. In many vision problems, rotation-invariant analysis is necessary or preferred. Popular solutions are mainly based on pose normalization or learning, neglecting some intrinsic properties of rotations. This paper presents a method to build rotation-invariant HOG descriptors using Fourier analysis in polar/spherical coordinates, which are closely related to the irreducible representation of the 2D/3D rotation groups. This is achieved by considering a gradient histogram as a continuous angular signal which can be well represented by the Fourier basis (2D) or spherical harmonics (3D). As rotation-invariance is established in an analytical way, we can avoid discretization artifacts and create a continuous mapping from the image to the feature space. In the experiments, we first show that our method outperforms the state-of-the-art in a public dataset for a car detection task in aerial images. We further use the Princeton Shape Benchmark and the SHREC 2009 Generic Shape Benchmark to demonstrate the high performance of our method for similarity measures of 3D shapes. Finally, we show an application on microscopic volumetric data.

read more

Citations
More filters
Journal ArticleDOI

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

TL;DR: This work builds up the existing state-of-the-art object detection systems and proposes a simple but effective method to train rotation-invariant and Fisher discriminative CNN models to further boost object detection performance.
Journal ArticleDOI

Applications of machine learning in animal behaviour studies

TL;DR: This review aims to introduce animal behaviourists unfamiliar with machine learning (ML) to the promise of these techniques for the analysis of complex behavioural data and illustrate key ML approaches by developing data analytical pipelines for three different case studies that exemplify the types of behavioural and ecological questions ML can address.
Proceedings ArticleDOI

Oriented Response Networks

TL;DR: Active rotating filters (ARFs) as mentioned in this paper can be used to produce within-class rotation-invariant deep features while maintaining inter-class discrimination for classification tasks, which can also be used for image and object orientation estimation.
Proceedings ArticleDOI

Effective semantic pixel labelling with convolutional networks and Conditional Random Fields

TL;DR: An effective semantic pixel labelling using CNN features, hand-crafted features and Conditional Random Fields (CRFs) is proposed and applied to the ISPRS 2D semantic labelling challenge dataset with competitive classification accuracy.
Journal ArticleDOI

Drones count wildlife more accurately and precisely than humans

TL;DR: The effectiveness of management decision-making is often dependent on the accuracy and timeliness of the relevant ecological data upon which decisions are based, meaning that improvements to data collection methods may herald improved ecological outcomes from management actions as mentioned in this paper.
References
More filters
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Journal ArticleDOI

Gradient-based learning applied to document recognition

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
Journal ArticleDOI

LIBSVM: A library for support vector machines

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Journal ArticleDOI

Support-Vector Networks

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Related Papers (5)