Head Pose Estimation in Computer Vision: A Survey

doi:10.1109/TPAMI.2008.106

Journal ArticleDOI

Head Pose Estimation in Computer Vision: A Survey

Erik Murphy-Chutorian, +1 more

- 01 Apr 2009 -

IEEE Transactions on Pattern Analysis an...

- Vol. 31, Iss: 4, pp 607-626

Chats0

TLDR

This paper discusses the inherent difficulties in head pose estimation and presents an organized survey describing the evolution of the field, comparing systems by focusing on their ability to estimate coarse and fine head pose and highlighting approaches well suited for unconstrained environments.

Abstract:

The capacity to estimate the head pose of another person is a common human ability that presents a unique challenge for computer vision systems. Compared to face detection and recognition, which have been the primary foci of face-related vision research, identity-invariant head pose estimation has fewer rigorously evaluated systems or generic solutions. In this paper, we discuss the inherent difficulties in head pose estimation and present an organized survey describing the evolution of the field. Our discussion focuses on the advantages and disadvantages of each approach and spans 90 of the most innovative and characteristic papers that have been published on this topic. We compare these systems by focusing on their ability to estimate coarse and fine head pose, highlighting approaches that are well suited for unconstrained environments.

Citations

PDF

Open Access

More filters

Book

Computer Vision: Algorithms and Applications

Richard Szeliski

TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.

...read moreread less

Proceedings ArticleDOI

Face detection, pose estimation, and landmark localization in the wild

Xiangxin Zhu, +1 more

TL;DR: It is shown that tree-structured models are surprisingly effective at capturing global elastic deformation, while being easy to optimize unlike dense graph structures, in real-world, cluttered images.

...read moreread less

Journal ArticleDOI

HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition

Rajeev Ranjan, +2 more

- 01 Jan 2019 -

IEEE Transactions on Pattern Analysis an...

TL;DR: HyperFace as discussed by the authors combines face detection, landmarks localization, pose estimation and gender recognition using deep convolutional neural networks (CNNs) and achieves significant improvement in performance by fusing intermediate layers of a deep CNN using a separate CNN followed by a multi-task learning algorithm that operates on the fused features.

...read moreread less

Proceedings ArticleDOI

OpenFace: An open source facial behavior analysis toolkit

Tadas Baltrusaitis, +2 more

TL;DR: OpenFace is the first open source tool capable of facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation and allows for easy integration with other applications and devices through a lightweight messaging system.

...read moreread less

Proceedings ArticleDOI

Annotated Facial Landmarks in the Wild: A large-scale, real-world database for facial landmark localization

Martin Köstinger, +3 more

TL;DR: AFLW provides a large-scale collection of images gathered from Flickr, exhibiting a large variety in face appearance as well as general imaging and environmental conditions, and is well suited to train and test algorithms for multi-view face detection, facial landmark localization and face pose estimation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Book

Neural networks for pattern recognition

Christopher M. Bishop

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.

...read moreread less

Proceedings ArticleDOI

Rapid object detection using a boosted cascade of simple features

Paul A. Viola, +1 more

TL;DR: A machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates and the introduction of a new image representation called the "integral image" which allows the features used by the detector to be computed very quickly.

...read moreread less

Book

Adaptive Filter Theory

Simon Haykin

TL;DR: In this paper, the authors propose a recursive least square adaptive filter (RLF) based on the Kalman filter, which is used as the unifying base for RLS Filters.

...read moreread less

Collapse

Head Pose Estimation in Computer Vision: A Survey

Citations

Computer Vision: Algorithms and Applications

Face detection, pose estimation, and landmark localization in the wild

HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition

OpenFace: An open source facial behavior analysis toolkit

Annotated Facial Landmarks in the Wild: A large-scale, real-world database for facial landmark localization

References

Distinctive Image Features from Scale-Invariant Keypoints

Gradient-based learning applied to document recognition

Neural networks for pattern recognition

Rapid object detection using a boosted cascade of simple features

Adaptive Filter Theory

Related Papers (5)

Robust Real-Time Face Detection

Active appearance models

Rapid object detection using a boosted cascade of simple features

Face detection, pose estimation, and landmark localization in the wild

Histograms of oriented gradients for human detection