Showing papers in &quot;Image and Vision Computing in 2014&quot;

Editor's Choice Article: A survey of approaches and trends in person re-identification

TL;DR: Becauseposedandun-posed(aka "spontaneous") facialexpressionsdifferalong severaldimensions includingcomplexity andtiming, well-annotated video of un-posedfacialbehavior is needed.

...read moreread less

523 citations

Journal Article•DOI•

[...]

Apurva Bedagkar-Gala¹, Shishir K. Shah¹•Institutions (1)

University of Houston¹

01 Apr 2014-Image and Vision Computing

TL;DR: The problem of person re-identification is explored and open issues and challenges of the problem are highlighted with a discussion on potential directions for further research.

...read moreread less

422 citations

Journal Article•DOI•

Covariance descriptor based on bio-inspired features for person re-identification and face verification

[...]

Bingpeng Ma, Yu Su¹, Frédéric Jurie¹•Institutions (1)

University of Caen Lower Normandy¹

01 Jun 2014-Image and Vision Computing

TL;DR: Avoiding the use of complicated pre-processing steps such as accurate face and body part segmentation or image normalization, this paper proposes a novel face/person image representation which can properly handle background and illumination variations called gBiCov.

...read moreread less

269 citations

Journal Article•DOI•

Face detection by structural models

[...]

Junjie Yan¹, Xuzong Zhang¹, Zhen Lei¹, Stan Z. Li¹•Institutions (1)

Chinese Academy of Sciences¹

Nonverbal social withdrawal in depression: Evidence from manual and automatic analyses

TL;DR: The co-occurrence between face and body helps to handle large variations, such as heavy occlusions, to further boost the face detection performance, and the hierarchical part based structural model is proposed to explicitly capture them.

...read moreread less

203 citations

Journal Article•DOI•

[...]

Jeffrey M. Girard¹, Jeffrey F. Cohn², Jeffrey F. Cohn¹, Mohammad H. Mahoor³, S. Mohammad Mavadati³, Zakia Hammal², Dean P. Rosenwald¹ - Show less +3 more•Institutions (3)

University of Pittsburgh¹, Carnegie Mellon University², University of Denver³

A review of recent advances in visual speech decoding

TL;DR: The finding that automatic facial expression analysis was both consistent with manual coding and revealed the same pattern of findings suggests that automatic Facial expression analysis may be ready to relieve the burden of manual coding in behavioral and clinical science.

...read moreread less

199 citations

Journal Article•DOI•

[...]

Ziheng Zhou¹, Guoying Zhao¹, Xiaopeng Hong¹, Matti Pietikäinen¹•Institutions (1)

University of Oulu¹

01 Sep 2014-Image and Vision Computing

TL;DR: A detailed review of recent advances in visual speech decoding, focusing on the important questions asked by researchers and summarize the recent studies that attempt to answer them, and providing details of audio-visual speech databases.

...read moreread less

169 citations

Journal Article•DOI•

FIRME: Face and Iris Recognition for Mobile Engagement

[...]

Maria De Marsico¹, Chiara Galdi², Michele Nappi², Daniel Riccio³•Institutions (3)

Sapienza University of Rome¹, University of Salerno², University of Naples Federico II³

Review ArticleEditor's Choice Article: Comparison of human and computer performance across face recognition experiments

TL;DR: FRIME (Face and Iris Recognition for Mobile Engagement) is described as a biometric application based on a multimodal recognition of face and iris, which is designed to be embedded in mobile devices and optimized to be low-demanding and computation-light.

...read moreread less

159 citations

Journal Article•DOI•

[...]

P. Jonathon Phillips¹, Alice J. O'Toole²•Institutions (2)

National Institute of Standards and Technology¹, University of Texas at Dallas²

Evaluating spatiotemporal interest point features for depth-based action recognition

TL;DR: The analysis shows that for matching frontal faces in still images, algorithms are consistently superior to humans, and for video and difficult still face pairs, humans are superior.

...read moreread less

120 citations

Journal Article•DOI•

[...]

Yu Zhu¹, Wenbin Chen¹, Guodong Guo¹•Institutions (1)

West Virginia University¹

01 Aug 2014-Image and Vision Computing

TL;DR: This paper evaluates the spatiotemporal interest point (STIP) based features for depth-based action recognition, and investigates a fusion of the best STIP features with the prevalent skeleton features, to present a complementary use of the STip features for action recognition on 3D data.

...read moreread less

117 citations

Journal Article•DOI•

Automatic audiovisual behavior descriptors for psychological disorder analysis

[...]

Stefan Scherer¹, Giota Stratou¹, Gale M. Lucas¹, Marwa Mahmoud², Marwa Mahmoud¹, Jill Boberg¹, Jonathan Gratch¹, Albert Rizzo¹, Louis-Philippe Morency¹ - Show less +5 more•Institutions (2)

University of Southern California¹, University of Cambridge²

Learning low-rank and discriminative dictionary for image classification ☆

TL;DR: A number of nonverbal behavior descriptors that can be automatically estimated from audiovisual signals are proposed that could be used to support healthcare providers with quantified and objective observations that could ultimately improve clinical assessment.

...read moreread less

110 citations

Journal Article•DOI•

[...]

Liangyue Li¹, Sheng Li¹, Yun Fu¹•Institutions (1)

Northeastern University¹

A framework for joint estimation of age, gender and ethnicity on a large database

TL;DR: The proposed discriminative dictionary learning with low-rank regularization (D2L2R2) approach is evaluated on four face and digit image datasets in comparison with existing representative dictionary learning and classification algorithms and demonstrates the superiority of the approach.

...read moreread less

Journal Article•DOI•

[...]

Guodong Guo¹, Guowang Mu²•Institutions (2)

West Virginia University¹, Hebei University of Technology²

Learning gaze biases with head motion for head pose-free gaze estimation

TL;DR: It is found that the canonical correlation analysis (CCA) based methods can derive an extremely low dimensionality in estimating age, gender and ethnicity.

...read moreread less

Journal Article•DOI•

[...]

Feng Lu¹, Takahiro Okabe², Yusuke Sugano¹, Yoichi Sato¹•Institutions (2)

University of Tokyo¹, Kyushu Institute of Technology²

01 Mar 2014-Image and Vision Computing

TL;DR: A novel method is proposed that performs accurate gaze estimation without restricting the user's head motion by decomposing the original free-head motion problem into subproblems, including an initial fixed head pose problem and subsequent compensations to correct the initial estimation biases.

...read moreread less

Journal Article•DOI•

Fast stereo matching using adaptive guided filtering

[...]

Qingqing Yang¹, Pan Ji¹, Dongxiao Li¹, Shao-Jun Yao¹, Ming Zhang¹ - Show less +1 more•Institutions (1)

Zhejiang University¹

01 Mar 2014-Image and Vision Computing

TL;DR: A novel stereo matching algorithm is presented that ranks the 10th among about 152 algorithms on the Middlebury stereo evaluation benchmark, and takes the 1st place in all local methods.

...read moreread less

Journal Article•DOI•

Classification and weakly supervised pain localization using multiple segment representation

[...]

Karan Sikka¹, Abhinav Dhall², Marian Stewart Bartlett¹•Institutions (2)

University of California, San Diego¹, Australian National University²

Modeling and correction of multipath interference in time of flight cameras

TL;DR: This work extends the idea of detecting facial expressions through 'concept frames' to 'concept segments' and argues through extensive experiments that algorithms such as MIL are needed to reap the benefits of such representation and demonstrates that MS-MIL yields a significant improvement on another spontaneous facial expression dataset, the FEEDTUM dataset.

...read moreread less

Journal Article•DOI•

[...]

David Jiménez¹, Daniel Pizarro², Manuel Mazo¹, Sira E. Palazuelos¹•Institutions (2)

University of Alcalá¹, University of Auvergne²

Direct model based visual tracking and pose estimation using mutual information

TL;DR: An iterative optimization algorithm is proposed that obtains model parameters that best reproduce ToF measurements, recovering the depth of the scene without distortion and accurately corrects the multipath distortion, obtaining depth maps that are very close to ground truth data.

...read moreread less

Journal Article•DOI•

[...]

Guillaume Caron¹, Amaury Dame², Eric Marchand³•Institutions (3)

University of Picardie Jules Verne¹, University of Oxford², French Institute for Research in Computer Science and Automation³

Iterative Grassmannian optimization for robust image alignment

TL;DR: This paper proposes a direct approach that takes into account the image as a whole, and considers a similarity measure, the mutual information, which allows the method to deal with different image modalities (real and synthetic).

...read moreread less

Journal Article•DOI•

[...]

Jun He¹, Dejiao Zhang¹, Laura Balzano², Tao Tao¹•Institutions (2)

Nanjing University of Information Science and Technology¹, University of Michigan²

Automatic measurement of ad preferences from facial responses gathered over the Internet

TL;DR: Transformed Grassmannian robust adaptive subspace tracking algorithm (t-GRASTA) as mentioned in this paper iteratively performs incremental gradient descent constrained to the Grassmann manifold of subspaces to estimate three components of a decomposition of a collection of images: a low-rank subspace, a sparse part of occlusions and foreground objects, and a transformation such as rotation or translation of the image.

...read moreread less

Journal Article•DOI•

[...]

Daniel McDuff¹, Rana el Kaliouby, Thibaud Senechal, David Demirdjian¹, Rosalind W. Picard¹ - Show less +1 more•Institutions (1)

Massachusetts Institute of Technology¹

Real-time 3D face tracking based on active appearance model constrained by depth data

TL;DR: Comparison of the two smile detection algorithms showed that improved smile detection helps correctly classify responses recorded in challenging lighting conditions and those in which the expressions were subtle, showed that temporal discriminative approaches to classification performed most strongly showing that temporal information about an individual's response is important.

...read moreread less

Journal Article•DOI•

[...]

Nikolai Smolyanskiy¹, Christian Huitema¹, Lin Liang¹, Sean Eron Anderson¹•Institutions (1)

Microsoft¹

01 Nov 2014-Image and Vision Computing

TL;DR: A new constraint is introduced into AAM fitting that uses depth data from a commodity RGBD camera (Kinect) that significantly reduces 3D tracking errors and describes how to initialize the 3D morphable face model used in the tracking algorithm by computing its face shape parameters of the user from a batch of tracked frames.

...read moreread less

Journal Article•DOI•

Attribute-based learning for gait recognition using spatio-temporal interest points

[...]

Worapan Kusakunniran¹•Institutions (1)

Mahidol University¹

Radial shifted Legendre moments for image analysis and invariant image recognition

TL;DR: This paper proposes a new method to extract a gait feature from a raw gait video directly using the Space-Time Interest Points (STIPs) to enhance the SVM-based gait classification.

...read moreread less

Journal Article•DOI•

[...]

Bin Xiao¹, Guoyin Wang¹, Weisheng Li¹•Institutions (1)

Chongqing University of Posts and Telecommunications¹

Unsupervised manifold learning using Reciprocal kNN Graphs in image re-ranking and rank aggregation tasks

TL;DR: A mathematical framework for obtaining the rotation, scaling and translation invariants of these two types of radial shifted Legendre moments is provided and the superiority of the proposed methods in terms of image reconstruction capability and invariant recognition accuracy under both noisy and noise-free conditions is shown.

...read moreread less

Journal Article•DOI•

[...]

Daniel Carlos Guimarães Pedronette¹, Otavio A. B. Penatti², Ricardo da Silva Torres²•Institutions (2)

Sao Paulo State University¹, State University of Campinas²

01 Feb 2014-Image and Vision Computing

TL;DR: An unsupervised distance learning approach for improving the effectiveness of image retrieval tasks by proposing a Reciprocal kNN Graph algorithm that considers the relationships among ranked lists in the context of a k-reciprocal neighborhood.

...read moreread less

Journal Article•DOI•

Automatic expression spotting in videos

[...]

Matthew Shreve¹, Jesse Brizzi¹, Sergiy Fefilatyev¹, Timur Luguev¹, Dmitry B. Goldgof¹, Sudeep Sarkar¹ - Show less +2 more•Institutions (1)

University of South Florida¹

01 Aug 2014-Image and Vision Computing

TL;DR: A novel solution for the problem of segmenting macro- and micro-expression frames (or retrieving the expression intervals) in video sequences, which is a prior step for many expression recognition algorithms is proposed.

...read moreread less

Journal Article•DOI•

Dynamic-static unsupervised sequentiality, statistical subunits and lexicon for sign language recognition

[...]

Stavros Theodorakis¹, Vassilis Pitsikalis¹, Petros Maragos¹•Institutions (1)

National Technical University of Athens¹

01 Aug 2014-Image and Vision Computing

TL;DR: A new computational phonetic modeling framework for sign language (SL) recognition based on dynamic-static statistical subunits and provides sequentiality in an unsupervised manner, without prior linguistic information is introduced.

...read moreread less

Journal Article•DOI•

Tracking in dense crowds using prominence and neighborhood motion concurrence

[...]

Haroon Idrees¹, Nolan Warner², Mubarak Shah¹•Institutions (2)

University of Central Florida¹, University of Nevada, Reno²

Editor's Choice Article: Sparse feature selection based on graph Laplacian for web image annotation

TL;DR: A novel tracking method tailored to dense crowds is proposed which provides an alternative and complementary approach to methods that require modeling of crowd flow and is less likely to fail in the case of dynamic crowd flows and anomalies by minimally relying on previous frames.

...read moreread less

Journal Article•DOI•

[...]

Caijuan Shi¹, Qiuqi Ruan¹, Gaoyun An¹•Institutions (1)

Beijing Jiaotong University¹

01 Mar 2014-Image and Vision Computing

TL;DR: A novel sparse feature selection framework for web image annotation, namely sparse Feature Selection based on Graph Laplacian (FSLG) is proposed, which applies the l"2","1"/"2-matrix norm into the sparse features selection algorithm to select the most sparse and discriminative features.

...read moreread less

Journal Article•DOI•

Timely autonomous identification of UAV safe landing zones

[...]

Timothy Patterson¹, Sally McClean¹, Philip Morrow¹, Gerard Parr¹, Chunbo Luo² - Show less +1 more•Institutions (2)

Ulster University¹, University of the West of Scotland²

01 Sep 2014-Image and Vision Computing

TL;DR: Novel work on autonomously identifying Safe Landing Zones (SLZs) which can be utilised upon occurrence of a safety critical event is presented and results are presented based on colour aerial imagery captured during manned flight demonstrating practical potential in the methods discussed.

...read moreread less

Journal Article•DOI•

Bi-modal biometric authentication on mobile phones in challenging conditions

[...]

Elie Khoury¹, Laurent El Shafey¹, Chris McCool², Manuel Günther¹, Sébastien Marcel¹ - Show less +1 more•Institutions (2)

Idiap Research Institute¹, NICTA²

Local circular patterns for multi-modal facial gender and ethnicity classification

TL;DR: It is shown that inter-session variability modelling using Gaussian mixture models provides a consistently robust system for face, speaker and bi-modal authentication and that multi-algorithm fusion provides a consistent performance improvement for face and speaker authentication.

...read moreread less

Journal Article•DOI•

[...]

Di Huang¹, Huaxiong Ding², Chen Wang², Yunhong Wang¹, Guangpeng Zhang¹, Liming Chen² - Show less +2 more•Institutions (2)

Beihang University¹, École centrale de Lyon²