Showing papers in "Image and Vision Computing in 2018"

PDF

Open Access

Journal Article•DOI•

Speeded up detection of squared fiducial markers

[...]

Francisco J. Romero-Ramirez¹, Rafael Muñoz-Salinas¹, Rafael Medina-Carnicer¹•Institutions (1)

01 Aug 2018-Image and Vision Computing

TL;DR: This paper proposes a multi-scale strategy for speeding up marker detection in video sequences by wisely selecting the most appropriate scale for detection, identification and corner estimation.

...read moreread less

488 citations

Journal Article•DOI•

Beyond one-hot encoding: Lower dimensional target embedding

[...]

Pau Rodríguez¹, Miguel Ángel Bautista², Jordi Gonzàlez¹, Sergio Escalera¹, Sergio Escalera³ - Show less +1 more•Institutions (3)

Autonomous University of Barcelona¹, Heidelberg University², University of Barcelona³

01 Jul 2018-Image and Vision Computing

TL;DR: This work proposes a normalized eigenrepresentation of the class manifold that encodes the targets with minimal information loss, improving the accuracy of random projections encoding while enjoying the same convergence rates.

...read moreread less

210 citations

Journal Article•DOI•

A comprehensive review of fruit and vegetable classification techniques

[...]

Khurram Hameed¹, Douglas Chai¹, Alexander Rassau¹•Institutions (1)

Edith Cowan University¹

01 Dec 2018-Image and Vision Computing

TL;DR: A critical comparison of different state-of-the-art computer vision methods proposed by researchers for classifying fruit and vegetable is presented.

...read moreread less

120 citations

Journal Article•DOI•

Reading car license plates using deep neural networks

[...]

Hui Li¹, Peng Wang¹, Mingyu You², Chunhua Shen¹•Institutions (2)

University of Adelaide¹, Tongji University²

01 Apr 2018-Image and Vision Computing

TL;DR: By exploring contextual information and avoiding errors caused by segmentation, this method performs better than conventional methods and achieves state-of-the-art recognition accuracy.

...read moreread less

103 citations

Journal Article•DOI•

Survey on automatic lip-reading in the era of deep learning

[...]

Adriana Fernandez-Lopez¹, Federico M. Sukno¹•Institutions (1)

Pompeu Fabra University¹

01 Oct 2018-Image and Vision Computing

TL;DR: It is found that DL architectures perform similarly to traditional ones for simpler tasks but report significant improvements in more complex tasks, such as word or sentence recognition, with up to 40% improvement in word recognition rates.

...read moreread less

84 citations

Journal Article•DOI•

On the generalization of color texture-based face anti-spoofing

[...]

Zinelabidine Boulkenafet¹, Jukka Komulainen¹, Abdenour Hadid², Abdenour Hadid¹•Institutions (2)

University of Oulu¹, Northwestern Polytechnical University²

01 Sep 2018-Image and Vision Computing

TL;DR: A face PAD solution of attack-specific countermeasures based solely on color texture analysis is proposed and investigated to see how well it generalizes under display and print attacks in different conditions.

...read moreread less

61 citations

Journal Article•DOI•

Enhancing Convolutional Neural Networks for Face Recognition with Occlusion Maps and Batch Triplet Loss

[...]

Daniel Saez-Trigueros¹, Li Meng¹, Margaret Hartnett•Institutions (1)

University of Hertfordshire¹

01 Nov 2018-Image and Vision Computing

TL;DR: A simple method to find out which parts of the human face are more important to achieve a high recognition rate, and use that information during training to force a convolutional neural network to learn discriminative features from all the face regions more equally, including those that typical approaches tend to pay less attention to is proposed.

...read moreread less

54 citations

Journal Article•DOI•

Image annotation: Then and now

[...]

P. K. Bhagat¹, Prakash Choudhary¹•Institutions (1)

National Institute of Technology, Manipur¹

01 Dec 2018-Image and Vision Computing

TL;DR: This paper is an attempt to discuss predominant approaches, its constraints and ways to deal in AIA, and presents performance evaluation measures with relevant and influential image annotation database.

...read moreread less

45 citations

Journal Article•DOI•

Long-term path prediction in urban scenarios using circular distributions

[...]

Pasquale Coscia¹, Francesco Castaldo¹, Francesco Palmieri¹, Alexandre Alahi², Silvio Savarese³, Lamberto Ballan⁴ - Show less +2 more•Institutions (4)

Seconda Università degli Studi di Napoli¹, École Polytechnique Fédérale de Lausanne², Stanford University³, University of Padua⁴

01 Jan 2018-Image and Vision Computing

TL;DR: This work focuses on a typical urban human-scene where it aims at predicting an agent's behavior using a stochastic model, fuse the various factors that would contribute to a human motion in different contexts and provides a statistical smooth prediction towards the most likely areas.

...read moreread less

41 citations

Journal Article•DOI•

Modeling of facial aging and kinship: A survey

[...]

Markos Georgopoulos¹, Yannis Panagakis¹, Yannis Panagakis², Maja Pantic¹•Institutions (2)

Imperial College London¹, Middlesex University²

01 Dec 2018-Image and Vision Computing

TL;DR: In this paper, the authors provide an up-to-date, complete list of available annotated datasets and an in-depth analysis of geometric, hand-crafted, and learned facial representations that are used for facial aging and kinship characterization.

...read moreread less

32 citations

Journal Article•DOI•

Multi-view 3D face reconstruction with deep recurrent neural networks

[...]

Pengfei Dou¹, Ioannis A. Kakadiaris¹•Institutions (1)

University of Houston¹

01 Dec 2018-Image and Vision Computing

TL;DR: This work proposes a method, Deep Recurrent 3D FAce Reconstruction (DRFAR), to solve the task of multi-view 3D face reconstruction using a subspace representation of the 3D facial shape and a deep recurrent neural network that consists of both a deep convolutional neural network (DCNN) and a recurrent Neural Network (RNN).

...read moreread less

Journal Article•DOI•

Learning Deep Similarity Models with Focus Ranking for Fabric Image Retrieval

[...]

Daiguo Deng¹, Ruomei Wang¹, Hefeng Wu², Huayong He¹, Qi Li³, Xiaonan Luo⁴ - Show less +2 more•Institutions (4)

Sun Yat-sen University¹, Guangdong University of Foreign Studies², Western Kentucky University³, Guilin University of Electronic Technology⁴

01 Feb 2018-Image and Vision Computing

TL;DR: This paper proposes a novel embedding method termed focus ranking that can be easily unified into a CNN for jointly learning image representations and metrics in the context of fine-grained fabric image retrieval and shows the superiority of the proposed model over existing metric embedding models.

...read moreread less

Journal Article•DOI•

Deep and Low-level Feature based Attribute Learning for Person Re-identification

[...]

Yiqiang Chen, Stefan Duffner, Andrei Stoian, Jean-Yves Dufour, Atilla Baskurt - Show less +1 more

12 Sep 2018-Image and Vision Computing

TL;DR: A CNN-based pedestrian attribute-assisted person re-identification framework that performs the attribute learning by a part-specific CNN to model attribute patterns related to different body parts and fuse them with low-level robust Local Maximal Occurrence features to address the problem of the large variation of visual appearance and location of attributes.

...read moreread less

Journal Article•DOI•

Benchmark database for fine-grained image classification of benthic macroinvertebrates

[...]

Jenni Raitoharju¹, Ekaterina Riabchenko¹, Iftikhar Ahmad¹, Alexandros Iosifidis¹, Moncef Gabbouj¹, Serkan Kiranyaz², Ville Tirronen³, Johanna Ärje⁴, Salme Kärkkäinen⁴, Kristian Meissner⁵ - Show less +6 more•Institutions (5)

Tampere University of Technology¹, Qatar University², Information Technology University³, University of Jyväskylä⁴, Finnish Environment Institute⁵

01 Oct 2018-Image and Vision Computing

TL;DR: A benchmark database for automatic visual classification methods to evaluate their ability for distinguishing visually similar categories of aquatic macroinvertebrate taxa, and presents the classification results of Convolutional Neural Networks that are widely used for deep learning tasks in large databases.

...read moreread less

Journal Article•DOI•

Template adaptation for face verification and identification

[...]

Nate Crosswhite, Jeffrey Byrne, Chris Stauffer, Omkar M. Parkhi¹, Qiong Cao¹, Andrew Zisserman¹ - Show less +2 more•Institutions (1)

University of Oxford¹

01 Nov 2018-Image and Vision Computing

TL;DR: A surprising result is shown, that perhaps the simplest method of template adaptation, combining deep convolutional network features with template specific linear SVMs, outperforms the state-of-the-art by a wide margin.

...read moreread less

Journal Article•DOI•

Hair detection, segmentation, and hairstyle classification in the wild

[...]

Umar Riaz Muhammad¹, Umar Riaz Muhammad², Michele Svanera³, Michele Svanera¹, Riccardo Leonardi¹, Sergio Benini¹ - Show less +2 more•Institutions (3)

University of Brescia¹, Queen Mary University of London², University of Glasgow³

01 Mar 2018-Image and Vision Computing

TL;DR: This work tackles the problem of hair analysis from unconstrained view by relying only on textures, without a-priori information on head shape and location, nor using body-part classifiers, and achieves segmentation accuracy superior to known state-of-the-art.

...read moreread less

Journal Article•DOI•

Unobtrusive and pervasive video-based eye-gaze tracking

[...]

Stefania Cristina¹, Kenneth P. Camilleri¹•Institutions (1)

University of Malta¹

01 Jun 2018-Image and Vision Computing

TL;DR: This critical review focuses on emerging passive and unobtrusive video-based eye-gaze tracking methods in recent literature, with the aim to identify different research avenues that are being followed in response to the challenges of pervasive eye- gaze tracking.

...read moreread less

Journal Article•DOI•

Negative results in computer vision: A perspective

[...]

Ali Borji¹•Institutions (1)

University of Central Florida¹

01 Jan 2018-Image and Vision Computing

TL;DR: What makes negative results important, how they should be disseminated and incentivized, and what lessons can be learned from cognitive vision research in this regard are addressed.

...read moreread less

Journal Article•DOI•

Distances evolution analysis for online and off-line human object interaction recognition

[...]

Meng Meng¹, Hassen Drira¹, Hassen Drira², Jacques Boonaert•Institutions (2)

North Carolina Central University¹, university of lille²

01 Feb 2018-Image and Vision Computing

TL;DR: The experiments demonstrate that the proposed spatio-temporal modeling of human-object interaction videos for online and off-line recognition is effective and discriminative for human object interaction classification as demonstrated here.

...read moreread less

Journal Article•DOI•

Joint gender classification and age estimation by nearly orthogonalizing their semantic spaces

[...]

Qing Tian¹, Qing Tian², Songcan Chen²•Institutions (2)

Nanjing University of Information Science and Technology¹, Nanjing University of Aeronautics and Astronautics²

01 Jan 2018-Image and Vision Computing

TL;DR: Zhang et al. as mentioned in this paper proposed a general learning framework for jointly estimating human gender and age by attempting to formulate such semantic relationships as a form of near-orthogonality regularization and then to incorporate it into the objective of the joint learning framework.

...read moreread less

Journal Article•DOI•

Gait recognition in the wild using shadow silhouettes

[...]

Tanmay Tulsidas Verlekar¹, Luís Ducla Soares², Paulo Lobato Correia¹•Institutions (2)

Instituto Superior Técnico¹, ISCTE – University Institute of Lisbon²

01 Aug 2018-Image and Vision Computing

TL;DR: The main factors affecting the gait features that can be acquired from a 2D video sequence are discussed, proposing a taxonomy to classify them across four dimensions and the results highlight the advantages of using rectified shadow silhouettes over body silhouettes under certain conditions.

...read moreread less

Journal Article•DOI•

Context awareness in biometric systems and methods: State of the art and future scenarios

[...]

Michele Nappi¹, Stefano Ricciardi², Massimo Tistarelli³•Institutions (3)

University of Salerno¹, University of Molise², University of Sassari³

01 Aug 2018-Image and Vision Computing

TL;DR: An overall vision of the main contributions available so far in the field of context-aware biometric systems and methods is provided, along with a comparison of their features, aims and performances.

...read moreread less

Journal Article•DOI•

Multi-view dynamic facial action unit detection

[...]

Andrés Romero¹, Juan León¹, Pablo Arbeláez¹•Institutions (1)

University of Los Andes¹

26 Sep 2018-Image and Vision Computing

TL;DR: In this article, a multi-view dynamic facial action unit detection approach is proposed to detect the presence or absence of a specific action unit in a still image of a human face.

...read moreread less

Journal Article•DOI•

Marker-based non-overlapping camera calibration methods with additional support camera views

[...]

Fangda Zhao¹, Toru Tamaki¹, Takio Kurita¹, Bisser Raytchev¹, Kazufumi Kaneda¹ - Show less +1 more•Institutions (1)

Hiroshima University¹

01 Feb 2018-Image and Vision Computing

TL;DR: Simple methods to calibrate non-overlapping cameras using markers on the cameras are proposed, which works stably and uses fewer images.

...read moreread less

Journal Article•DOI•

The L 0 -regularized discrete variational level set method for image segmentation

[...]

Yang Liu¹, Chuanjiang He¹, Yongfei Wu², Zemin Ren³•Institutions (3)

Chongqing University¹, Taiyuan University of Technology², Chongqing University of Science and Technology³

01 Jul 2018-Image and Vision Computing

TL;DR: A ternary variational level set model involving L0 gradient regularizer and L0 function regularizer in discrete framework following the Chan-Vese model for image segmentation is proposed and has good performance for segmentation of images with severe noise, outliers or low contrast.

...read moreread less

Journal Article•DOI•

Kinematic Spline Curves: A temporal invariant descriptor for fast action recognition

[...]

Enjie Ghorbel¹, Rémi Boutteau, Jacques Boonaert¹, Xavier Savatier, Stéphane Lecoeuche¹ - Show less +1 more•Institutions (1)

university of lille¹

01 Sep 2018-Image and Vision Computing

TL;DR: A novel human action descriptor based on skeleton data provided by RGB-D cameras for fast action recognition is proposed, built by interpolating the kinematics of skeleton joints using a cubic spline algorithm.

...read moreread less

Journal Article•DOI•

The challenge of simultaneous object detection and pose estimation: A comparative study

[...]

Daniel Oñoro-Rubio¹, Roberto J. López-Sastre¹, Carolina Redondo-Cabrera¹, P. Gil-Jimenez¹•Institutions (1)

University of Alcalá¹

01 Nov 2018-Image and Vision Computing

TL;DR: This work proposes three novel deep learning architectures, which are able to perform a joint detection and pose estimation, where the two tasks gradually decouple, and investigates whether the pose estimation problem should be solved as a classification or regression problem, being this still an open question.

...read moreread less

Journal Article•DOI•

Hybrid eye center localization using cascaded regression and hand-crafted model fitting

[...]

Alex Levinshtein, Edmund Phung, Parham Aarabi¹•Institutions (1)

University of Toronto¹

01 Mar 2018-Image and Vision Computing

TL;DR: This work proposes a new cascaded regressor for eye center detection that achieves state-of-the-art performance on the BioID, GI4E, and the TalkingFace datasets and improves the robustness of localization by using both advanced features and powerful regression machinery.

...read moreread less

Journal Article•DOI•

Minimum barrier superpixel segmentation

[...]

Yinlin Hu¹, Yunsong Li¹, Rui Song¹, Rui Song², Peng Rao², Yangli Wang¹ - Show less +2 more•Institutions (2)

Xidian University¹, Chinese Academy of Sciences²

01 Feb 2018-Image and Vision Computing

TL;DR: A new compact-aware minimum barrier distance for superpixel segmentation (MBS), and a propagation scheme for the cluster centers between adjacent levels on a hierarchical architecture are introduced.

...read moreread less

Journal Article•DOI•

Recognition of action dynamics in fencing using multimodal cues

[...]

Filip Malawski¹, Bogdan Kwolek¹•Institutions (1)

AGH University of Science and Technology¹

01 Jul 2018-Image and Vision Computing

TL;DR: This work proposes informative motion descriptors based on accelerometric data, skeleton joints features and depth maps, and demonstrates their potential to model the motion dynamics, and shows that fusing data from multiple modalities permits better recognition accuracy.

...read moreread less