Showing papers in "Computer Vision and Image Understanding in 2019"

PDF

Open Access

Journal Article•DOI•

Pros and Cons of GAN Evaluation Measures

[...]

01 Feb 2019-Computer Vision and Image Understanding

TL;DR: More recently, this article reviewed and critically discussed more than 24 quantitative and 5 qualitative measures for evaluating generative models with a particular emphasis on GAN-derived models and also provided a set of 7 desiderata followed by an evaluation of whether a given measure or a family of measures is compatible with them.

...read moreread less

505 citations

Journal Article•DOI•

A survey on deep learning based face recognition

[...]

Guodong Guo¹, Guodong Guo², Na Zhang²•Institutions (2)

North University of China¹, West Virginia University²

01 Dec 2019-Computer Vision and Image Understanding

TL;DR: Major deep learning concepts pertinent to face image analysis and face recognition are reviewed, and a concise overview of studies on specific face recognition problems is provided, such as handling variations in pose, age, illumination, expression, and heterogeneous face matching.

...read moreread less

312 citations

Journal Article•DOI•

Anabranch network for camouflaged object segmentation

[...]

Trung-Nghia Le¹, Tam V. Nguyen², Zhongliang Nie², Minh-Triet Tran, Akihiro Sugimoto³ - Show less +1 more•Institutions (3)

Graduate University for Advanced Studies¹, University of Dayton², National Institute of Informatics³

01 Jul 2019-Computer Vision and Image Understanding

TL;DR: This paper proposes a general end-to-end network, called the Anabranch Network, that leverages both classification and segmentation tasks and possesses the second branch for classification to predict the probability of containing camouflaged object(s) in an image.

...read moreread less

200 citations

Journal Article•DOI•

Getting to know low-light images with the Exclusively Dark dataset

[...]

Yuen Peng Loh¹, Chee Seng Chan¹•Institutions (1)

Information Technology University¹

01 Jan 2019-Computer Vision and Image Understanding

TL;DR: The Exclusively Dark dataset as discussed by the authors consists of low-light images captured in visible light only, with image and object level annotations, and the effects of lowlight reach far deeper into the features than can be solved by simple illumination invariance.

...read moreread less

193 citations

Journal Article•DOI•

Multitask learning for large-scale semantic change detection

[...]

Rodrigo Caye Daudt¹, Rodrigo Caye Daudt², Bertrand Le Saux¹, Alexandre Boulch¹, Yann Gousseau² - Show less +1 more•Institutions (2)

Université Paris-Saclay¹, Télécom ParisTech²

01 Oct 2019-Computer Vision and Image Understanding

TL;DR: This paper presents the first large scale very high resolution semantic change detection dataset, which enables the usage of deep supervised learning methods for semantic changes detection with very highresolution images, and presents a network architecture that performs change detection and land cover mapping simultaneously.

...read moreread less

142 citations

Journal Article•DOI•

A survey of advances in vision-based vehicle re-identification

[...]

Sultan Daud Khan, Habib Ullah

01 May 2019-Computer Vision and Image Understanding

TL;DR: The detail analysis of different V-reID methods in terms of mean average precision (mAP) and cumulative matching curve (CMC) provide objective insight into the strengths and weaknesses of these methods.

...read moreread less

116 citations

Journal Article•DOI•

Siamese graph convolutional network for content based remote sensing image retrieval

[...]

Ushasi Chaudhuri¹, Biplab Banerjee¹, Avik Bhattacharya¹•Institutions (1)

Indian Institute of Technology Bombay¹

01 Jul 2019-Computer Vision and Image Understanding

TL;DR: This paper proposes the SGCN architecture for assessing the similarity between a pair of graphs which can be trained with the contrastive loss function and implements the proposed embeddings for the task of CBIR for RS data on the popular UC-Merced dataset and the PatternNet dataset where improved performance can be observed.

...read moreread less

104 citations

Journal Article•DOI•

Automotive radar and camera fusion using Generative Adversarial Networks

[...]

Vladimir Lekic¹, Zdenka Babic²•Institutions (2)

Mercedes-Benz¹, University of Banja Luka²

01 Jul 2019-Computer Vision and Image Understanding

TL;DR: A proposed fully-unsupervised machine learning algorithm converts the radar sensor data to artificial, camera-like, environmental images that are more consistent, accurate, and useful information than that provided solely by the radar or the camera.

...read moreread less

55 citations

Journal Article•DOI•

Analyzing human–human interactions: A survey

[...]

Alexandros Stergiou¹, Ronald Poppe¹•Institutions (1)

Utrecht University¹

01 Nov 2019-Computer Vision and Image Understanding

TL;DR: In this paper, a survey provides a summary of these challenges and datasets to address them, followed by an in-depth discussion of relevant vision-based recognition and detection methods, focusing on recent, promising work based on deep learning and convolutional neural networks (CNNs).

...read moreread less

53 citations

Journal Article•DOI•

ASSD: Attentive single shot multibox detector

[...]

Jingru Yi¹, Pengxiang Wu¹, Dimitris N. Metaxas¹•Institutions (1)

Rutgers University¹

01 Dec 2019-Computer Vision and Image Understanding

TL;DR: Zhang et al. as discussed by the authors proposed a new deep neural network for object detection, which builds feature relations in the spatial space of the feature map, and learns to highlight useful regions on the feature maps while suppressing the irrelevant information.

...read moreread less

49 citations

Journal Article•DOI•

Heterogeneous hand gesture recognition using 3D dynamic skeletal data

[...]

Quentin De Smedt¹, Hazem Wannous¹, Jean-Philippe Vandeborre¹•Institutions (1)

university of lille¹

01 Apr 2019-Computer Vision and Image Understanding

TL;DR: This work uses the natural structure of the hand topology – called later hand skeletal data – to extract effective hand kinematic descriptors from the gesture sequence and introduces a prior gesture detection phase achieved using a binary classifier before the final gesture recognition.

...read moreread less

Journal Article•DOI•

Single image rain removal via a deep decomposition–composition network

[...]

Siyuan Li¹, Wenqi Ren², Jiawan Zhang¹, Jinke Yu³, Xiaojie Guo¹ - Show less +1 more•Institutions (3)

Tianjin University¹, Chinese Academy of Sciences², Dalian University of Technology³

01 Sep 2019-Computer Vision and Image Understanding

TL;DR: Guo et al. as discussed by the authors designed a multi-task leaning architecture in an end-to-end manner to reduce the mapping range from input to output and boost the performance, where a decomposition net is built to split rain images into clean background and rain layers.

...read moreread less

Journal Article•DOI•

Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild

[...]

Youngkyoon Jang¹, Hatice Gunes², Ioannis Patras³•Institutions (3)

University of Bristol¹, University of Cambridge², Queen Mary University of London³

01 May 2019-Computer Vision and Image Understanding

TL;DR: Face-SSD is the first network to perform face analysis without relying on pre-processing such as face detection and registration in advance and achieves real-time performance even when detecting multiple faces and recognising multiple classes in a given image.

...read moreread less

Journal Article•DOI•

Faster training of Mask R-CNN by focusing on instance boundaries

[...]

Roland S. Zimmermann¹, Julien Niklas Siems²•Institutions (2)

University of Göttingen¹, University of Freiburg²

01 Nov 2019-Computer Vision and Image Understanding

TL;DR: An auxiliary task to Mask R-CNN, an instance segmentation network, is presented, which leads to faster training of the mask head, and a new prediction head is added, the Edge Agreement Head, which is inspired by the way human annotators perform instance segmentations.

...read moreread less

Journal Article•DOI•

Face alignment using a 3D deeply-initialized ensemble of regression trees

[...]

Roberto Valle¹, José Miguel Buenaposada², Antonio Valdés³, Luis Baumela¹•Institutions (3)

Technical University of Madrid¹, King Juan Carlos University², Complutense University of Madrid³

01 Dec 2019-Computer Vision and Image Understanding

TL;DR: 3DDE as discussed by the authors is a robust and efficient face alignment algorithm based on a coarse-to-fine cascade of ensembles of regression trees, which is initialized by robustly fitting a 3D face model to the probability maps produced by a CNN.

...read moreread less

Journal Article•DOI•

Generalizing semi-supervised generative adversarial networks to regression using feature contrasting

[...]

Greg Olmschenk¹, Zhigang Zhu¹, Hao Tang¹•Institutions (1)

City University of New York¹

01 Sep 2019-Computer Vision and Image Understanding

TL;DR: In this article, a semi-supervised regression GAN is proposed for regression problems and applied to age estimation, driving steering angle prediction, and crowd counting from a single image.

...read moreread less

Journal Article•DOI•

L2 Divergence for robust colour transfer

[...]

Mairéad Grogan¹, Rozenn Dahyot¹•Institutions (1)

Trinity College, Dublin¹

01 Apr 2019-Computer Vision and Image Understanding

TL;DR: The proposed alternative framework where the cost function used for inferring a parametric transfer function is defined as the robust L 2 divergence between two probability density functions outperforms many recent algorithms as measured quantitatively with standard quality metrics, and qualitatively using perceptual studies.

...read moreread less

Journal Article•DOI•

Cross-view image synthesis using geometry-guided conditional GANs

[...]

Krishna Regmi¹, Ali Borji•Institutions (1)

University of Central Florida¹

01 Oct 2019-Computer Vision and Image Understanding

TL;DR: In this paper, the authors address the problem of generating images across two drastically different views, namely ground (street) and aerial (overhead) views, and resort to homography as a guide to map the images between the views based on the common field of view to preserve the details in the input image.

...read moreread less

Journal Article•DOI•

Joint person re-identification and camera network topology inference in multiple cameras

[...]

Yeong-Jun Cho¹, Su-A Kim², Jae-Han Park², Kyuewang Lee², Kuk-Jin Yoon¹ - Show less +1 more•Institutions (2)

KAIST¹, Gwangju Institute of Science and Technology²

01 Mar 2019-Computer Vision and Image Understanding

TL;DR: Li et al. as mentioned in this paper proposed a unified framework which jointly solves both person re-identification and camera network topology inference problems with minimal prior knowledge about the environments, which can be applied to online person Re-ID in large-scale multi-camera networks.

...read moreread less

Journal Article•DOI•

Distance transform regression for spatially-aware deep semantic segmentation

[...]

Nicolas Audebert¹, Alexandre Boulch¹, Bertrand Le Saux¹, Sébastien Lefèvre•Institutions (1)

Université Paris-Saclay¹

01 Dec 2019-Computer Vision and Image Understanding

TL;DR: This work introduces a new semantic segmentation regularization based on the regression of a distance transform, which requires almost no modification of the network structure and adds a very low overhead to the training process.

...read moreread less

Journal Article•DOI•

A label noise tolerant random forest for the classification of remote sensing data based on outdated maps for training

[...]

Alina E. Maas¹, Franz Rottensteiner¹, Christian Heipke¹•Institutions (1)

Leibniz University of Hanover¹

01 Nov 2019-Computer Vision and Image Understanding

TL;DR: This paper suggests an adaptation of the random forest classifier by integrating a model for label noise based on the idea that a training sample should not be assigned to one class only, but to all classes, each with a certain probability.

...read moreread less

Journal Article•DOI•

Dynamic topology and relevance learning SOM-based algorithm for image clustering tasks

[...]

Heitor R. Medeiros¹, Felipe D. B. de Oliveira¹, Hansenclever F. Bassani¹, Aluizio F. R. Araújo¹•Institutions (1)

Federal University of Pernambuco¹

01 Feb 2019-Computer Vision and Image Understanding

TL;DR: This paper utilizes a variant of Self-organizing Map to cluster images in two different scenarios: disjoint and non-disjoint sets, and compares the state-of-the-art image clustering algorithms with a SOM-based subspace clustering method that identifies automatically the relevant features in the high-dimensional image representations.

...read moreread less

Journal Article•DOI•

Video synopsis: A survey

[...]

Kemal Batuhan Baskurt, Refik Samet

01 Apr 2019-Computer Vision and Image Understanding

TL;DR: This study is the first review of published video synopsis approaches and provides a comprehensive analysis of state-of-the-art approaches to achieve efficient video browsing and retrieval for surveillance cameras.

...read moreread less

Journal Article•DOI•

Geometry in active learning for binary and multi-class image segmentation

[...]

Ksenia Konyushkova¹, Raphael Sznitman², Pascal Fua¹•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, University of Bern²

01 May 2019-Computer Vision and Image Understanding

TL;DR: This approach combines geometric smoothness priors in the image space with more traditional uncertainty measures to estimate which pixels or voxels are the most informative, and thus should to be annotated next, for multi-class settings and introduces two novel criteria for uncertainty.

...read moreread less

Journal Article•DOI•

DRAU: Dual Recurrent Attention Units for Visual Question Answering

[...]

Ahmed Osman¹, Wojciech Samek¹•Institutions (1)

Heinrich Hertz Institute¹

01 Aug 2019-Computer Vision and Image Understanding

TL;DR: This paper proposes a recurrent attention mechanism for VQA which utilizes dual (textual and visual) Recurrent Attention Units (RAUs) and shows the effect of all possible combinations of recurrent and convolutional dual attention.

...read moreread less

Journal Article•DOI•

Informative sample generation using class aware generative adversarial networks for classification of chest Xrays

[...]

Behzad Bozorgtabar¹, Dwarikanath Mahapatra², Hendrik von Teng, Alexander Pöllinger, Lukas Ebner, Jean-Philippe Thiran¹, Jean-Philippe Thiran³, Mauricio Reyes⁴ - Show less +4 more•Institutions (4)

École Polytechnique Fédérale de Lausanne¹, IBM², University of Lausanne³, University of Bern⁴

01 Jul 2019-Computer Vision and Image Understanding

TL;DR: In this article, an active learning framework is proposed to select most informative samples for training a robust deep learning system using a Bayesian neural network, which is then used within a novel class aware generative adversarial network (CAGAN) to generate realistic chest xray images for data augmentation by transferring characteristics from one class label to another.

...read moreread less

Journal Article•DOI•

Domain invariant hierarchical embedding for grocery products recognition

[...]

Alessio Tonioni¹, Luigi Di Stefano¹•Institutions (1)

University of Bologna¹

01 May 2019-Computer Vision and Image Understanding

TL;DR: In this article, an end-to-end architecture comprising a GAN to address the domain shift at training time and a deep CNN trained on the samples generated by the GAN was proposed to learn an embedding of product images.

...read moreread less

Journal Article•DOI•

Learn to synthesize and synthesize to learn

[...]

Behzad Bozorgtabar¹, Mohammad Saeed Rad¹, Hazim Kemal Ekenel², Jean-Philippe Thiran¹, Jean-Philippe Thiran³ - Show less +1 more•Institutions (3)

École Polytechnique Fédérale de Lausanne¹, Istanbul Technical University², University of Lausanne³

01 Aug 2019-Computer Vision and Image Understanding

TL;DR: Compared to existing models, synthetic face images generated by the proposed attribute guided face image generation method present a good photorealistic quality on several face datasets and can be used for synthetic data augmentation, and improve the performance of the classifier used for facial expression recognition.

...read moreread less

Journal Article•DOI•

Deep 3D morphable model refinement via progressive growing of conditional Generative Adversarial Networks

[...]

Leonardo Galteri¹, Claudio Ferrari¹, Giuseppe Lisanti², Stefano Berretti¹, Alberto Del Bimbo¹ - Show less +1 more•Institutions (2)

University of Florence¹, University of Bologna²

01 Aug 2019-Computer Vision and Image Understanding

TL;DR: This work proposes an approach based on a Conditional Generative Adversarial Network (CGAN) for refining the coarse reconstruction provided by a 3DMM, represented as a three channels image, where the pixel intensities represent the depth, curvature and elevation values of the 3D vertices.

...read moreread less

Journal Article•DOI•

Attentive matching network for few-shot learning

[...]

Sijie Mai¹, Haifeng Hu¹, Jia Xu¹•Institutions (1)

Sun Yat-sen University¹

01 Oct 2019-Computer Vision and Image Understanding

TL;DR: This paper presents an effective framework named Attentive Matching Network (AMN) to address few-shot learning problem, and proposes a feature-level attention mechanism to help similarity function pay more emphasis on the features that better reflect the inter-class differences.

...read moreread less