Conference

Asian Conference on Computer Vision

About: Asian Conference on Computer Vision is an academic conference. The conference publishes majorly in the area(s): Computer science & Feature (computer vision). Over the lifetime, 2847 publications have been published by the conference receiving 51985 citations.

...read moreread less

Topics: Computer science, Feature (computer vision), Segmentation, Artificial intelligence, Convolutional neural network ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

A+: Adjusted Anchored Neighborhood Regression for Fast Super-Resolution

[...]

Radu Timofte¹, Vincent De Smet², Luc Van Gool¹, Luc Van Gool²•Institutions (2)

ETH Zurich¹, Katholieke Universiteit Leuven²

01 Nov 2014

TL;DR: This work proposes A+, an improved variant of Anchored Neighborhood Regression, which combines the best qualities of ANR and SF and builds on the features and anchored regressors from ANR but instead of learning the regressors on the dictionary it uses the full training material, similar to SF.

...read moreread less

Abstract: We address the problem of image upscaling in the form of single image super-resolution based on a dictionary of low- and high-resolution exemplars. Two recently proposed methods, Anchored Neighborhood Regression (ANR) and Simple Functions (SF), provide state-of-the-art quality performance. Moreover, ANR is among the fastest known super-resolution methods. ANR learns sparse dictionaries and regressors anchored to the dictionary atoms. SF relies on clusters and corresponding learned functions. We propose A+, an improved variant of ANR, which combines the best qualities of ANR and SF. A+ builds on the features and anchored regressors from ANR but instead of learning the regressors on the dictionary it uses the full training material, similar to SF. We validate our method on standard images and compare with state-of-the-art methods. We obtain improved quality (i.e. 0.2–0.7 dB PSNR better than ANR) and excellent time complexity, rendering A+ the most efficient dictionary-based super-resolution method to date.

...read moreread less

1,418 citations

Book Chapter•DOI•

Model based training, detection and pose estimation of texture-less 3d objects in heavily cluttered scenes

[...]

Stefan Hinterstoisser¹, Vincent Lepetit², Slobodan Ilic¹, Stefan Johannes Josef Holzer¹, Gary Bradski, Kurt Konolige, Nassir Navab¹ - Show less +3 more•Institutions (2)

Technische Universität München¹, École Polytechnique Fédérale de Lausanne²

05 Nov 2012

TL;DR: A framework for automatic modeling, detection, and tracking of 3D objects with a Kinect and shows how to build the templates automatically from 3D models, and how to estimate the 6 degrees-of-freedom pose accurately and in real-time.

...read moreread less

Abstract: We propose a framework for automatic modeling, detection, and tracking of 3D objects with a Kinect. The detection part is mainly based on the recent template-based LINEMOD approach [1] for object detection. We show how to build the templates automatically from 3D models, and how to estimate the 6 degrees-of-freedom pose accurately and in real-time. The pose estimation and the color information allow us to check the detection hypotheses and improves the correct detection rate by 13% with respect to the original LINEMOD. These many improvements make our framework suitable for object manipulation in Robotics applications. Moreover we propose a new dataset made of 15 registered, 1100+ frame video sequences of 15 various objects for the evaluation of future competing methods.

...read moreread less

1,114 citations

Book Chapter•DOI•

GANomaly : semi-supervised anomaly detection via adversarial training.

[...]

Samet Akcay¹, Amir Atapour-Abarghouei¹, Toby P. Breckon¹•Institutions (1)

Durham University¹

02 Dec 2018

TL;DR: In this paper, a conditional generative adversarial network (GAN) is used for anomaly detection in a one-class, semi-supervised learning paradigm, where an encoder-decoder-encoder sub-network is employed to map the input image to a lower dimension vector, which is then used to reconstruct the generated output image.

...read moreread less

Abstract: Anomaly detection is a classical problem in computer vision, namely the determination of the normal from the abnormal when datasets are highly biased towards one class (normal) due to the insufficient sample size of the other class (abnormal). While this can be addressed as a supervised learning problem, a significantly more challenging problem is that of detecting the unknown/unseen anomaly case that takes us instead into the space of a one-class, semi-supervised learning paradigm. We introduce such a novel anomaly detection model, by using a conditional generative adversarial network that jointly learns the generation of high-dimensional image space and the inference of latent space. Employing encoder-decoder-encoder sub-networks in the generator network enables the model to map the input image to a lower dimension vector, which is then used to reconstruct the generated output image. The use of the additional encoder network maps this generated image to its latent representation. Minimizing the distance between these images and the latent vectors during training aids in learning the data distribution for the normal samples. As a result, a larger distance metric from this learned data distribution at inference time is indicative of an outlier from that distribution—an anomaly. Experimentation over several benchmark datasets, from varying domains, shows the model efficacy and superiority over previous state-of-the-art approaches.

...read moreread less

857 citations

Book Chapter•DOI•

Efficient large-scale stereo matching

[...]

Andreas Geiger¹, Martin Roser¹, Raquel Urtasun²•Institutions (2)

Karlsruhe Institute of Technology¹, Toyota Technological Institute at Chicago²

08 Nov 2010

TL;DR: A novel approach to binocular stereo for fast matching of high-resolution images by building a prior on the disparities by forming a triangulation on a set of support points which can be robustly matched, reducing the matching ambiguities of the remaining points.

...read moreread less

Abstract: In this paper we propose a novel approach to binocular stereo for fast matching of high-resolution images. Our approach builds a prior on the disparities by forming a triangulation on a set of support points which can be robustly matched, reducing the matching ambiguities of the remaining points. This allows for efficient exploitation of the disparity search space, yielding accurate dense reconstruction without the need for global optimization. Moreover, our method automatically determines the disparity range and can be easily parallelized. We demonstrate the effectiveness of our approach on the large-scale Middlebury benchmark, and show that state-of-the-art performance can be achieved with significant speedups. Computing the left and right disparity maps for a one Megapixel image pair takes about one second on a single CPU core.

...read moreread less

818 citations

Book Chapter•DOI•

[...]

Hieu V. Nguyen¹, Li Bai¹•Institutions (1)

University of Nottingham¹

08 Nov 2010

TL;DR: This paper proposes a new method, named the Cosine Similarity Metric Learning (CSML) for learning a distance metric for facial verification, which has achieved the highest accuracy in the literature.

...read moreread less

Abstract: Face verification is the task of deciding by analyzing face images, whether a person is who he/she claims to be. This is very challenging due to image variations in lighting, pose, facial expression, and age. The task boils down to computing the distance between two face vectors. As such, appropriate distance metrics are essential for face verification accuracy. In this paper we propose a new method, named the Cosine Similarity Metric Learning (CSML) for learning a distance metric for facial verification. The use of cosine similarity in our method leads to an effective learning algorithm which can improve the generalization ability of any given metric. Our method is tested on the state-of-the-art dataset, the Labeled Faces in the Wild (LFW), and has achieved the highest accuracy in the literature.

...read moreread less

626 citations

Collapse

Performance

Metrics

2,847

Papers

51,985

Citations

No. of papers from the Conference in previous years
Year	Papers
2023	13
2022	282
2021	1
2020	266
2019	2
2018	308