Showing papers on "Feature (computer vision) published in 2012"

PDF

Open Access

Posted Content•

Improving neural networks by preventing co-adaptation of feature detectors

[...]

Geoffrey E. Hinton¹, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov¹ - Show less +1 more•Institutions (1)

University of Toronto¹

03 Jul 2012-arXiv: Neural and Evolutionary Computing

TL;DR: The authors randomly omits half of the feature detectors on each training case to prevent complex co-adaptations in which a feature detector is only helpful in the context of several other specific feature detectors.

...read moreread less

Abstract: When a large feedforward neural network is trained on a small training set, it typically performs poorly on held-out test data. This "overfitting" is greatly reduced by randomly omitting half of the feature detectors on each training case. This prevents complex co-adaptations in which a feature detector is only helpful in the context of several other specific feature detectors. Instead, each neuron learns to detect a feature that is generally helpful for producing the correct answer given the combinatorially large variety of internal contexts in which it must operate. Random "dropout" gives big improvements on many benchmark tasks and sets new records for speech and object recognition.

...read moreread less

6,899 citations

Proceedings Article•DOI•

Low-complexity single-image super-resolution based on nonnegative neighbor embedding

[...]

Marco Bevilacqua, Aline Roumy, Christine Guillemot, Marie-Line Alberi Morel¹•Institutions (1)

Bell Labs¹

01 Sep 2012

TL;DR: The neighbor embedding SR algorithm so designed is shown to give good visual results, comparable to other state-of-the-art methods, while presenting an appreciable reduction of the computational time.

...read moreread less

Abstract: This paper describes a single-image super-resolution (SR) algorithm based on nonnegative neighbor embedding. It belongs to the family of single-image example-based SR algorithms, since it uses a dictionary of low resolution (LR) and high resolution (HR) trained patch pairs to infer the unknown HR details. Each LR feature vector in the input image is expressed as the weighted combination of its K nearest neighbors in the dictionary; the corresponding HR feature vector is reconstructed under the assumption that the local LR embedding is preserved. Three key aspects are introduced in order to build a low-complexity competitive algorithm: (i) a compact but efficient representation of the patches (feature representation) (ii) an accurate estimation of the patches by their nearest neighbors (weight computation) (iii) a compact and already built (therefore external) dictionary, which allows a one-step upscaling. The neighbor embedding SR algorithm so designed is shown to give good visual results, comparable to other state-of-the-art methods, while presenting an appreciable reduction of the computational time.

...read moreread less

2,059 citations

Proceedings Article•DOI•

Saliency filters: Contrast based filtering for salient region detection

[...]

Federico Perazzi¹, Philipp Krähenbühl², Yael Pritch¹, Alexander Hornung¹•Institutions (2)

Disney Research¹, Stanford University²

16 Jun 2012

TL;DR: A conceptually clear and intuitive algorithm for contrast-based saliency estimation that outperforms all state-of-the-art approaches and can be formulated in a unified way using high-dimensional Gaussian filters.

...read moreread less

Abstract: Saliency estimation has become a valuable tool in image processing. Yet, existing approaches exhibit considerable variation in methodology, and it is often difficult to attribute improvements in result quality to specific algorithm properties. In this paper we reconsider some of the design choices of previous methods and propose a conceptually clear and intuitive algorithm for contrast-based saliency estimation. Our algorithm consists of four basic steps. First, our method decomposes a given image into compact, perceptually homogeneous elements that abstract unnecessary detail. Based on this abstraction we compute two measures of contrast that rate the uniqueness and the spatial distribution of these elements. From the element contrast we then derive a saliency measure that produces a pixel-accurate saliency map which uniformly covers the objects of interest and consistently separates fore- and background. We show that the complete contrast and saliency estimation can be formulated in a unified way using high-dimensional Gaussian filters. This contributes to the conceptual simplicity of our method and lends itself to a highly efficient implementation with linear complexity. In a detailed experimental evaluation we analyze the contribution of each individual feature and show that our method outperforms all state-of-the-art approaches.

...read moreread less

1,711 citations

Journal Article•DOI•

Radiomics: the process and the challenges

[...]

Virendra Kumar, Yuhua Gu, Satrajit Basu¹, Anders Berglund, Steven A. Eschrich, Matthew B. Schabath, Kenneth M. Forster, Hugo J.W.L. Aerts², Hugo J.W.L. Aerts³, Andre Dekker³, David Fenstermacher, Dmitry B. Goldgof¹, Lawrence O. Hall¹, Philippe Lambin³, Yoganand Balagurunathan, Robert A. Gatenby, Robert J. Gillies - Show less +13 more•Institutions (3)

University of South Florida¹, Harvard University², Maastricht University³

01 Nov 2012-Magnetic Resonance Imaging

TL;DR: "Radiomics" refers to the extraction and analysis of large amounts of advanced quantitative imaging features with high throughput from medical images obtained with computed tomography, positron emission tomography or magnetic resonance imaging, leading to a very large potential subject pool.

...read moreread less

1,608 citations

Evaluation of Brand Extensions: The Role of Product Feature Similarity and Brand Concept

[...]

Sandra J. Milberg, Robert Lawson

01 Jan 2012

TL;DR: In this paper, the authors examined two factors that differentiate between successful and unsuccessful brand extensions: product feature similarity and brand concept consistency, and found that consumers take into account not only information about the product-level feature similarity between the new product and the products already associated with the brand, but also the concept consistency between the brand concept and the extension.

...read moreread less

Abstract: This article examines two factors that differentiate between successful and unsuccessful brand extensions: product feature similarity and brand concept consistency. The results reveal that, in identifying brand extensions, consumers take into account not only information about the product-level feature similarity between the new product and the products already associated with the brand, but also the concept consistency between the brand concept and the extension. For both function-oriented and prestige-oriented brand names, the most favorable reactions occur when brand extensions are made with high brand concept consistency and high product feature similarity. In addition, the relative impact of these two factors differs to some extent, depending on the nature of the brand-name concept. When a brand's concept is consistent with those of its extension products, the prestige brand seems to have greater extendibility to products with low feature similarity than the functional brand does. Copyright 1991 by the University of Chicago.

...read moreread less

1,173 citations

Journal Article•

Conditional likelihood maximisation: a unifying framework for information theoretic feature selection

[...]

Gavin Brown¹, Adam Craig Pocock¹, Ming-Jie Zhao¹, Mikel Luján¹•Institutions (1)

University of Manchester¹

01 Jan 2012-Journal of Machine Learning Research

TL;DR: Overall it is concluded that the JMI criterion provides the best tradeoff in terms of accuracy, stability, and flexibility with small data samples.

...read moreread less

Abstract: We present a unifying framework for information theoretic feature selection, bringing almost two decades of research on heuristic filter criteria under a single theoretical interpretation. This is in response to the question: "what are the implicit statistical assumptions of feature selection criteria based on mutual information?". To answer this, we adopt a different strategy than is usual in the feature selection literature--instead of trying to define a criterion, we derive one, directly from a clearly specified objective function: the conditional likelihood of the training labels. While many hand-designed heuristic criteria try to optimize a definition of feature 'relevancy' and 'redundancy', our approach leads to a probabilistic framework which naturally incorporates these concepts. As a result we can unify the numerous criteria published over the last two decades, and show them to be low-order approximations to the exact (but intractable) optimisation problem. The primary contribution is to show that common heuristics for information based feature selection (including Markov Blanket algorithms as a special case) are approximate iterative maximisers of the conditional likelihood. A large empirical study provides strong evidence to favour certain classes of criteria, in particular those that balance the relative size of the relevancy/redundancy terms. Overall we conclude that the JMI criterion (Yang and Moody, 1999; Meyer et al., 2008) provides the best tradeoff in terms of accuracy, stability, and flexibility with small data samples.

...read moreread less

1,058 citations

Book Chapter•DOI•

Interactive facial feature localization

[...]

Vuong Le¹, Jonathan Brandt², Zhe Lin², Lubomir Bourdev³, Thomas S. Huang¹ - Show less +1 more•Institutions (3)

University of Illinois at Urbana–Champaign¹, Adobe Systems², Facebook³

07 Oct 2012

TL;DR: An improvement to the Active Shape Model is proposed that allows for greater independence among the facial components and improves on the appearance fitting step by introducing a Viterbi optimization process that operates along the facial contours.

...read moreread less

Abstract: We address the problem of interactive facial feature localization from a single image. Our goal is to obtain an accurate segmentation of facial features on high-resolution images under a variety of pose, expression, and lighting conditions. Although there has been significant work in facial feature localization, we are addressing a new application area, namely to facilitate intelligent high-quality editing of portraits, that brings requirements not met by existing methods. We propose an improvement to the Active Shape Model that allows for greater independence among the facial components and improves on the appearance fitting step by introducing a Viterbi optimization process that operates along the facial contours. Despite the improvements, we do not expect perfect results in all cases. We therefore introduce an interaction model whereby a user can efficiently guide the algorithm towards a precise solution. We introduce the Helen Facial Feature Dataset consisting of annotated portrait images gathered from Flickr that are more diverse and challenging than currently existing datasets. We present experiments that compare our automatic method to published results, and also a quantitative evaluation of the effectiveness of our interactive method.

...read moreread less

973 citations

Journal Article•DOI•

Efficient Additive Kernels via Explicit Feature Maps

[...]

Andrea Vedaldi¹, Andrew Zisserman¹•Institutions (1)

University of Oxford¹

01 Mar 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work introduces explicit feature maps for the additive class of kernels, such as the intersection, Hellinger's, and χ2 kernels, commonly used in computer vision, and enables their use in large scale problems.

...read moreread less

Abstract: Large scale nonlinear support vector machines (SVMs) can be approximated by linear ones using a suitable feature map. The linear SVMs are in general much faster to learn and evaluate (test) than the original nonlinear SVMs. This work introduces explicit feature maps for the additive class of kernels, such as the intersection, Hellinger's, and χ2 kernels, commonly used in computer vision, and enables their use in large scale problems. In particular, we: 1) provide explicit feature maps for all additive homogeneous kernels along with closed form expression for all common kernels; 2) derive corresponding approximate finite-dimensional feature maps based on a spectral analysis; and 3) quantify the error of the approximation, showing that the error is independent of the data dimension and decays exponentially fast with the approximation order for selected kernels such as χ2. We demonstrate that the approximations have indistinguishable performance from the full kernels yet greatly reduce the train/test times of SVMs. We also compare with two other approximation methods: Nystrom's approximation of Perronnin et al. [1], which is data dependent, and the explicit map of Maji and Berg [2] for the intersection kernel, which, as in the case of our approximations, is data independent. The approximations are evaluated on a number of standard data sets, including Caltech-101 [3], Daimler-Chrysler pedestrians [4], and INRIA pedestrians [5].

...read moreread less

804 citations

Posted Content•

Multi-Task Feature Learning Via Efficient l2,1-Norm Minimization

[...]

Jun Liu, Shuiwang Ji, Jieping Ye

09 May 2012-arXiv: Learning

TL;DR: This paper proposes to accelerate the computation of the l2, 1-norm regularized regression model by reformulating it as two equivalent smooth convex optimization problems which are then solved via the Nesterov's method---an optimal first-order black-box method for smooth conveX optimization.

...read moreread less

Abstract: The problem of joint feature selection across a group of related tasks has applications in many areas including biomedical informatics and computer vision. We consider the l2,1-norm regularized regression model for joint feature selection from multiple tasks, which can be derived in the probabilistic framework by assuming a suitable prior from the exponential family. One appealing feature of the l2,1-norm regularization is that it encourages multiple predictors to share similar sparsity patterns. However, the resulting optimization problem is challenging to solve due to the non-smoothness of the l2,1-norm regularization. In this paper, we propose to accelerate the computation by reformulating it as two equivalent smooth convex optimization problems which are then solved via the Nesterov's method-an optimal first-order black-box method for smooth convex optimization. A key building block in solving the reformulations is the Euclidean projection. We show that the Euclidean projection for the first reformulation can be analytically computed, while the Euclidean projection for the second one can be computed in linear time. Empirical evaluations on several data sets verify the efficiency of the proposed algorithms.

...read moreread less

630 citations

Journal Article•DOI•

An Evaluation of Popular Copy-Move Forgery Detection Approaches

[...]

Vincent Christlein¹, Christian Riess¹, Johannes Jordan¹, Elli Angelopoulou¹•Institutions (1)

University of Erlangen-Nuremberg¹

01 Dec 2012-IEEE Transactions on Information Forensics and Security

TL;DR: This paper created a challenging real-world copy-move dataset, and a software framework for systematic image manipulation, and examined the 15 most prominent feature sets, finding the keypoint-based features Sift and Surf as well as the block-based DCT, DWT, KPCA, PCA, and Zernike features perform very well.

...read moreread less

Abstract: A copy-move forgery is created by copying and pasting content within the same image, and potentially postprocessing it. In recent years, the detection of copy-move forgeries has become one of the most actively researched topics in blind image forensics. A considerable number of different algorithms have been proposed focusing on different types of postprocessed copies. In this paper, we aim to answer which copy-move forgery detection algorithms and processing steps (e.g., matching, filtering, outlier detection, affine transformation estimation) perform best in various postprocessing scenarios. The focus of our analysis is to evaluate the performance of previously proposed feature sets. We achieve this by casting existing algorithms in a common pipeline. In this paper, we examined the 15 most prominent feature sets. We analyzed the detection performance on a per-image basis and on a per-pixel basis. We created a challenging real-world copy-move dataset, and a software framework for systematic image manipulation. Experiments show, that the keypoint-based features Sift and Surf, as well as the block-based DCT, DWT, KPCA, PCA, and Zernike features perform very well. These feature sets exhibit the best robustness against various noise sources and downsampling, while reliably identifying the copied regions.

...read moreread less

623 citations

Book Chapter•DOI•

Relaxed pairwise learned metric for person re-identification

[...]

Martin Hirzer¹, Peter M. Roth¹, Martin Köstinger¹, Horst Bischof¹•Institutions (1)

Graz University of Technology¹

07 Oct 2012

TL;DR: This paper proposes to learn a metric from pairs of samples from different cameras, so that even less sophisticated features describing color and texture information are sufficient for finally getting state-of-the-art classification results.

...read moreread less

Abstract: Matching persons across non-overlapping cameras is a rather challenging task. Thus, successful methods often build on complex feature representations or sophisticated learners. A recent trend to tackle this problem is to use metric learning to find a suitable space for matching samples from different cameras. However, most of these approaches ignore the transition from one camera to the other. In this paper, we propose to learn a metric from pairs of samples from different cameras. In this way, even less sophisticated features describing color and texture information are sufficient for finally getting state-of-the-art classification results. Moreover, once the metric has been learned, only linear projections are necessary at search time, where a simple nearest neighbor classification is performed. The approach is demonstrated on three publicly available datasets of different complexity, where it can be seen that state-of-the-art results can be obtained at much lower computational costs.

...read moreread less

Journal Article•DOI•

A review on automatic image annotation techniques

[...]

Dengsheng Zhang¹, Md. Monirul Islam¹, Guojun Lu¹•Institutions (1)

Monash University¹

01 Jan 2012-Pattern Recognition

TL;DR: This paper analyzes key aspects of the various AIA methods, including both feature extraction and semantic learning methods and provides a comprehensive survey on automatic image annotation.

...read moreread less

Proceedings Article•

Unsupervised feature selection using nonnegative spectral analysis

[...]

Zechao Li¹, Yi Yang², Jing Liu¹, Xiaofang Zhou³, Hanqing Lu¹ - Show less +1 more•Institutions (3)

Chinese Academy of Sciences¹, Carnegie Mellon University², University of Queensland³

22 Jul 2012

TL;DR: A new unsupervised learning algorithm, namely Nonnegative Discriminative Feature Selection (NDFS), which exploits the discriminative information and feature correlation simultaneously to select a better feature subset.

...read moreread less

Abstract: In this paper, a new unsupervised learning algorithm, namely Nonnegative Discriminative Feature Selection (NDFS), is proposed. To exploit the discriminative information in unsupervised scenarios, we perform spectral clustering to learn the cluster labels of the input samples, during which the feature selection is performed simultaneously. The joint learning of the cluster labels and feature selection matrix enables NDFS to select the most discriminative features. To learn more accurate cluster labels, a nonnegative constraint is explicitly imposed to the class indicators. To reduce the redundant or even noisy features, l2,1-norm minimization constraint is added into the objective function, which guarantees the feature selection matrix sparse in rows. Our algorithm exploits the discriminative information and feature correlation simultaneously to select a better feature subset. A simple yet efficient iterative algorithm is designed to optimize the proposed objective function. Experimental results on different real world datasets demonstrate the encouraging performance of our algorithm over the state-of-the-arts.

...read moreread less

Proceedings Article•DOI•

Real-time facial feature detection using conditional regression forests

[...]

Matthias Dantone¹, Juergen Gall¹, Gabriele Fanelli¹, Luc Van Gool¹•Institutions (1)

ETH Zurich¹

16 Jun 2012

TL;DR: In the authors' experiments, it is demonstrated that conditional regression forests outperform regression forests for facial feature detection and close-to-human accuracy is achieved while processing images in real-time.

...read moreread less

Abstract: Although facial feature detection from 2D images is a well-studied field, there is a lack of real-time methods that estimate feature points even on low quality images. Here we propose conditional regression forest for this task. While regression forest learn the relations between facial image patches and the location of feature points from the entire set of faces, conditional regression forest learn the relations conditional to global face properties. In our experiments, we use the head pose as a global property and demonstrate that conditional regression forests outperform regression forests for facial feature detection. We have evaluated the method on the challenging Labeled Faces in the Wild [20] database where close-to-human accuracy is achieved while processing images in real-time.

...read moreread less

Journal Article•DOI•

An Evaluation of Popular Copy-Move Forgery Detection Approaches

[...]

Vincent Christlein, Christian Riess, Johannes Jordan, Corinna Riess, Elli Angelopoulou - Show less +1 more

17 Aug 2012-arXiv: Computer Vision and Pattern Recognition

TL;DR: Wang et al. as mentioned in this paper examined the 15 most prominent feature sets and analyzed the detection performance on a per-image basis and on per-pixel basis, and found that the keypoint-based features SIFT and SURF, as well as the block-based DCT, DWT, KPCA, PCA and Zernike features perform very well.

...read moreread less

Abstract: A copy-move forgery is created by copying and pasting content within the same image, and potentially post-processing it. In recent years, the detection of copy-move forgeries has become one of the most actively researched topics in blind image forensics. A considerable number of different algorithms have been proposed focusing on different types of postprocessed copies. In this paper, we aim to answer which copy-move forgery detection algorithms and processing steps (e.g., matching, filtering, outlier detection, affine transformation estimation) perform best in various postprocessing scenarios. The focus of our analysis is to evaluate the performance of previously proposed feature sets. We achieve this by casting existing algorithms in a common pipeline. In this paper, we examined the 15 most prominent feature sets. We analyzed the detection performance on a per-image basis and on a per-pixel basis. We created a challenging real-world copy-move dataset, and a software framework for systematic image manipulation. Experiments show, that the keypoint-based features SIFT and SURF, as well as the block-based DCT, DWT, KPCA, PCA and Zernike features perform very well. These feature sets exhibit the best robustness against various noise sources and downsampling, while reliably identifying the copied regions.

...read moreread less

Journal Article•DOI•

Neighborhood repulsed metric learning for kinship verification

[...]

Jiwen Lu¹, Junlin Hu², Xiuzhuang Zhou³, Yuanyuan Shang³, Yap-Peng Tan⁴, Gang Wang¹ - Show less +2 more•Institutions (4)

Agency for Science, Technology and Research¹, Beijing Normal University², Capital Normal University³, Nanyang Technological University⁴

16 Jun 2012

TL;DR: This paper proposes a new neighborhood repulsed metric learning (NRML) method for kinship verification, and proposes a multiview NRM-L method to seek a common distance metric to make better use of multiple feature descriptors to further improve the verification performance.

...read moreread less

Abstract: Kinship verification from facial images is an interesting and challenging problem in computer vision, and there are very limited attempts on tackle this problem in the literature. In this paper, we propose a new neighborhood repulsed metric learning (NRML) method for kinship verification. Motivated by the fact that interclass samples (without a kinship relation) with higher similarity usually lie in a neighborhood and are more easily misclassified than those with lower similarity, we aim to learn a distance metric under which the intraclass samples (with a kinship relation) are pulled as close as possible and interclass samples lying in a neighborhood are repulsed and pushed away as far as possible, simultaneously, such that more discriminative information can be exploited for verification. To make better use of multiple feature descriptors to extract complementary information, we further propose a multiview NRML (MNRML) method to seek a common distance metric to perform multiple feature fusion to improve the kinship verification performance. Experimental results are presented to demonstrate the efficacy of our proposed methods. Finally, we also test human ability in kinship verification from facial images and our experimental results show that our methods are comparable to that of human observers.

...read moreread less

Journal Article•DOI•

Early fault diagnosis of rotating machinery based on wavelet packets—Empirical mode decomposition feature extraction and neural network

[...]

G.F. Bin¹, G.F. Bin², Jinji Gao¹, X.J. Li², Balbir S. Dhillon³ - Show less +1 more•Institutions (3)

Beijing University of Chemical Technology¹, Hunan University of Science and Technology², University of Ottawa³

01 Feb 2012-Mechanical Systems and Signal Processing

TL;DR: The results show that the proposed method can effectively get the signal feature to diagnose the occurrence of early fault of rotating machinery.

...read moreread less

Book•

Feature Extraction Image Processing For Computer Vision

[...]

Mark S. Nixon, Alberto S. Aguado

09 Oct 2012

TL;DR: This book is an essential guide to the implementation of image processing and computer vision techniques, with tutorial introductions and sample code in Matlab, and contains extensive new material on Haar wavelets, Viola-Jones, bilateral filtering, SURF, PCA-SIFT, moving object detection and tracking.

...read moreread less

Abstract: This book is an essential guide to the implementation of image processing and computer vision techniques, with tutorial introductions and sample code in Matlab. Algorithms are presented and fully explained to enable complete understanding of the methods and techniques demonstrated. As one reviewer noted, "The main strength of the proposed book is the exemplar code of the algorithms." Fully updated with the latest developments in feature extraction, including expanded tutorials and new techniques, this new edition contains extensive new material on Haar wavelets, Viola-Jones, bilateral filtering, SURF, PCA-SIFT, moving object detection and tracking, development of symmetry operators, LBP texture analysis, Adaboost, and a new appendix on color models. Coverage of distance measures, feature detectors, wavelets, level sets and texture tutorials has been extended. * Named a 2012 Notable Computer Book for Computing Methodologies by Computing Reviews* Essential reading for engineers and students working in this cutting-edge field* Ideal module text and background reference for courses in image processing and computer vision* The only currently available text to concentrate on feature extraction with working implementation and worked through derivation

...read moreread less

Journal Article•DOI•

On Combining Multiple Features for Hyperspectral Remote Sensing Image Classification

[...]

Lefei Zhang¹, Liangpei Zhang¹, Dacheng Tao², Xin Huang¹•Institutions (2)

Wuhan University¹, University of Technology, Sydney²

01 Mar 2012-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: The patch alignment framework is introduced to linearly combine multiple features in the optimal way and obtain a unified low-dimensional representation of these multiple features for subsequent classification in hyperspectral remote sensing image classification.

...read moreread less

Abstract: In hyperspectral remote sensing image classification, multiple features, e.g., spectral, texture, and shape features, are employed to represent pixels from different perspectives. It has been widely acknowledged that properly combining multiple features always results in good classification performance. In this paper, we introduce the patch alignment framework to linearly combine multiple features in the optimal way and obtain a unified low-dimensional representation of these multiple features for subsequent classification. Each feature has its particular contribution to the unified representation determined by simultaneously optimizing the weights in the objective function. This scheme considers the specific statistical properties of each feature to achieve a physically meaningful unified low-dimensional representation of multiple features. Experiments on the classification of the hyperspectral digital imagery collection experiment and reflective optics system imaging spectrometer hyperspectral data sets suggest that this scheme is effective.

...read moreread less

Journal Article•DOI•

Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition

[...]

Md. Sahidullah¹, Goutam Saha¹•Institutions (1)

Indian Institute of Technology Kharagpur¹

01 May 2012-Speech Communication

TL;DR: A class of linear transformation techniques based on block wise transformation of MFLE which effectively decorrelate the filter bank log energies and also capture speech information in an efficient manner are studied.

...read moreread less

Journal Article•

Feature selection via dependence maximization

[...]

Le Song¹, Alexander J. Smola², Arthur Gretton³, Justin Bedo⁴, Karsten M. Borgwardt³ - Show less +1 more•Institutions (4)

Georgia Institute of Technology¹, Yahoo!², Max Planck Society³, Australian National University⁴

01 Jan 2012-Journal of Machine Learning Research

TL;DR: This work introduces a framework for feature selection based on dependence maximization between the selected features and the labels of an estimation problem, using the Hilbert-Schmidt Independence Criterion, and shows that a number of existing feature selectors are special cases of this framework.

...read moreread less

Abstract: We introduce a framework for feature selection based on dependence maximization between the selected features and the labels of an estimation problem, using the Hilbert-Schmidt Independence Criterion. The key idea is that good features should be highly dependent on the labels. Our approach leads to a greedy procedure for feature selection. We show that a number of existing feature selectors are special cases of this framework. Experiments on both artificial and real-world data show that our feature selector works well in practice.

...read moreread less

Journal Article•DOI•

Kernelized Locality-Sensitive Hashing

[...]

Brian Kulis¹, Kristen Grauman²•Institutions (2)

Ohio State University¹, University of Texas at Austin²

01 Jun 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: It is shown how to generalize locality-sensitive hashing to accommodate arbitrary kernel functions, making it possible to preserve the algorithm's sublinear time similarity search guarantees for a wide class of useful similarity functions.

...read moreread less

Abstract: Fast retrieval methods are critical for many large-scale and data-driven vision applications. Recent work has explored ways to embed high-dimensional features or complex distance functions into a low-dimensional Hamming space where items can be efficiently searched. However, existing methods do not apply for high-dimensional kernelized data when the underlying feature embedding for the kernel is unknown. We show how to generalize locality-sensitive hashing to accommodate arbitrary kernel functions, making it possible to preserve the algorithm's sublinear time similarity search guarantees for a wide class of useful similarity functions. Since a number of successful image-based kernels have unknown or incomputable embeddings, this is especially valuable for image retrieval tasks. We validate our technique on several data sets, and show that it enables accurate and fast performance for several vision problems, including example-based object classification, local feature matching, and content-based retrieval.

...read moreread less

Journal Article•DOI•

Image Forgery Localization via Fine-Grained Analysis of CFA Artifacts

[...]

Pasquale Ferrara, Tiziano Bianchi¹, A. De Rosa¹, Alessandro Piva¹•Institutions (1)

University of Florence¹

01 Oct 2012-IEEE Transactions on Information Forensics and Security

TL;DR: A forensic tool able to discriminate between original and forged regions in an image captured by a digital camera is presented, based on a new feature measuring the presence of demosaicking artifacts at a local level and a new statistical model allowing to derive the tampering probability of each 2 × 2 image block without requiring to know a priori the position of the forged region.

...read moreread less

Abstract: In this paper, a forensic tool able to discriminate between original and forged regions in an image captured by a digital camera is presented. We make the assumption that the image is acquired using a Color Filter Array, and that tampering removes the artifacts due to the demosaicking algorithm. The proposed method is based on a new feature measuring the presence of demosaicking artifacts at a local level, and on a new statistical model allowing to derive the tampering probability of each 2 × 2 image block without requiring to know a priori the position of the forged region. Experimental results on different cameras equipped with different demosaicking algorithms demonstrate both the validity of the theoretical model and the effectiveness of our scheme.

...read moreread less

Proceedings Article•DOI•

Person Re-identification by Attributes.

[...]

Ryan Layne¹, Timothy M. Hospedales, Shaogang Gong¹•Institutions (1)

Queen Mary University of London¹

01 Jan 2012

TL;DR: This work proposes a novel method for re-identification that learns a selection and weighting of mid-level semantic attributes to describe people, an attribute-centric, parts-based feature representation that differs from and complements existing low-level features that rely purely on bottom-up statistics for feature selection.

...read moreread less

Abstract: Visually identifying a target individual reliably in a crowded environment observed by a distributed camera network is critical to a variety of tasks in managing business information, border control, and crime prevention. Automatic re-identification of a human candidate from public space CCTV video is challenging due to spatiotemporal visual feature variations and strong visual similarity between different people, compounded by low-resolution and poor quality video data. In this work, we propose a novel method for re-identification that learns a selection and weighting of mid-level semantic attributes to describe people. Specifically, the model learns an attribute-centric, parts-based feature representation. This differs from and complements existing low-level features for re-identification that rely purely on bottom-up statistics for feature selection, which are limited in discriminating and identifying reliably visual appearances of target people appearing in different camera views under certain degrees of occlusion due to crowdedness. Our experiments demonstrate the effectiveness of our approach compared to existing feature representations when applied to benchmarking datasets.

...read moreread less

Proceedings Article•

Learning to Align from Scratch

[...]

Gary B. Huang¹, Marwan Mattar¹, Honglak Lee², Erik Learned-Miller¹•Institutions (2)

University of Massachusetts Amherst¹, University of Michigan²

03 Dec 2012

TL;DR: This paper incorporates deep learning into the congealing alignment framework, and modify the learning algorithm for the restricted Boltzmann machine by incorporating a group sparsity penalty, leading to a topographic organization of the learned filters and improving subsequent alignment results.

...read moreread less

Abstract: Unsupervised joint alignment of images has been demonstrated to improve performance on recognition tasks such as face verification. Such alignment reduces undesired variability due to factors such as pose, while only requiring weak supervision in the form of poorly aligned examples. However, prior work on unsupervised alignment of complex, real-world images has required the careful selection of feature representation based on hand-crafted image descriptors, in order to achieve an appropriate, smooth optimization landscape. In this paper, we instead propose a novel combination of unsupervised joint alignment with unsupervised feature learning. Specifically, we incorporate deep learning into the congealing alignment framework. Through deep learning, we obtain features that can represent the image at differing resolutions based on network depth, and that are tuned to the statistics of the specific data being aligned. In addition, we modify the learning algorithm for the restricted Boltzmann machine by incorporating a group sparsity penalty, leading to a topographic organization of the learned filters and improving subsequent alignment results. We apply our method to the Labeled Faces in the Wild database (LFW). Using the aligned images produced by our proposed unsupervised algorithm, we achieve higher accuracy in face verification compared to prior work in both unsupervised and supervised alignment. We also match the accuracy for the best available commercial method.

...read moreread less

Journal Article•DOI•

Sketch-based shape retrieval

[...]

Mathias Eitz¹, Ronald Richter¹, Tamy Boubekeur², Kristian Hildebrand¹, Marc Alexa¹ - Show less +1 more•Institutions (2)

Technical University of Berlin¹, Télécom ParisTech²

01 Jul 2012

TL;DR: A targeted feature transform based on Gabor filters for this system for 3D object retrieval based on sketched feature lines as input is developed and it is shown objectively that this transform is better suited than other approaches from the literature developed for similar tasks.

...read moreread less

Abstract: We develop a system for 3D object retrieval based on sketched feature lines as input. For objective evaluation, we collect a large number of query sketches from human users that are related to an existing data base of objects. The sketches turn out to be generally quite abstract with large local and global deviations from the original shape. Based on this observation, we decide to use a bag-of-features approach over computer generated line drawings of the objects. We develop a targeted feature transform based on Gabor filters for this system. We can show objectively that this transform is better suited than other approaches from the literature developed for similar tasks. Moreover, we demonstrate how to optimize the parameters of our, as well as other approaches, based on the gathered sketches. In the resulting comparison, our approach is significantly better than any other system described so far.

...read moreread less

Journal Article•DOI•

Shadow detection: A survey and comparative evaluation of recent methods

[...]

Andres Sanin¹, Conrad Sanderson¹, Brian C. Lovell¹•Institutions (1)

University of Queensland¹

01 Apr 2012-Pattern Recognition

TL;DR: A survey and a comparative evaluation of recent techniques for moving cast shadow detection indicate that all shadow detection approaches make different contributions and all have individual strength and weaknesses.

...read moreread less

Book Chapter•DOI•

Person re-identification: what features are important?

[...]

Chunxiao Liu¹, Shaogang Gong², Chen Change Loy, Xinggang Lin¹•Institutions (2)

Tsinghua University¹, Queen Mary University of London²

07 Oct 2012

TL;DR: This study shows that certain features play more important role than others under different circumstances, and proposes a novel unsupervised approach for learning a bottom-up feature importance, so features extracted from different individuals are weighted adaptively driven by their unique and inherent appearance attributes.

...read moreread less

Abstract: State-of-the-art person re-identification methods seek robust person matching through combining various feature types. Often, these features are implicitly assigned with a single vector of global weights, which are assumed to be universally good for all individuals, independent to their different appearances. In this study, we show that certain features play more important role than others under different circumstances. Consequently, we propose a novel unsupervised approach for learning a bottom-up feature importance, so features extracted from different individuals are weighted adaptively driven by their unique and inherent appearance attributes. Extensive experiments on two public datasets demonstrate that attribute-sensitive feature importance facilitates more accurate person matching when it is fused together with global weights obtained using existing methods.

...read moreread less

Journal Article•DOI•

Saliency Detection in the Compressed Domain for Adaptive Image Retargeting

[...]

Yuming Fang¹, Zhenzhong Chen¹, Weisi Lin¹, Chia-Wen Lin²•Institutions (2)

Nanyang Technological University¹, National Tsing Hua University²

01 Sep 2012-IEEE Transactions on Image Processing

TL;DR: The proposed image retargeting algorithm effectively preserves the visually important regions for images, efficiently removes the less crucial regions, and therefore significantly outperforms the relevant state-of-the-art algorithms, as demonstrated with the in-depth analysis in the extensive experiments.

...read moreread less

Abstract: Saliency detection plays important roles in many image processing applications, such as regions of interest extraction and image resizing. Existing saliency detection models are built in the uncompressed domain. Since most images over Internet are typically stored in the compressed domain such as joint photographic experts group (JPEG), we propose a novel saliency detection model in the compressed domain in this paper. The intensity, color, and texture features of the image are extracted from discrete cosine transform (DCT) coefficients in the JPEG bit-stream. Saliency value of each DCT block is obtained based on the Hausdorff distance calculation and feature map fusion. Based on the proposed saliency detection model, we further design an adaptive image retargeting algorithm in the compressed domain. The proposed image retargeting algorithm utilizes multioperator operation comprised of the block-based seam carving and the image scaling to resize images. A new definition of texture homogeneity is given to determine the amount of removal block-based seams. Thanks to the directly derived accurate saliency information from the compressed domain, the proposed image retargeting algorithm effectively preserves the visually important regions for images, efficiently removes the less crucial regions, and therefore significantly outperforms the relevant state-of-the-art algorithms, as demonstrated with the in-depth analysis in the extensive experiments.

...read moreread less

Posted Content•

Learning with Augmented Features for Heterogeneous Domain Adaptation

[...]

Lixin Duan¹, Dong Xu¹, Ivor W. Tsang¹•Institutions (1)

Nanyang Technological University¹

18 Jun 2012-arXiv: Learning

TL;DR: A new learning method for heterogeneous domain adaptation (HDA), in which the data from the source domain and the target domain are represented by heterogeneous features with different dimensions, and it is demonstrated that HFA outperforms the existing HDA methods.

...read moreread less

Abstract: We propose a new learning method for heterogeneous domain adaptation (HDA), in which the data from the source domain and the target domain are represented by heterogeneous features with different dimensions. Using two different projection matrices, we first transform the data from two domains into a common subspace in order to measure the similarity between the data from two domains. We then propose two new feature mapping functions to augment the transformed data with their original features and zeros. The existing learning methods (e.g., SVM and SVR) can be readily incorporated with our newly proposed augmented feature representations to effectively utilize the data from both domains for HDA. Using the hinge loss function in SVM as an example, we introduce the detailed objective function in our method called Heterogeneous Feature Augmentation (HFA) for a linear case and also describe its kernelization in order to efficiently cope with the data with very high dimensions. Moreover, we also develop an alternating optimization algorithm to effectively solve the nontrivial optimization problem in our HFA method. Comprehensive experiments on two benchmark datasets clearly demonstrate that HFA outperforms the existing HDA methods.

...read moreread less

Collapse