scispace - formally typeset
Proceedings ArticleDOI

Learning Coupled Feature Spaces for Cross-Modal Matching

Kaiye Wang, +4 more
- pp 2088-2095
Reads0
Chats0
TLDR
A novel coupled linear regression framework to deal with the measure of relevance and coupled feature selection in cross-modal data matching, and an iterative algorithm based on half-quadratic minimization to solve the proposed regularized linear regression problem.
Abstract
Cross-modal matching has recently drawn much attention due to the widespread existence of multimodal data. It aims to match data from different modalities, and generally involves two basic problems: the measure of relevance and coupled feature selection. Most previous works mainly focus on solving the first problem. In this paper, we propose a novel coupled linear regression framework to deal with both problems. Our method learns two projection matrices to map multimodal data into a common feature space, in which cross-modal data matching can be performed. And in the learning procedure, the ell_21-norm penalties are imposed on the two projection matrices separately, which leads to select relevant and discriminative features from coupled feature spaces simultaneously. A trace norm is further imposed on the projected data as a low-rank constraint, which enhances the relevance of different modal data with connections. We also present an iterative algorithm based on half-quadratic minimization to solve the proposed regularized linear regression problem. The experimental results on two challenging cross-modal datasets demonstrate that the proposed method outperforms the state-of-the-art approaches.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Visual Domain Adaptation: A survey of recent advances

TL;DR: A survey of domain adaptation methods for visual recognition discusses the merits and drawbacks of existing domain adaptation approaches and identifies promising avenues for research in this rapidly evolving field.
Proceedings ArticleDOI

Adversarial Cross-Modal Retrieval

TL;DR: Comprehensive experimental results show that the proposed ACMR method is superior in learning effective subspace representation and that it significantly outperforms the state-of-the-art cross-modal retrieval methods.
Journal ArticleDOI

Multi-View Discriminant Analysis

TL;DR: This work proposes a Multi-view Discriminant Analysis (MvDA) approach, which seeks for a single discriminant common space for multiple views in a non-pairwise manner by jointly learning multiple view-specific linear transforms.
Journal ArticleDOI

Transductive Multi-View Zero-Shot Learning

TL;DR: A novel heterogeneous multi-view hypergraph label propagation method is formulated for zero-shot learning in the transductive embedding space that rectifies the projection shift between the auxiliary and target domains, exploits the complementarity of multiple semantic representations, and significantly outperforms existing methods for both zero- shot and N-shot recognition.
Journal ArticleDOI

Learning Compact Binary Face Descriptor for Face Recognition

TL;DR: A compact binary face descriptor (CBFD) feature learning method for face representation and recognition that reduces the modality gap of heterogeneous faces at the feature level to make the method applicable to heterogeneous face recognition.
References
More filters
Journal ArticleDOI

Canonical Correlation Analysis: An Overview with Application to Learning Methods

TL;DR: A general method using kernel canonical correlation analysis to learn a semantic representation to web images and their associated text and compares orthogonalization approaches against a standard cross-representation retrieval technique known as the generalized vector space model is presented.
Proceedings ArticleDOI

A new approach to cross-modal multimedia retrieval

TL;DR: It is shown that accounting for cross-modal correlations and semantic abstraction both improve retrieval accuracy and are shown to outperform state-of-the-art image retrieval systems on a unimodal retrieval task.
Book ChapterDOI

Overview and recent advances in partial least squares

TL;DR: Partial Least Squares (PLS) as mentioned in this paper is a wide class of methods for modeling relations between sets of observed variables by means of latent variables, which comprises of regression and classification tasks as well as dimension reduction techniques and modeling tools.
Journal ArticleDOI

Separating Style and Content with Bilinear Models

TL;DR: A general framework for learning to solve two-factor tasks using bilinear models, which provide sufficiently expressive representations of factor interactions but can nonetheless be fit to data using efficient algorithms based on the singular value decomposition and expectation-maximization are presented.
Proceedings ArticleDOI

Generalized Multiview Analysis: A discriminative latent space

TL;DR: GMA solves a joint, relaxed QCQP over different feature spaces to obtain a single (non)linear subspace and is a supervised extension of Canonical Correlational Analysis (CCA), which is useful for cross-view classification and retrieval.
Related Papers (5)