Hash-Based Support Vector Machines Approximation for Large Scale Prediction

doi:10.5244/C.26.86

Open AccessProceedings ArticleDOI

Hash-Based Support Vector Machines Approximation for Large Scale Prediction

- pp 1-11

TLDR

This paper addresses the problem of speeding-up the prediction phase of linear Support Vector Machines via Locality Sensitive Hashing by building efficient hash based classifiers that are applied in a first stage in order to approximate the exact results and filter the hypothesis space.

Abstract:

How-to train effective classifiers on huge amount of multimedia data is clearly a major challenge that is attracting more and more research works across several communities. Less efforts however are spent on the counterpart scalability issue: how to apply big trained models efficiently on huge non annotated media collections ? In this paper, we address the problem of speeding-up the prediction phase of linear Support Vector Machines via Locality Sensitive Hashing. We propose building efficient hash based classifiers that are applied in a first stage in order to approximate the exact results and filter the hypothesis space. Experiments performed with millions of one-against-one classifiers show that the proposed hash-based classifier can be more than two orders of magnitude faster than the exact classifier with minor losses in quality.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

CROification: Accurate Kernel Classification with the Efficiency of Sparse Linear SVM

Mehran Kafai, +1 more

- 01 Jan 2019 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A new kernel is introduced, the CRO (Concomitant Rank Order) kernel that approximates the Gaussian kernel on the unit sphere and a randomized feature map is introduced that produces sparse, high-dimensional feature vectors whose inner product asymptotically equals theCRO kernel.

...read moreread less

Journal ArticleDOI

Towards large-scale multimedia retrieval enriched by knowledge about human interpretation

Kimiaki Shirahama, +1 more

- 01 Jan 2016 -

Multimedia Tools and Applications

TL;DR: This paper defends the importance of human-machine cooperation which incorporates the above knowledge into LSMR, and defines its three future directions (cognition-based, ontology-based and adaptive learning) depending on types of knowledge, and suggest to explore each direction by considering its relation to the others.

...read moreread less

Journal ArticleDOI

Scalable Mobile Visual Classification by Kernel Preserving Projection Over High-Dimensional Features

Yu-Chuan Su, +4 more

- 06 May 2014 -

IEEE Transactions on Multimedia

TL;DR: This work proposes an unsupervised linear dimension reduction algorithm, kernel preserving projection (KPP), which approximates the kernel matrix of high dimensional features with low dimensional linear embedding and proves that the proposed method outperforms existing dimension reduction methods.

...read moreread less

Book ChapterDOI

Large-Scale R-CNN with Classifier Adaptive Quantization

Ryota Hinami, +1 more

TL;DR: This paper presents a novel quantization method designed forlinear classification wherein the quantization error is re-defined for linear classification and approximates the error as the empirical error with pre-defined multiple exemplar classifiers and captures the variance and common attributes of object category classifiers effectively.

...read moreread less

Journal ArticleDOI

A Systematic Review on Minwise Hashing Algorithms

Jingjing Tang, +1 more

- 26 Oct 2016 -

Annals of Data Science

TL;DR: The purpose of this paper is to review minwise hashing algorithms in detail and provide an insightful understanding of current developments and their limitations, major opportunities and challenges, extensions and variants as well as potential important research directions have been pointed out.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

Bagging predictors

Leo Breiman

TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.

...read moreread less

Journal Article

LIBLINEAR: A Library for Large Linear Classification

Rong-En Fan, +4 more

- 01 Jun 2008 -

Journal of Machine Learning Research

TL;DR: LIBLINEAR is an open source library for large-scale linear classification that supports logistic regression and linear support vector machines and provides easy-to-use command-line tools and library calls for users and developers.

...read moreread less

Proceedings ArticleDOI

Similarity estimation techniques from rounding algorithms

Moses Charikar

TL;DR: It is shown that rounding algorithms for LPs and SDPs used in the context of approximation algorithms can be viewed as locality sensitive hashing schemes for several interesting collections of objects.

...read moreread less

Proceedings ArticleDOI

Training linear SVMs in linear time

Thorsten Joachims

TL;DR: A Cutting Plane Algorithm for training linear SVMs that provably has training time 0(s,n) for classification problems and o(sn log (n)) for ordinal regression problems and several orders of magnitude faster than decomposition methods like svm light for large datasets.

...read moreread less

ACM Transactions on Intelligent Systems ...

Hash-Based Support Vector Machines Approximation for Large Scale Prediction

Citations

CROification: Accurate Kernel Classification with the Efficiency of Sparse Linear SVM

Towards large-scale multimedia retrieval enriched by knowledge about human interpretation

Scalable Mobile Visual Classification by Kernel Preserving Projection Over High-Dimensional Features

Large-Scale R-CNN with Classifier Adaptive Quantization

A Systematic Review on Minwise Hashing Algorithms

References

ImageNet: A large-scale hierarchical image database

Bagging predictors

LIBLINEAR: A Library for Large Linear Classification

Similarity estimation techniques from rounding algorithms

Training linear SVMs in linear time

Related Papers (5)

Streaming similarity search over one billion tweets using parallel locality-sensitive hashing

Online hashing

Accelerated large scale optimization by concomitant hashing

Hashing for Distributed Data

Fast Distributed kNN Graph Construction Using Auto-tuned Locality-sensitive Hashing