The Bitwise Hashing Trick for Personalized Search

doi:10.1080/08839514.2019.1630961

Open AccessJournal ArticleDOI

The Bitwise Hashing Trick for Personalized Search

Braddock Gaskill

- 18 Oct 2019 -

arXiv: Information Retrieval

TLDR

In this article, the use of feature bit vectors using the hashing trick for improving relevance in personalized search and other personalization applications is introduced. But they use a single bit per dimension instead of floating point results in an order of magnitude decrease in data structure size while preserving or even improving quality.

Abstract:

Many real world problems require fast and efficient lexical comparison of large numbers of short text strings. Search personalization is one such domain. We introduce the use of feature bit vectors using the hashing trick for improving relevance in personalized search and other personalization applications. We present results of several lexical hashing and comparison methods. These methods are applied to a user's historical behavior and are used to predict future behavior. Using a single bit per dimension instead of floating point results in an order of magnitude decrease in data structure size, while preserving or even improving quality. We use real data to simulate a search personalization task. A simple method for combining bit vectors demonstrates an order of magnitude improvement in compute time on the task with only a small decrease in accuracy.

The Bitwise Hashing Trick for Personalized Search

Citations

An Efficient and Accurate Detection of Fake News Using Capsule Transient Auto Encoder

References

Mining of Massive Datasets

Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations

Feature hashing for large scale multitask learning

Personalizing search via automated analysis of interests and activities

Quantized neural networks: training neural networks with low precision weights and activations

Related Papers (5)

Hashing with Graphs

Scalable mining of small visual objects

Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval

CapsNet-based supervised hashing

Zero-Shot Hashing via Transferring Supervised Knowledge