Topic

Feature hashing

About: Feature hashing is a research topic. Over the lifetime, 993 publications have been published within this topic receiving 51462 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Variational Deep Semantic Hashing for Text Documents

[...]

Suthee Chaidaroon¹, Yi Fang¹•Institutions (1)

Santa Clara University¹

07 Aug 2017

TL;DR: A series of novel deep document generative models for text hashing that can be interpreted as encoder-decoder deep neural networks and thus they are capable of learning complex nonlinear distributed representations of the original documents.

...read moreread less

Abstract: As the amount of textual data has been rapidly increasing over the past decade, efficient similarity search methods have become a crucial component of large-scale information retrieval systems. A popular strategy is to represent original data samples by compact binary codes through hashing. A spectrum of machine learning methods have been utilized, but they often lack expressiveness and flexibility in modeling to learn effective representations. The recent advances of deep learning in a wide range of applications has demonstrated its capability to learn robust and powerful feature representations for complex data. Especially, deep generative models naturally combine the expressiveness of probabilistic generative models with the high capacity of deep neural networks, which is very suitable for text modeling. However, little work has leveraged the recent progress in deep learning for text hashing. In this paper, we propose a series of novel deep document generative models for text hashing. The first proposed model is unsupervised while the second one is supervised by utilizing document labels/tags for hashing. The third model further considers document-specific factors that affect the generation of words. The probabilistic generative formulation of the proposed models provides a principled framework for model extension, uncertainty estimation, simulation, and interpretability. Based on variational inference and reparameterization, the proposed models can be interpreted as encoder-decoder deep neural networks and thus they are capable of learning complex nonlinear distributed representations of the original documents. We conduct a comprehensive set of experiments on four public testbeds. The experimental results have demonstrated the effectiveness of the proposed supervised learning models for text hashing.

...read moreread less

77 citations

Proceedings Article•

Non-metric locality-sensitive hashing

[...]

Yadong Mu¹, Shuicheng Yan¹•Institutions (1)

National University of Singapore¹

11 Jul 2010

TL;DR: This paper utilizes the norm-keeping property of p-stable functions to ensure that two data's collision probability reflects their non-metric distance in original feature space and investigates various concrete examples to validate the proposed algorithm.

...read moreread less

Abstract: Non-metric distances are often more reasonable compared with metric ones in terms of consistency with human perceptions. However, existing locality-sensitive hashing (LSH) algorithms can only support data which are gauged with metrics. In this paper we propose a novel locality-sensitive hashing algorithm targeting such non-metric data. Data in original feature space are embedded into an implicit reproducing kernel Kreĭn space and then hashed to obtain binary bits. Here we utilize the norm-keeping property of p-stable functions to ensure that two data's collision probability reflects their non-metric distance in original feature space. We investigate various concrete examples to validate the proposed algorithm. Extensive empirical evaluations well illustrate its effectiveness in terms of accuracy and retrieval speedup.

...read moreread less

76 citations

Journal Article•DOI•

Lexicographical framework for image hashing with implementation based on DCT and NMF

[...]

Zhenjun Tang¹, Shuozhong Wang¹, Xinpeng Zhang¹, Weimin Wei¹, Yan Zhao¹ - Show less +1 more•Institutions (1)

Shanghai University¹

01 Apr 2011-Multimedia Tools and Applications

TL;DR: Under the proposed framework, a hashing scheme using discrete cosine transform (DCT) and non-negative matrix factorization (NMF) is implemented, and experimental results show that the proposed scheme is resistant to normal content-preserving manipulations, and has a very low collision probability.

...read moreread less

Abstract: Image hash is a content-based compact representation of an image for applications such as image copy detection, digital watermarking, and image authentication. This paper proposes a lexicographical-structured framework to generate image hashes. The system consists of two parts: dictionary construction and maintenance, and hash generation. The dictionary is a large collection of feature vectors called words, representing characteristics of various image blocks. It is composed of a number of sub-dictionaries, and each sub-dictionary contains many features, the number of which grows as the number of training images increase. The dictionary is used to provide basic building blocks, namely, the words, to form the hash. In the hash generation, blocks of the input image are represented by features associated to the sub-dictionaries. This is achieved by using a similarity metric to find the most similar feature among the selective features of each sub-dictionary. The corresponding features are combined to produce an intermediate hash. The final hash is obtained by encoding the intermediate hash. Under the proposed framework, we have implemented a hashing scheme using discrete cosine transform (DCT) and non-negative matrix factorization (NMF). Experimental results show that the proposed scheme is resistant to normal content-preserving manipulations, and has a very low collision probability.

...read moreread less

76 citations

Journal Article•DOI•

Adaptive hash retrieval with kernel based similarity

[...]

Xiao Bai¹, Cheng Yan¹, Haichuan Yang¹, Lu Bai², Jun Zhou³, Edwin R. Hancock⁴ - Show less +2 more•Institutions (4)

Beihang University¹, Central University of Finance and Economics², Griffith University³, University of York⁴

01 Mar 2018-Pattern Recognition

TL;DR: A novel adaptive similarity measure which is consistent with k-nearest neighbor search is presented, and it is proved that it leads to a valid kernel if the original similarity function is a kernel function.

...read moreread less

76 citations

Journal Article•DOI•

Supervised deep hashing for scalable face image retrieval

[...]

Jinhui Tang¹, Zechao Li¹, Xiang Zhu¹•Institutions (1)

Nanjing University of Science and Technology¹

01 Mar 2018-Pattern Recognition

TL;DR: This work proposes a novel supervised hashing method for scalable face image retrieval, i.e., Deep Hashing based on Classification and Quantization errors (DHCQ), by simultaneously learning feature representations of images, hash codes and classifiers.

...read moreread less

76 citations

Collapse

Network Information

Performance

Metrics

1,120

Papers

57,460

Citations

No. of papers in the topic in previous years
Year	Papers
2023	33
2022	89
2021	11
2020	16
2019	16
2018	38

Feature hashing

Papers published on a yearly basis

Papers

Trending Questions (2)

Network Information

Related Topics (5)

Performance

Metrics