TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval

doi:10.1145/3343031.3350900

Proceedings ArticleDOI

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval

- pp 1676-1684

TLDR

A Triplet Classification Network (TC-Net) for iSBIR is presented which is composed of two major components: triplet Siamese network, and auxiliary classification loss which can break the limitations existed in previous works.

Abstract:

Sketch has been employed as an effective communication tool to express the abstract and intuitive meaning of object. While content-based sketch recognition has been studied for several decades, the instance-level Sketch Based Image Retrieval (iSBIR) task has attracted significant research attention recently. In many previous iSBIR works -- TripletSN, and DSSA, edge maps were employed as intermediate representations in bridging the cross-domain discrepancy between photos and sketches. However, it is nontrivial to efficiently train and effectively use the edge maps in an iSBIR system. Particularly, we find that such an edge map based iSBIR system has several major limitations. First, the system has to be pre-trained on a significant amount of edge maps, either from large-scale sketch datasets, e.g., TU-Berlin~\citeeitz2012hdhso, or converted from other large-scale image datasets, e.g., ImageNet-1K\citedeng2009imagenet dataset. Second, the performance of such an iSBIR system is very sensitive to the quality of edge maps. Third and empirically, the multi-cropping strategy is essentially very important in improving the performance of previous iSBIR systems. To address these limitations, this paper advocates an end-to-end iSBIR system without using the edge maps. Specifically, we present a Triplet Classification Network (TC-Net) for iSBIR which is composed of two major components: triplet Siamese network, and auxiliary classification loss. Our TC-Net can break the limitations existed in previous works. Extensive experiments on several datasets validate the efficacy of the proposed network and system.

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval

Citations

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt

Deep Structural Contour Detection

Deep Learning for Free-Hand Sketch: A Survey and A Toolbox

Deep Learning for Free-Hand Sketch: A Survey

AE-Net: Fine-grained sketch-based image retrieval via attention-enhanced network

References

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database

A Computational Approach to Edge Detection

Densely Connected Convolutional Networks

Related Papers (5)

How do humans sketch objects

Distinctive Image Features from Scale-Invariant Keypoints

Deep Residual Learning for Image Recognition

Sketch Me That Shoe

SketchNet: Sketch Classification with Web Images