首页 | 本学科首页   官方微博 | 高级检索  
     


Effective and efficient indexing in cross-modal hashing-based datasets
Affiliation:1. Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, No. 43, Section 4, Keelung Road, Taipei 10672, Taiwan, ROC;2. Department of Computer Science and Information Engineering, National Chiayi University, No. 300, Xuefu Rd., Chiayi 60004, Taiwan, ROC;1. Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India;2. Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India;1. E-ID Department, TUBITAK BILGEM, Kocaeli, 41470, Turkey;2. Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul, 34956, Turkey
Abstract:To overcome the barrier of storage and computation, the hashing technique has been widely used for nearest neighbor search in multimedia retrieval applications recently. Particularly, cross-modal retrieval that searches across different modalities becomes an active but challenging problem. Although numerous of cross-modal hashing algorithms are proposed to yield compact binary codes, exhaustive search is impractical for large-scale datasets, and Hamming distance computation suffers inaccurate results. In this paper, we propose a novel search method that utilizes a probability-based index scheme over binary hash codes in cross-modal retrieval. The proposed indexing scheme employs a few binary bits from the hash code as the index code. We construct an inverted index table based on the index codes, and train a neural network for ranking and indexing to improve the retrieval accuracy. Experiments are performed on two benchmark datasets for retrieval across image and text modalities, where hash codes are generated and compared with several state-of-the-art cross-modal hashing methods. Results show the proposed method effectively boosts the performance on search accuracy, computation cost, and memory consumption in these datasets and hashing methods. The source code is available on https://github.com/msarawut/HCI.
Keywords:Binary embedding  Cross-modal retrieval  Inverted indexing  Learning to rank  Nearest neighbor search
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号