首页 | 本学科首页   官方微博 | 高级检索  
     

在线哈希算法研究综述
引用本文:郭一村,陈华辉. 在线哈希算法研究综述[J]. 计算机应用, 2021, 41(4): 1106-1112. DOI: 10.11772/j.issn.1001-9081.2020071047
作者姓名:郭一村  陈华辉
作者单位:宁波大学 信息科学与工程学院, 浙江 宁波 315000
基金项目:国家自然科学基金资助项目
摘    要:在当前大规模数据检索任务中,学习型哈希方法能够学习紧凑的二进制编码,在节省存储空间的同时能快速地计算海明空间内的相似度,因此近似最近邻检索常使用哈希的方式来完善快速最近邻检索机制.对于目前大多数哈希方法都采用离线学习模型进行批处理训练,在大规模流数据的环境下无法适应可能出现的数据变化而使得检索效率降低的问题,提出在线哈...

关 键 词:在线学习  学习型哈希  无监督学习  监督学习  最近邻检索
收稿时间:2020-07-21
修稿时间:2020-10-19

Survey on online hashing algorithm
GUO Yicun,CHEN Huahui. Survey on online hashing algorithm[J]. Journal of Computer Applications, 2021, 41(4): 1106-1112. DOI: 10.11772/j.issn.1001-9081.2020071047
Authors:GUO Yicun  CHEN Huahui
Affiliation:Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo Zhejiang 315000, China
Abstract:In the current large-scale data retrieval tasks, learning to hash methods can learn compact binary codes, which saves storage space and can quickly calculate the similarity in Hamming space. Therefore, for approximate nearest neighbor search, hashing methods are often used to improve the mechanism of fast nearest neighbor search. In most current hashing methods, the offline learning models are used for batch training, which cannot adapt to possible data changes appeared in the environment of large-scale streaming data, resulting in reduction of retrieval efficiency. Therefore, the adaptive hash functions were proposed and learnt in online hashing methods, which realize the continuous learning in the process of inputting data and make the methods can be applied to similarity retrieval in real-time. Firstly, the basic principles of learning to hash and the inherent requirements to realize online hashing were explained. Secondly, the different learning methods of online hashing were introduced from the perspectives such as the reading method, learning mode, and model update method of streaming data under online conditions. Thirdly, the online learning algorithms were further divided into six categories, that is, categories based on passive-aggressive algorithms, matrix factorization technology, unsupervised clustering, similarity supervision, mutual information measurement, codebook supervision respectively. And the advantages, disadvantages and characteristics of these algorithms were analyzed. Finally, the development directions of online hashing were summarized and discussed.
Keywords:online learning  learning to hash  unsupervised learning  supervised learning  nearest neighbor search  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号