首页 | 本学科首页   官方微博 | 高级检索  
     

基于查询性能预测的鲁棒检索排序研究
引用本文:薛源海,俞晓明,刘 悦,关 峰,程学旗.基于查询性能预测的鲁棒检索排序研究[J].中文信息学报,2016,30(5):169-175.
作者姓名:薛源海  俞晓明  刘 悦  关 峰  程学旗
作者单位:1. 中国科学院网络数据科学与技术重点实验室,北京 100190;
2. 中国科学院 计算技术研究所,北京 100190;
3. 中国科学院大学,北京 100190
基金项目:国家自然科学基金(61232010,61173008);国家“863”高技术研究发展计划(2012AA011003,2013AA01A213);国家“973”重点基础研究发展规划(2012CB316303,2013CB329602);国家科技部“十一五”科技计划(2012BAH39B02,2012BAH46B04)
摘    要:信息检索技术致力于从海量的信息资源中为用户获取所需的信息。相较于传统的简单模型,近些年来的大量研究工作在提升了检索结果平均质量的同时,往往忽略了鲁棒性的问题,即造成了很多查询的性能下降,导致用户满意度的显著下降。本文提出了一种基于排序学习的查询性能预测方法,针对每一个查询,对多种模型得到的检索结果列表进行预测,将其中预测性能最优的检索结果列表展示给用户。在LETOR的三个标准数据集OHSUMED、MQ2008和MSLR-WEB10K上的一系列对比实验表明,在以经典的BM25模型作为基准的情况下,与当前最好的检索模型之一LambdaMART相比,该方法在提升了检索结果平均质量的同时,显著地减少了性能下降的查询的数量,具备较好的鲁棒性。


关 键 词:查询性能预测  排序学习  鲁棒检索排序
  

Robust Ranking via Query Performance Prediction
XUE Yuanhai,YU Xiaoming,LIU Yue,GUAN Feng,CHENG Xueqi.Robust Ranking via Query Performance Prediction[J].Journal of Chinese Information Processing,2016,30(5):169-175.
Authors:XUE Yuanhai  YU Xiaoming  LIU Yue  GUAN Feng  CHENG Xueqi
Affiliation:1. Key Laboratory of Network Data Science and Technology, Chinese Academy of Sciences, Beijing 100190,China;
2. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190,China;
3. University of Chinese Academy of Sciences, Beijing 100190,China
Abstract:The main purpose of information retrieval technology is satisfying users information needs by using massive amounts of information recource. Recent years, many techniques increase average effectiveness relative to traditional simple model while they often ignore the robustness issue. Users satisfaction will be significantly hurt because of degraded results of many queries. A query performance prediction method based on learning to rank is proposed to obtain robust ranking results. For each query, the performance of multiple ranking results generated by different models are predicted and the best one is shown to the user. A series of experiments are conducted on three standard LETOR benchmark datasets which are OHSUMED, MQ2008 and MSLR-WEB10K. The results show that, compared to one of the state-of-the-art models named LambdaMART, the ranking results obtained this way significantly reduced the number of queries whose performance are hurt with respect to BM25 model while improving the nearly same degree of everage effectiveness.
Keywords:query performance prediction  learning to rank  robust ranking  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号