首页 | 本学科首页   官方微博 | 高级检索  
     


Probabilistic data fusion on a large document collection
Authors:David Lillis  Fergus Toolan  Rem Collier  John Dunnion
Affiliation:(1) School of Computer Science and Informatics, University College Dublin, Dublin 4, Ireland;(2) Faculty of Computing Science, Griffith College Dublin, Dublin 8, Ireland
Abstract:Data fusion is the process of combining the output of a number of Information Retrieval (IR) algorithms into a single result set, to achieve greater retrieval performance. ProbFuse is a data fusion algorithm that uses the history of the underlying IR algorithms to estimate the probability that subsequent result sets include relevant documents in particular positions. It has been shown to out-perform CombMNZ, the standard data fusion algorithm against which to compare performance, in a number of previous experiments. This paper builds upon this previous work and applies probFuse to the much larger Web Track document collection from the 2004 Text REtreival Conference. The performance of probFuse is compared against that of CombMNZ using a number of evaluation measures and is shown to achieve substantial performance improvements.
Keywords:Data fusion  Information retrieval            ProbFuse
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号