首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于查询特性的查询结果缓存与预取方法
引用本文:马宏远,王斌.一种基于查询特性的查询结果缓存与预取方法[J].中文信息学报,2011,25(5):37-44.
作者姓名:马宏远  王斌
作者单位:1. 中国科学院 计算技术研究所,北京 100190; 2. 中国科学院 研究生院,北京 100049
基金项目:国家自然科学基金资助项目(60873166);国家973资助项目(2007CB311103);国家863计划资助项目(2006AA010105);教育部科学技术研究重点资助项目(109028)
摘    要:针对搜索引擎查询结果缓存与预取问题,该文提出了一种基于查询特性的搜索引擎查询结果缓存与预取方法,该方法包括用来指导预取的查询结果页码预测模型和缓存与预取算法框架,用于提高搜索引擎系统性能。通过对国内某著名中文商业搜索引擎的某段时间的用户查询日志分析得出,用户对不同查询返回的查询结果所浏览的页数具有显著的非均衡性,结合该特性设计查询结果页码预测模型来进行预取和分区缓存。在该搜索引擎两个月的大规模真实用户查询日志上的实验结果表明,与传统的方法相比,该方法可以获得3.5%~8.45%的缓存命中率提升。

关 键 词:搜索引擎    性能优化    查询结果    缓存    预取  

A Query Result Caching and Prefetching Approach Based on Query Characteristics
MA Hongyuan,WANG Bin.A Query Result Caching and Prefetching Approach Based on Query Characteristics[J].Journal of Chinese Information Processing,2011,25(5):37-44.
Authors:MA Hongyuan  WANG Bin
Affiliation:1. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China;2. Graduate University of the Chinese Academy of Sciences, Beijing 100049, China
Abstract:Query results caching and prefetching is an effective way to enhance the performance of Web search engines. We present an analysis of query logs originated from a famous Chinese Web search engine and describe the characteristics of Web search engine queries. A query results caching and prefetching approach based on query characteristics is proposed in this paper. The approach contains predictive models of query results page number and a caching and prefetching algorithm framework in Web search engines. We then use a real large scale query logs for a period of 2-months to evaluate the approach, in contrast to the traditional methods and theoretical upper bounds. Experimental results show that this approach can achieve 3.5% to 8.45% increase for all requests as compared with state-of-the-art methods.
Key wordssearch engine; performance optimization; query results; caching; prefetching
Keywords:search engine  performance optimization    query results  caching  prefetching  
本文献已被 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号