首页 | 本学科首页   官方微博 | 高级检索  
     

基于对等网络的全文信息检索
引用本文:程学旗,吕建明,周昭涛. 基于对等网络的全文信息检索[J]. 计算机研究与发展, 2004, 41(12): 2148-2155
作者姓名:程学旗  吕建明  周昭涛
作者单位:中国科学院计算技术研究所软件研究室,北京,100080;中国科学院计算技术研究所软件研究室,北京,100080;中国科学院计算技术研究所软件研究室,北京,100080
基金项目:国防预研基金项目(51415070304ZK1101)
摘    要:基于P2P方式的信息检索系统相对集中式信息检索系统具有成本低、可扩展性好、容错性强等优点,可充分挖掘网络边缘资源,并可提供个性化的信息服务.然而如何在纯P2P环境下实现全文检索并定位目标资源是困难的.当前,采用广播查询的非结构化P2P(如Gnutella)和采用分布式Hash表方式的结构化P2P(如CAN)都不能直接实现全文检索.针对这个问题,提出了基于质心法的结构化P2P全文检索方法,并建立模拟程序,对检索的性能与效果做了初步的验证.实验结果表明了该方法的有效性.

关 键 词:对等网络  全文信息检索  质心法  路由

P2P Full Text Information Retrieval Based on Centroid Method
CHENG Xue-Qi,U Jian-Ming,ZHOU Zhao-Tao. P2P Full Text Information Retrieval Based on Centroid Method[J]. Journal of Computer Research and Development, 2004, 41(12): 2148-2155
Authors:CHENG Xue-Qi  U Jian-Ming  ZHOU Zhao-Tao
Abstract:Instead of a centralized information system, a peer-to-peer(P2P) full text information retrieval system is more scalable, cost effective and fault tolerant.It can cover the information at the edge of network and is more suitable for personalized resources services.However, P2P full text search is a very challenging problem, and the traditional broadcast ways are quite ineffective.Unstructured P2P information sharing systems (such as Gnutella, KaZaA) and structured P2P system can not support direct full text information retrieval.In this paper, a P2P full text information retrieval system is presented based on centroid method.A simulation program is created and the performance of the system is tested.Experimental results show that this is a steady system with high recall, good load balance and low resource usage.
Keywords:peer to peer  full text information retrieval  centroid method  routing  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号