首页 | 本学科首页   官方微博 | 高级检索  
     

面向Web电子产品信息分布式检索系统的设计与实现
引用本文:张渊源,张琴燕,蒋关富.面向Web电子产品信息分布式检索系统的设计与实现[J].计算机应用,2013,33(4):1026-1030.
作者姓名:张渊源  张琴燕  蒋关富
作者单位:1. 浙江中医药大学 信息技术学院,杭州 310053 2. 浙江大学 计算中心,杭州 310058 3. 浙江大学 计算机科学与技术学院,杭州 315100
基金项目:浙江教育厅科研项目,浙江省自然科学基金资助项目,国家科技支撑计划项目
摘    要:为了从这些海量信息中获取“有用的、满足用户需求的信息”,提出一个基于Hadoop和Lucene技术的分布式检索系统架构处理Web电子产品信息检索。利用Hadoop的Map和Reduce实现分布式索引文件的存储,通过Lucene检索技术实现索引文件的访问,从而提高信息检索的效率。并且针对Lucene_Hadoop架构存在粗粒度检索问题,提出了一种细粒度检索方法,减少了系统建立索引的时间。实验表明基于Hadoop和Lucene的分布式检索系统在Web电子产品信息中具有较高的检索性能。

关 键 词:分布式检索系统    Web电子产品信息    Hadoop    Lucene    细粒度检索
收稿时间:2012-09-06
修稿时间:2012-10-28

Design and implementation of distributed retrieval system for electronic products information
ZHUANG Yuanyuan , ZHANG Qinyan , JIANG Guanfu.Design and implementation of distributed retrieval system for electronic products information[J].journal of Computer Applications,2013,33(4):1026-1030.
Authors:ZHUANG Yuanyuan  ZHANG Qinyan  JIANG Guanfu
Affiliation:1. College of Information Technology, Zhejiang Chinese Medical University, Hangzhou Zhejiang 310053, China
2. Computer Center, Zhejiang University, Hangzhou Zhejiang 310058, China
3. School of Computer Science and Technology, Zhejiang University, Hangzhou Zhejiang 315100, China
Abstract:In order to obtain the useful information that can satisfy the user requirements, this paper proposed a distributed information retrieval system based on Hadoop and Lucene handling the Web electronic products information retrieval. In order to improve the retrieval efficiency, using the Map and Reduce method of Hadoop technology implemented the storage of distributed index files and using Lucene technology implemented the file access of distributed index files. At the same time, it also proposed an improved method at fine grain retrieval level, which reduced the index building time. The experiment demonstrates that our distributed information retrieval system has a good retrieval performance for Web electronic products information.
Keywords:distributed information retrieval system                                                                                                                        Web electronic products information                                                                                                                        Hadoop                                                                                                                        Lucene                                                                                                                        fine grain retrieval
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号