首页 | 本学科首页   官方微博 | 高级检索  
     

基于数据挖掘的面向话题搜索引擎研究
引用本文:陈勇,张佳骥,吴立德,刘海娟.基于数据挖掘的面向话题搜索引擎研究[J].无线电通信技术,2011,37(5):38-40.
作者姓名:陈勇  张佳骥  吴立德  刘海娟
作者单位:1. 中国电子科技集团公司第五十四研究所,河北石家庄,050081
2. 复旦大学,上海,200433
摘    要:为了解决面向话题的搜索问题,提出一种新的面向话题的检索技术。首先分析了面向话题的搜索技术所面临的问题,然后基于数据挖掘技术提出了解决方案。利用数据挖掘技术抽取文本的多层次语义特征,形成对文本的多精度表示,抽取的特征不仅包括单个词特征也包括多词特征。建立了一个示例检索系统,实验表明利用多层次文本特征能够很好地实现面向话题的文本检索。

关 键 词:信息检索  数据挖掘  文本分析

Research on Topic Oriented Search Engine Based on Data Mining Technology
CHEN Yong,ZHANG Jia-ji,WU Li-de,LIU Hai-juan.Research on Topic Oriented Search Engine Based on Data Mining Technology[J].Radio Communications Technology,2011,37(5):38-40.
Authors:CHEN Yong  ZHANG Jia-ji  WU Li-de  LIU Hai-juan
Affiliation:CHEN Yong1,ZHANG Jia-ji1,WU Li-de2,LIU Hai-juan1(1.The 54th Research Institute of CETC,Shijiazhuang Hebei 050081,China,2.FuDan University,Shanghai 200433,China)
Abstract:A novel topic-oriented text retrieval approach is proposed in this paper. In this approach,data mining techniques are used to extract multi-level semantic features from texts, generating multi-precision representation on text. Features extracted from text include both single word features and multi-word features. With this approach, more significant feature in text can be discovered and used. Extracted features are closed to the essence of texts. Experiments show that multi-level features can be used to create a topic-oriented text retrieval system.
Keywords:information retrieval  data mining  text analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号