首页 | 本学科首页   官方微博 | 高级检索  
     

智能文本搜索新技术
引用本文:王占一,徐蔚然,郭军.智能文本搜索新技术[J].智能系统学报,2012,7(1):40-49.
作者姓名:王占一  徐蔚然  郭军
作者单位:北京邮电大学模式识别与智能系统实验室,北京100876;北京邮电大学信息与通信工程学院,北京100876
基金项目:国家自然科学基金资助项目(60905017);高等学校学科创新引智计划项目(B08004)
摘    要:面对当今互联网上海量的信息,以及搜索信息准确、高效、个性化等需求,提出了一套包括信息检索、信息抽取和信息过滤在内的智能文本搜索新技术.首先举荐了与信息检索新技术相关的企业检索、实体检索、博客检索、相关反馈子任务.然后介绍了与信息抽取技术相关的实体关联和实体填充子任务,以及与信息过滤技术相关的垃圾邮件过滤子任务.这些关键技术融合在一起,在多个著名的国际评测中得到应用,如美国主办的文本检索会议评测和文本分析会议评测,并且在互联网舆情、短信舆情和校园网对象搜索引擎等实际系统中得到了检验.

关 键 词:智能文本搜索  文本检索  文本分析

New technologies of intelligent text search
WANG Zhanyi , XU Weiran , GUO Jun.New technologies of intelligent text search[J].CAAL Transactions on Intelligent Systems,2012,7(1):40-49.
Authors:WANG Zhanyi  XU Weiran  GUO Jun
Affiliation:1,2(1.Pattern Recognition and Intelligent System(PRIS) Laboratory,Beijing University of Posts and Telecommunications,Beijing 100876,China;2.School of Information and Communication Engineering,Beijing University of Posts and Telecommunications,Beijing 100876,China)
Abstract:To adapt to the massive amount of information on the internet and the need for accuracy,efficiency,and individualization,a set of technologies of intelligent text search including information retrieval,extraction,and filtering were proposed.First,new technologies of information retrieval were illustrated including the subtasks of enterprise retrieval,entity retrieval,blog retrieval,and relevance feedback.Second,the subtask of entity linking and slot filling related to information extraction was introduced.Finally,the subtask of spam e-mail filtering related to information filtering was described.These technologies were converged for application in many well-known international evaluations.These include the text retrieval conference(TREC) and text analysis conference(TAC) sponsored in the USA,and these technologies of intelligent text search were proven in practical applications such as public opinions on the Internet,short message opinions,and the campus object search engine(COSE).
Keywords:intelligent text search  text retrieval  text analysis
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号