首页 | 本学科首页   官方微博 | 高级检索  
     

基于场景信息融合的中文姓名识别方法研究
引用本文:张腾飞,王晓磊,王保云.基于场景信息融合的中文姓名识别方法研究[J].计算机工程与应用,2009,45(34):147-151.
作者姓名:张腾飞  王晓磊  王保云
作者单位:南京邮电大学自动化学院,南京,210046
基金项目:国家自然科学基金,南京邮电大学引进人才科研基金 
摘    要:为克服传统的先分词再识别方法的缺点,提出了一种基于场景信息融合的姓名识别方法。该方法结合中文姓名的特点,综合考虑上下文信息、词本身信息、词典信息和姓名自身信息等场景资源对中文名实体的影响,将它们作为姓名识别的依据,同时引入了证据理论,通过场景资源信息的融合,最终识别出人名。通过对互联网上随机抽取的大规模真实语料的开放测试表明,该方法可以取得较高的召回率并同时保证较高的准确率。

关 键 词:姓名识别  场景信息融合  自动分词  证据理论
收稿时间:2009-9-17
修稿时间:2009-11-16  

Research of Chinese name identification method based on scene information fusion
ZHANG Teng-fei,WANG Xiao-lei,WANG Bao-yun.Research of Chinese name identification method based on scene information fusion[J].Computer Engineering and Applications,2009,45(34):147-151.
Authors:ZHANG Teng-fei  WANG Xiao-lei  WANG Bao-yun
Affiliation:College of Automation,Nanjing University of Posts and Telecommunications,Nanjing 210046,China
Abstract:To overcome the defects of traditional name identification algorithms with automatic segmentation at first,a name identification method based on scene information fusion is presented.Combining the characteristics of Chinese names,the scene information,such as the context,word,dictionary,names,is used as the basis of name identification.And then,the evidence theory is introduced,and the names are identified by scene information fusion.The open tests on real data sets randomly selected from the internet show that it is an effective method to improve the result of the identification with high recall rate and accuracy rate are guaranteed.
Keywords:name identification  scene information fusion  automatic segmentation  evidence theory
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号