首页 | 本学科首页   官方微博 | 高级检索  
     

基于层叠条件随机场模型的中文机构名自动识别
引用本文:周俊生,戴新宇,尹存燕,陈家骏.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809.
作者姓名:周俊生  戴新宇  尹存燕  陈家骏
作者单位:1. 南京大学计算机软件新技术国家重点实验室,江苏南京 210093;2. 南京师范大学计算机科学系,江苏南京 210097
基金项目:国家科技攻关项目,江苏省建设厅科研项目
摘    要:中文机构名的自动识别是自然语言处理中的一个比较困难的问题.本文提出了一种新的基于层叠条件随机场模型的中文机构名自动识别算法.该算法在低层条件随机场模型中解决对人名、地名等简单命名实体的识别,将识别结果传递到高层模型,为高层的机构名条件随机场模型实现对复杂机构名的识别提供决策支持.文中为机构名条件随机场模型设计了有效的特征模板和特征自动选择算法.对大规模真实语料的开放测试中,召回率达到90.05%,准确率达到88.12%,性能优于其它中文机构名识别算法.

关 键 词:命名实体  中文机构名识别  条件随机场  
文章编号:0372-2112(2006)05-0804-06
收稿时间:2005-04-08
修稿时间:2005-04-082005-12-12

Automatic Recognition of Chinese Organization Name Based on Cascaded Conditional Random Fields
ZHOU Jun-sheng,DAI Xin-yu,YIN Cun-yan,CHEN Jia-jun.Automatic Recognition of Chinese Organization Name Based on Cascaded Conditional Random Fields[J].Acta Electronica Sinica,2006,34(5):804-809.
Authors:ZHOU Jun-sheng  DAI Xin-yu  YIN Cun-yan  CHEN Jia-jun
Affiliation:1. State Kay Laboratory for Novel Software Techonology,Nanjing University,Nanjing,Jiangsu 210093,China;2. Deptartment of Computer Science,Nanjing Normal University,Nanjing,Jiangsu 210097,China
Abstract:Automatic recognition of Chinese organization name is a very difficult problem in many NLP tasks. This paper presents a new algorithm of Chinese organization name recognition based on cascaded conditional random fields. In the proposed algorithm, the person name and location name are first recognized by the lower model. The result then is passed to the high model and supports the decision of high model for recognition of the complicated organization names. We experimentally evaluate the algorithm on large-scale corpus. In open test, its recalling rate achieves 90, 05% and the precision rate 88, 12%. The evaluation results show that the algorithm based on cascaded conditional random fields significantly outperforms previous methods.
Keywords:named entity  Chinese organization name recognition  conditional random fields
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号