首页 | 本学科首页   官方微博 | 高级检索  
     

中文组织机构名称与简称的识别
引用本文:沈嘉懿,李芳,徐飞玉.中文组织机构名称与简称的识别[J].中文信息学报,2007,21(6):17-21.
作者姓名:沈嘉懿  李芳  徐飞玉
作者单位:1. 上海交通大学 计算机系 上海200240; 2. 德国人工智能研究中心 语言技术实验室
基金项目:中德语言技术联合实验室进行项目
摘    要:本文提出了一种基于规则识别中文组织机构全称和简称的方法。全称的识别首先借助机构后缀词库获得其右边界,然后通过规则匹配并借助贝叶斯概率模型加以决策获得其左边界。简称的识别是在全称的基础上应用其对应的简称规则实现的。在开放性测试中,该方法的总体查全率为85.19%,查准率为83.03%,F Measure为84.10%;简称的查全率为67.18%,查准率为74.14%。目前该方法已应用于中文关系的抽取系统。

关 键 词:计算机应用  中文信息处理  组织机构名称识别  组织机构简称识别  规则匹配  贝叶斯概率模型  
文章编号:1003-0077(2007)06-0017-05
收稿时间:2006-09-14
修稿时间:2007-05-22

Recognition of Chinese Organization Names and Abbreviations
Hans Uszkoreit,SHEN Jia-yi,LI Fang,XU Fei-yu,Hans Uszkoreit.Recognition of Chinese Organization Names and Abbreviations[J].Journal of Chinese Information Processing,2007,21(6):17-21.
Authors:Hans Uszkoreit  SHEN Jia-yi  LI Fang  XU Fei-yu  Hans Uszkoreit
Affiliation:1. Department of Computer Science and Technologyp; Shanghai JiaoTong University, Shanghai 200240, China;
2. German Research Center for Artificial Intelligence
Abstract:This paper proposes a method for recognizing Chinese organization names and their abbreviations based on rules.The right boundary of an organization name is identified with the help of the organization suffix lexicon.The left boundary is recognized by the optimum rules based on Bayesian probability model.After idendifying an organization name,we can get candidate abbreviations based on abbreviation rules accordingly.In open test,the recall is 85.19%,the precision is 83.03%,the F Measure is 84.10% for name recognition,and the recall is 67.18%,the precision is 74.14% for abbreviation recognition.This method has been applied in the Chinese relation identification system.
Keywords:computer application  Chinese information processing  recognition of Chinese organization names  recognition of Chinese organization abbreviations  rule matching  bayesian probability model
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号