首页 | 本学科首页   官方微博 | 高级检索  
     

一种命名实体翻译等价对的抽取方法
引用本文:陈怀兴,尹存燕,陈家骏. 一种命名实体翻译等价对的抽取方法[J]. 中文信息学报, 2008, 22(4): 55-60
作者姓名:陈怀兴  尹存燕  陈家骏
作者单位:南京大学 计算机软件新技术国家重点实验室,江苏 南京 210093
基金项目:国家高技术研究发展计划(863计划) , 国家自然科学基金 , 江苏省自然科学基金
摘    要:有关命名实体的翻译等价对在多语言处理中有着非常重要的意义。在过去的几年里,双语字典查找,音译模型等方法先后被提出。另一种极具价值的方法是从平行语料库中自动抽取有关命名实体的翻译等价对,现有的方法要求预先对双语语料库的两种语言文本进行命名实体标注。提出了一种只要求对语料库中源语言进行命名实体标注,目标语言不需标注,然后利用训练得到的HMM词对齐结果来抽取有关命名实体翻译等价对的方法。在实验中,把中文作为源语言,英文作为目标语言。实验结果表明用该方法,即使在对齐模型只是部分准确的情况下,也得到了较高正确率的命名实体翻译等价对。

关 键 词:人工智能  机器翻译  命名实体   翻译等价对  HMM  对齐模型  

An Approach to Extract Named Entity Translingual Equivalence
CHEN Huai-xing,YIN Cun-yan,CHEN Jia-jun. An Approach to Extract Named Entity Translingual Equivalence[J]. Journal of Chinese Information Processing, 2008, 22(4): 55-60
Authors:CHEN Huai-xing  YIN Cun-yan  CHEN Jia-jun
Affiliation:State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu 210093, China
Abstract:Identification of translingual equivalence of named entities is substantial to multilingual natural language processing.Some approaches to named entity translation,such as bilingual dictionary lookup,word/sub-word translation or transliteration,have been explored in the past years.Another promising approach is to extract named entity translingual equivalence automatically from a parallel corpus,which usually requires the named entities to be annotated manually or automatically for both languages.In this paper,we propose a new approach to extract equivalence of named entities from a parallel corpus with only the source language annotation and the result of HMM alignment.The experiment is carried in a Chinese-English parallel copus,and we treat Chinese as the source language and English as the target language.The result shows that our new approach achieves high quality of named entity pairs with relatively high precision,even though sometimes the word alignment result is partially correct.
Keywords:artificial intelligence  machine translation  named entity  translingual equivalence  HMM  alignment model
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号