首页 | 本学科首页   官方微博 | 高级检索  
     

基于多重线性回归模型的翻译等价对获取
引用本文:张春祥,赵铁军,李生.基于多重线性回归模型的翻译等价对获取[J].计算机工程与应用,2006,42(4):1-3,26.
作者姓名:张春祥  赵铁军  李生
作者单位:哈尔滨工业大学计算机科学与技术学院,哈尔滨,150001
基金项目:国家科技攻关项目;中国科学院资助项目
摘    要:翻译等价对在词典编纂、机器翻译和跨语言信息检索中有着广泛的应用。文章从双语句对的译文等价树中抽取翻译等价对。使用译文直译率、短语对齐概率和目标语-源语言短语长度差异等特征对自动获取的等价对进行评价。提出了一种基于多重线性回归模型的等价对评价方法,并结合N-Best策略对候选翻译等价对进行过滤。实验结果表明:在开放测试中,基于多重线性回归模型的等价对评价及过滤方法其性能要优于其它方法。

关 键 词:翻译等价对  多重线性回归模型  N-Best策略
文章编号:1002-8331-(2006)04-0001-03
收稿时间:2005-06
修稿时间:2005-06

Acquisition of Translation Equivalences Based on Multiple Linear Regression
Zhang Chunxiang,Zhao Tiejun,Li Sheng.Acquisition of Translation Equivalences Based on Multiple Linear Regression[J].Computer Engineering and Applications,2006,42(4):1-3,26.
Authors:Zhang Chunxiang  Zhao Tiejun  Li Sheng
Affiliation:School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001
Abstract:Translation equivalence is very useful for bilingual lexicography,machine translation system and cross-lingual information retrieval.In this paper,translation equivalences are extracted from translation corresponding trees of bilingual sentence pairs.Translation literality,phrase alignment probability,and length difference from target language phrase to source language phrase are employed to score for ex.tracted equivalences.An evaluation method based on multiple linear regression is proposed.This new approach is employed to filter equivalences combined with N-Best strategy.Experimental results show that the new method does better than other approaches on evaluation and filtering.
Keywords:translation equivalence  multiple linear regression  N-Best strategy
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号