首页 | 本学科首页   官方微博 | 高级检索  
     

统计机器翻译中翻译规则抽取
引用本文:刘颖,姜巍. 统计机器翻译中翻译规则抽取[J]. 计算机工程与应用, 2012, 48(32): 98-101,146
作者姓名:刘颖  姜巍
作者单位:清华大学中文系,北京,100084
基金项目:国家自然科学基金(No.61171114).
摘    要:对齐短语是决定统计机器翻译系统质量的核心模块。提出基于短语结构树的层次短语模型,这是利用串-树模型的思想对层次短语模型的扩展。基于短语结构树的层次短语模型是在双语对齐短语的基础之上结合英语短语结构树抽取翻译规则,并利用启发式策略获得翻译规则的扩展句法标记。采用翻译规则的统计机器翻译系统在不同数据集上具有稳定的翻译结果,在训练集和测试集的平均BlEU评分高于短语模型和层次短语模型的BLEU评分。

关 键 词:统计机器翻译  翻译规则  抽取  过滤  BLEU评分

Translation rules extraction for statistical machine translation
LIU Ying , JIANG Wei. Translation rules extraction for statistical machine translation[J]. Computer Engineering and Applications, 2012, 48(32): 98-101,146
Authors:LIU Ying    JIANG Wei
Affiliation:Department of Chinese Language and Literature, Tsinghua University, Beijing 100084, China
Abstract:Aligned phrases are important for Statistical Machine Translation (SMT). Hierarchical phrase model based on phrase tree is provided, which integrates the advantages of string-tree model and hierarchical phrase model. Translation rules are extracted according to aligned phrases and English phrase trees, and heuristic strategies are proposed for determining corresponding syntax labels of new translation rules. Translation quality of SMT using translation rules is better than those of phrase model and hierarchical phrase model. The BLEU score of the model is higher than phrase model and hierarchical phrase model.
Keywords:statistical machine translation  translation rules  extraction  filtration  BLEU
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号