基于最大熵短语重排序模型的特征抽取算法改进 An Improving Feature Extraction Algorithm for Maximum Entropy Based Phrase Reordering Model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于最大熵短语重排序模型的特征抽取算法改进

引用本文：	孙萌,姚建民,吕雅娟,姜文斌,刘群.基于最大熵短语重排序模型的特征抽取算法改进[J].中文信息学报,2011,25(2):78-83.

作者姓名：	孙萌姚建民吕雅娟姜文斌刘群

作者单位：	1. 中国科学院计算技术研究所智能信息处理重点实验室, 北京 100190; 2. 苏州大学计算机科学与技术学院, 江苏苏州 215006

基金项目：	国家自然科学基金资助项目

摘要：	该文针对统计机器翻译中基于最大熵短语重排序模型特征抽取算法,提出一种改进算法。该算法能够抽取出更多准确的短语重排序信息,特别是逆序短语的特征信息,解决了原算法中最大熵训练时特征数据不平衡的问题,提高了翻译中短语重排序的准确率。以NIST MT 05 作为汉语到英语翻译的测试集,实验结果表明改进后的系统BLEU值比原系统提高0.65%。
关键词：	最大熵特征抽取统计机器翻译重排序模型
An Improving Feature Extraction Algorithm for Maximum Entropy Based Phrase Reordering Model

SUN Meng,YAO Jianmin,LV Yajuan,JIANG Wenbin,LIU Qun.An Improving Feature Extraction Algorithm for Maximum Entropy Based Phrase Reordering Model[J].Journal of Chinese Information Processing,2011,25(2):78-83.

Authors:	SUN Meng YAO Jianmin LV Yajuan JIANG Wenbin LIU Qun

Affiliation:	1. Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; 2. School of Computer Science & Technology, Soochow University, Suzhou, Jiangsu 215006, China

Abstract:	This paper presents an improved feature extraction algorithm for maximum entropy based phrase reordering model. The algorithm can extract more accurate feature information of phrase reordering, particularly the feature of inverted phrases. It solves the problem of uneven distribution of feature information and increases the rate of correct translation. We use BLEU as a metric on Chinese-to-English translation, and the proposed algorithm obtains a relative improvement of 0.65% over baseline system. Key wordsmaximum entropy; feature extraction; statistic machine translation; reordering model

Keywords:	maximum entropy feature extraction statistic machine translation reordering model
本文献已被万方数据等数据库收录！
	点击此处可从《中文信息学报》浏览原始摘要信息
	点击此处可从《中文信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏