Improving neural sentence alignment with word translation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Improving neural sentence alignment with word translation

Authors:	Ying DING Junhui LI Zhengxian GONG Guodong ZHOU

Affiliation:	School of Computer Science and Technology, Soochow University, Suzhou 215006, China

Abstract:	Sentence alignment is a basic task in natural language processing which aims to extract high-quality parallel sentences automatically. Motivated by the observation that aligned sentence pairs contain a larger number of aligned words than unaligned ones, we treat word translation as one of the most useful external knowledge. In this paper, we show how to explicitly integrate word translation into neural sentence alignment. Specifically, this paper proposes three cross-lingual encoders to incorporate word translation: 1) Mixed Encoder that learns words and their translation annotation vectors over sequences where words and their translations are mixed alternatively; 2) Factored Encoder that views word translations as features and encodes words and their translations by concatenating their embeddings; and 3) Gated Encoder that uses gate mechanism to selectively control the amount of word translations moving forward. Experimentation on NIST MT and Opensubtitles Chinese-English datasets on both non-monotonicity and monotonicity scenarios demonstrates that all the proposed encoders significantly improve sentence alignment performance.

Keywords:	sentence alignment word translation mixed encoder factored encoder gated encoder
本文献已被维普等数据库收录！
	点击此处可从《Frontiers of Computer Science》浏览原始摘要信息
	点击此处可从《Frontiers of Computer Science》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏