首页 | 本学科首页   官方微博 | 高级检索  
     

基于汉英双语语料库的翻译等价单位自动获取研究
引用本文:常宝宝.基于汉英双语语料库的翻译等价单位自动获取研究[J].术语标准化与信息技术,2002,19(2):24-29.
作者姓名:常宝宝
作者单位:北京大学
摘    要:双语语料库在机器翻译或机器辅助翻译研究中的重要作用已经越来越多地得到研究人员的认可。本文探讨了如何利用汉英双语语料进行汉英翻译等价单位的抽取,提出了基于词语关联度进行多词组合单位的识别方法,并利用假设-检验的方法,在汉英双语语料库中抽取翻译等价单位。本文还对不同的关联度量方法进行了对比,并提出利用范畴假设改进抽取算法的效率。

关 键 词:英语  汉语  双语语料库  翻译等价单位  自动抽取

Extraction of Translation Equivalent Pairs from Chinese - English Parallel Corpus
CHANG Baobao.Extraction of Translation Equivalent Pairs from Chinese - English Parallel Corpus[J].Terminology Standardization & Information Technology,2002,19(2):24-29.
Authors:CHANG Baobao
Abstract:More and more researchers have recognized the potential value of the parallel corpus in the research on Machine Translation and Machine Aided Transl ation. This paper examines how the translation equivalent pairs could be extract ed from parallel corpus. An iterative algorithm based on degree of word associat ion is proposed to identify the multiword units for Chinese and English. Then a hypothesis-testing approach is used to extract the Chinese-English Translation Equivalent Pairs. We also made comparison between different statistical associa tion measurement and proposed to use categorical hypothesis to improve the perfo rmance of extraction.
Keywords:bilingual corpus  translation equivalent pair  automatic extraction of TEPs  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号