首页 | 本学科首页   官方微博 | 高级检索  
     

基于规则和统计相结合的西里尔蒙古文到传统蒙古文转换方法
引用本文:飞 龙,高光来,王洪伟,路 敏.基于规则和统计相结合的西里尔蒙古文到传统蒙古文转换方法[J].中文信息学报,2017,31(3):156-162.
作者姓名:飞 龙  高光来  王洪伟  路 敏
作者单位:内蒙古大学 计算机学院,内蒙古 呼和浩特 010021
基金项目:国家自然科学基金(61563040);内蒙古自然科学基金(2016D06);内蒙古大学高层次人才引进科研项目资助
摘    要:西里尔蒙古文与传统蒙古文分别是蒙古国与中国使用的蒙古文,西里尔蒙古文到传统蒙古文的转换工作不仅给两国同胞的交流带来更多的便利,而且对蒙古族的科学、文化和教育发展具有重要意义。本文结合规则与统计模型的优点,研究了西里尔蒙古文到传统蒙古文的转换方法。本文首先采用基于规则的方法对西里尔蒙古文集内词进行转换,其次对集外词的转换采用了基于联合序列模型的方法,并采用N-gram语言模型解决了一个西里尔蒙古文单词对应多个传统蒙古文单词的问题。实验结果表明,该系统单词转换错误率低至4.12%,基本达到了实用要求。

关 键 词:西里尔蒙古文  传统蒙古文  转换  规则  联合序列模型  

Combining of Rules and Statistics for Cyrillic Mongolian to Traditional Mongolian Conversion
BAO Feilong,GAO Guanglai,WANG Hongwei,LU min.Combining of Rules and Statistics for Cyrillic Mongolian to Traditional Mongolian Conversion[J].Journal of Chinese Information Processing,2017,31(3):156-162.
Authors:BAO Feilong  GAO Guanglai  WANG Hongwei  LU min
Affiliation:College of Computer Science, Inner Mongolia University, Hohhot,Inner Mongolia 010021, China
Abstract:Cyrillic Mongolian and Traditional Mongolian are used in Mongolia and China, respectively. Cyrillic Mongolian to Traditional Mongolian conversion not only will bring more convenience to exchanges between the two countries, but also has great significance for scientific, cultural and educational development of Mongolian. This paper proposes a highly efficient Cyrillic Mongolian to Traditional Mongolian conversion method. It adopts the rule-based approach to convert the words in the vocabulary, and the statistical model to convert the out-of-vocabulary words. A large part of Cyrillic Mongolian words correspond more than one candidates in Traditional Mongolian, which is solved by the N-gram language model. Experimental results show that the word error rate is as low as 4.12%, meeting the practical requirement.
Keywords:Cyrillic Mongolian  Traditional Mongolian  conversion  rules  joint sequence model  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号