首页 | 本学科首页   官方微博 | 高级检索  
     

基于部件组合的联机手写“藏文—梵文”样本生成
引用本文:王维兰,卢小宝,蔡正琦,沈文韬,付吉,才科扎西. 基于部件组合的联机手写“藏文—梵文”样本生成[J]. 中文信息学报, 2017, 31(5): 64-73
作者姓名:王维兰  卢小宝  蔡正琦  沈文韬  付吉  才科扎西
作者单位:1.西北民族大学 数学与计算机科学学院,甘肃 兰州 730030;
2.中国人民银行 白银中心支行,甘肃 白银 730900
基金项目:国家自然科学基金(61375029);国家民委领军人才计划;西北民族大学中央高校基本科研业务费专项资金(31920170142)。
摘    要:“藏文—梵文”包括500多个现代藏文、6 000多个梵音藏文,在文字识别领域属于大类别的字符集,所以联机手写样本采集是庞大而复杂的工程。鉴于此,提供了一种基于部件组合的“藏文—梵文”手写样本生成方法,主要包括: (1)确定“藏文—梵文”字符集和部件集;(2)获取“藏文—梵文”字丁的部件位置信息;(3)采集联机手写“藏文—梵文”部件的样本;(4)生成联机手写“藏文—梵文”字符集样本库。该文为联机手写“藏文—梵文”识别的研究提供字符训练样本库和测试样本库,提高了手写梵音藏文样本采集效率,解决了样本数量及多样性问题,降低了样本采集成本,为进一步联机手写梵音藏文识别的研究与系统开发奠定了基础。

关 键 词:联机手写  藏文—梵文  字符集  部件组合  样本生成  

Online Handwritten Sample Generated Based on Component Combination for Tibetan-Sanskrit
WANG Weilan,LU Xiaobao,CAI Zhengqi,SHEN Wentao,FU Ji,CAIKE Zhaxi. Online Handwritten Sample Generated Based on Component Combination for Tibetan-Sanskrit[J]. Journal of Chinese Information Processing, 2017, 31(5): 64-73
Authors:WANG Weilan  LU Xiaobao  CAI Zhengqi  SHEN Wentao  FU Ji  CAIKE Zhaxi
Affiliation:1.Department of Math and Computer Science, Northwest University for Nationalities, Lanzhou, Gansu 730030, China;
2.Baiyin Center Subbranch, People's Bank of China, Baiyin, Gansu 730900, China
Abstract:Tibetan-Sanskrit includes more than 500 Tibetan characters, and more than 6000 Sanskrit. Belonging to the large class of character set, the sample collection of the online handwritten is a large and complex project. We present an online handwriting character sample generation method based on component combination for Tibetan-Sanskrit. The proposed method includes four main parts: (1) to determine the Tibetan-Sanskrit character set and component set; (2) to get location information of Tibetan-Sanskrit characters; (3) to collect online handwritten sample of component set for Tibetan-Sanskrit; and (4) to generate sample database of online handwritten Tibetan-Sanskrit character set. This provides the character's training sample set and test sample set for online handwritten Tibetan-Sanskrit.
Keywords:online handwritten    Tibetan-Sanskrit    character set    component combination    sample generation  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号