首页 | 本学科首页   官方微博 | 高级检索  
     

基于加权贝叶斯的脱机手写阿文单词识别
引用本文:许亚美,何继爱.基于加权贝叶斯的脱机手写阿文单词识别[J].中文信息学报,2021,35(2):133-140.
作者姓名:许亚美  何继爱
作者单位:兰州理工大学 计算机与通信学院,甘肃 兰州 730050
基金项目:国家自然科学基金(61562058,61561031)
摘    要:针对手写阿拉伯单词书写连笔,且相似词较多的特点,该文提出一种新的脱机手写文字识别算法。该算法以固定组件为成分拆分阿拉伯单词,构建自组件特征至单词类别的加权贝叶斯推理模型。算法结合单词组件分割、多级混合式组件识别、组件加权系数估计等,计算单词类别的后验概率并得到单词识别结果。在IFN/ENIT库上的实验,获得了90.03%的单词识别率,证实组件分解对笔画连写具有鲁棒性,组件识别能提高相似词的辨别能力,而且该算法所需训练类别少,易向大词汇量识别扩展。

关 键 词:手写文字识别  阿拉伯文  单词识别  加权贝叶斯  
收稿时间:2020-01-12

Offline Handwritten Arabic Word Recognition Based on Weighted Bayesian
XU Yamei,HE Ji'ai.Offline Handwritten Arabic Word Recognition Based on Weighted Bayesian[J].Journal of Chinese Information Processing,2021,35(2):133-140.
Authors:XU Yamei  HE Ji'ai
Affiliation:School of Computer and Communication, Lanzhou University of Technology, Lanzhou, Gansu 730050, China
Abstract:A new offline handwritten Arabic word recognition algorithm is proposed to deal with its connected writing strokes and more similar words. The algorithm first establishes a structure model with fixed graphemes for each Arabic word category to be recognized, then segments the word samples into graphemes. Then a weighted Bayesian inference model is constructed from the grapheme features to word categories. The word recognition results are obtained by calculating the posterior probabilities of word categories. On the IFN/ENIT database, the proposed algorithm achieves as high as 90.03% accuracy.
Keywords:
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号