首页 | 本学科首页   官方微博 | 高级检索  
     

汉字识别研究的回顾
引用本文:丁晓青.汉字识别研究的回顾[J].电子学报,2002,30(9):1364-1368.
作者姓名:丁晓青
作者单位:清华大学电子工程系智能技术与系统国家重点实验室,北京,100084
基金项目:国家863高技术计划(No.2001AAll4081),国家自然科学基金(No.69972024)
摘    要:本文回顾了汉字识别研究的历史。根据模仿人类视觉模型,基于文字图像的统计模式识别方法是文字识别取得瞩目进展的基础。模式识别信息熵理论揭示了模式分类的信息过程和理论极限,本文讨论了从汉字图像中提取特征以及文字识别分类器设计和学习的各种方法。介绍了文本识别必须解决的文字切分,版面分析、理解和重构,及提高识别性能等重点问题,最后,总结了文字识别研究的重要进展和对今后的展望。

关 键 词:汉字识别  文本识别  视觉感知  特征提取  分类器设计  版面分析
文章编号:0372-2112(2002)09-1364-05

Chinese Character Recognition:A Review
DING,Xiao-qing.Chinese Character Recognition:A Review[J].Acta Electronica Sinica,2002,30(9):1364-1368.
Authors:DING  Xiao-qing
Abstract:A review for the research on Chinese character recognition is reported in this paper. Based on the theory of Visual thinking of human, more excellent progresses for Chinese character recognition have been achieved by the statistical pattern recognition method on the character image. The information theory on pattern recognition has discovered the nature and limitation of statistical pattern classification capability. The feature extraction and selection from character image, also the classifier design and learning methods are introduced. Besides, the important problems in document recognition such as layout analysis, understanding, and reconstruction, character segmentation, context postprocessing by language model etc. have been discussed. At last some conclusion and prospect for the progress on CCR are reported.
Keywords:Chinese character recognition  document recognition  visual perception  feature extraction  classifier design  layout analysis and understanding
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号