首页 | 本学科首页   官方微博 | 高级检索  
     

数学公式识别系统:MatheReader
引用本文:靳简明,江红英,王庆人.数学公式识别系统:MatheReader[J].计算机学报,2006,29(11):2018-2026.
作者姓名:靳简明  江红英  王庆人
作者单位:南开大学机器智能研究所,天津,300071
摘    要:数学公式广泛存在于各类文献之中,但是公式的识别远比文字段落的识别困难.义章介绍了一个数学公式图像识别系统MatheReader,重点阐述了其在公式定位及公式分析方面的技术方案.在公式定伉方面,抽取版式特征,采用Parzen分类器区分独立公式和普通文字行,在普通文字行内检测二维结构定位内嵌公式.在公式分析方面,定义十一种基本公式类型,并用产生式规则限定每类公式的唯一分解方法,提出先识别公式类型,然后分解为子表达式的公式分析方法.和已有系统比较,MatheReader的功能更加强大,能够处理的公式更加丰富.

关 键 词:公式定位  公式识别  公式分析  自动性能评估  文档图像处理
收稿时间:2005-03-11
修稿时间:2005-03-112006-06-24

Mathematical Expression Recognition System: MatheReader
JIN Jian-Ming,JIANG Hong-Ying,WANG Qing-Ren.Mathematical Expression Recognition System: MatheReader[J].Chinese Journal of Computers,2006,29(11):2018-2026.
Authors:JIN Jian-Ming  JIANG Hong-Ying  WANG Qing-Ren
Affiliation:Institute of Machine Intelligence, Nankai University, Tianjin 300071
Abstract:Numerous mathematical expressions exist in all kinds of documents, but expression recognition is far more difficult than ordinary text recognition. A mathematical expression recognition system, MatheReader, is presented in this paper, and the detail schemes of expression extraction and expression analysis are described. For expression extraction, isolated expressions and normal text lines are distinguished by Parzen classifier based on layout features and embedded expressions are extracted by 2 D structures detection. For expression analysis, eleven basic expression types are defined, and the unique decomposition way for each type is defined by a set of production rules. The expression analysis scheme is proposed with recognizing expression type at first, and then decomposing the expression into sub-expressions according to the expression type. MatheReader is more powerful and can recognize more kinds of expressions than former systems.
Keywords:expression extraction  expression recognition  expression analysis  automatic performance evaluation  document image processing
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号