首页 | 本学科首页   官方微博 | 高级检索  
     

基于自适应尺度边缘特征的建筑施工图重叠字符识别方法研究
作者姓名:王正  邓雪原
作者单位:1. 上海交通大学船舶海洋与建筑工程学院,上海 200240; 2. 上海市公共建筑和基础设施数字化运维重点实验室,上海 200240
基金项目:“十三五”国家重点研发计划项目(2016YFC0702001)
摘    要:目前非重叠字符的识别技术已趋于完善,但难以识别建筑工程图纸标注等场景中的重叠字符,阻碍了基于二维扫描图纸的自动建模技术的突破。针对传统字符识别方法无法识别重叠字符的现状,提出了一套基于自适应尺度边缘特征的建筑施工图重叠字符识别新方法。基于像素空间分布特征初步确定重叠字符区域,定义并提取字符的自适应尺度边缘特征;借助双变量匹配概率函数筛选“位置+内容”的结果组合,并以全局最优原则代替绝对阈值作为识别标准,最终输出正确的识别结果。不同于先修复后识别的常规思路,该方法将特征匹配与干扰过滤相结合、字符定位与字符识别相关联,能解决百度等成熟商用 OCR 无法解决的重叠字符识别问题,且经数据实验证实具备较高的识别准确率。

关 键 词:重叠字符  字符识别  自适应尺度  分布概率  投影分割  

Research on recognition method of overlapped characters in constructiondrawings based on adaptive scale edge feature
Authors:WANG Zheng  DENG Xue-yuan
Affiliation:1. School of Naval Architecture, Ocean & Civil Engineering, Shanghai Jiao Tong University, Shanghai 200240, China; 2. Shanghai Key Laboratory for Digital Maintenance of Buildings and Infrastructure, Shanghai 200240, China
Abstract:At present, the recognition technology of non-overlapped characters has been perfected, but it remains difficult to solve the recognition problem of common overlapped characters in scenarios such as the annotation of architectural engineering drawings, which hinders the breakthrough of automatic modeling technology based on 2D scanned drawings. To address the incapability of traditional character recognition methods to recognize overlapped characters, a new method was proposed for overlapped characters recognition in construction drawings based on adaptive scale edge features. Based on the spatial distribution characteristics of pixels, the overlapped character areas were preliminarily determined, and the adaptive scale edge features of characters were defined and extracted. The result combination of “position + content” was screened with the help of the bivariate matching probability function, and the global optimal principle was used instead of the absolute threshold as the identification standard. Finally, the correct recognition of overlapped characters was achieved. Different from the conventional idea of recognizing after repairing, the new method combined feature matching and interference filtering, character positioning and character recognition. The proposed method can solve the overlapping character recognition problem insolvable for mature commercial OCR such as Baidu, and the data experiment proves that this method is of high recognition accuracy.
Keywords:overlapped characters  character recognition  adaptive scale  distribution probability  projection segmentation  
点击此处可从《》浏览原始摘要信息
点击此处可从《》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号