首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
目的 手写文本行提取是文档图像处理中的重要基础步骤,对于无约束手写文本图像,文本行都会有不同程度的倾斜、弯曲、交叉、粘连等问题。利用传统的几何分割或聚类的方法往往无法保证文本行边缘的精确分割。针对这些问题提出一种基于文本行回归-聚类联合框架的手写文本行提取方法。方法 首先,采用各向异性高斯滤波器组对图像进行多尺度、多方向分析,利用拖尾效应检测脊形结构提取文本行主体区域,并对其骨架化得到文本行回归模型。然后,以连通域为基本图像单元建立超像素表示,为实现超像素的聚类,建立了像素-超像素-文本行关联层级随机场模型,利用能量函数优化的方法实现超像素的聚类与所属文本行标注。在此基础上,检测出所有的行间粘连字符块,采用基于回归线的k-means聚类算法由回归模型引导粘连字符像素聚类,实现粘连字符分割与所属文本行标注。最后,利用文本行标签开关实现了文本行像素的操控显示与定向提取,而不再需要几何分割。结果 在HIT-MW脱机手写中文文档数据集上进行文本行提取测试,检测率DR为99.83%,识别准确率RA为99.92%。结论 实验表明,提出的文本行回归-聚类联合分析框架相比于传统的分段投影分析、最小生成树聚类、Seam Carving等方法提高了文本行边缘的可控性与分割精度。在高效手写文本行提取的同时,最大程度地避免了相邻文本行的干扰,具有较高的准确率和鲁棒性。  相似文献   

2.
In this paper, we present an effective approach for grouping text lines in online handwritten Japanese documents by combining temporal and spatial information. With decision functions optimized by supervised learning, the approach has few artificial parameters and utilizes little prior knowledge. First, the strokes in the document are grouped into text line strings according to off-stroke distances. Each text line string, which may contain multiple lines, is segmented by optimizing a cost function trained by the minimum classification error (MCE) method. At the temporal merge stage, over-segmented text lines (caused by stroke classification errors) are merged with a support vector machine (SVM) classifier for making merge/non-merge decisions. Last, a spatial merge module corrects the segmentation errors caused by delayed strokes. Misclassified text/non-text strokes (stroke type classification precedes text line grouping) can be corrected at the temporal merge stage. To evaluate the performance of text line grouping, we provide a set of performance metrics for evaluating from multiple aspects. In experiments on a large number of free form documents in the Tokyo University of Agriculture and Technology (TUAT) Kondate database, the proposed approach achieves the entity detection metric (EDM) rate of 0.8992 and the edit-distance rate (EDR) of 0.1114. For grouping of pure text strokes, the performance reaches EDM of 0.9591 and EDR of 0.0669.  相似文献   

3.
工程图纸矢量化中的线条轮廓跟踪法   总被引:10,自引:0,他引:10       下载免费PDF全文
在工程图纸矢量化中引入了平均链码和线条的概述,阐述了平均链码与直线链码的关系,提出了确定线条边界的切向和法向,线条检测,线条两侧同步跟踪以民折处理的方法。以此实现直线或微弯曲线的整条提取,跟踪的同时实现图文的磁量化速度和质量有较大的提高。  相似文献   

4.
A new signal processing method is developed for estimating the skew angle in text document images. Detection of the skew angle is an important step in text processing tasks such as optical character recognition (OCR) and computerized filing. Based on a recently introduced multiline-fitting algorithm, the proposed method reformulates the skew detection problem into a special parameter-estimation framework such that a signal structure similar to the one in the field of sensor array processing is obtained. In this framework, straight lines in an image are modeled as wavefronts of propagating planar waves. Certain measurements are defined in this virtual propagation environment such that the large amount of coherency that exists between the locations of the pixels on parallel lines is exploited to enhance a subspace in the space spanned by the measurements. The well-studied techniques of sensor array processing (e.g., the ESPRIT algorithm) are then exploited to produce a closed form and high-resolution estimate for the skew angle.  相似文献   

5.
Extracting curved text lines using local linearity of the text line   总被引:1,自引:0,他引:1  
In order to enhance the ability of document analysis systems, we need a text line extraction method which can handle not only straight text lines but also text lines in various shapes. This paper proposes a new method called Extended Linear Segment Linking (ELSL for short), which is able to extract text lines in arbitrary orientations and curved text lines. We also consider the existence of both horizontally and vertically printed text lines on the same page. The new method can produce text line candidates for multiple orientations. We verify the ability of the method by some experiments as well. Received December 21, 1998 / Revised version September 2, 1999  相似文献   

6.
一种新型的航空图像城区建筑物自动提取方法   总被引:12,自引:0,他引:12  
提出了一种新的从航空城区图像中自动提取矩形建筑物的方法.该方法基于从航空城区图像中提取的边缘,经过轮廓跟踪,采用Splitting方法提取直线,得出其相应的直线几何图形;针对航空图像的复杂及现有边缘检测算法的不足,提出了一系列直线处理的方法(如直线的分类、排序、合并、调整等)有效地弥补了前述处理的不足;为提高矩形房屋提取的准确率,引入知识定义了几种近似的矩形结构.文章采用几何结构元分析的方法,提取图形中构成矩形的各种基本结构元,再根据结构元合并的准则,将各种基本结构元通过一定的合并算法合并成矩形结构.大量试验结果证明该方法提取矩形房屋的准确率较高,鲁棒性好,运算速度快,具有较强的实际应用价值.  相似文献   

7.
一种快速的文本倾斜检测方法   总被引:2,自引:0,他引:2  
文本的倾斜检测是将文本转换成数字形式的过程中的第一步工作,也是很重要的一步工作。因为后续的很多工作都是基于摆正的文本。文章提出了一种全新的倾斜检测与纠正方法。其特点在于:一、与文本的纹理无关,从而适应各种图文混排及各种书写方向并存等复杂情形;二、运算量小,只需进行一次旋转和四次对图像的部分投影。  相似文献   

8.
This paper presents a morphology-based text line extraction algorithm for extracting text regions from cluttered images. First of all, the method defines a novel set of morphological operations for extracting important contrast regions as possible text line candidates. The contrast feature is robust to lighting changes and invariant against different image transformations like image scaling, translation, and skewing. In order to detect skewed text lines, a moment-based method is then used for estimating their orientations. According to the orientation, an x-projection technique can be applied to extract various text geometries from the text-analogue segments for text verification. However, due to noise, a text line region is often fragmented to different pieces of segments. Therefore, after the projection, a novel recovery algorithm is then proposed for recovering a complete text line from its pieces of segments. After that, a verification scheme is then proposed for verifying all extracted potential text lines according to their text geometries. Experimental results show that the proposed method improves the state-of-the-art work in terms of effectiveness and robustness for text line detection.  相似文献   

9.
针对图像处理(如OCR技术)对图像方向要求十分严格,文本图像方向具有不确定性的问题,提出了中文文本图像倒置快速检测算法.利用投影技术定位出文本字符,结合中文字符及标点符号结构特征,筛选出文本图像中的标点符号,根据标点符号像素分布特点判断出类型,结合标点符号的使用习惯,采用统计的方法判断中文文本图像是否倒置.实验结果表明,投影方法可以不用基于内容达到高效快速的要求,利用统计方法可以保证判别率,该方法可用于OCR预处理过程.  相似文献   

10.
表格分析是对表格的基本结构及形状进行识别的过程,是以后能否从表格单元中正确提取文本信息的关键.在结合表格特点的基础上,采用了表格线检测与处理相结合的方法获取表格框线.检测表格线过程中,通过定义了主表格线长度来加快扫描的速度:在表格线的处理中,针对杂线的剔除、表格线的调整及最终获得表格结构等方面进行了系统的探讨.大量的实验结果表明所提方法是可行的.  相似文献   

11.
In this paper, we present a new text line detection method for handwritten documents. The proposed technique is based on a strategy that consists of three distinct steps. The first step includes image binarization and enhancement, connected component extraction, partitioning of the connected component domain into three spatial sub-domains and average character height estimation. In the second step, a block-based Hough transform is used for the detection of potential text lines while a third step is used to correct possible splitting, to detect text lines that the previous step did not reveal and, finally, to separate vertically connected characters and assign them to text lines. The performance evaluation of the proposed approach is based on a consistent and concrete evaluation methodology.  相似文献   

12.
In today’s real world, an important research part in image processing is scene text detection and recognition. Scene text can be in different languages, fonts, sizes, colours, orientations and structures. Moreover, the aspect ratios and layouts of a scene text may differ significantly. All these variations appear assignificant challenges for the detection and recognition algorithms that are considered for the text in natural scenes. In this paper, a new intelligent text detection and recognition method for detectingthe text from natural scenes and forrecognizing the text by applying the newly proposed Conditional Random Field-based fuzzy rules incorporated Convolutional Neural Network (CR-CNN) has been proposed. Moreover, we have recommended a new text detection method for detecting the exact text from the input natural scene images. For enhancing the presentation of the edge detection process, image pre-processing activities such as edge detection and color modeling have beenapplied in this work. In addition, we have generated new fuzzy rules for making effective decisions on the processes of text detection and recognition. The experiments have been directedusing the standard benchmark datasets such as the ICDAR 2003, the ICDAR 2011, the ICDAR 2005 and the SVT and have achieved better detection accuracy intext detection and recognition. By using these three datasets, five different experiments have been conducted for evaluating the proposed model. And also, we have compared the proposed system with the other classifiers such as the SVM, the MLP and the CNN. In these comparisons, the proposed model has achieved better classification accuracywhen compared with the other existing works.  相似文献   

13.
In this paper, we present a segmentation methodology of handwritten documents in their distinct entities, namely, text lines and words. Text line segmentation is achieved by applying Hough transform on a subset of the document image connected components. A post-processing step includes the correction of possible false alarms, the detection of text lines that Hough transform failed to create and finally the efficient separation of vertically connected characters using a novel method based on skeletonization. Word segmentation is addressed as a two class problem. The distances between adjacent overlapped components in a text line are calculated using the combination of two distance metrics and each of them is categorized either as an inter- or an intra-word distance in a Gaussian mixture modeling framework. The performance of the proposed methodology is based on a consistent and concrete evaluation methodology that uses suitable performance measures in order to compare the text line segmentation and word segmentation results against the corresponding ground truth annotation. The efficiency of the proposed methodology is demonstrated by experimentation conducted on two different datasets: (a) on the test set of the ICDAR2007 handwriting segmentation competition and (b) on a set of historical handwritten documents.  相似文献   

14.
文本文档水印质心检测方法的改进   总被引:1,自引:0,他引:1  
戴祖旭  洪帆  李小刚  董洁 《计算机应用》2007,27(5):1064-1066
对Brassil等的文本水印质心检测方法作了改进,通过模拟扩展初始文本行,综合应用再生文本行轮廓和初始文本行轮廓信息构造了一个再生模拟文本行质心序列,证明了该序列依概率收敛于初始文本行质心。实验结果表明改进后的检测方法在处理含有短行的文本文档水印时较之与Brassil方法,误检概率可减少一半,因此用行移编码嵌入水印时可不受文本行长度限制,提高了文档水印容量。  相似文献   

15.
为了提高对覆冰状态下高压输电线路运行可靠性的自动检测能力,提出一种基于关联规则信息融合的覆冰状态下高压输电线路运行可靠性自动检测方法,采用自相关特征检测方法进行覆冰状态下高压输电线路运行状态挖掘,提取覆冰状态下高压输电线路运行的可靠性数据,根据数据提取结果进行信息融合和量化回归分析,结合异常谱密度特征检测方法,实现对覆冰状态下高压输电线路运行可靠性数据挖掘,构建覆冰状态下高压输电线路运行状态的特征分布模型,结合关联规则信息融合方法,实现覆冰状态下高压输电线路运行可靠性自动检测。仿真结果表明,采用该方法进行覆冰状态下高压输电线路运行可靠性检测的准确性较高,抗干扰性能较好。  相似文献   

16.
随着各大电力公司对无人机(unmanned aerial vehicle,UAV)巡检的大力推广,“机巡为主,人巡为辅”已成为我国电力巡检的主要运维模式。电力线检测作为电力巡检的关键技术,在无人机自主导航、低空避障飞行以及输电线路安全稳定运行等方面发挥着重要作用。众多研究者将输电线路的无人机航拍图像用于线路设备识别与故障诊断,利用机器视觉的方法在电力线检测技术研究中占据主导地位,也是未来的主要发展方向。本文综述了近10年来无人机航拍图像中电力线检测方法的研究进展。首先简述了电力线特征,阐明了电力线检测的传统处理方法的一般流程及所面临的挑战;然后重点阐述了使用传统图像处理方法及深度学习方法的电力线检测原理,前者包括基于Hough变换的方法、基于Radon变换的方法、基于LSD (line segment detector)的方法、基于扫描标记的方法及其他检测方法,后者根据深度卷积神经网络(deep convolutional neural network,DCNN)的结构不同分为基于DCNN的分类方法及基于DCNN的语义分割方法,评述各类方法的优缺点并进行分析与比较,与传统图像处理方法相比,深度学习方法能更有效地实现航拍图像中的电力线检测,并指出基于DCNN的语义分割方法在电力线目标智能识别与分析中发挥着重要作用;随后介绍了电力线检测的常用数据集及性能评价指标;最后针对电力线检测方法目前存在的问题,对下一步的研究方向进行展望。  相似文献   

17.
18.
19.
谢凤英  姜志国  汪雷 《计算机应用》2006,26(7):1587-1589
针对扫描背景不定且含有图表信息的复杂文本图像,提出了一种有效的倾斜检测方法。该方法首先通过对梯度图像的统计分析,自适应地选取到了包含文字的特征子区;在特征子区内,论文把文字行间的空白条带看作一条隐含的线,用优化理论计算出空白条带的倾斜角度,这也就是文本的倾斜角度。实验结果表明,该倾斜检测方法不受扫描背景、边界大小、文本布局及行间距等情况的限制,具有速度快、精度高、适应性强的特点。  相似文献   

20.
The topic of this paper is machine translation (MT) from French text into French sign language (LSF). After arguing in favour of a rule-based method, it presents the architecture of an original MT system, built on two distinct efforts: formalising LSF production rules and triggering them with text processing. The former is made without any concern for text or translation and involves corpus analysis to link LSF form features to linguistic functions. It produces a set of production rules which may constitute a full LSF production grammar. The latter is an information extraction task from text, broken down in as many subtasks as there are rules in the grammar. After discussing this architecture, comparing it to the traditional methods and presenting the methodology for each task, the paper present the set of production rules found to govern event precedence and duration in LSF and gives a progress report on the implementation of the rule triggering system. With this proposal, it is also hoped to show how MT can benefit today from sign language processing.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号