首页 | 本学科首页   官方微博 | 高级检索  
     

基于灰度直方图和谱聚类的文本图像二值化方法
引用本文:吴锐,黄剑华,唐降龙,刘家锋.基于灰度直方图和谱聚类的文本图像二值化方法[J].电子与信息学报,2009,31(10):2460-2464.
作者姓名:吴锐  黄剑华  唐降龙  刘家锋
作者单位:哈尔滨工业大学计算机科学与技术学院,哈尔滨,150001
基金项目:国家自然科学基金(60672090)资助课题 
摘    要:在自动文本提取中,经定位获得的字符区域需二值化后方能有效识别,由于背景的复杂,常用的阈值化方法不能有效分割自然环境下的字符图像。该文提出了一种基于谱聚类的图像二值化方法,该方法利用规范化切痕(Normalized cut, Ncut)作为谱聚类测度,结合灰度直方图计算相似性矩阵,并通过实验确定最佳的直方图等级数,与通常基于像素级相似矩阵相比,算法的空间复杂度和计算复杂性都大为降低。实验结果表明,针对自然场景下的字符图像,该文方法的二值化结果优于常用的阈值分割结果。

关 键 词:图像处理    文本图像    二值化    图分割    谱聚类
收稿时间:2008-10-9
修稿时间:2009-3-17

Method of Text Image Binarization Processing Using Histogram and Spectral Clustering
Wu Rui Huang Jian-hua Tang Xiang-long Liu Jia-feng.Method of Text Image Binarization Processing Using Histogram and Spectral Clustering[J].Journal of Electronics & Information Technology,2009,31(10):2460-2464.
Authors:Wu Rui Huang Jian-hua Tang Xiang-long Liu Jia-feng
Affiliation:School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
Abstract:The located text regions need to be binarized for accurate recognition in automatic textural extraction. Due to the complex backgrounds, traditional thresholding methods can not segment the character image effectively from natural scenes. A novel approach of binarization is proposed for gray images. The proposed algorithm uses the Normalized graph cut(Ncut) as the measure for spectral clustering, and the weighted matrices used in evaluating the graph cuts are based on the gray levels of an image, rather than the commonly used image pixels. Thus, the proposed algorithm requires much smaller spatial costs and much lower computation complexity. Experiments on text images in natural scene show the superior performance of the proposed method compared to the typical thresholding algorithms.
Keywords:Image processing  Text image  Binarization processing  Graph partition  Spectral clustering
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号