一种新的多尺度深度学习图像语义理解方法研究 A new multi-scale image semantic understanding method based on deep learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种新的多尺度深度学习图像语义理解方法研究

引用本文：	蒋应锋,张桦,薛彦兵,周冕,徐光平,高赞.一种新的多尺度深度学习图像语义理解方法研究[J].光电子．激光,2016(2):224-230.

作者姓名：	蒋应锋张桦薛彦兵周冕徐光平高赞

作者单位：	天津理工大学天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384;天津理工大学天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384;天津理工大学天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384;天津理工大学天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384;天津理工大学天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384;天津理工大学天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384

基金项目：	国家自然科学基金(61202168,1,61472278)、天津市自然科学基金重点(14JCZDJC31700)和天津市高校发展基金(20120802,4)资助项目 (天津理工大学天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384)

摘要：	如何在深度学习中融合图像的多尺度信息,是基于深度学习的视觉算法需要解决的一个关键问题。本文提出一种基于多尺度交替迭代训练的深度学习方法,并应用于图像的语义理解。算法采用卷积神经网络(CNN)从原始图像中提取稠密性特征来编码以每个像素为中心的矩形区域,将多个尺度图像交替迭代训练,能够捕获不同尺度下的纹理、颜色和边缘等重要信息。在深度学习提取特征分类结果的基础上,提出了一种结合超像素分割的方法,统计超像素块的主导类别,来校正分类错误的像素类别,同时描绘出目标区域边界轮廓,完成最终的语义理解。在Stanford Background Dataset 8类数据集上验证了本文方法的有效性,准确率达到77.4%。
关键词：	图像语义理解超像素分割卷积神经网络(CNN) 深度学习
收稿时间：	2015/9/25 0:00:00
A new multi-scale image semantic understanding method based on deep learning

Affiliation:	Key Laboratory of Computer Vision and System,Ministry of Education of China,Tia njin Key Laboratory of Intelligence Computing and Novel Software Technology,Tian jin Universi ty of Technology,Tianjin 300384,China;Key Laboratory of Computer Vision and System,Ministry of Education of China,Tia njin Key Laboratory of Intelligence Computing and Novel Software Technology,Tian jin Universi ty of Technology,Tianjin 300384,China;Key Laboratory of Computer Vision and System,Ministry of Education of China,Tia njin Key Laboratory of Intelligence Computing and Novel Software Technology,Tian jin Universi ty of Technology,Tianjin 300384,China;Key Laboratory of Computer Vision and System,Ministry of Education of China,Tia njin Key Laboratory of Intelligence Computing and Novel Software Technology,Tian jin Universi ty of Technology,Tianjin 300384,China;Key Laboratory of Computer Vision and System,Ministry of Education of China,Tia njin Key Laboratory of Intelligence Computing and Novel Software Technology,Tian jin Universi ty of Technology,Tianjin 300384,China;Key Laboratory of Computer Vision and System,Ministry of Education of China,Tia njin Key Laboratory of Intelligence Computing and Novel Software Technology,Tian jin Universi ty of Technology,Tianjin 300384,China

Abstract:	How to fuse multi-scale information o f image in deep learning is a key problem to be solved.To solve these problems,this paper proposed a deep learning method base d on multi-scale iterative training for image semantic understanding.The algorithm uses the convolution neu ral network (CNN) to extract dense feature vectors from raw pixel for encoding regions centered on each pixel.The multi-scale iterative training captures different scales of textures,colors,edges and other important informa tion.A new method combined with superpixel segmentation is proposed,to estimate the leading category of superpi xel block and to correct the pixel classification error.It can depict the outline of the target area boundary a nd complete the final semantic understanding.The experiments on Stanford Background Dataset-8verify the effectiveness of the proposed method,and the accuracy rate is 77.4%.

Keywords:	image semantic understanding superpixel segmentation convolutional neural netw ork (CNN) deep learning

	点击此处可从《光电子．激光》浏览原始摘要信息
	点击此处可从《光电子．激光》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏