基于融合注意力机制的图像标题生成 Image caption generation based on fusion attention mechanism期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于融合注意力机制的图像标题生成

引用本文：	侯一雯,田玉玲.基于融合注意力机制的图像标题生成[J].计算机应用研究,2021,38(7):2209-2212.

作者姓名：	侯一雯田玉玲

作者单位：	太原理工大学信息与计算机学院,太原030000

基金项目：	国家自然科学基金资助项目(61472271)

摘要：	图像标题生成利用机器自动产生描述图像的句子,属于计算机视觉与自然语言处理的交叉领域.传统基于注意力机制的算法侧重特征图不同区域,忽略特征图通道,易造成注意偏差.该模型通过当前嵌入单词与隐藏层状态的耦合度来赋予特征图不同通道相应权重,并将其与传统方法结合为融合注意力机制,准确定位注意位置.实验结果均在指定的评估方法上有一定的提升,表明该模型可以生成更加流利准确的自然语句.
关键词：	图像标题生成注意偏差通道耦合度融合注意力
收稿时间：	2020/6/29 0:00:00
修稿时间：	2021/6/18 0:00:00
Image caption generation based on fusion attention mechanism

Hou,Yiwen and Tian yuling.Image caption generation based on fusion attention mechanism[J].Application Research of Computers,2021,38(7):2209-2212.

Authors:	Hou Yiwen and Tian yuling

Affiliation:	Dept of Information Computer,Taiyuan University of Technology,:Taiyuan Shanxi;Dept of Information Computer,Taiyuan University of Technology,:Taiyuan Shanxi,

Abstract:	Image caption generation makes machine to automatically describe the content of an image, which belongs to a crossing domain of computer vision and natural language processing. Traditional algorithm based on attention mechanism focus on the different sub-regions of the feature maps, without considering the different channels of feature maps, which is easy to cause attention deviation. To solve this problem, the proposed model assigned corresponding weights to different channel feature maps by the degree of coupling between the currently embedded word and the state of the hidden layer, and combined it with the traditional method as a fusion attention mechanism to accurately locate the attention position. The experimental results have a certain improvement on the specified evaluation method, indicating that the model can generate more fluent and accurate natural sentences.

Keywords:	image caption generation attention deviation channel coupling fusion attention
本文献已被万方数据等数据库收录！
	点击此处可从《计算机应用研究》浏览原始摘要信息
	点击此处可从《计算机应用研究》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏