首页 | 本学科首页   官方微博 | 高级检索  
     

基于融合注意力机制的图像标题生成
引用本文:侯一雯,田玉玲.基于融合注意力机制的图像标题生成[J].计算机应用研究,2021,38(7):2209-2212.
作者姓名:侯一雯  田玉玲
作者单位:太原理工大学 信息与计算机学院,太原030000
基金项目:国家自然科学基金资助项目(61472271)
摘    要:图像标题生成利用机器自动产生描述图像的句子,属于计算机视觉与自然语言处理的交叉领域.传统基于注意力机制的算法侧重特征图不同区域,忽略特征图通道,易造成注意偏差.该模型通过当前嵌入单词与隐藏层状态的耦合度来赋予特征图不同通道相应权重,并将其与传统方法结合为融合注意力机制,准确定位注意位置.实验结果均在指定的评估方法上有一定的提升,表明该模型可以生成更加流利准确的自然语句.

关 键 词:图像标题生成  注意偏差  通道  耦合度  融合注意力
收稿时间:2020/6/29 0:00:00
修稿时间:2021/6/18 0:00:00

Image caption generation based on fusion attention mechanism
Hou,Yiwen and Tian yuling.Image caption generation based on fusion attention mechanism[J].Application Research of Computers,2021,38(7):2209-2212.
Authors:Hou  Yiwen and Tian yuling
Affiliation:Dept of Information Computer,Taiyuan University of Technology,:Taiyuan Shanxi;Dept of Information Computer,Taiyuan University of Technology,:Taiyuan Shanxi,
Abstract:Image caption generation makes machine to automatically describe the content of an image, which belongs to a crossing domain of computer vision and natural language processing. Traditional algorithm based on attention mechanism focus on the different sub-regions of the feature maps, without considering the different channels of feature maps, which is easy to cause attention deviation. To solve this problem, the proposed model assigned corresponding weights to different channel feature maps by the degree of coupling between the currently embedded word and the state of the hidden layer, and combined it with the traditional method as a fusion attention mechanism to accurately locate the attention position. The experimental results have a certain improvement on the specified evaluation method, indicating that the model can generate more fluent and accurate natural sentences.
Keywords:image caption generation  attention deviation  channel  coupling  fusion attention
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号