首页 | 本学科首页   官方微博 | 高级检索  
     


Foveated convolutional neural networks for video summarization
Authors:Jiaxin Wu  Sheng-hua Zhong  Zheng Ma  Stephen J. Heinen  Jianmin Jiang
Affiliation:1.The College of Computer Science and Software Engineering,Shenzhen University,Shenzhen,China;2.Smith-Kettlewell Eye Research Institute,San Francisco,USA
Abstract:With the proliferation of video data, video summarization is an ideal tool for users to browse video content rapidly. In this paper, we propose a novel foveated convolutional neural networks for dynamic video summarization. We are the first to integrate gaze information into a deep learning network for video summarization. Foveated images are constructed based on subjects’ eye movements to represent the spatial information of the input video. Multi-frame motion vectors are stacked across several adjacent frames to convey the motion clues. To evaluate the proposed method, experiments are conducted on two video summarization benchmark datasets. The experimental results validate the effectiveness of the gaze information for video summarization despite the fact that the eye movements are collected from different subjects from those who generated summaries. Empirical validations also demonstrate that our proposed foveated convolutional neural networks for video summarization can achieve state-of-the-art performances on these benchmark datasets.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号