Deep learning for content-based video retrieval in film and television production期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Deep learning for content-based video retrieval in film and television production

Authors:	Markus Mühling Nikolaus Korfhage Eric Müller Christian Otto Matthias Springstein Thomas Langelage Uli Veith Ralph Ewerth Bernd Freisleben

Affiliation:	1.Department of Mathematics and Computer Science,University of Marburg,Marburg,Germany;2.taglicht media Film- & Fernsehproduktion GmbH,K?ln,Germany;3.German National Library of Science and Technology (TIB),Hannover,Germany;4.L3S Research Center,Leibniz Universit?t Hannover,Hannover,Germany

Abstract:	While digitization has changed the workflow of professional media production, the content-based labeling of image sequences and video footage, necessary for all subsequent stages of film and television production, archival or marketing is typically still performed manually and thus quite time-consuming. In this paper, we present deep learning approaches to support professional media production. In particular, novel algorithms for visual concept detection, similarity search, face detection, face recognition and face clustering are combined in a multimedia tool for effective video inspection and retrieval. The analysis algorithms for concept detection and similarity search are combined in a multi-task learning approach to share network weights, saving almost half of the computation time. Furthermore, a new visual concept lexicon tailored to fast video retrieval for media production and novel visualization components are introduced. Experimental results show the quality of the proposed approaches. For example, concept detection achieves a mean average precision of approximately 90% on the top-100 video shots, and face recognition clearly outperforms the baseline on the public Movie Trailers Face Dataset.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏