首页 | 本学科首页   官方微博 | 高级检索  
     


Deep learning for content-based video retrieval in film and television production
Authors:Markus Mühling  Nikolaus Korfhage  Eric Müller  Christian Otto  Matthias Springstein  Thomas Langelage  Uli Veith  Ralph Ewerth  Bernd Freisleben
Affiliation:1.Department of Mathematics and Computer Science,University of Marburg,Marburg,Germany;2.taglicht media Film- & Fernsehproduktion GmbH,K?ln,Germany;3.German National Library of Science and Technology (TIB),Hannover,Germany;4.L3S Research Center,Leibniz Universit?t Hannover,Hannover,Germany
Abstract:While digitization has changed the workflow of professional media production, the content-based labeling of image sequences and video footage, necessary for all subsequent stages of film and television production, archival or marketing is typically still performed manually and thus quite time-consuming. In this paper, we present deep learning approaches to support professional media production. In particular, novel algorithms for visual concept detection, similarity search, face detection, face recognition and face clustering are combined in a multimedia tool for effective video inspection and retrieval. The analysis algorithms for concept detection and similarity search are combined in a multi-task learning approach to share network weights, saving almost half of the computation time. Furthermore, a new visual concept lexicon tailored to fast video retrieval for media production and novel visualization components are introduced. Experimental results show the quality of the proposed approaches. For example, concept detection achieves a mean average precision of approximately 90% on the top-100 video shots, and face recognition clearly outperforms the baseline on the public Movie Trailers Face Dataset.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号