Spatio-Temporal Tube data representation and Kernel design for SVM-based video object retrieval system期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Spatio-Temporal Tube data representation and Kernel design for SVM-based video object retrieval system

Authors:	Shuji Zhao Frédéric Precioso Matthieu Cord

Affiliation:	1.ETIS Lab, CNRS/ENSEA/Univ Cergy-Pontoise,Cergy-Pontoise,France;2.UPMC-Sorbonne Universités – LIP6,Paris,France

Abstract:	In this article, we propose a new video object retrieval system. Our approach is based on a Spatio-Temporal data representation, a dedicated kernel design and a statistical learning toolbox for video object recognition and retrieval. Using state-of-the-art video object detection algorithms (for faces or cars, for example) we segment video object tracks from real movies video shots. We then extract, from these tracks, sets of spatio-temporally coherent features that we call Spatio-Temporal Tubes. To compare these complex tube objects, we design a Spatio-Temporal Tube Kernel (STTK) function. Based on this kernel similarity we present both supervised and active learning strategies embedded in Support Vector Machine framework. Additionally, we propose a multi-class classification framework dealing with unbalanced data. Our approach is successfully evaluated on two real movies databases, the french movie “L’esquive” and episodes from “Buffy, the Vampire Slayer” TV series. Our method is also tested on a car database (from real movies) and shows promising results for car identification task.

Keywords:
本文献已被 SpringerLink 等数据库收录！