Object and Action Classification with Latent Window Parameters |
| |
Authors: | Hakan Bilen Vinay P Namboodiri Luc J Van Gool |
| |
Affiliation: | 1. ESAT-PSI/iMinds, Ku Leuven, Kasteelpark Arenberg 10, 3001, Heverlee, Belgium 2. Alcatel-Lucent Bell Labs, Copernicuslaan 50, 2018, Antwerp, Belgium 3. Computer Vision Laboratory, ETH Zürich, Sternwartstrasse 7, 8092, Zurich, Switzerland
|
| |
Abstract: | In this paper we propose a generic framework to incorporate unobserved auxiliary information for classifying objects and actions. This framework allows us to automatically select a bounding box and its quadrants from which best to extract features. These spatial subdivisions are learnt as latent variables. The paper is an extended version of our earlier work Bilen et al. (Proceedings of The British Machine Vision Conference, 2011), complemented with additional ideas, experiments and analysis. We approach the classification problem in a discriminative setting, as learning a max-margin classifier that infers the class label along with the latent variables. Through this paper we make the following contributions: (a) we provide a method for incorporating latent variables into object and action classification; (b) these variables determine the relative focus on foreground versus background information that is taken account of; (c) we design an objective function to more effectively learn in unbalanced data sets; (d) we learn a better classifier by iterative expansion of the latent parameter space. We demonstrate the performance of our approach through experimental evaluation on a number of standard object and action recognition data sets. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|