Nearest-Neighbor Automatic Sound Annotation with a WordNet Taxonomy |
| |
Authors: | Pedro Cano Markus Koppenberger Sylvain Le Groux Julien Ricard Nicolas Wack Perfecto Herrera |
| |
Affiliation: | (1) Music Technology Group, Institut Universitari de lAudiovisual, Universitat Pompeu Fabra, 08003 Barcelona, Spain |
| |
Abstract: | Sound engineers need to access vast collections of sound effects for their film and video productions. Sound effects providers rely on text-retrieval techniques to give access to their collections. Currently, audio content is annotated manually, which is an arduous task. Automatic annotation methods, normally fine-tuned to reduced domains such as musical instruments or limited sound effects taxonomies, are not mature enough for labeling with great detail any possible sound. A general sound recognition tool would require first, a taxonomy that represents the world and, second, thousands of classifiers, each specialized in distinguishing little details. We report experimental results on a general sound annotator. To tackle the taxonomy definition problem we use WordNet, a semantic network that organizes real world knowledge. In order to overcome the need of a huge number of classifiers to distinguish many different sound classes, we use a nearest-neighbor classifier with a database of isolated sounds unambiguously linked to WordNet concepts. A 30% concept prediction is achieved on a database of over 50,000 sounds and over 1600 concepts.Part of the contents of this paper has been published in the Proceedings of the 2004 IEEE International Workshop on Machine Learning for Signal Processing. |
| |
Keywords: | audio identification WordNet nearest-neighbor everyday sound knowledge management |
本文献已被 SpringerLink 等数据库收录! |
|