首页 | 本学科首页   官方微博 | 高级检索  
     


Nearest-Neighbor Automatic Sound Annotation with a WordNet Taxonomy
Authors:Pedro Cano  Markus Koppenberger  Sylvain Le Groux  Julien Ricard  Nicolas Wack  Perfecto Herrera
Affiliation:(1) Music Technology Group, Institut Universitari de l"rsquo"Audiovisual, Universitat Pompeu Fabra, 08003 Barcelona, Spain
Abstract:Sound engineers need to access vast collections of sound effects for their film and video productions. Sound effects providers rely on text-retrieval techniques to give access to their collections. Currently, audio content is annotated manually, which is an arduous task. Automatic annotation methods, normally fine-tuned to reduced domains such as musical instruments or limited sound effects taxonomies, are not mature enough for labeling with great detail any possible sound. A general sound recognition tool would require first, a taxonomy that represents the world and, second, thousands of classifiers, each specialized in distinguishing little details. We report experimental results on a general sound annotator. To tackle the taxonomy definition problem we use WordNet, a semantic network that organizes real world knowledge. In order to overcome the need of a huge number of classifiers to distinguish many different sound classes, we use a nearest-neighbor classifier with a database of isolated sounds unambiguously linked to WordNet concepts. A 30% concept prediction is achieved on a database of over 50,000 sounds and over 1600 concepts.Part of the contents of this paper has been published in the Proceedings of the 2004 IEEE International Workshop on Machine Learning for Signal Processing.
Keywords:audio identification  WordNet  nearest-neighbor  everyday sound  knowledge management
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号