首页 | 本学科首页   官方微博 | 高级检索  
     

基于支持向量机的音频分类与分割
引用本文:白亮 老松杨 陈剑赟 吴玲达. 基于支持向量机的音频分类与分割[J]. 计算机科学, 2005, 32(4): 87-90
作者姓名:白亮 老松杨 陈剑赟 吴玲达
作者单位:国防科技大学多媒体研发中心,长沙,410073;国防科技大学多媒体研发中心,长沙,410073;国防科技大学多媒体研发中心,长沙,410073;国防科技大学多媒体研发中心,长沙,410073
摘    要:音频分类与分割是提取音频结构和内容语义的重要手段,是基于内容的音频、视频检索和分析的基础。支持向量机(SVM)是一种有效的统计学习方法。本文提出了一种基于SVM的音频分类算法。将音频分为5类:静音、噪音、音乐、纯语音和带背景音的语音。在分类的基础上,采用3个平滑规则对分类结果进行平滑。分析了SVM分类嚣的分类性能,同时也评估了本文提出的新的音频特征在SVM分类嚣上的分类效果。实验结果显示,基于SVM的音频分类算法分类效果良好,平滑处理后的音频分割结果比较准确。

关 键 词:音频分类与分割  支持向量机

Audio Classification and Segmentation Based on Support Vector Machines
BAI Liang,LAO Song-Yang,CHEN Jian-Yun,WU Ling-Da. Audio Classification and Segmentation Based on Support Vector Machines[J]. Computer Science, 2005, 32(4): 87-90
Authors:BAI Liang  LAO Song-Yang  CHEN Jian-Yun  WU Ling-Da
Affiliation:BAI Liang,LAO Song-Yang,CHEN Jian-Yun,WU Ling-Da Multimedia Research and Development Center,National Univ. of Defense Technology,Changsha 410073
Abstract:Audio classification and segmentation are an important access to extract audio structure and content, and are a basis for further audio/video retrieval and analysis. Support vector machines (SVM) is a valid statistic learning method. In this paper, the work on audio classification based on SVM is presented. Five audio classes are considered in this paper: silence, noise, music, pure speech and speech over background sound. Three smooth rules are present- ed and applied in the final segmentation. The performance of SVM on audio classification is evaluated. The effective- ness of some new proposed features is also evaluated. Experiment results show that SVM performs very well for au- dio classification and segmentation accuracy is good with the proposed three smooth rules.
Keywords:Audio classification and segmentation  Support vector machine
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号