首页 | 本学科首页   官方微博 | 高级检索  
     

基于优化正交匹配追踪和深度置信网的声音识别
引用本文:陈秋菊,李应. 基于优化正交匹配追踪和深度置信网的声音识别[J]. 计算机应用, 2017, 37(2): 505-511. DOI: 10.11772/j.issn.1001-9081.2017.02.0505
作者姓名:陈秋菊  李应
作者单位:福州大学 数学与计算机科学学院, 福州 350116
基金项目:国家自然科学基金资助项目(61075022)。
摘    要:针对各种环境声音对声音事件识别的影响,提出一种基于优化的正交匹配追踪(OOMP)和深度置信网(DBN)的声音事件识别方法。首先,利用粒子群优化(PSO)算法优化OMP稀疏分解,在实现正交匹配追踪(OMP)的快速稀疏分解的同时,保留声音信号的主体部分,抑制噪声对声音信号的影响;接着,对重构声音信号提取Mel频率倒谱系数(MFCC)、OMP时-频特征和基音频率(Pitch)特征,组成OOMP的复合特征;最后,使用DBN对提取的OOMP特征进行特征学习,并对40种声音事件在不同环境不同信噪比下进行识别。实验结果表明,OOMP特征结合DBN的方法适用于各种环境声下的声音事件识别,而且能有效地识别各种环境下的声音事件,即使在信噪比(SNR)为0 dB的情况下,仍然能保持平均60%的识别率。

关 键 词:声音事件识别  正交匹配追踪  稀疏分解  粒子群优化  深度置信网  
收稿时间:2016-06-12
修稿时间:2016-08-04

Sound recognition based on optimized orthogonal matching pursuit and deep belief network
CHEN Qiuju,LI Ying. Sound recognition based on optimized orthogonal matching pursuit and deep belief network[J]. Journal of Computer Applications, 2017, 37(2): 505-511. DOI: 10.11772/j.issn.1001-9081.2017.02.0505
Authors:CHEN Qiuju  LI Ying
Affiliation:College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350116, China
Abstract:Concerning the influence of various environmental ambiances on sound event recognition, a sound event recognition method based on Optimized Orthogonal Matching Pursuit (OOMP) and Deep Belief Network (DBN) was proposed. Firstly, Particle Swarm Optimization (PSO) algorithm was used to optimize Orthogonal Matching Pursuit (OMP) sparse decomposition of sound signal, which realized fast sparse decomposition of OMP and reserved the main body of sound signal and reduced the influence of noise. Then, an optimized composited feature was composed by Mel-Frequency Cepstral Coefficient (MFCC), time-frequency OMP feature and Pitch feature extracted from the reconstructed sound signal, which was called OOMP feature. Finally, the DBN was employed to learn the OOMP feature and recognize 40 classes of sound events in different environments and Signal-to-Noise Ratio (SNR). The experimental results show that the proposed method which combined OOMP and BDN is suitable for sound event recognition in various environments, and can effectively recognize sound events in various environments; it can still maitain an average accuracy rate of 60% even when the SNR is 0 dB.
Keywords:sound event recognition  Orthogonal Matching Pursuit (OMP)  sparse decomposition  Particle Swarm Optimization (PSO)  Deep Belief Network (DBN)  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号