首页 | 本学科首页   官方微博 | 高级检索  
     

基于听皮层神经元感受野的强噪声环境下说话人识别
引用本文:牛晓可,黄伊鑫,徐华兴,蒋震阳.基于听皮层神经元感受野的强噪声环境下说话人识别[J].计算机应用,2020,40(10):3034-3040.
作者姓名:牛晓可  黄伊鑫  徐华兴  蒋震阳
作者单位:1. 郑州大学 电气工程学院, 郑州 450001;2. 河南省脑科学与脑机接口技术重点实验室(郑州大学), 郑州 450001
摘    要:针对说话人识别易受环境噪声影响的问题,借鉴生物听皮层神经元频谱-时间感受野(STRF)的时空滤波机制,提出一种新的声纹特征提取方法。在该方法中,对基于STRF获得的听觉尺度-速率图进行了二次特征提取,并与传统梅尔倒谱系数(MFCC)进行组合,获得了对环境噪声具有强容忍的声纹特征。采用支持向量机(SVM)作为分类器,对不同信噪比(SNR)语音数据进行测试的结果表明,基于STRF的特征对噪声的鲁棒性普遍高于MFCC系数,但识别正确率较低;组合特征提升了语音识别的正确率,同时对环境噪声具有良好的鲁棒性。该结果说明所提方法在强噪声环境下说话人识别上是有效的。

关 键 词:听皮层  频谱-时间感受野  梅尔倒谱系数  含噪说话人识别  支持向量机  
收稿时间:2020-04-12
修稿时间:2020-06-01

Speaker recognition in strong noise environment based on auditory cortical neuronal receptive field
NIU Xiaoke,HUANG Yixin,XU Huaxing,JIANG Zhenyang.Speaker recognition in strong noise environment based on auditory cortical neuronal receptive field[J].journal of Computer Applications,2020,40(10):3034-3040.
Authors:NIU Xiaoke  HUANG Yixin  XU Huaxing  JIANG Zhenyang
Affiliation:1. School of Electrical Engineering, Zhengzhou University, Zhengzhou Henan 450001, China;2. Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology(Zhengzhou University), Zhengzhou Henan 450001, China
Abstract:Aiming at the problem that speaker recognition is susceptible to environmental noise, a new voiceprint extraction method was proposed based on the spatial-temporal filtering mechanism of Spectra-Temporal Receptive Field (STRF) of biological auditory cortex neurons. In the method, the quadratic characteristics were extracted from the auditory scale-rate map based on STRF, and the traditional Mel-Frequency Cepstral Coefficient (MFCC) was combined to obtain the voiceprint features with strong tolerance to environmental noise. Using Support Vector Machine (SVM) as feature classifier, the testing results on speech data with different Signal-to-Noise Ratios (SNR) showed that the STRF-based features were more robust to noise than MFCC coefficient, but had lower recognition accuracy; the combined features improved the accuracy of speech recognition and had good robustness to noise. The results verify the effectiveness of the proposed method in speaker recognition under strong noise environment.
Keywords:auditory cortex  Spectral-Temporal Receptive Field (STRF)  Mel-Frequency Ceptral Coefficient (MFCC)  noisy speaker recognition  Support Vector Machine (SVM)  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号