基于DWT-TEO的说话人识别 Speaker Recognition Based on DWT-TEO期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于DWT-TEO的说话人识别

引用本文：	邱政权, 尹俊勋, 薛丽萍. 基于DWT-TEO的说话人识别. 自动化学报, 2006, 32(5): 753-759.

作者姓名：	邱政权尹俊勋薛丽萍

作者单位：	1.华南理工大学电信学院, 广州 510640

摘要：	针对在噪声环境下的说话人识别系统,做了两点改进．第一,为了提高系统的鲁棒性,通过不同尺度的小波基,把含有噪声的信号分解于不同频段中,然后在各个频段分别通过TEO(Teager能量算子)去噪．针对说话人识别的特点,在小波重构时对各小波系数进行了加权处理．再把各个频段的输出通过小波重构恢复信号．最后通过Mel滤波器组把小波系数转换成MFCC．第二,为了进一步提高识别性能和训练速度,在识别阶段采用了改进的OGMM(正交高斯混合模型),即把正交变换改到EM算法之前进行,这样就不必要在EM迭代过程中每次都进行正交运算了．从实验得出,采用本文提出的DWT-TEO参数对于说话人识别的效果较好．采用改进的OGMM进一步提高了识别性能和训练速度．
关键词：	小波变换 TEO DWT-TEO OGMM
收稿时间：	2005-06-29
修稿时间：	2006-03-27
Speaker Recognition Based on DWT-TEO

QIU Zheng-Quan, YIN Jun-Xun, XUE Li-Ping. Speaker Recognition Based on DWT-TEO. ACTA AUTOMATICA SINICA, 2006, 32(5): 753-759.

Authors:	QIU Zheng-Quan YIN Jun-Xun XUE Li-Ping

Affiliation:	1. School of Electronics and Information Engineering, South China University of Technology, Guangzhou 510640

Abstract:	Two modifications for speaker recognition system in noise environment are described. First, in order to improve the robustness of the system, noisy speech is decomposed into various frequency bands and de-noising is The wavelet coefficient is weighted according carried out by TEO in every frequency band. to the characteristics of speaker recognition, and is then transformed into MFCC. Second, in order to improve recognition performance and training speed, a modified OGMM that orthogonal transform is performed before EM arithmetic is applied at the recognition stage. Thus, it is not necessary to do orthogonal operation during every EM iterative process. The experimental results show that the parameters proposed have produced good effect and that modified OGMM can further improve recognition performance and training speed.

Keywords:	TEO DWT-TEO OGMM
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏