Using redundant parallel architecture to improve speaker recognition performance期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Using redundant parallel architecture to improve speaker recognition performance

Authors:	Zhengquan QIU Junxun YIN Caiyun FAN

Affiliation:	1. School Electronic and Information Engineering,South China University of Technolog,Guangzhou Guangdong 510640,China 2. School of Mathematical Sciences,South China University of Technology,Guangzhou Guangdong 510640,China

Abstract:	In this Paper, we propose two kinds of modifications in speaker recognition.First,the correlations between frequency channels are of prime importance for speaker recognition.Some of these correlations are lost when the frequency domain is divided into sub-bands.Consequently we propose a particularly redundant parallel architecture for which most of the correlations are kept.Second,generally a log transformation used to modify the power spectrum is done after the filter-bank in the classical spectrum calculation.We will see that performing this transformation before the filter bank is more interesting in our case.In the processing of recognition,the Gaussian mixture model(GMM)recognition arithmetic is adopted.Experiments on speech corrupted by noise show a better adaptability of this approach in noisy environments,compared with a conventional device,especially when pruning of some recognizers is performed.

Keywords:	Correlations Redundant parallel architecture Log transformation GMM
本文献已被 CNKI 维普万方数据 SpringerLink 等数据库收录！
	点击此处可从《控制理论与应用(英文版)》浏览原始摘要信息
	点击此处可从《控制理论与应用(英文版)》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏