首页 | 本学科首页   官方微博 | 高级检索  
     


Using redundant parallel architecture to improve speaker recognition performance
Authors:Zhengquan QIU  Junxun YIN  Caiyun FAN
Affiliation:1. School Electronic and Information Engineering,South China University of Technolog,Guangzhou Guangdong 510640,China
2. School of Mathematical Sciences,South China University of Technology,Guangzhou Guangdong 510640,China
Abstract:In this Paper, we propose two kinds of modifications in speaker recognition.First,the correlations between frequency channels are of prime importance for speaker recognition.Some of these correlations are lost when the frequency domain is divided into sub-bands.Consequently we propose a particularly redundant parallel architecture for which most of the correlations are kept.Second,generally a log transformation used to modify the power spectrum is done after the filter-bank in the classical spectrum calculation.We will see that performing this transformation before the filter bank is more interesting in our case.In the processing of recognition,the Gaussian mixture model(GMM)recognition arithmetic is adopted.Experiments on speech corrupted by noise show a better adaptability of this approach in noisy environments,compared with a conventional device,especially when pruning of some recognizers is performed.
Keywords:Correlations  Redundant parallel architecture  Log transformation  GMM
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
点击此处可从《控制理论与应用(英文版)》浏览原始摘要信息
点击此处可从《控制理论与应用(英文版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号