首页 | 本学科首页   官方微博 | 高级检索  
     

基于群稀疏约束的语音识别特征混合判别分析
引用本文:陈斌,陈琦,张连海,屈丹,李弼程.基于群稀疏约束的语音识别特征混合判别分析[J].四川大学学报(工程科学版),2015,47(5):139-145.
作者姓名:陈斌  陈琦  张连海  屈丹  李弼程
作者单位:解放军信息工程大学信息系统工程学院,解放军信息工程大学信息系统工程学院,解放军信息工程大学信息系统工程学院,解放军信息工程大学信息系统工程学院,解放军信息工程大学信息系统工程学院
基金项目:国家自然科学基金:基于分段条件随机场的连续语音识别技术(61175017);基于声学空间非线性流形结构的低资源语音识别(61403415)
摘    要:为了克服因数据不足,而造成较难提取稳定的长时特征的问题,提出了一种基于群稀疏约束的混合判别分析方法。该方法首先采用高斯混合模型描述数据的分布,在此基础上利用二次变分的形式进行群稀疏的表示,得到基于群稀疏约束的混合判别分析目标函数。接着,通过定义模糊响应矩阵(blurred response matrix),有效地结合最优化得分方法求解判别分析变换矩阵。最后,拼接相邻帧梅尔滤波器组输出组成超矢量,采用变换矩阵进行变换降维,提取时频特征。实验结果表明,本文方法能够得到稀疏的变换矩阵,相比于PLDA(Penalized LDA)和SLDA(Sparse LDA)判别分析方法,识别准确率分别提高了0.71%和1.53%,且在数据不足的条件下,本文方法能获得更高的识别性能。

关 键 词:混合判别分析  群稀疏  特征变换  语音识别
收稿时间:2014/10/30 0:00:00
修稿时间:3/2/2015 12:00:00 AM

Group-Lasso based Mixture Discriminant Analysis Method for Speech Recognition Feature
chen bin,and.Group-Lasso based Mixture Discriminant Analysis Method for Speech Recognition Feature[J].Journal of Sichuan University (Engineering Science Edition),2015,47(5):139-145.
Authors:chen bin  and
Abstract:In order to extract the stable long-term features when the data is insufficient, a group-Lasso based mixture discriminant analysis method is proposed. Firstly, the method uses the Gaussian mixture model to describe the distribution of data, and the objective function of group-Lasso based mixture discriminant analysis is got based on the quadratic variational form of the group-Lasso. Subsequently, through defining the fuzzy response matrix, the problem of solving the discriminant analysis transform matrix is figured out by effectively combined with the optimal scoring method. Finally, the super-vector is obtained by conjoined the adjacent frames Mel filter bank output, and the time-frequency feature is extracted after the dimensionality of super-vector reduced using the transform matrix. The experimental results show that, this method can obtain sparse transformation matrix. Compared to the PLDA and SLDA discriminant analysis method, the recognition accuracy rate is increased by 0.71% and 1.53% respectively, and when lack of data, this method can achieve higher recognition performance.
Keywords:mixture discriminant analysis  group-Lasso  feature transformation  speech recognition
本文献已被 万方数据 等数据库收录!
点击此处可从《四川大学学报(工程科学版)》浏览原始摘要信息
点击此处可从《四川大学学报(工程科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号