首页 | 本学科首页   官方微博 | 高级检索  
     

基于MFCC和运动强度聚类初始化的多说话人识别
引用本文:曹 洁,余丽珍.基于MFCC和运动强度聚类初始化的多说话人识别[J].计算机应用研究,2012,29(9):3295-3298.
作者姓名:曹 洁  余丽珍
作者单位:1. 兰州理工大学 计算机与通信学院,兰州,730050
2. 兰州理工大学 电气工程与信息工程学院,兰州,730050
基金项目:甘肃省自然科学基金资助项目(1014ZSB064); 甘肃省财政厅资助项目(0914ZTB148)
摘    要:针对常用基于音频特征的多说话人聚类初始化方法精度不高这一问题,提出了一种基于视频信号的新方法。该方法通过运用每一时间帧视频信号的运动强度特征对聚类初始化阶段的初始话者类进行选择,有效提升了说话人初始类纯度。最后将该方法应用到高斯混合模型(GMM)多说话人识别系统。实验结果表明,在整个会议集上该方法相比其他方法有了很大改善,较之线性初始化系统的错误识别率平均降低了19.436%,较之改进的线性初始化系统的错误识别率平均降低了16.618%。

关 键 词:多说话人识别  聚类初始化  运动强度特征  运动强度初始化

Multi-speaker recognition based on MFCC and motionintensity clustering initialization
CAO Jie,YU Li-zhen.Multi-speaker recognition based on MFCC and motionintensity clustering initialization[J].Application Research of Computers,2012,29(9):3295-3298.
Authors:CAO Jie  YU Li-zhen
Affiliation:a. College of Computer & Communication, b. College of Electrical & Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China
Abstract:Aiming at the problem of conventional initialization methods performed on audio feature of multiple speakers clustering with poor accuracy, this paper proposed a new method visual-based feature. The method used motion intensity feature with each time-frame of visual information to find initial speaker cluster during the process of clustering initialization, and promoted the purity of speaker initial cluster effectively. Finally, applied this method to Gaussian mixture model GMM multi-speaker recognition system. And the experimental results show that, across the entire meeting set, this proposed new method achieved consistent improvements over other methods, and compared to linear initialization it makes the error recognition of system been reduced by 19. 436% on average; 16. 618% to the improved linear initialization.
Keywords:multi-speaker recognition  clustering initialization  motion intensity feature  motion intensities initialization
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号