首页 | 本学科首页   官方微博 | 高级检索  
     

基于聚类分析与说话人识别的语音跟踪
引用本文:郝敏,刘航,李扬,简单,王俊影.基于聚类分析与说话人识别的语音跟踪[J].计算机与现代化,2020,0(4):7-13,18.
作者姓名:郝敏  刘航  李扬  简单  王俊影
作者单位:广东工业大学机电工程学院,广东广州510006;广东工业大学机电工程学院,广东广州510006;广东工业大学机电工程学院,广东广州510006;广东工业大学机电工程学院,广东广州510006;广东工业大学机电工程学院,广东广州510006
基金项目:广东省省级科技计划;佛山市产学研专项
摘    要:目前语音跟踪在说话人干扰的条件下,即一段语音中存在多个说话人的混合语音信号时,语音跟踪质量会严重下降。针对这种情况,提出一种基于聚类分析与说话人识别的语音跟踪算法。算法首先使用改进的聚类分析方法进行语音分离,具体包括在K-means聚类中对质心进行缓存并降低采样率,以及在embedding特征空间引入正则项。其次,算法采用GMM-UBM说话人模型进行语音跟踪。实验结果表明改进的聚类分析方法可以有效提高算法的实时性及其语音分离质量,GMM-UBM模型在3 s语音的测试中具有84%的识别率。

关 键 词:单信道语音跟踪  智能语音  聚类分析  高斯混合模型  长短期记忆网络  
收稿时间:2020-04-24

Speech Tracking Based on Cluster Analysis and Speaker Recognition
HAO Min,LIU Hang,LI Yang,JIAN Dan,WANG Jun-ying.Speech Tracking Based on Cluster Analysis and Speaker Recognition[J].Computer and Modernization,2020,0(4):7-13,18.
Authors:HAO Min  LIU Hang  LI Yang  JIAN Dan  WANG Jun-ying
Abstract:At present, the speech tracking quality will be seriously reduced under the condition of speaker interference, that is, mixed speech signals of multiple speakers in a speech segment. Aiming at this situation, a speech tracking algorithm based on cluster analysis and speaker recognition is proposed. Firstly, the improved clustering analysis method is used for speech separation. Specifically, it includes caching the center of mass and lowering the sampling rate in K-means clustering, and introducing regular terms into embedding feature space. Secondly, the GMM-UBM speaker model is used for speech tracking. The experimental results show that the improved cluster analysis method can effectively improve the real-time performance of the algorithm and the quality of speech separation, the GMM-UBM model has an 84% recognition rate in 3 s speech test.
Keywords:single channel speech track  intelligent speech  clustering analysis  Gaussian mixture model  LSTM  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号