首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的基于Viterbi的语音切分算法
引用本文:李欢欢,王金明,尹海明,徐志军,孔磊,张开礼.一种改进的基于Viterbi的语音切分算法[J].通信技术,2015,48(9):1027-1031.
作者姓名:李欢欢  王金明  尹海明  徐志军  孔磊  张开礼
作者单位:1.解放军理工大学 通信工程学院,江苏 南京 210007;2.西安通信学院 信息服务系,陕西 西安 710000
摘    要:主要针对文本提示型说话人识别中语音切分高精确度要求的问题,在利用Viterbi算法的语音切分基础上,提出了向后平滑搜索多帧能量极小值的语音切分方法。该算法首先对0~9的每个数字建立模型,然后利用Viterbi算法对随机数字串进行切分得到初始切分点,最后利用搜索多帧能量极小值的方法更新原始切分点。实验表明,相比于传统的切分算法,在误差范围小于20 ms之内,改进算法的切分准确率由82.1%提高到88%。

关 键 词:语音切分  Viterbi  多帧能量极小值  

An Improved Speech Segmentation Algorithm based on Viterbi
LI Huan-huan,WANG Jin-ming,YIN Hai-ming,
XU Zhi-jun,KONG Lei,ZHANG Kai-li.An Improved Speech Segmentation Algorithm based on Viterbi[J].Communications Technology,2015,48(9):1027-1031.
Authors:LI Huan-huan  WANG Jin-ming  YIN Hai-ming  
XU Zhi-jun
  KONG Lei  ZHANG Kai-li
Affiliation:1.College of Communication Engineering, PLA University of Science & Technology,Nanjing Jiangsu 210007,China;2. Department of Information Service in Xi′an Communication Institute,Xi′an Shaanxi 710000,
Abstract:An improved algorithm for speech segmentation is proposed to improve the segmentation accuracy in text-prompted speaker recognition. This method, based on Viterbi algorithm, implements speech segmentation by backward smooth searching of minimum frame energy. Firstly, the models for numbers from 0 to 9 are trained individually, then the segmentation points are acquired by using Viterbi algorithm to segment a series of random numbers, and finally the segmentation points are updated by smooth searching of minimum frame energy. Experimental results show that this proposed algorithm could achieve an improvement of from 82.1% to 88% in segmentation accuracy within the error range of 20ms, as compared with the traditional algorithm.
Keywords:speech segmentation  Viterbi  minimum frame energy  
点击此处可从《通信技术》浏览原始摘要信息
点击此处可从《通信技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号