首页 | 本学科首页   官方微博 | 高级检索  
     

修正倒谱和动态规划的基频估计算法
引用本文:金学成,解岭,汪增福.修正倒谱和动态规划的基频估计算法[J].声学技术,2008,27(1):79-86.
作者姓名:金学成  解岭  汪增福
作者单位:中国科学技术大学自动化系,安徽,合肥,230027
基金项目:中国科技大学校科研和教改项目 , 国家重点实验室基金 , 中国科学技术大学/中国科学院自动化研究所智能科学与技术联合实验室开放基金
摘    要:基音频率是语音信号处理中的一个重要参数。倍频、半频错误以及清浊音判决的可靠性等问题一直是基频估计中的难点问题。在对语音信号的倒谱进行适当修正的基础上,提出了一种高精度的基频估计算法。该算法根据倒谱、短时能量和短时过零率在清音段和浊音段的不同表现,构造了一个清浊音判决函数,大大提高了清浊音判决精度;然后利用动态规划技术进行基频跟踪。在构造代价函数时.充分考虑了基频连续性的影响,从而使该算法既能有效地避免倍频和半频错误,又能体现出基频的自然加倍和减半。通过与现有的几种效果较好的方法进行对比实验,结果表明该算法具有准确率高、基频轨迹平滑的优点,利用该算法得到的基频轨迹基本不需要进行后期平滑处理。

关 键 词:基频提取  倒谱  动态规划  清浊音判决
文章编号:1000-3630(2008)-01-0079-08
收稿时间:2007-01-29
修稿时间:2007-05-05

A modified cepstrum-based algorithm for fundamental frequency estimation using dynamic programming
JIN Xue-cheng,XIE Ling and WANG Zeng-fu.A modified cepstrum-based algorithm for fundamental frequency estimation using dynamic programming[J].Technical Acoustics,2008,27(1):79-86.
Authors:JIN Xue-cheng  XIE Ling and WANG Zeng-fu
Affiliation:(Department of Automation, University of Science and Technology of China, Hefei, Anhui, 230027, china)
Abstract:Fundamental frequency (F0) is a key parameter in speech signals processing. Pitch doubling, pitch halving and the reliability of voicing decision are the most difficult problems in the estimation of fundamental frequency. An algorithm based on the modified cepstrum is proposed for the estimation of the fundamental frequency (F0) of speech signals Voicing decisions are made by using a decision function composed of cepstral peak, zero-crossing rate, and energy of short-time segments of speech signals. An accurate voiced/unvoiced classification is obtained based on this decision function. Then a dynamic programming method is used to realize pitch tracking. The consecution of F0 is considered sufficiently in the cost function. The proposed algorithm can avoid the problem concerning with pitch doubling and pitch halving effectively, as well as preserve the natural doubling and halving of F0. The comparing experiments with several other well-known methods show that the algorithm in this paper has some desirable advantages such as high accuracy and smooth F0 contour, which needs no postsmoother.
Keywords:fundamental frequency estimation  cepstrum  dynamic programming  voicing detection
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号