首页 | 本学科首页   官方微博 | 高级检索  
     

采用经验模态分解的语音与音频通用编码方法
引用本文:李晓明,鲍长春.采用经验模态分解的语音与音频通用编码方法[J].信号处理,2013,29(10):1274-1282.
作者姓名:李晓明  鲍长春
作者单位:北京工业大学 电子信息与控制工程学院 语音与音频信号处理研究室
基金项目:国家自然科学基金资助项目(61072089,61201197);北京市教育委员会科技发展计划重点项目(KZ201110005005)
摘    要:为有效解决现有单一模型编码器无法在中低速率对语音和音频信号进行高质量通用编码的问题,本文借助语音与音频信号的谐波特性,建立了一种对语音和音频信号统一编码的方法。首先,本文利用经验模态分解(Empirical Mode Decomposition, EMD)提取输入信号的谐波成分;其次,利用感知匹配追踪算法,并结合正弦参数建模对谐波成分进行参数提取与量化;第三,对于量化谐波后的残差进行抖动格型矢量量化,以提升重建音频的主观听觉质量,并最终实现一套包含24kbps和32kbps码率的宽带语音与音频通用编码器;最后,对所提算法进行了客观PESQ/PEAQ和主观A/B测试,并与ITU-T G.722.1和G.722.2编码器进行了比较,实验结果表明,所提编码器对语音和音频信号的编码质量均优于参考编码器。 

关 键 词:语音编码    音频编码    经验模态分解    感知匹配追踪    抖动格型矢量量化
收稿时间:2013-07-09

A Unified Speech and Audio Coding with Empirical Model Decomposition
Affiliation:Speech and Audio Signal Processing Lab, School of Electronic Information and Control Engineering, Beijing University of Technology
Abstract:In this paper, a unified speech and audio coding method that based on Empirical Mode Decomposition (EMD) by exploiting the harmonic structure of input signal was proposed. This coder can achieve a high performance for both speech and audio signals at low and medium bitrates, which cannot be done by the codec with one single analysis model. Prior to the quantization, the EMD was adopted to extract the harmonic components of the input signal, after this, the extracted harmonic signal was modeled and quantized by sinusoidal model and perceptual weighted matching pursuit. For the quantization residual of harmonic signal, the dithered lattice vector quantization was used to improve the subjective quality. Finally, both the objective PESQ/PEAQ results and subjective A/B listening tests show that the proposed coder outperforms the ITU-T G.722.1 and G.722.2 codec. 
Keywords:
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号