首页 | 本学科首页   官方微博 | 高级检索  
     

基于多任务疏表达的二元麦克风小阵列语音增强算法
引用本文:杨立春,叶敏超,钱沄涛.基于多任务疏表达的二元麦克风小阵列语音增强算法[J].通信学报,2014,35(2):12-94.
作者姓名:杨立春  叶敏超  钱沄涛
作者单位:1. 浙江大学 计算机科学与技术学院,浙江 杭州 310027;2. 浙江万里学院 智能控制技术研究所,浙江 宁波 315101
基金项目:国家自然科学基金资助项目(61171151);国家重点基础研究发展计划(“973”计划)基金资助项目(2012CB316400);国家科技支撑计划基金资助项目(2011BAD24B03)
摘    要:针对常规二元麦克风小阵列话音增强算法通常需要话音活动检测技术支持,并且难以有效抑制第一帧含目标信号的噪声。提出了一种基于多任务稀疏表达的二元麦克风小阵列话音增强算法,首先利用字典学习方法分别获得目标信号和噪声信号的过完备字典,然后利用 混合范数对信号在其字典上的表示系数进行正则化稀疏约束,使得2个阵元接收到信号中的噪声信号被抑制,而话音信号尽量保持不变,从而达到话音增强的目标。仿真和实验数据表明,无论开始位置是否含有目标话音信号,所提出的非话音活动检测支持的二元麦克风小阵列话音增强算法均能有效实现话音增强的目标。

关 键 词:麦克风小阵列  话音增强  字典学习  多任务稀疏表达

Speech enhancement based on multi-task sparse representation for dual small microphone arrays
Li-chun YANG,Min-chao YE,Yun-tao QIAN.Speech enhancement based on multi-task sparse representation for dual small microphone arrays[J].Journal on Communications,2014,35(2):12-94.
Authors:Li-chun YANG  Min-chao YE  Yun-tao QIAN
Affiliation:1. College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China;2. Intelligent Control Research Institute, Zhejiang Wanli University, Ningbo 315101, China
Abstract:Speech enhancement algorithms for dual small microphone arrays usually rely on the voice activity detection(VAD), and they may fail in some cases when target speech signal is included in the first frame. A multi-task sparse representation based speech enhancement algorithm was proposed. First, dictionaries for signal and noise were respectively formed via dictionary learning. Then the noise in signals obtain from two microphones was reduced by regularized sparse representation on the over-complete dictionary, while the target speech signals were mostly preserved, hence the speech signals were enhanced. Experimental results from synthetic and real-world data show that the proposed speech enhancement algorithm without VAD works well in all cases no matter speech signal is included in the first frame or not.
Keywords:small microphone arrays  speech enhancement  dictionary learning  multi-task sparse representation
本文献已被 CNKI 等数据库收录!
点击此处可从《通信学报》浏览原始摘要信息
点击此处可从《通信学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号