首页 | 本学科首页   官方微博 | 高级检索  
     

复杂环境下基于时延估计的声源定位技术研究
引用本文:张大威,鲍长春,夏丙寅. 复杂环境下基于时延估计的声源定位技术研究[J]. 通信学报, 2014, 35(1): 183-190. DOI: 10.3969/j.issn.1000-436x.2014.01.021
作者姓名:张大威  鲍长春  夏丙寅
作者单位:北京工业大学 电子信息与控制工程学院 话音与音频信号处理研究室,北京 100124
基金项目:北京市教育委员会科技发展计划重点基金资助项目(KZ201110005005);国家自然科学基金资助项目(61072089)
摘    要:
为了改善在复杂环境下声源定位算法的性能,提出了一种新的时延估计(TDE)方法,即基于传递函数比的统计模型方法(ATFR-SM)。该方法采用统计模型去除噪声对传递函数(ATF)的影响,在计算传递函数时对功率谱密度(PSD)进行平滑和“白化”,以去除混响对传递函数的影响。同时,算法中引入话音激活检测(VAD)去除对求取传递函数无用的噪声段,以提高时延估计的准确性。此外,将所提时延估计方法与线性定位法相结合,构成一套完整的声源定位方法。实验结果表明,在复杂环境下,时延估计方法具有更低的异常点百分比(PAP)和均方根误差(RMSE),且明显优于传统的参考算法,同时声源定位方法具有更高的定位精度。

关 键 词:时延估计;传递函数比;VAD;统计模型;声源定位

Source localization based on time delayestimation in complex environment
Da-wei ZHANG,Chang-chun BAO,Bing-yin XIA. Source localization based on time delayestimation in complex environment[J]. Journal on Communications, 2014, 35(1): 183-190. DOI: 10.3969/j.issn.1000-436x.2014.01.021
Authors:Da-wei ZHANG  Chang-chun BAO  Bing-yin XIA
Affiliation:Speech and Audio Signal Processing Lab,School of Electronic Information and Control Engineering,Beijing University of Technology,Beijing 100124,China
Abstract:
In order to improve the performance of source localization in noisy and reverberant environments, a novel time delay estimation (TDE) method was proposed. This method is called acoustical transfer function ratio based on statistical model (ATFR-SM). In the proposed algorithm, the noise reduction method based on the statistical model was adopted to reduce the effect of noise on acoustical transfer Function (ATF). In the ATF method, the power spectral density (PSD) was smoothed and whitened to reduce the effect of reverberations. voice activity detection (VAD) was used to distinguish the speech period from the noise period, and the TDE was performed in the speech period to improve the estimation accuracy. Moreover, the proposed TDE method and the linear closed-form method for source localization were combined to constitute a source localization system. The results of performance evaluation show that, in both the noisy and reverberant conditions, the lower percentage of abnormal points (PAP) and lower root mean square error (RMSE) can be achieved by the proposed TDE method than those of the reference methods. Meanwhile, the source localization has higher accuracy than the reference methods.
Keywords:TDE   ATF ratio   VAD   statistical model   source localization
本文献已被 CNKI 等数据库收录!
点击此处可从《通信学报》浏览原始摘要信息
点击此处可从《通信学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号