首页 | 本学科首页   官方微博 | 高级检索  
     

噪声谱估计算法对语音可懂度的影响
引用本文:张建伟,陶亮,周健,王华彬.噪声谱估计算法对语音可懂度的影响[J].声学技术,2015,34(5):424-430.
作者姓名:张建伟  陶亮  周健  王华彬
作者单位:安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031;安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031;安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031;安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031
基金项目:国家自然科学基金(61301219、61003131)、安徽省自然科学基金(1408085MF113)资助项目。
摘    要:噪声谱估计是单通道语音增强算法的关键步骤,当前大部分语音增强算法旨在提高语音质量,提高语音可懂度的算法却很少。在传统的单通道语音增强算法中,语音质量的提高往往是以牺牲语音的可懂度为代价的。对目前主流的几种噪声谱估计算法对语音可懂度影响进行分析。在不同噪声背景、不同信噪比情况下进行噪声谱估计,并采用谱减法对含噪语音信号作去噪处理,对比分析不同噪声、不同信噪比下增强前后语音的短时客观可懂度(Short-Time Objective Intelligibility,STOI)值,最后根据信噪比,对比分析了不同噪声环境下,语音增强前后语音能量高于噪声能量的时频块所占比例。实验表明,相比其他噪声估计算法,最小统计(Minima Statistics,MS)算法由于保留了更多的以语音能量为主的时频块,使得去噪后的语音有较高的可懂度。

关 键 词:噪声谱估计  谱减法  时频块  最小统计  短时客观可懂度  语音可懂度
收稿时间:2014/12/15 0:00:00
修稿时间:2015/3/29 0:00:00

Effects of noise spectrum estimation algorithms on speech intelligibility
ZHANG Jian-wei,TAO Liang,ZHOU Jian and WANG Hua-bin.Effects of noise spectrum estimation algorithms on speech intelligibility[J].Technical Acoustics,2015,34(5):424-430.
Authors:ZHANG Jian-wei  TAO Liang  ZHOU Jian and WANG Hua-bin
Affiliation:Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University, Hefei 230031, Anhui, China;Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University, Hefei 230031, Anhui, China;Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University, Hefei 230031, Anhui, China;Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University, Hefei 230031, Anhui, China
Abstract:Noise spectrum estimation is a key step in single channel speech enhancement algorithms. Most of current speech enhancement algorithms are designed to improve speech quality, however, algorithms for increasing speech intelligibility are few. The traditional speech enhancement algorithms improve speech quality, while sacrificing speech intelligibility. In this paper, classical noise spectrum estimation algorithms are evaluated for their effects on speech intelligibility. Noise spectrum is estimated in different noise environments with SNRs between -9 dB and 3 dB. The spectral subtraction is thereafter used for speech denoising. The STOI(Short-Time Objective Intelligibility) value of the enhanced speech is computed. At last, according to the signal-to-noise ratio, the proportions of speech dominated time-frequency blocks under different noise environments are analyzed. Experimental results show that, compared with other noise estimation algorithms, the minimum statistics (MS) obtains high speech intelligibility because it retains more speech dominated time-frequency blocks after speech denoising.
Keywords:noise spectrum estimation  spectrum subtraction  time-frequency blocks  Minima Statistics(MS)  Short-Time Objective Intelligibility(STOI)  speech intelligibility
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号