首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到17条相似文献,搜索用时 125 毫秒
1.
针对麦克风阵列后滤波语音增强算法的不足, 结合人耳的听觉掩蔽效应, 提出了改进的后滤波语音增强算法. 提出了最大化目标语音存在概率来确定信号子空间维度的方法, 在噪声子空间上, 利用条件概率估计出噪声功率谱. 基于人耳的听觉掩蔽效应, 提出了后滤波器的一种合理的设计方法. 实验证明, 所提的噪声估计方法比传统方法更加准确, 所提的后滤波算法比传统的后滤波算法更好, 在多项语音评价指标上, 都取得了更好的实验效果.  相似文献   

2.
针对现有语音增强算法面临残留噪声这一问题,提出一种结合人耳听觉掩蔽的改进算法。将MMSE-LSA谱估计法和一种最优感知增强滤波器融入一个两极语音增强算法框架,利用人耳听觉掩蔽去除残留噪声;给出算法实施的具体步骤和最优感知滤波器的理论推导。实验结果表明,在非平稳噪声环境下,该算法可以有效降低语音失真和残余噪声,提升增强语音信号的主观和客观质量。  相似文献   

3.
针对MMSE语音增强算法低信噪比时产生较大的语音畸变的缺点,提出了一种结合人耳听觉掩蔽效应的MMSE语音增强算法。该算法利用掩蔽阈值来调整MMSE算法中的增益值,使得增强后的语音信号残留噪声和语音畸变较小。通过计算机仿真对增强前后语音信号的信噪比分析以及主观试听表明:改进的MMSE语音增强算法不仅提高了语音信号的信噪比,而且减少了语音畸变,提高了语音的可懂度。  相似文献   

4.
改进的子空间语音增强算法   总被引:1,自引:0,他引:1  
单通道子空间语音增强算法在加性噪声为白噪声的情况下,效果比较理想.加性噪声为有色噪声的情况下,通常用广义奇异值分解算法来进行处理.为了降低低信噪比情况下残留的音乐噪声,结合人耳的听觉掩蔽效应,提出了一种基于感官抑制的广义奇异值分解算法.实验结果显示,该算法能够明显地提高语音质量、可懂度和识别率,特别是在加性噪声是有色噪声的情况下实验结果明显优于其他的语音增强算法.  相似文献   

5.
对于低信噪比环境下的语音信号,传统谱减法残留的背景噪声较大。针对该问题,基于听觉掩蔽效应提出一种改进的语音增强算法。将人耳听觉掩蔽特性与功率谱减法相结合,设计一种时域递归平均算法对噪声进行估计,同时对带噪语音信号做频谱相减处理,从听觉的角度出发,利用估计的语音信号功率谱计算掩蔽阈值,并引入谱减功率修正系数和谱减噪声系数,实现带噪语音的信号增强。利用Matlab 2012b进行仿真,实验结果表明,该算法在低信噪比条件下能够较好地抑制背景噪声,改善语音质量,且与改进自适应滤波算法相比,其输出信号的信噪比可提高5%左右。  相似文献   

6.
为降低装甲车辆内部强噪声对话音通信的影响,结合Mel算法、频率短时能量差、听觉掩蔽效应和经过改进的谱减法,提出了一种语音激活检测和语音增强方法,对传统谱减法噪声估计不精确和语音失真等问题进行了改进.该方法采用Mel频率对带噪语音进行语音激活检测,对噪声进行保守估计,替代一般谱减法采用的噪声统计均值.结合听觉掩蔽阈值对谱减法的相关系数进行动态调整,避免传统算法系数保持不变的不合理性.实验结果表明,该方法能很好抑制音乐噪声,提高带噪语音信噪比,改善语音的清晰度和可懂度.  相似文献   

7.
提出一种基于谱减法和听觉掩蔽效应的改进的卡尔曼滤波语音增强算法.引入基于谱减法的AR参数估计使卡尔曼算法降低了复杂度和计算量从而易于实现.用卡尔曼滤波滤除噪声的同时结合人耳听觉掩蔽特性设计一个后置感知滤波器,使得从卡尔曼滤波获得的估计误差低于人耳掩蔽阈值,在去噪和语音失真之间取较好的折中.仿真结果表明所提方法优于传统的卡尔曼滤波增强法,能够有效地减少语音失真,并且更符合人耳听觉特性,特别是在低信噪比的情况下,语音具有更好的清晰度和可懂度.  相似文献   

8.
针对非平稳噪声环境和低信噪比的情况,提出了一种基于低频区语音特性的非平稳噪声估计方法,通过构造一个时变的权值,实现对噪声的实时估计,同时结合人耳听觉掩蔽效应,利用估计出的噪声自适应设定增强系数。仿真实验表明,该方法能够较好地抑制背景噪声,提高信噪比,减少语音失真。  相似文献   

9.
为了减小传统谱减法引入的音乐噪声,提出了一种将多频带谱减和听觉掩蔽效应相结合的语音增强算法.用加权递归平滑的方法估计噪声的功率谱,对带噪的语音信号进行多频带谱减,计算听觉掩蔽阈值,再根据掩蔽阈值动态地调节谱减因子,通过增益函数得到增强后语音信号的频谱.仿真实验结果表明,与传统的谱减法相比,该算法在信噪比较低情况下,背景噪声和残余噪声得到了有效的抑制,语音信号的清晰度和可懂度也有了明显提升.  相似文献   

10.
联合听觉掩蔽效应的子空间语音增强算法   总被引:1,自引:0,他引:1       下载免费PDF全文
在经典子空间语音增强算法中,因语音特征值估计偏差会造成语音失真和音乐噪声。针对该问题,提出一种联合听觉掩蔽效应的语音增强算法。该算法联合掩蔽阈值自适应调节噪声特征值的抑制系数,并利用维纳滤波对音乐噪声的抑制性,对该特征值并行修正,最终还原出纯净的语音。实验结果证明,该算法在白噪声和有色噪声的背景下,与经典子空间的语音增强算法相比,能提高信噪比,减少语音失真和音乐噪声。  相似文献   

11.
A signal subspace scheme based on masking properties is proposed for enhancement of speech degraded by additive noise. Since the masking properties are related to the critical frequency band that is derived from the characteristics of human cochlea, the incorporation of masking threshold into a subspace technique requires the transformation between the frequency and eigen domains. We present and apply an invertible transformation between the frequency and eigen domains. In this paper, we use masking properties of the human auditory system to define the audible noise quantity in the eigendomain. We derive the eigen-decomposition of the estimated speech autocorrelation matrix with the assumption of white noise. Subsequently, an audible noise reduction scheme is developed based on a signal subspace technique, and the implementation of our proposed scheme is outlined. We further extend the scheme to the colored noise case. Simulation results show the superiority of our proposed scheme over other existing subspace methods in terms of segmental signal-to-noise ratio (SNR), perceptual evaluation of speech quality (PESQ), modified Bark spectral distortion (MBSD), spectrogram and informal listening tests.  相似文献   

12.
基于感知掩蔽深度神经网络的单通道语音增强方法   总被引:1,自引:0,他引:1  
本文将心理声学掩蔽特性应用于基于深度神经网络(Deep neural network,DNN)的单通道语音增强任务中,提出了一种具有感知掩蔽特性的DNN结构.首先,提出的DNN对带噪语音幅度谱特征进行训练并分别得到纯净语音和噪声的幅度谱估计.其次,利用估计的纯净语音幅度谱计算噪声掩蔽阈值.然后,将噪声掩蔽阈值和估计的噪声幅度谱联合计算得到一个感知增益函数.最后,利用感知增益函数从带噪语音幅度谱中估计出增强语音幅度谱.在TIMIT数据库上,对不同信噪比下的20种噪声进行的仿真实验表明,无论噪声类型是否在语音的训练集中出现,所提出的感知掩蔽DNN都能够在有效去除噪声的同时保持较小的语音失真,增强效果明显优于常见的DNN增强方法以及NMF(Nonnegative matrix factorization)增强方法.  相似文献   

13.
The singular value decomposition (SVD)-based method for single-channel speech enhancement has been shown to be very useful when the additive noise is white. For colored noise, with this approach, one needs to whiten the noise spectrum prior to SVD-based approach and perform the inverse whitening processing afterwards. A truncated quotient SVD (QSVD)-based approach has been proposed to handle this problem and found very useful. In this paper, a generalized SVD (GSVD)-based subspace approach for speech enhancement is first extended from the concept of the truncated QSVD-based approach, in which the dimension of the signal subspace can be precisely and automatically determined for each frame of the noisy signal. But with this new approach some residual noise is still perceivable under lower signal-to-noise ratio conditions. Therefore a perceptually constrained GSVD (PCGSVD)-based approach is further proposed to incorporate the masking properties of human auditory system to make sure the undesired residual noise to be nearly un-perceivable. Closed-form solutions are obtained for both the GSVD- and PCGSVD-based enhancement approaches. Very carefully performed objective evaluations and subjective listening tests show that the PCGSVD-based approach proposed here can offer improved speech quality, intelligibility and recognition accuracy, whether the noise is stationary or nonstationary, especially when the additive noise is nonwhite  相似文献   

14.
针对频域受限子空间语音增强在构造增强矩阵时,采用固定拉格朗日乘子,使得减小语音畸变和提高语音可懂度的过程中,有音乐噪声残留,提出一种变拉格朗日乘子的算法。利用听觉特性中较强的频率成分对噪声进行掩蔽,通过掩蔽阈值的频率域与子空间特征值之间的变换算法,用变量控制子空间拉格朗日乘子计算增益函数的对角矩阵。对比实验和试听结果表明,提出算法增强的语音信号不仅信噪比有较大提高,语音质量主观感知度也有明显改善。  相似文献   

15.
针对复杂背景噪声下语音增强后带有音乐噪声的问题,提出一种子空间与维纳滤波相结合的语音增强方法。对带噪语音进行KL变换,估计出纯净语音的特征值,再利用子空间域中的信噪比计算公式构成一个维纳滤波器,使该特征值通过这个滤波器,从而得到新的纯净语音特征值,由KL逆变换还原出纯净语音。仿真结果表明,在白噪声和火车噪声的背景下,信噪比都比传统子空间方法有明显提高,并有效抑制了增强后产生的音乐噪声。  相似文献   

16.
提出一种符合人耳听觉感知的语音增强方法,使电子耳蜗能在噪声环境下获得准确的语音信息。利用Bark子波变换实现电子耳蜗中的语音处理,结合人耳听觉系统特性实现语音增强。使用根据人耳听觉掩蔽效应提出的自适应减参数。实验结果表明该算法在低信噪比情况下,信噪比可提高30 dB左右,更好地抑制了残留噪声和背景噪声,合成的语音具有较好清晰度和可懂度。  相似文献   

17.
We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-to-noise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized time-frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time-frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号