共查询到20条相似文献,搜索用时 78 毫秒
1.
为了解决在受阻塞干扰的跳频信道上语音质量受影响严重的问题,对以传输语音为主的数字跳频系统采用CVSD(连续可变斜率增量调制)编码时的编码算法进行了优化研究。在传统CVSD编码基本原理的基础上,提出了一种新的CVSD编码优化算法,并给出了数字跳频系统中语音质量衡量准则和基于优化算法的数字跳频系统CVSD基带仿真模型,具体分析了在部分频带噪声干扰下优化算法对系统接收语音信号恶化量的影响。仿真结果表明,优化算法较之传统CVSD编码算法能有效地控制接收语音信号恶化量,即使在受阻塞干扰十分严重的情况下也能获得较好的语音质量。 相似文献
2.
Chong Un Hwang Lee 《Communications, IEEE Transactions on》1980,28(1):96-101
A performance comparison of three representative ADM systems has been made by computer simulation using real speech. The three systems studied are continuously variable slope delta modulation (CVSD), Jayant's constant factor delta modulation (CFDM), and a modified version of Un and Magill's hybrid companding delta modulation (HCDM). Among the three systems, HCDM yields the best performance in signal-to-quantization noise ratio (SQNR) and dynamic range regardless of the channel bit error rate. Comparing CVSD and CFDM in an ideal channel, the dynamic range of the latter is significantly wider than that of the former, although their peak SQNR's are almost the same. In a noisy channel, CFDM degrades more rapidly than the other two as the bit error rate increases. In the channel with an error rate above 10-3, the use of CFDM appears to be impractical when the bit rate is below 16 khits/s. However, intelligible speech transmission is possible with HCDM or CVSD even at the error rate of as high as 10-1. 相似文献
3.
Subjective quality measurements on three digital speech coders, simulated with mobile radio channel transmission, were performed using the "mean opinion score (MOS)" method. The three speech coding methods tested were: continuously variable slope deltamodulation (CVSD) coding, adaptive predictive coding (APC), and residually excited linear predictive (RELP) coding. Several versions of each coder, with transmission rates in the range of 7.3 to 16.1 kbits/s, were simulated. Five different channel conditions, including three derived from land mobile radio field experiments, were applied to the speech coders' encoded output to study the effects. The results show that of the three coders, the CVSD coder is the most robust to channel errors, but produces reconstructed output speech of unacceptable quality. The 14.4 kbit/s RELP coder produces relatively good Output speech quality, exhibits a mild degree of robustness to mobile radio channel errors, and is slightly less complex than the APC coder. Of the three digital speech coders tested, the RELP coder appears the most suitable for use with land mobile radio. However none of the three coders was able to produce speech of telephone toll quality in a mobile radio environment. 相似文献
4.
The authors discuss a method for spectral analysis of noise corrupted signals using statistical properties of the zero-crossing intervals. It is shown that an initial stage of filter-bank analysis is effective for achieving noise robustness. The technique is compared with currently popular spectral analysis techniques based on singular value decomposition and is found to provide generally better resolution and lower variance at low signal to noise ratios (SNRs). These techniques, along with three established methods and three variations of these method, are further evaluated for their effectiveness for formant frequency estimation of noise corrupted speech. The theoretical results predict and experimental results confirm that the zero-crossing method performs well for estimating low frequencies and hence for first formant frequency estimation in speech at high noise levels (~0 dB SNR). Otherwise, J.A. Cadzow's high performance method (1983) is found to be a close alternative for reliable spectral estimation. As expected the overall performance of all techniques is found to degrade for speech data. The standard autocorrelation-LPC method is found best for clean speech and all methods deteriorate roughly equally in noise 相似文献
5.
基于听觉感知的LSA-MMSE改进型语音增强方法 总被引:3,自引:0,他引:3
传统增强方法的增益函数对每个频点都进行估计,必然会引进相对较多的语音失真.为了提高低信噪比下的语音增强效果,提出了一种计算掩蔽概率的方法,得到优化的语音增强方法.基于听觉感知特性,对噪声被掩蔽部分的带噪语音谱和未掩蔽部分采用不同处理方法.增强后的语音可以表示为这两个状态下单独估计的加权和,其中权重与噪声被掩蔽概率有关.通过与Virag的方法、LSA-MMSE估计等方法进行比较,实验结果表明所提的增强方法能在低信噪比下有效地抑制残留噪声的同时保持更小的语音失真. 相似文献
6.
7.
Speech enhancement using constrained spectral amplitude subtraction based on noncausal a priori SNR 总被引:3,自引:0,他引:3
Wu Hongwei Wu Zhenyang 《电子科学学刊(英文版)》2006,23(6):937-942
Two gain forms of spectral amplitude subtraction are derived theoretically without neglecting the correlation of speech and noise spectrum during the period of a fralne. In the implementation, the constrained gain is expressed as a function of noncausal a priori SNR (Signal-to-Noise Ratio). Noise and noncausal a priori SNR are estimated from the multitaper spectrum of the noisy signal with algorithms modified to be suitable for the multitaper spectruln. Objective evaluations show that in case of white Gaussian noise the proposed method outperforms some methods based on LSA (Log Spectral Amplitude) in terms of MBSD (Modified Bark Spectral Distortion), segmental SNR and overall SNR, and informal listening tests show that speech reconstructed in this way has little speech distortion and musical noise is nearly inaudible even at low SNR. 相似文献
8.
相位谱补偿语音增强算法通过调整相位谱对噪声进行压缩,提高重构信号的质量。针对传统的相位谱补偿(phase spectrum compensation, PSC)语音增强算法采用固定的相位补偿因子,且算法的性能易受噪声估计准确性的影响,提出了一种基于稀疏性的相位谱补偿(sparsity-based phase spectrum compensation, SPSC)语音增强算法。首先,利用噪声估计算法得到噪声幅度谱,利用基于幅度谱的语音增强算法得到目标语音幅度谱;接着,通过噪声和目标语音幅度谱之间的局部信噪比(Signal-to-Noise Ratio, SNR)来估计谱时间稀疏性;然后,利用sigmoid函数改进相位补偿因子,联合补偿因子和谱时间稀疏性,得到SPSC函数。最后,使用SPSC函数对相位谱中的谱分量进行补偿,通过短时傅里叶逆变换得到最终增强后的语音信号。仿真实验表明,在四种不同背景噪声的低信噪比下,新的相位谱补偿算法使增强语音获得了更好的LSD、PESQ和segSNR指标,说明新的算法在低信噪比下,可以有效恢复带噪语音中的语音成分,对噪声抑制效果明显,增强语音的质量和听感均有一定提升。 相似文献
9.
A new idea, enhancing speech based on auditory evidence, is explored for the problem of enhancing speech degraded by stationary and nonstationary additive white noise. Distinguishing different objectives for heavy and light noise interference, two related algorithms are developed. For speech degraded by heavy noise, the improvement in signal-to-noise ratio (SNR) is as high as 12 dB; for lightly noisy speech, the improvement is modest and decreases as the SNR of the noisy speech increases. Quantizing noise is used to assess the capacity for reducing nonstationary noise using these algorithms; a significant reduction of such noise and an improvement in speech quality are achieved. The advantages of the proposed algorithms for speech enhancement include no need for prior knowledge of the noise and only a modest computational requirement 相似文献
10.
In this paper, the authors present optimal multichannel frequency domain estimators for minimum mean-square error (MMSE) short-time spectral amplitude (STSA), log-spectral amplitude (LSA), and spectral phase estimation in a widely distributed microphone configuration. The estimators utilize Rayleigh and Gaussian statistical models for the speech prior and noise likelihood with a diffuse noise field for the surrounding environment. Based on the Signal-to-Noise Ratio (SNR) and Segmental Signal-to-Noise Ratio (SSNR) along with the Log-Likelihood Ratio (LLR) and Perceptual Evaluation of Speech Quality (PESQ) as objective metrics, the multichannel LSA estimator decreases background noise and speech distortion and increases speech quality compared to the baseline single channel STSA and LSA estimators, where the optimal multichannel spectral phase estimator serves as a significant quantity to the improvements, and demonstrates robustness due to time alignment and attenuation factor estimation. Overall, the optimal distributed microphone spectral estimators show strong results in noisy environments with application to many consumer, industrial, and military products. 相似文献
11.
为解决传统算法对噪声适应性较差,残留音乐噪声较强的问题,本文提出了一种基于自适应噪声估计的宽带语音增强算法。该算法可应用于宽带语音编码器,以提升在噪声环境下的编码质量。本文所提算法利用谱熵对噪声类型进行有效的判别,将背景噪声分为白噪声和有色噪声两类,并根据噪声特性选择适当的噪声估计方法。在白噪声背景下,选择一种谱平滑的方法;在有色噪声背景下,则选择经典的最小值控制递归平均算法。在此基础上结合经典的统计模型方法,构建一种具有较强噪声鲁棒性的宽带语音增强算法。在ITU-T G.160标准下对算法进行性能测试,测试结果表明,在不同强度的背景噪声环境下,增强语音的信噪比提高都较为明显。同时,在低信噪比情况下,该算法有效的抑制了严重影响听觉质量的音乐噪声现象。 相似文献
12.
一种低信噪比语音的增强算法 总被引:2,自引:0,他引:2
为改善低信噪比环境下语音的质量,论文提出了一种新的语音增强算法。算法首先根据噪声频谱的高斯统计模型得到用先验信噪比形式表示的噪声频谱估计值,然后利用帧内、帧间平滑算法估计每一个频点的先验信噪比,从而能够更好地跟踪先验信噪比的变化。算法接着引入一种简便的估计语音在每一个频点出现概率的方法,得出一种新的语音增强算法。客观测试和非正式听音测试表明:该算法在几乎不损伤语音清晰度的前提下,能够更好地抑制低信噪比语音增强所产生的音乐噪声,同时使语音信噪比得到了明显提高。 相似文献
13.
In this paper, a smoothing approach for enhancing speech signals degraded by statistically independent additive nonstationary noise is developed. The autoregressive hidden Markov model (ARHMM) is used for modeling the statistical characteristics of both the clean speech signal and the nonstationary noise process. In this case, the speech enhancement comprises a weighted sum of the conditional mean estimators for the composite states of the models for the speech and noise, where the weights are equal to the posterior probabilities of the composite states, given the noisy speech. The conditional mean estimators use a smoothing approach based on two Kalman filters with Markovian switching coefficients, where one of the filters propagates in the forward-time direction and the other propagates in the backward-time direction with one frame. The proposed method is tested on speech signals degraded by Gaussian colored noise or nonstationary noise at various input signal-to-noise ratios. An approximate improvement of 4.7–5.2 dB in SNR is achieved at input SNR 10 and 15 dB. Also, in comparison with conventional method (Ephraim, IEEE Trans. Signal Process. SP-41 (April 1992) 725–735), our proposed method shows improvement of about 0.3 dB in SNR. 相似文献
14.
15.
Filtering of colored noise for speech enhancement and coding 总被引:6,自引:0,他引:6
Scalar and vector Kalman filters are implemented for filtering speech contaminated by additive white noise or colored noise, and an iterative signal and parameter estimator which can be used for both noise types is presented. Particular emphasis is placed on the removal of colored noise, such as helicopter noise, by using state-of-the-art colored-noise-assumption Kalman filters. The results indicate that the colored noise Kalman filters provide a significant gain in signal-to-noise ratio (SNR), a visible improvement in the sound spectrogram, and an audible improvement in output speech quality, none of which are available with white-noise-assumption Kalman and Wiener filters. When the filter is used as a prefilter for linear predictive coding, the coded output speech quality and intelligibility are enhanced in comparison to direct coding of the noisy speech 相似文献
16.
The letter describes a method of improving the dynamic range of a continuously variable slope delta modulator (CVSD). This is achieved by modifying the basic step size ?0 Compared to the CVSD algorithm, the modified CVSD (MCVSD) algorithm yields about 15?20 dB dynamic range improvement without degrading the peak SNR and the bit error rate tolerance. 相似文献
17.
18.
19.
讨论了一种基于传统谱相减算法的改进方法。利用语音的短时平稳性,通过先验幅度比来连续更新噪声谱的估计,从而代替复杂的VAD(话音活性检测)。计算机仿真结果表明,这种改进方法有效抑制了噪声干扰,语音得到了增强,在极大地提高信噪比的同时,将残留的音乐噪声和语音失真保持在人耳听觉容忍的范围以内,从而较好的保持了语音自然度。 相似文献
20.
谱减法是常用的单通道语音降噪方法,传统谱减法在抑制背景噪声的同时引入了“音乐噪声”,影响听觉效果。为了抑制音乐噪声,提出了一种基于后验信噪比的频域语音增强新方法,当后验信噪比较高时,采用基于后验信噪比的谱减法增强语音信号;当后验信噪比较低时,采用基于后验信噪比的谱衰减方法对含噪语音信号谱线进行衰减,达到语音增强的目的。仿真结果表明,基于后验信噪比的频域语音增强法具有较好的背景噪声和音乐噪声抑制效果,并保持了较好语音可懂度。 相似文献