共查询到20条相似文献,搜索用时 15 毫秒
1.
基于最小统计噪声估计的信号子空间语音增强 总被引:1,自引:0,他引:1
针对传统子空间方法中,采用语音活动检测(Voice activity detection,VAD)估计噪声的缺陷,提出了一种基于子空间域的最小统计噪声估计算法。噪声估计通过跟踪带噪语音协方差矩阵用每个特征向量上的特征值的最小值来获得,该方法不需要VAD明确区分语音段和噪声段,能够在整个信号期间实现噪声的连续估计和不断更新。实验结果表明,相对于传统的基于VAD的子空间方法,本文提出的算法对语音增强效果有非常显著的提高。 相似文献
2.
3.
Karsten Vandborg Sorensen Sren Vang Andersen 《IEEE transactions on audio, speech, and language processing》2007,15(3):901-917
In this paper, we propose a new statistical model for noise periodogram modeling and estimation. The proposed model is a hidden Markov model (HMM) with a Rayleigh mixture model (RMM) in each state. For this new model, we derive an expectation-maximization (EM) training algorithm and a minimum mean-square error (MMSE) noise periodogram estimator. It is shown that when compared to the Gaussian mixture model (GMM)-based HMM, the RMM-based HMM has less computationally complex EM iterations and gives a better fit of the noise periodograms when the mixture models has a low number of components. Furthermore, we propose a specialization of the proposed model, which is shown to provide better MMSE noise periodogram estimates than any other of the tested HMM initializations for cyclo-stationary noise types 相似文献
4.
5.
针对复杂噪声干扰环境中语音特征参数会发生改变,引起训练模型和测试语音之间的失配,使语音识别系统的识别率降低,为提高语音特征参数在色噪声环境中提取的鲁棒性,提出了基于总体最小二乘旋转不变子空间技术(TLS-ESPRIT)谐波倒谱加权谱鲁棒特征参数提取方法.运用TLS-SVD方法对观测数据矩阵进行广义特征值分解估计谐波模型的参数,实现了有色噪声背景下语音信号的最优估计.在重建语音的过程中根据谐波能量与带噪语音能量的比值,对重建谐波的各个谐波峰给予不同的加权和语音建模,并进行仿真,结果实现了鲁棒性特征参数的提取,解决了模型之间的失配问题. 相似文献
6.
噪音环境下的语音识别一直是语音识别的难点,本文采用了谱减法进行去噪,进行孤立词(数字0-9)的识别,提高系统的识别率 相似文献
7.
噪音环境下的语音识别一直是语音识别的难点,本文采用了谱减法进行去噪,进行孤立词(数字0-9)的识别,提高系统的识别率. 相似文献
8.
9.
10.
11.
Han-Fu Chen 《Automatic Control, IEEE Transactions on》2007,52(4):703-709
For Hammerstein and Wiener systems observed with additive noises the adaptive regulation control is produced by a truncated stochastic approximation (SA) algorithm with truncation regions expanding with a prescribed rate. It is proved that the stochastic adaptive control given in this note is optimal in the sense that it minimizes the long run average of regulation errors a.s 相似文献
12.
Zhao D.Y. Kleijn W.B. Ypma A. de Vries B. 《IEEE transactions on audio, speech, and language processing》2008,16(4):835-846
We propose a noise estimation algorithm for single-channel noise suppression in dynamic noisy environments. A stochastic-gain hidden Markov model (SG-HMM) is used to model the statistics of nonstationary noise with time-varying energy. The noise model is adaptive and the model parameters are estimated online from noisy observations using a recursive estimation algorithm. The parameter estimation is derived for the maximum-likelihood criterion and the algorithm is based on the recursive expectation maximization (EM) framework. The proposed method facilitates continuous adaptation to changes of both noise spectral shapes and noise energy levels, e.g., due to movement of the noise source. Using the estimated noise model, we also develop an estimator of the noise power spectral density (PSD) based on recursive averaging of estimated noise sample spectra. We demonstrate that the proposed scheme achieves more accurate estimates of the noise model and noise PSD, and as part of a speech enhancement system facilitates a lower level of residual noise. 相似文献
13.
Feature Enhancement for Noisy Speech Recognition With a Time-Variant Linear Predictive HMM Structure
Jianping Deng Bouchard M. Tet Hin Yeap 《IEEE transactions on audio, speech, and language processing》2008,16(5):891-899
This paper presents a new approach for speech feature enhancement in the log-spectral domain for noisy speech recognition. A switching linear dynamic model (SLDM) is explored as a parametric model for the clean speech distribution. Each multivariate linear dynamic model (LDM) is associated with the hidden state of a hidden Markov model (HMM) as an attempt to describe the temporal correlations among adjacent frames of speech features. The state transition on the Markov chain is the process of activating a different LDM or activating some of them simultaneously by different probabilities generated by the HMM. Rather than holding a transition probability for the whole process, a connectionist model is employed to learn the time variant transition probabilities. With the resulting SLDM as the speech model and with a model for the noise, speech and noise are jointly tracked by means of switching Kalman filtering. Comprehensive experiments are carried out using the Aurora2 database to evaluate the new algorithm. The results show that the new SLDM approach can further improve the speech feature enhancement performance in terms of noise-robust recognition accuracy, since the transition probabilities among the LDMs can be described more precisely at each time point. 相似文献
14.
15.
16.
端点检测是语音识别申的一项关键技术,端点检测的准确性对语音识别的性能有很大影响。论文对基于短时能量和短时过零率及基于LPC倒谱特征的端点检测算法进行了研究,给出改进的基于LPC美尔倒谱特征的端点检测算法,并通过实验证明其在低信噪比下具有较好的检测性能。随着语音识别技术的发展,这种算法在实际应用中的高效率、实时、准确性会逐渐显现出。 相似文献
17.
18.
卡尔曼滤波是惯导系统(INS)/GPS组合导航的主要算法之一,Sage-Husa算法是在卡尔曼滤波基础上,为减少系统噪声和量测噪声的不确定性对误差估计的影响而采用的自适应估计方法.对Sage-Husa算法提出了4条改进措施;并通过在3种数据扰动情形下的仿真计算发现,只对一类噪声做自适应估计更容易产生较大的偏差,对系统噪声和量测噪声两类噪声同时做自适应估计,其效果要优于只对一类噪声做自适应估计,把此现象定义为卡尔曼滤波的系统和量测噪声自适应估计的关联性.这个结果不同于一些文献的观点.此项研究对自适应卡尔曼滤波在INS/GPS组合导航的工程化应用有较高的实用价值. 相似文献
19.
《IEEE transactions on audio, speech, and language processing》2009,17(8):1577-1590