首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 515 毫秒
1.
A simple method is proposed for estimation of amplitudes of multiple sinusoids. The estimation is based on the existing adaptive identifier which offers the globally convergent estimate of sinusoidal frequencies. To deal with possible singularities of the amplitude estimation, adaptive observers are also proposed for estimation of sinusoidal amplitudes. Simulations illustrate the results  相似文献   

2.
3.
A speech enhancement algorithm that takes advantage of the time and frequency dependencies of speech signals is presented in this paper. The above dependencies are incorporated in the statistical model using concepts from the theory of Markov Random Fields. In particular, the speech short-time Fourier transform (STFT) amplitude samples are modeled with a novel Chi Markov Random Field prior, which is then used for the development of an estimator based on the Iterated Conditional Modes method. The novel prior is also coupled with a ‘harmonic’ neighborhood, which apart from the immediately adjacent samples on the time frequency plane, also considers samples which are one pitch frequency apart, so as to take advantage of the rich structure of the voiced speech time frames. Additionally, central to the development of the algorithm is the adaptive estimation of the weights that determine the interaction between neighboring samples, which allows the restoration of weak speech spectral components, while maintaining a low level of uniform residual noise. Results that illustrate the improvements achieved with the proposed algorithm, and a comparison with other established speech enhancement schemes are also given.   相似文献   

4.
The goal of signal processing is to estimate the contained frequencies and extract subtle changes in the signals. In this paper, a new adaptive multiple signal classification-empirical wavelet transform (MUSIC-EWT) methodology is presented for accurate time–frequency representation of noisy non-stationary and nonlinear signals. It uses the MUSIC algorithm to estimate the contained frequencies in the signal and build the appropriate boundaries to create the wavelet filter bank. Then, the EWT decomposes the time-series signal into a set of frequency bands according to the estimated boundaries. Finally, the Hilbert transform is applied to observe the evolution of calculated frequency bands over time. The usefulness and effectiveness of the proposed methodology are validated using two simulated signals and an ECG signal obtained experimentally. The results demonstrate clearly that the proposed methodology is immune to noise and capable of estimating the optimal boundaries to isolate the frequencies from noise and estimate the main frequencies with high accuracy, especially the closely-spaced frequencies.  相似文献   

5.
Several algorithms have been developed for tracking formant frequency trajectories of speech signals, however most of these algorithms are either not robust in real-life noise environments or are not suitable for real-time implementation. The algorithm presented in this paper obtains formant frequency estimates from voiced segments of continuous speech by using a time-varying adaptive filterbank to track individual formant frequencies. The formant tracker incorporates an adaptive voicing detector and a gender detector for formant extraction from continuous speech, for both male and female speakers. The algorithm has a low signal delay and provides smooth and accurate estimates for the first four formant frequencies at moderate and high signal-to-noise ratios. Thorough testing of the algorithm has shown that it is robust over a wide range of signal-to-noise ratios for various types of background noises.  相似文献   

6.
本文在分析基于短时能量的语音端点检测算法局限的基础上,引入短时信噪比SNR估计方法,并设计自适应的判决门限,提出一种自适应语音端点检测算法.通过对平稳高斯白噪声环境下信噪比从-10dB到20dB的带噪语音信号进行的仿真实验表明,所提方法能更为准确地检测到语音的端点.  相似文献   

7.
This paper presents a new approach to speech enhancement based on modified least mean square-multi notch adaptive digital filter (MNADF). This approach differs from traditional speech enhancement methods since no a priori knowledge of the noise source statistics is required. Specifically, the proposed method is applied to the case where speech quality and intelligibility deteriorates in the presence of background noise. Speech coders and automatic speech recognition systems are designed to act on clean speech signals. Therefore, corrupted speech signals by the noise must be enhanced before their processing. The proposed method uses a primary input containing the corrupted speech signal and a reference input containing noise only. The new computationally efficient algorithm is developed here based on tracking significant frequencies of the noise and implementing MNADF at those frequencies. To track frequencies of the noise time-frequency analysis method such as short time frequency transform is used. Different types of noises from Noisex-92 database are used to degrade real speech signals. Objective measures, the study of the speech spectrograms and global signal-to-noise ratio (SNR), segmental SNR (segSNR) as well as subjective listing test demonstrate consistently superior enhancement performance of the proposed method over tradition speech enhancement method such as spectral subtraction.  相似文献   

8.
This paper presents a geostatistical model as a new approach to the linear prediction analysis of speech. The autocorrelation method of autoregressive modeling, which is widely applied in the linear predictive coding of speech, is used as a benchmark for comparison with the present algorithm. Before discussing the proposed model, we will briefly describe the concepts of linear prediction analysis of speech and how this is solved by the well-known method of autocorrelation. Following is the introduction of geostatistics including the ideas of regionalized variables, semi-variograms and kriging equations. We then propose a geostatistical model to the linear prediction modeling of speech signals. Examples on speech data are given to illustrate the effectiveness of the present algorithm in comparison with the autocorrelation method. Advantages offered by the proposed geostatistical algorithm over the autocorrelation method in the linear prediction analysis of speech are summarized as follows: (1) it is more effective due to the optimization of the kriging equations taking into account the biased condition; (2) it is more flexible by allowing different biased values for the fitting of the signal spectrum, and therefore may provide a means for adaptive LPC; (3) it can give a good estimate of the number of poles used in the LPC by means of the theoretical semi-variogram.  相似文献   

9.
基于自适应滤波器的时频分析   总被引:1,自引:0,他引:1  
储昭碧  张崇巍  冯小英 《自动化学报》2009,35(11):1420-1428
采用最小方差原则和梯度下降方法, 经过旋转变换, 获得频率参数和带宽参数可调的自适应二维线性正弦跟踪滤波器, 实现信号跟随与幅值估计. 把多个跟踪器并联, 形成带宽可调的多频点梳状滤波器, 得到多维线性常系数微分动力系统. 用不变原理证明梳状滤波器是一致渐近稳定的. 用拉普拉斯变换导出频率特性, 单正弦输入时为向量形式, 针对多正弦分量时为频率特性矩阵. 当输入信号所有正弦分量的频率都等于梳状滤波器频率参数时, 本算法能够同时准确跟随所有分量及其幅值. 分析了算法的频域格栅效应以及带宽参数对稳态精度的影响, 通过仿真说明了算法的有效性.  相似文献   

10.
In this paper, we consider the parameters estimation of a model of superimposed exponential signals in multiplicative and additive noise when some observations are missing randomly. The least squares estimators (LSEs) and asymptotic Cramer–Rao low bound (ACRLB) for the considered model are studied and the asymptotic distributions of the LSEs for parameters of frequencies, phases and amplitudes of the considered model are also derived and obtained. An adaptive and computationally efficient iterative algorithm is proposed to estimate the frequencies of the considered model. It can be seen that the iterative algorithm works quite well in terms of biases and mean squared errors and the refined estimators by three iterations are observed to be asymptotically unbiased and consistent. The statistics for iteration are designed to change adaptively according to different missing distributions of time points so as to keep the estimators of frequencies to be asymptotically unbiased. Moreover, the proposed estimators attain the same convergence rate and asymptotic distribution as those of LSEs which are used to obtain the confident intervals and coverage probabilities of the frequencies for finite sample. Since the iterative algorithm needs only three iterations to work, it saves much computation time. So the proposed estimators are LSEs equivalent while avoid the heavy computation cost of LSEs. Finally, several simulation experiments are performed to verify the effectiveness of the proposed algorithm. To examine the robustness of the proposed algorithm, we also test the algorithm on the dual tone multi-frequency (DTMF) signal with observations missing in block and symmetric α-stable (SaS) noise condition, as well as on sinusoidal frequency modulated signals.  相似文献   

11.
广义Gamma模型是近年来新提出的一种语音分布模型,相对于传统的高斯或超高斯模型具有更好的普适性和灵活性,提出一种基于广义Gamma语音模型和语音存在概率修正的语音增强算法。在假设语音和噪声的幅度谱系数分别服从广义Gamma分布和Gaussian分布的基础上,推导了语音信号对数谱的最小均方误差估计式;在该模型下进一步推导了语音存在概率,对最小均方误差估计进行修正。仿真结果表明,与传统的短时谱估计算法相比,该算法不仅能够进一步提高增强语音的信噪比,而且可以有效减小增强语音的失真度,提高增强语音的主观感知质量。  相似文献   

12.
In this paper, an automatic adaptive method for identification and separation of the useful information content, from the background noise of time–frequency distributions (TFD) of multicomponent nonstationary signals, is presented. The method is based on an initial segmentation of the TFD information content by the K-means clustering algorithm, that partitions the initial data set in order to obtain K classes containing elements with similar amplitudes. It is shown that the local Rényi entropy (LRE) can accurately distinguish classes containing noise from classes with the useful information content, as a consequence of their basic structural differences in the time–frequency plane. Simulations are run to compare the performance of the proposed adaptive algorithm for blind separation of useful information from background noise (i.e. blind amplitude threshold) and non-adaptive (hard) amplitude TFD threshold procedures. Simulation results indicate that the proposed method performs better or closely to the best of five blindly chosen hard thresholds. The limitation of efficient hard-thresholding is the need of previous knowledge of the signal's structure and SNR or visual evaluation.  相似文献   

13.
Most speech enhancement algorithms are based on the assumption that speech and noise are both Gaussian in the discrete cosine transform (DCT) domain. For further enhancement of noisy speech in the DCT domain, we consider multiple statistical distributions (i.e., Gaussian, Laplacian and Gamma) as a set of candidates to model the noise and speech. We first use the goodness-of-fit (GOF) test in order to measure how far the assumed model deviate from the actual distribution for each DCT component of noisy speech. Our evaluations illustrate that the best candidate is assigned to each frequency bin depending on the Signal-to-Noise-Ratio (SNR) and the Power Spectral Flatness Measure (PSFM). In particular, since the PSFM exhibits a strong relation with the best statistical fit we employ a simple recursive estimation of the PSFM in the model selection. The proposed speech enhancement algorithm employs a soft estimate of the speech absence probability (SAP) separately for each frequency bin according to the selected distribution. Both objective and subjective tests are performed for the evaluation of the proposed algorithms on a large speech database, for various SNR values and types of background noise. Our evaluations show that the proposed soft decision scheme based on multiple statistical modeling or the PSFM provides further speech quality enhancement compared with recent methods through a number of subjective and objective tests.  相似文献   

14.
In this paper, an intelligent speaker identification system is presented for speaker identification by using speech/voice signal. This study includes both combination of the adaptive feature extraction and classification by using optimum wavelet entropy parameter values. These optimum wavelet entropy values are obtained from measured Turkish speech/voice signal waveforms using speech experimental set. It is developed a genetic wavelet adaptive network based on fuzzy inference system (GWANFIS) model in this study. This model consists of three layers which are genetic algorithm, wavelet and adaptive network based on fuzzy inference system (ANFIS). The genetic algorithm layer is used for selecting of the feature extraction method and obtaining the optimum wavelet entropy parameter values. In this study, one of the eight different feature extraction methods is selected by using genetic algorithm. Alternative feature extraction methods are wavelet decomposition, wavelet decomposition – short time Fourier transform, wavelet decomposition – Born–Jordan time–frequency representation, wavelet decomposition – Choi–Williams time–frequency representation, wavelet decomposition – Margenau–Hill time–frequency representation, wavelet decomposition – Wigner–Ville time–frequency representation, wavelet decomposition – Page time–frequency representation, wavelet decomposition – Zhao–Atlas–Marks time–frequency representation. The wavelet layer is used for optimum feature extraction in the time–frequency domain and is composed of wavelet decomposition and wavelet entropies. The ANFIS approach is used for evaluating to fitness function of the genetic algorithm and for classification speakers. It has been evaluated the performance of the developed system by using noisy Turkish speech/voice signals. The test results showed that this system is effective in detecting real speech signals. The correct classification rate is about 91% for speaker classification.  相似文献   

15.
This paper addresses the problem of speech enhancement and acoustic noise reduction by adaptive filtering algorithms. Recently, we have proposed a new Forward blind source separation algorithm that enhances very noisy speech signals with a subband approach. In this paper, we propose a new variable subband step-sizes algorithm that allows improving the previous algorithm behaviour when the number of subband is selected high. This new proposed algorithm is based on recursive formulas to compute the new variable step-sizes of the cross-coupling filters by using the decorrelation criterion between the estimated sub-signals at each subband output. This new algorithm has shown an important improvement in the steady state and the mean square error values. Along this paper, we present the obtained simulation results by the proposed algorithm that confirm its superiority in comparison with its original version that employs fixed step-sizes of the cross-coupling adaptive filters and with another fullband algorithm.  相似文献   

16.
This paper presents a novel method for the enhancement of independent components of mixed speech signal segregated by the frequency domain independent component analysis (FDICA) algorithm. The enhancement algorithm proposed here is based on maximum a posteriori (MAP) estimation of the speech spectral components using generalized Gaussian distribution (GGD) function as the statistical model for the time–frequency series of speech (TFSS) signal. The proposed MAP estimator has been used and evaluated as the post-processing stage for the separation of convolutive mixture of speech signals by the fixed-point FDICA algorithm. It has been found that the combination of separation algorithm with the proposed enhancement algorithm provides better separation performance under both the reverberant and non-reverberant conditions.  相似文献   

17.
提出了一种基于自适应加权谱内插(STRAIGHT)的宽带语音编码算法。输入的语音信号首先经过STRAIGHT分析得到精确的基频参数和谱参数,然后通过时域抽取和频域建模实现有效的编码压缩。在时域抽取时采用的区别于传统编码算法固定帧长的自适应可变帧长方法,使得编码存储量可以根据实际语音变化情况得到更加合理的分配。主观测听结果表明,该算法针对16kHz采样的语音信号,在6kbps码率上可以取得与AMR-WB(G.722.2)在8.85kbps时的相当的音质效果。此外,该算法还具有对恢复语音的时长、基频以及谱参数较强的调整能力。  相似文献   

18.
A new method is proposed in this article for fast-varying AM–FM components extraction. There are two prominent characteristics in this method. Firstly, a new evaluation method for the instantaneous bandwidth is established, which is based on the instantaneous slope of the time-frequency curve with respect to the AM–FM component. Secondly, a new adaptive STFT algorithm is established, which adjusts the window width by adapting to the instantaneous bandwidth at each frequency position. In order to extract multiple AM–FM components from a signal, the width of the reconstruction area is required to be determined efficiently to avoid the interference caused by adjacent components. Simulations are given in the end, which show that the proposed method has good performance for fast-varying AM–FM components extraction from noisy signals.  相似文献   

19.
α稳定分布下Volterra滤波器的自适应数据块算法   总被引:1,自引:0,他引:1  
基于分数低阶统计量原理提出了α稳定分布下Volterra滤波器的数据块滤波算法。该算法对Volterra滤波器权向量的线性项部分和非线性项部分分别采用不同的收敛因子,克服了传统只采用一个收敛因子的Volterra滤波器算法收敛性能差缺点,利用更多的输入信号和误差信号信息,更好地估计梯度,更精确地调节自适应滤波器权向量,提高了收敛速度。仿真结果验证了该方法的优越性。  相似文献   

20.
针对强背景环境噪声对语音信号的影响问题,根据短时分形维数能够反映信号动态特征的特性,提出了一种基于分形维数的自适应语音降噪改进算法.实际应用表明,该算法能够有效提升非平稳噪声环境下的语音增强效果.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号