Speech enhancement using Teager energy operated ERB-like perceptual wavelet packet decomposition |
| |
Authors: | Anirban Bhowmick Mahesh Chandra Astik Biswas |
| |
Affiliation: | 1.Department of Electronics & Communication,BIT,Mesra,India;2.Department of Electronics & Communication,ABES Engineering College,Ghaziabad,India |
| |
Abstract: | In recent past, wavelet packet (WP) based speech enhancement techniques have been gaining popularity due to their inherent nature of noise minimization. WP based techniques appeared as more robust and efficient than short-time Fourier transform based methods. In the present work, a speech enhancement method using Teager energy operated equal rectangular bandwidth (ERB)-like WP decomposition has been proposed. Twenty four sub-band perceptual wavelet packet decomposition (PWPD) structure is implemented according to the auditory ERB scale. ERB scale based decomposition structure is used because the central frequency of the ERB scale distribution is similar to the frequency response of the human cochlea. Teager energy operator is applied to estimate the threshold value for the PWPD coefficients. Lastly, Wiener filtering is applied to remove the low frequency noise before final reconstruction stage. The proposed method has been applied to evaluate the Hindi sentences database, corrupted with six noise conditions. The proposed method’s performance is analysed with respect to several speech quality parameters and output signal to noise ratio levels. Performance indicates that the proposed technique outperforms some traditional speech enhancement algorithms at all SNR levels. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|