首页 | 本学科首页   官方微博 | 高级检索  
     


Speech Enhancement Using Phase‐Dependent A Priori SNR Estimator in Log‐Mel Spectral Domain
Authors:Yun‐Kyung Lee  Jeon Gue Park  Yun Keun Lee  Oh‐Wook Kwon
Affiliation:1. Yun‐Kyung Lee (yunklee@etri.re.kr), Jeon Gue Park (jgp@etri.re.kr), and Yun Keun Lee (yklee@etri.re.kr) are with the SW·Content Research Laboratory, ETRI, Daejeon, Rep. of Korea.;2. Oh‐Wook Kwon (corresponding author, owkwon@cbnu.ac.kr) is with the School of Electronics Engineering, Chungbuk National University, Cheongju, Rep. of Korea.
Abstract:We propose a novel phase‐based method for single‐channel speech enhancement to extract and enhance the desired signals in noisy environments by utilizing the phase information. In the method, a phase‐dependent a priori signal‐to‐noise ratio (SNR) is estimated in the log‐mel spectral domain to utilize both the magnitude and phase information of input speech signals. The phase‐dependent estimator is incorporated into the conventional magnitude‐based decision‐directed approach that recursively computes the a priori SNR from noisy speech. Additionally, we reduce the performance degradation owing to the one‐frame delay of the estimated phase‐dependent a priori SNR by using a minimum mean square error (MMSE)‐based and maximum a posteriori (MAP)‐based estimator. In our speech enhancement experiments, the proposed phase‐dependent a priori SNR estimator is shown to improve the output SNR by 2.6 dB for both the MMSE‐based and MAP‐based estimator cases as compared to a conventional magnitude‐based estimator.
Keywords:Phase modeling  speech enhancement  speech separation  decision‐directed approach  minimum mean square error estimator
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号