Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra

Affiliation:	1. Faculty of Electrical Engineering, University of Montenegro, 20000 Podgorica, Montenegro;2. The CAC, Villanova University, PA, USA;1. Université de Moncton, Campus de Shippagan, Canada;2. LCPTS, FEI, USTHB, B.P. 32 El Alia, Bab-Ezzouar, 16111, Algeria;1. Shanghai Institute of Applied Mathematics and Mechanics, Shanghai Key Laboratory of Mechanics in Energy Engineering, Shanghai University, Shanghai 200072, China;2. Ningbo Institute of Technology, Zhejiang University, Ningbo 315100, China;3. School of Science, East China University of Science and Technology, Shanghai 200237, China;1. York University, Department of Mathematics & Statistics, Canada;2. University of Saskatchewan, Department of Medical Imaging, Canada

Abstract:	In this paper, we consider the topic of iterative, one dimensional, signal reconstruction (specifically speech signals) from the magnitude spectrum and the phase spectrum. While this topic has been extensively researched and documented, we wish to recast some well-established results for the benefit of new researchers and those who desire a short, yet comprehensive, review of the subject. The three main points of the review are: (i) a signal can be reconstructed to within a scale factor from its phase spectrum, (ii) a signal cannot be reconstructed to within a scale factor from its magnitude spectrum, and (iii) a signal can be reconstructed to within a scale factor from its magnitude spectrum when the phase-sign (i.e., one bit of phase spectrum information) is known. Through a number of illustrative examples, we first demonstrate how the algorithms work when the spectral information is determined over the entire duration of the signal. We then demonstrate that the algorithms are equally valid for reconstruction of a signal from the spectra obtained from short-time segments. In addition, we present the results of some further experimentation in which we have attempted to reconstruct a speech signal from only partial phase spectrum information (in the absence of all magnitude spectrum information). We make the following observations: (i) intelligible signal reconstruction (albeit noisy) is possible from knowledge of only the phase spectrum sign information, (ii) an intelligible signal cannot be reconstructed from knowledge of only the phase spectrum frequency-derivative or only the phase spectrum time-derivative, and (iii) an intelligible signal can be reconstructed from the combined knowledge of both the phase spectrum frequency-derivative and time-derivative.

Keywords:
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏