Speech enhancement based on a sinusoidal model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Speech enhancement based on a sinusoidal model

Authors:	JM Kates

Affiliation:	Center for Research in Speech and Hearing Sciences City University of New York.

Abstract:	Sinusoidal modeling is a new procedure for representing the speech signal. In this approach, the signal is divided into overlapping segments, the Fourier transform computed for each segment, and a set of desired spectral peaks is identified. The speech is then resynthesized using sinusoids that have the frequency, amplitude, and phase of the selected peaks, with the remaining spectral information being discarded. Using a limited number of sinusoids to reproduce speech in a background of multi-talker speech babble results in a speech signal that has an improved signal-to-noise ratio and enhanced spectral contrast. The more intense spectral components, assumed to be primarily the desired speech, are reproduced, whereas the less intense components, assumed to be primarily background noise, are not. To test the effectiveness of this processing approach as a noise suppression technique, both consonant recognition and perceived speech intelligibility were determined in quiet and in noise for a group of subjects with normal hearing as the number of sinusoids used to represent isolated speech tokens was varied. The results show that reducing the number of sinusoids used to represent the speech causes reduced consonant recognition and perceived intelligibility both in quiet and in noise, and suggests that similar results would be expected for listeners with hearing impairments.

Keywords:
本文献已被 PubMed 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏