Perceptual Audio Coding Using Sinusoidal/Optimum Wavelet Representation |
| |
Authors: | PS Sathidevi Y Venkataramani |
| |
Affiliation: | (1) Department of Electronics Engineering, National Institute of Technology, Calicut, Kerala, 673 601, India |
| |
Abstract: | A perceptual audio coder, in which each audio segment is
adaptively analyzed using either a sinusoidal or an optimum wavelet basis
according to the time-varying characteristics of the audio signals, has been
constructed. The basis optimization is achieved by a novel switched filter
bank scheme, which switches between a uniform filter bank structure
(discrete cosine transform) and a non-uniform filter bank structure
(discrete wavelet transform). A major artifact of the International
ISO/Moving Pictures Experts Group (MPEG) audio coding standard (MPEG-I
layers 1 and 2) known as pre-echo distortion which uses a uniform filter bank structure for
audio signal analysis, is almost eliminated in the proposed coder. A
perceptual masking model implemented using a high-resolution wavelet packet
filter bank with 27 subbands, closely mimicking the critical bands
of the human auditory system, is employed in this audio coder. The resulting
scheme is a variable bit-rate audio coder, which provides compression ratios
comparable to MPEG-I layers 1 and 2 with almost transparent quality. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|