Scalable perceptual audio representation with an adaptive three time-scale sinusoidal signal model |
| |
Authors: | Al-Moussawy Raed Yin Junxun Song Shaopeng |
| |
Affiliation: | College of Electronic and Information Eng., South China Univ. of Tech., Guangzhou 510640 |
| |
Abstract: | This work is concerned with the development and optimization of a signal model for scalable perceptual audio coding at low bit rates. A complementary two-part signal model consisting of Sines plus Noise (SN) is described. The paper presents essentially a fundamental enhancement to the sinusoidal modeling component. The enhancement involves an audio signal scheme based on carrying out overlap-add sinusoidal modeling at three successive time scales, large, medium, and small. The sinusoidal modeling is done in an analysis-by-synthesis overlap- add manner across the three scales by using a psychoacoustically weighted matching pursuits. The sinusoidal modeling residual at the first scale is passed to the smaller scales to allow for the modeling of various signal features at appropriate resolutions.This approach greatly helps to correct the pre-echo inherent in the sinusoidal model. This improves the perceptual audio quality upon our previous work of sinusoidal modeling while using tile same number of sinusoids. Tile most obvious application for the SN model is in scalable, high fidelity audio coding and signal modification. |
| |
Keywords: | Multiresolution sinusoidal modeling Parametric audio coding Low-rate audio coding Signal modifications |
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录! |
| 点击此处可从《电子科学学刊(英文版)》浏览原始摘要信息 |
|
点击此处可从《电子科学学刊(英文版)》下载全文 |