Stereo audio source separation based on time–frequency masking and multilevel thresholding |
| |
Authors: | Maximo ,Jos J. |
| |
Affiliation: | aTechnical University of Valencia, Institute for Telecommunications and Multimedia Applications (iTEAM), Camino de Vera s/n, Valencia, Spain |
| |
Abstract: | Source separation and up-mixing in real commercial music recordings is a challenging problem. In the last few years, some algorithms have provided interesting results, but the problem remains unsolved. In this paper we describe a method for separating the sources present in a two channel mixture based on the panning coefficients used in the stereo mixdown. The sources are separated by estimating time–frequency masks using the multilevel extension of the Otsu thresholding algorithm used in image segmentation. A refinement step is also carried out for extraction and reassignment of inter-source residuals. Examples of application and performance evaluation are also discussed. |
| |
Keywords: | Sound source separation Stereo music mixtures Multilevel thresholding |
本文献已被 ScienceDirect 等数据库收录! |
|