Stereo audio source separation based on time–frequency masking and multilevel thresholding期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Stereo audio source separation based on time–frequency masking and multilevel thresholding

Authors:	Maximo ,Jos J.

Affiliation:	^aTechnical University of Valencia, Institute for Telecommunications and Multimedia Applications (iTEAM), Camino de Vera s/n, Valencia, Spain

Abstract:	Source separation and up-mixing in real commercial music recordings is a challenging problem. In the last few years, some algorithms have provided interesting results, but the problem remains unsolved. In this paper we describe a method for separating the sources present in a two channel mixture based on the panning coefficients used in the stereo mixdown. The sources are separated by estimating time–frequency masks using the multilevel extension of the Otsu thresholding algorithm used in image segmentation. A refinement step is also carried out for extraction and reassignment of inter-source residuals. Examples of application and performance evaluation are also discussed.

Keywords:	Sound source separation Stereo music mixtures Multilevel thresholding
本文献已被 ScienceDirect 等数据库收录！