首页 | 本学科首页   官方微博 | 高级检索  
     


Stereo audio source separation based on time–frequency masking and multilevel thresholding
Authors:Maximo ,Jos   J.
Affiliation:aTechnical University of Valencia, Institute for Telecommunications and Multimedia Applications (iTEAM), Camino de Vera s/n, Valencia, Spain
Abstract:Source separation and up-mixing in real commercial music recordings is a challenging problem. In the last few years, some algorithms have provided interesting results, but the problem remains unsolved. In this paper we describe a method for separating the sources present in a two channel mixture based on the panning coefficients used in the stereo mixdown. The sources are separated by estimating time–frequency masks using the multilevel extension of the Otsu thresholding algorithm used in image segmentation. A refinement step is also carried out for extraction and reassignment of inter-source residuals. Examples of application and performance evaluation are also discussed.
Keywords:Sound source separation   Stereo music mixtures   Multilevel thresholding
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号