首页 | 本学科首页   官方微博 | 高级检索  
     

利用概率混合模型的理想比率掩蔽多声源分离方法
引用本文:贾怡恬,杨淇善,贾懋珅,许文杰,鲍长春. 利用概率混合模型的理想比率掩蔽多声源分离方法[J]. 信号处理, 2021, 37(10): 1806-1815. DOI: 10.16798/j.issn.1003-0530.2021.10.003
作者姓名:贾怡恬  杨淇善  贾懋珅  许文杰  鲍长春
作者单位:北京工业大学信息学部
基金项目:国家自然科学基金(61971015)
摘    要:针对基于时频掩蔽的分离方法在多声源场景下的分离效果不佳的问题,论文提出一种利用概率混合模型的理想比率掩蔽多声源分离方法。首先,利用冯·米塞斯分布对时频点处方位角估计进行拟合以及拉普拉斯分布对归一化压力梯度信号向量进行拟合,由此建立概率混合模型。其次,利用期望最大化算法对模型参数进行求解,估计各声源对应的理想比率掩蔽。最后,利用估计出的理想比率掩蔽,从麦克风采集信号中分离得到各声源信号。实验结果表明,与现有基于时频掩蔽的多声源分离方法相比,论文所提方法在欠定场景下具有更好的分离效果。 

关 键 词:理想比率掩蔽   多声源分离   概率混合模型
收稿时间:2021-08-12

Multiple Sound Source Separation via Ideal Ratio Masking by Using Probability Mixture Model
Affiliation:Faculty of Information Technology, Beijing University of Technology
Abstract:The separation method based on time-frequency masking is liable to be invalid under multi-sound source scenes, this paper presents a method of multiple sound source separation via ideal ratio masking by using probability mixture model. First, the von Mises distribution is used to fit the angle estimates at each time-frequency point and the Laplacian distribution is used to fit the vector of normalized pressure gradient signals. Thereby, this paper establishing a probabilistic mixture model. Secondly, the expectation maximization algorithm is used to estimate the model parameters, and the ideal ratio masking corresponding to each source is obtained. Finally, with the estimated ideal ratio masking, source signals are separated from the microphone signals. Experimental results reveal that the performance of the proposed separated method is better than the existing separation method based on time-frequency masking. 
Keywords:
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号