A time-frequency blind separation method for underdetermined speech mixtures |
| |
Authors: | Yao Lv Shuangtian Li |
| |
Affiliation: | Graduate University of the Chinese Academy of Sciences, Beijing 100049, China |
| |
Abstract: | The proposed Blind Source Separation method (BSS),based on sparse representations,fuses time-frequency analysis and the clustering approach to separate underdetermined speech mixtures in the anechoic case regardless of the number of sources. The method remedies the insufficiency of the Degenerate Unmixing Estimation Technique (DUET) which assumes the number of sources a priori. In the proposed algorithm,the Short-Time Fourier Transform (STFT) is used to obtain the sparse rep-resentations,a clustering method called Unsupervised Robust C-Prototypes (URCP) which can accurately identify multiple clusters regardless of the number of them is adopted to replace the histo-gram-based technique in DUET,and the binary time-frequency masks are constructed to separate the mixtures. Experimental results indicate that the proposed method results in a substantial increase in the average Signal-to-Interference Ratio (SIR),and maintains good speech quality in the separation results. |
| |
Keywords: | Blind Source Separation (BSS) Sparse signal Unsupervised Robust C-Prototypes (URCP) |
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录! |
| 点击此处可从《电子科学学刊(英文版)》浏览原始摘要信息 |
|
点击此处可从《电子科学学刊(英文版)》下载全文 |