首页 | 本学科首页   官方微博 | 高级检索  
     

交叉对数似然度和贝叶斯信息判据的说话人聚类算法
引用本文:刘倓倓,潘接林,索洪斌,颜永红.交叉对数似然度和贝叶斯信息判据的说话人聚类算法[J].声学技术,2007,26(6):1181-1185.
作者姓名:刘倓倓  潘接林  索洪斌  颜永红
作者单位:中科院声学所中科信利实验室,北京,100080
基金项目:国家重点基础研究发展计划(973计划);国家自然科学基金;北京市科委科研项目
摘    要:说话人分段聚类的任务是将一段语音中由同一说话人发出的语音聚合起来。文中提出了一种基于交叉对数似然度(Cross Log-likelihood Ratio,CLR)和贝叶斯信息判据(Bayesian information criterion,BIC)相结合的说话人聚类算法。交叉对数似然度用于计算语音段间的相似度;而贝叶斯判据则提供了一种比较适当的停止聚类的准则,该算法结合了两种方法的优点,在无监督说话人聚类中得到了较好的应用。实验结果表明,基于交叉对数似然度和贝叶斯判据的说话人聚类方法,比单纯利用交叉对数似然度的方法准确度高。

关 键 词:说话人聚类  交叉对数似然度  贝叶斯判据  聚类
文章编号:1000-3630(2007)-06-1181-05
收稿时间:2006-09-28
修稿时间:2007-02-07

Speaker diarization algorithm based on CLR and BIC
LIU Tan-tan,PAN Jie-lin,SUO Hong-bin and YAN Yong-hong.Speaker diarization algorithm based on CLR and BIC[J].Technical Acoustics,2007,26(6):1181-1185.
Authors:LIU Tan-tan  PAN Jie-lin  SUO Hong-bin and YAN Yong-hong
Affiliation:Think IT Speech Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100080, China;Think IT Speech Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100080, China;Think IT Speech Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100080, China;Think IT Speech Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100080, China
Abstract:The task of Speaker diarization is to group together speech segments uttered by the same sp-eaker. This paper presents an approach to speaker diarization based on a novel combination of Cross Log-likelihood Ratio (CLR) and standard Bayesian information criterion (BIC). Cross Log-likelihood Ratio pro-vides an inter-cluster distance measure,while BIC provides a proper stopping criterion for clustering. The method combines the advantage of these two methods and yields favorable performance in unsupervised speaker diarization. Experiment results show that the performance of the proposed approach based on combination of the CLR and BIC,is better than the approach only based on CLR clustering.
Keywords:speaker diarization  CLR  BIC  clustering  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号