首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的基于说话者的语音分割算法
引用本文:卢坚,毛兵,孙正兴,张福炎.一种改进的基于说话者的语音分割算法[J].软件学报,2002,13(2):274-279.
作者姓名:卢坚  毛兵  孙正兴  张福炎
作者单位:南京大学,计算机科学与技术系,江苏,南京,210093,南京大学,计算机软件新技术国家重点实验室,江苏,南京,210093
基金项目:国家自然科学基金资助项目(69903006;60073030)
摘    要:语音分割是语音识别和语音文档检索等众多语音应用的基础.提出一种改进的基于说话者的语音分割算法,对GLR和BIC相结合的算法作进一步的改进:(1) 基于GLR距离方差的自适应阈值调整算法改进了不同声学特征下基于距离的语音分割算法中的阈值选取方法;(2) 引入BIC可测度概念来度量其适用范围;(3) BIC信息准则校准非冗余的候选分割点的偏差.实验结果表明,此改进算法优于原算法.

关 键 词:基于说话者的语音分割  贝叶斯信息准则(BIC)  一般似然比(GLR)  mel-frequency  cepstral  coefficient  (MFCC)  假设检验
文章编号:1000-9825/2002/13(02)0274-06
收稿时间:2000/5/10 0:00:00
修稿时间:8/3/2000 12:00:00 AM

An Improved Speaker Based Speech Segmentation Algorithm
LU Jian,MAO Bing,SUN Zheng-xing and ZHANG Fu-yan.An Improved Speaker Based Speech Segmentation Algorithm[J].Journal of Software,2002,13(2):274-279.
Authors:LU Jian  MAO Bing  SUN Zheng-xing and ZHANG Fu-yan
Abstract:Speech segmentation is the foundation of some applications such as speech recognition and spoken document retrieval. An improved algorithm is proposed here which include: (1) GLR variance based threshold adaptive algorithm is to improve the threshold selection approach in speaker based speech segmentation under various acoustic environments;(2) BICs Detection Ability is referred to determine when BIC is effective;(3) Besides to verify the candidate segmentation points, BIC is used to calibrate their bias caused by GLR variance.Experimental results indicate that the improved algorithm is prior to the original one.
Keywords:speaker-based speech segmentation  Bayesian information criterion (BIC)  generalized likelihood ratio (GLR)  mel-frequency cepstral coefficient (MFCC)  hypothesis testing  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号