首页 | 本学科首页   官方微博 | 高级检索  
     

基于说话人分类技术的分级说话人识别研究
引用本文:刘文举,孙兵,钟秋海.基于说话人分类技术的分级说话人识别研究[J].电子学报,2005,33(7):1230-1233.
作者姓名:刘文举  孙兵  钟秋海
作者单位:1. 中科院自动化所模式识别国家重点实验室,北京 100080;2. 北京理工大学自动控制系,北京 100081
基金项目:国家自然科学基金,北京市自然科学基金
摘    要:识别正确率和抗噪性能固然是说话人识别的研究重点,但识别响应速度也是决定系统实用化的关键所在.本文成功地提出了基于说话人分类技术的分级说话人辨识方法,极大地提高了系统运行速度,随着注册说话人数的增多,较之传统的说话人辨识方法,其优势更加明显.同时在说话人确认中,该方法的使用,进一步提高了确认的正确率,有效地降低了错误接受和错误拒绝率.本文提出的可信度打分方法,也一定程度上改进了系统的性能.实验表明:基于说话人分类技术的说话人辨识方法使系统的运行速度平均提高了3.5倍,对说话人确认等误识率和最小误识率平均下降了53.75%.

关 键 词:说话人辨识  说话人确认  说话人分类  Cohort集  可信度打分  
文章编号:0372-2112(2005)05-1230-04
收稿时间:2004-09-17
修稿时间:2004-09-172004-11-29

Research on Hierarchical Speaker Recognition Based on Speaker Clustering Technology
LIU Wen-ju,SUN Bing,ZHONG Qiu-hai.Research on Hierarchical Speaker Recognition Based on Speaker Clustering Technology[J].Acta Electronica Sinica,2005,33(7):1230-1233.
Authors:LIU Wen-ju  SUN Bing  ZHONG Qiu-hai
Affiliation:1. National Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Sciences,Beijing 100080,China;2. Department of Automatic Control,Beijing Institute of Technology,Beijing 100081,China
Abstract:Recognition correct rate and noise robust property are indeed important for speaker recognition research,but the response rate of recognition is also a key factor for a speaker recognition system when applied in the real world.Owing to this,we propose a novel speaker identification approach based on speaker clustering,namely Hierarchical Speaker Identification (HSI).It can increase the running speed greatly for speaker identification systems,and the more the number of registered speakers is,the faster the HSI system runs than the Conventional Speaker Identification (CSI) system.Simultaneously,its counterpart for speaker verification based on speaker clustering,can reduce the rates of false rejection and false acceptance efficiently to improve the capability of verification.A new method is also presented here called reliability scoring.The experiments show that speaker clustering based algorithms can run faster 3.5 times than original approach for the speaker identification and is 53.75% deduction of equal or minimal error rates for the speaker verification on average.
Keywords:speaker identification  speaker verification  speaker clustering  cohort set  reliability scoring
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号