首页 | 本学科首页   官方微博 | 高级检索  
     

基于矢量量化方法的说话人识别技术
引用本文:张一清,李轶.基于矢量量化方法的说话人识别技术[J].杭州电子科技大学学报,2005,25(4):58-61.
作者姓名:张一清  李轶
作者单位:杭州电子科技大学自动化学院,浙江,杭州,310018
摘    要:说话人识别是一项通过语音来识别说话人身份的技术,它在保安、司法、军事、财经和信息服务等领域都具有广泛的应用前景。该文采用线性预测倒谱系数和美尔倒谱系数特征相结合,基于矢量量化聚类方法建立了一个与文本无关的、连续语音发音的说话人识别系统。只要矢量量化聚类法码本大小选择合适,该说话人识别系统就可以获得较好的识别效果。当阈值恰当选取时,该系统具备拒绝识别集外人的功能。

关 键 词:矢量量化  说话人识别  线性预测倒谱系数  美尔倒谱系数
文章编号:1001-9146(2005)04-0058-04
收稿时间:2005-07-08
修稿时间:2005-07-08

Speaker Recognition Technology Based on VQ
ZHANG Yi-qing,LI Yi.Speaker Recognition Technology Based on VQ[J].Journal of Hangzhou Dianzi University,2005,25(4):58-61.
Authors:ZHANG Yi-qing  LI Yi
Abstract:Speaker recognition is a kind of technology to judge the speaker's identify according to his voice. It has good prospect in many areas such as security, judicatory, and military. One speaker identification system by extracting MFCC as feature vector and using VQ in match phase is constructed. The results of the experiment indicate that, the speaker recognition model based on VQ is effective; the advantage is correct classifying, small memory need and rapid judging.
Keywords:vector quantization(VQ)  speaker identification  LPCC cepstrum  MFCC cepstrum
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号