首页 | 本学科首页   官方微博 | 高级检索  
     

基于分段线性频谱弯折函数的说话人归一化方法
引用本文:卢正鼎,丰洪才. 基于分段线性频谱弯折函数的说话人归一化方法[J]. 小型微型计算机系统, 2004, 25(12): 2232-2236
作者姓名:卢正鼎  丰洪才
作者单位:华中科技大学,计算机学院,湖北,武汉,430074
基金项目:2003年国家星火计划项目(2003EA760004)资助.
摘    要:在传统的声道长度归一化方法中 ,基于声道无损级联短管模型假设 ,用一个简单的声道因子来确定频谱弯折函数 ,无法描述出不同说话人的频谱差异的细节 .针对这一缺陷 ,提出用细致的分段线性频谱弯折函数来描述说话人差异 ,在适当的频谱分段下 ,较好地完成了频谱对齐的任务 .此外 ,由于利用了与模型无关的频谱弯折函数 ,该方法被证明是一种快速的、尤其适用于无监督模式的说话人鲁棒性方法

关 键 词:语音识别  说话人归一化  频谱弯折
文章编号:1000-1220(2004)12-2232-05

Speaker Normalization Method Based on the Piece-Wise Linear Frequency Warping
LU Zheng-ding,FENG Hong-cai. Speaker Normalization Method Based on the Piece-Wise Linear Frequency Warping[J]. Mini-micro Systems, 2004, 25(12): 2232-2236
Authors:LU Zheng-ding  FENG Hong-cai
Abstract:In traditional vocal tract length normalization (VTLN) method, the details of the differences of spectral among speakers can not be modeled, because only a simple vocal tract length factor is regarded as the absolutely indicator of the speaker specific attribute, according to the assumption of lossless multi-tube vocal tract model. In this paper, the piece-wise frequency warping function was adopted to describe the speaker specific character in detail. With an appropriate partition of frequency axis, the differences of spectral can be removed well. Due to the model-independent warping function, this method is proved to be a quite fast adaptation technique, and especially suitable for the unsupervised adaptation.
Keywords:speech recognition  speaker normalization  frequency warping  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号