基于分段线性频谱弯折函数的说话人归一化方法 Speaker Normalization Method Based on the Piece-Wise Linear Frequency Warping期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于分段线性频谱弯折函数的说话人归一化方法

引用本文：	卢正鼎,丰洪才. 基于分段线性频谱弯折函数的说话人归一化方法[J]. 小型微型计算机系统, 2004, 25(12): 2232-2236

作者姓名：	卢正鼎丰洪才

作者单位：	华中科技大学,计算机学院,湖北,武汉,430074

基金项目：	2003年国家星火计划项目(2003EA760004)资助.

摘要：	在传统的声道长度归一化方法中 ,基于声道无损级联短管模型假设 ,用一个简单的声道因子来确定频谱弯折函数 ,无法描述出不同说话人的频谱差异的细节 .针对这一缺陷 ,提出用细致的分段线性频谱弯折函数来描述说话人差异 ,在适当的频谱分段下 ,较好地完成了频谱对齐的任务 .此外 ,由于利用了与模型无关的频谱弯折函数 ,该方法被证明是一种快速的、尤其适用于无监督模式的说话人鲁棒性方法
关键词：	语音识别说话人归一化频谱弯折
文章编号：	1000-1220(2004)12-2232-05
Speaker Normalization Method Based on the Piece-Wise Linear Frequency Warping

LU Zheng-ding,FENG Hong-cai. Speaker Normalization Method Based on the Piece-Wise Linear Frequency Warping[J]. Mini-micro Systems, 2004, 25(12): 2232-2236

Authors:	LU Zheng-ding FENG Hong-cai

Abstract:	In traditional vocal tract length normalization (VTLN) method, the details of the differences of spectral among speakers can not be modeled, because only a simple vocal tract length factor is regarded as the absolutely indicator of the speaker specific attribute, according to the assumption of lossless multi-tube vocal tract model. In this paper, the piece-wise frequency warping function was adopted to describe the speaker specific character in detail. With an appropriate partition of frequency axis, the differences of spectral can be removed well. Due to the model-independent warping function, this method is proved to be a quite fast adaptation technique, and especially suitable for the unsupervised adaptation.

Keywords:	speech recognition speaker normalization frequency warping
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏