首页 | 本学科首页   官方微博 | 高级检索  
     

基于CDCPM的维吾尔语非特定人语音识别
引用本文:王昆仑.基于CDCPM的维吾尔语非特定人语音识别[J].计算机研究与发展,2001,38(10):1242-1245.
作者姓名:王昆仑
作者单位:新疆师范大学计算机科学系
基金项目:国家自然科学基金 ( 695 62 0 0 1),新疆维吾尔自治区“九五”重点攻关科研项目基金 ( G95 32 60 3)资助,荣获 1998年新疆维吾尔自治区人民政府“科学技术进步奖”三等奖
摘    要:现代维吾尔语语音识别研究尚处于超始阶段,在此介绍了基于中心距离连续概率模型(CDCPM)的维吾尔语非特定人语音识别。CDCPM用中心距离正态(CDN)分布描述模型特征空间,去掉了HMM的状态转移概率矩阵A,对HMM进行了简化和改进。在维吾尔语综合语音库上进行的实验表明:恰当地估计模型状态数和模型混合密度数,当模型数为525个,模型状态数为16,混合密度数为24,维吾尔语非特定人语音识别首选正识率达到97.90%(集内)和94.76%(集外),取得了较好的识别效果。同时,指出了进一步开展维吾尔语语音识别研究的几个问题。

关 键 词:维吾尔语  语音识别  中心距离连续概率模型  CDCPM

UIGHUR SPEAKER-INDEPENDENT SPEECH RECOGNITION BASED ON CDCPM
WANG Kun-Lun.UIGHUR SPEAKER-INDEPENDENT SPEECH RECOGNITION BASED ON CDCPM[J].Journal of Computer Research and Development,2001,38(10):1242-1245.
Authors:WANG Kun-Lun
Abstract:The Uighur speech recognition research is in the starting stage. Introduced in this paper is Uighur speaker independent speech recognition based on the center distance continuance probability model (CDCPM). CDCPM describes the feature space of model by center distance normal distribution (CDN), and simplifies and improves the HMM efficiently by getting rid of the state transition probability matrix A. A large amount of experimentation carried out with the Uighur synthetic speech database shows that numbers of state and amalgamate density for models can be adjusted adequately. When the number of model is 525, state number of model is 16, and the amalgamate density number of model is 24, the rate of first correct recognition is up to 94 76% (in set) and 97.90%(out set) on the Uighur speaker independent speech recognition. Recognition result with good performance is derived. At the same time, some problems about Uighur speech recognition research are pointed out.
Keywords:Uighur  speaker  independent  speech recognition  Uighur synthetic speech database  center  distance continuance probability model  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号