首页 | 本学科首页   官方微博 | 高级检索  
     

Mandarin Digits Speech Recognition Using Support Vector Machines
引用本文:谢湘,匡镜明. Mandarin Digits Speech Recognition Using Support Vector Machines[J]. 北京理工大学学报(英文版), 2005, 14(1): 9-12
作者姓名:谢湘  匡镜明
作者单位:SchoolofInformationScienceandTechnology,BeijingInstituteofTechnology,Beijing100081,China
摘    要:A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33 %, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.

关 键 词:语音识别 普通话 演讲 无线电机器
收稿时间:2003-08-28

Mandarin Digits Speech Recognition Using Support Vector Machines
XIE Xiang and KUANG Jing-ming. Mandarin Digits Speech Recognition Using Support Vector Machines[J]. Journal of Beijing Institute of Technology, 2005, 14(1): 9-12
Authors:XIE Xiang and KUANG Jing-ming
Affiliation:School of Information Science and Technology, Beijing Institute of Technology, Beijing 100081, China
Abstract:A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.
Keywords:speech recognition  support vector machine (SVM)  kernel function
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《北京理工大学学报(英文版)》浏览原始摘要信息
点击此处可从《北京理工大学学报(英文版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号