Mandarin Digits Speech Recognition Using Support Vector Machines Mandarin Digits Speech Recognition Using Support Vector Machines期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Mandarin Digits Speech Recognition Using Support Vector Machines

作者姓名：	谢湘匡镜明

作者单位：	SchoolofInformationScienceandTechnology,BeijingInstituteofTechnology,Beijing100081,China

摘要：	A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33 %, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.
关键词：	语音识别普通话演讲无线电机器
收稿时间：	2003/8/28 0:00:00
Mandarin Digits Speech Recognition Using Support Vector Machines

XIE Xiang and KUANG Jing-ming.Mandarin Digits Speech Recognition Using Support Vector Machines[J].Journal of Beijing Institute of Technology,2005,14(1):9-12.

Authors:	XIE Xiang and KUANG Jing-ming

Affiliation:	School of Information Science and Technology, Beijing Institute of Technology, Beijing 100081, China

Abstract:	A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.

Keywords:	speech recognition support vector machine (SVM) kernel function
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《北京理工大学学报(英文版)》浏览原始摘要信息
	点击此处可从《北京理工大学学报(英文版)》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏