低资源语音识别若干关键技术研究进展 Research Progress on Key Technologies of Low Resource Speech Recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

低资源语音识别若干关键技术研究进展

引用本文：	刘加张卫强. 低资源语音识别若干关键技术研究进展[J]. 数据采集与处理, 2017, 32(2): 205-220

作者姓名：	刘加张卫强

作者单位：	清华大学电子工程系，北京，100084

摘要：	低资源语音识别是当今语音界研究的热点问题之一，也是多语言小语种语音识别技术在实际应用中所面临的重要挑战之一。本文回顾并总结了低资源语音识别的发展历史和研究现状，重点介绍了低资源语音识别在声学特征、声学模型和语言模型方面的若干关键技术研究进展。具体内容包括发音特征、多语言瓶颈特征、子空间高斯混合模型、卷积神经网络声学模型和递归神经网络语言模型，然后介绍了针对低资源语音识别的公开关键词搜索(Open keyword search,OpenKWS)评测，最后对低资源语音识别进行了总结和展望。
关键词：	语音识别；低资源；声学模型；语言模型
Research Progress on Key Technologies of Low Resource Speech Recognition

Liu Ji,Zhang Weiqiang. Research Progress on Key Technologies of Low Resource Speech Recognition[J]. Journal of Data Acquisition & Processing, 2017, 32(2): 205-220

Authors:	Liu Ji Zhang Weiqiang

Affiliation:	Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China

Abstract:	Low resource speech recognition is one of currently researching hotspots in speech recognition community, and is also one of the important challenges for the application of multilingual and minority language speech recognition technologies. This paper summarizes and reviews the current states and history of low resource speech recognition, and introduces several key technologies, including articulatory feature, multilingual bottleneck feature, subspace Gaussian mixture model, convolutional neural network based acoustic model and recurrent neural network based language model. After that the open keyword search (OpenKWS) evaluation is introduced. Finally, the prospective of low resource speech recognition is presented.

Keywords:	speech recognition low resource acoustic model language model

	点击此处可从《数据采集与处理》浏览原始摘要信息
	点击此处可从《数据采集与处理》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏