首页 | 本学科首页   官方微博 | 高级检索  
     

基于Kaldi的语音识别
引用本文:王凯,马明栋. 基于Kaldi的语音识别[J]. 计算机技术与发展, 2021, 0(1)
作者姓名:王凯  马明栋
作者单位:南京邮电大学通信与信息工程学院;南京邮电大学地理与生物信息学院
基金项目:江苏省自然科学基金-青年基金项目(BK20140868)
摘    要:人工智能技术是当前计算机科学的研究热点,人机通信是人工智能技术的重要组成之一。作为人机通信主要方法之一的语音交互也一直是科学家的研究热点,语音交互技术的关键是语音识别。而目前大多语音识别软件要么功能单一,要么价格昂贵,Kaldi作为新兴的开源语音识别工具,凭借其强大的功能和简单的获取渠道逐渐流行。该文介绍了语音识别技术的发展历程,Kadli软件的基本架构和其所具有的独特优势,语音识别的一般处理流程,多层神经网络的基本结构以及多层神经网络在语音识别当中的应用。对基于Kaldi软件当中的HMM-DNN模型,使用中文数据集训练该模型,搭建一个完整的语音识别系统。通过该系统,不仅能展现出Kaldi软件丰富强大的功能,同时也为语音识别研究人员选择合适的工具提供了新的思路。

关 键 词:人机通信  语音识别  Kaldi  多层神经网络  HMM-DNN

Speech Recognition Based on Kaldi
WANG Kai,MA Ming-dong. Speech Recognition Based on Kaldi[J]. Computer Technology and Development, 2021, 0(1)
Authors:WANG Kai  MA Ming-dong
Affiliation:(School of Telecommunications&Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China;School of Geographical and Biological Information,Nanjing University of Posts and Telecommunications,Nanjing 210003,China)
Abstract:Artificial intelligence technology is the current research hotspot of computer science,and human-machine communication is one of the important components of artificial intelligence technology.As one of the main methods of human-computer communication,speech interaction has always been a hot topic among scientists.The key of speech interaction technology is speech recognition.The current speech recognition software is either single-function or expensive.As an emerging open source speech recognition tool,Kaldi is gradually popular with its powerful functions and simple access channels.We describe the development of speech recognition technology,the basic architecture of Kaldi software and its unique advantages,the general processing flow of speech recognition,the basic structure of multi-layer neural networks and the application of multi-layer neural networks in speech recognition.The HMM-DNN model in Kaldi software is trained by Chinese data sets,and a complete speech recognition system can be built.This system not only shows the rich and powerful functions of Kaldi software,but also provides a new idea for speech recognition researchers to select the right tool.
Keywords:man-machine communication  speech recognition  Kaldi  multi-layer neural network  HMM-DNN
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号