首页 | 本学科首页   官方微博 | 高级检索  
     

语音分割与端点检测研究综述
引用本文:杨健,李振鹏,苏鹏.语音分割与端点检测研究综述[J].计算机应用,2020,40(1):1-7.
作者姓名:杨健  李振鹏  苏鹏
作者单位:大理大学 数学与计算机学院, 云南 大理 671003
基金项目:国家自然科学基金资助项目(71661001);云南省哲学社会科学规划项目(YB2017072)。
摘    要:语音分割是语音识别和语音合成中必不可少的基础性工作,其质量对后续系统的影响巨大。使用手工分割和标注虽然精度高,但费时费力,同时需要熟练的领域专家来完成,自动语音分割因此成为语音处理的研究热点。首先针对自动语音分割目前的研究进展,介绍了语音分割的不同分类方法;然后分别介绍了基于对齐的方法和基于边界检测的方法,并详细介绍了可以应用在上述两种框架下的神经网络语音分割方法;接着介绍了基于生物激励信号以及博弈论等方法的新型语音分割技术,并给出了领域内广泛使用的性能评估度量,并对这些评估指标进行比较和分析;最后总结并提出语音分割研究未来发展的重要方向。

关 键 词:语音分割  端点检测  语音合成  信号特征  人工神经网络  
收稿时间:2019-06-24
修稿时间:2019-09-04

Review of speech segmentation and endpoint detection
YANG Jian,LI Zhenpeng,SU Peng.Review of speech segmentation and endpoint detection[J].journal of Computer Applications,2020,40(1):1-7.
Authors:YANG Jian  LI Zhenpeng  SU Peng
Affiliation:School of Mathematics and Computer Science, Dali University, Dali Yunnan 671003, China
Abstract:Speech segmentation is an indispensable basic work in speech recognition and speech synthesis, and its quality has a great impact on the following system. Although manual segmentation and labeling is of high accuracy, it is quite time-consuming and laborious, and requires domain experts to deal with. As a result, automatic speech segmentation has become a research hotspot in speech processing. Firstly, aiming at current progress of automatic speech segmentation, several different classification methods of speech segmentation were explained. The alignment-based methods and boundary detection-based methods were introduced respectively, and the neural network speech segmentation methods, which can be applied in the above two frameworks, were expounded in detail. Then, some new speech segmentation technologies based on the methods such as bio-inspiration signal and game theory were introduced, and the performance evaluation metrics widely used in the speech segmentation field were given, and these evaluation metrics were compared and analyzed. Finally, the above contents were summarized and the future important research directions of speech segmentation were put forward.
Keywords:speech segmentation                                                                                                                        endpoint detection                                                                                                                        speech synthesis                                                                                                                        signal feature                                                                                                                        Artificial Neural Network (ANN)
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号