声韵母约束扩展识别网络的发音偏误检测 Mispronunciation Detection with Extended Recognition Network of Initials and Finals Constraint期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

声韵母约束扩展识别网络的发音偏误检测

引用本文：	董文伟,解焱陆,林举. 声韵母约束扩展识别网络的发音偏误检测[J]. 信号处理, 2020, 36(6): 977-983. DOI: 10.16798/j.issn.1003-0530.2020.06.020

作者姓名：	董文伟解焱陆林举

作者单位：	北京语言大学信息科学学院

基金项目：	国家社科基金项目（18BYY124）;语言资源高精尖创新中心项目(KYR17005);北京语言大学梧桐创新平台项目（中央高校基本科研业务费专项资金）（19PT04）

摘要：	发音偏误检测是计算机辅助发音训练（Computer Aided Pronunciation Training ，CAPT）的重要组成部分。为了在机器辅助语料标注任务或者缺少标注语料的偏误检测任务上提高性能，本文提出解码时使用声韵母约束的扩展识别网络方法。该方法将传统的语音识别中解码的自由文法循环（free grammar loop）部分换成结合声韵母交替以及字数限制规则的扩展识别网络，可以对全音素进行偏误检测, 并且不会出现插入删除错误。相比于传统的扩展识别网络，这种约束的扩展识别网络不需要大量的语料标注和分析。相对于传统的发音良好度评价方法（Goodness of Pronunciation, GOP）, 基于这种拓展识别网络的方法不仅可以对二语学习者的发音进行正误的检测，还能给出具体的错误反馈。实验结果表明，本文提出的基于声韵母约束拓展识别网络的方法在挑错任务上优于传统的发音质量评估（GOP）的方法，其错误接受率为29.2%，错误拒绝率为22.9%，诊断准确率为76.6%。比GOP方法的诊断准确率相对提升15.5%，并且模型相较于无标注经验汉语母语者能检测出更多偏误。
关键词：	扩展识别网络发音质量评估计算机辅助发音训练
收稿时间：	2020-03-02
Mispronunciation Detection with Extended Recognition Network of Initials and Finals Constraint

Affiliation:	Beijing Language and Culture University

Abstract:	Mispronunciation detection is an important part of computer aided pronunciation training (CAPT). In order to improve the performance of machine aided corpus annotation task or error detection task without annotating corpus. In this paper, we proposed a method of combining Deep Neural Network with initials and finals constrained extended recognition network (cERN), which replaced traditional ASR decoding’s free grammar loop part with cERN. The proposed model can detect errors from all kind of phones and without insertion, deletion errors. Compared with the traditional extended recognition network, this constrained extended recognition network does not need a lot of corpus annotation and analysis. Compared with Goodness of Pronunciation (GOP), this method can not only detect whether pronunciation is correct, but also can give learners error details. The experiment result shows that cERN is better than GOP in the task of pronunciation detection. The false accept rate is 29.2% and false reject rate is 22.9%. The accuracy rate of cERN is 76.6%, which improve 15.5% than GOP’s accuracy rate and is better than the result of untrained annotates.

Keywords:

	点击此处可从《信号处理》浏览原始摘要信息
	点击此处可从《信号处理》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏