首页 | 本学科首页   官方微博 | 高级检索  
     

训练数据有限的英文语音重音标注研究
引用本文:赖 珉,陈一宁,初 敏,胡访宇.训练数据有限的英文语音重音标注研究[J].计算机工程与应用,2007,43(33):48-50.
作者姓名:赖 珉  陈一宁  初 敏  胡访宇
作者单位:[1]中国科学技术大学电子工程与信息科学系,合肥230027 [2]微软亚洲研究院,北京100000
摘    要:大规模语料库的手工韵律标注消耗大量的时间和人力。这篇论文的目的在于研究如何充分利用少量的手工标注数据训练得到尽可能精确的语音重音自动标注器。论文列举并对比了四种训练方法的效果。在训练中结合声学分类器和语言学分类器,同时使用了综合分类器做后期优化。在实验中,使用机器数据训练声学分类器,并将有限的手工数据用于后期综合分类器能得到最佳的标注正确率。最终的正确率达到了94.0%,与手工标注的正确率上限97.2%比较接近。

关 键 词:自动重音检测  自动韵律标注  自动语音识别
文章编号:1002-8331(2007)33-0048-03
修稿时间:2007年5月1日

Stress detection in English sentences with limited training data
LAI Min,CHEN Yi-ning,CHU Min,HU Fang-yu.Stress detection in English sentences with limited training data[J].Computer Engineering and Applications,2007,43(33):48-50.
Authors:LAI Min  CHEN Yi-ning  CHU Min  HU Fang-yu
Affiliation:1.Department of Electronic Engineering and Information Science,University of Science and Technology of China,Hefei 230027,China 2.Microsoft Research Asia,Beijing 100000,China
Abstract:It is money and labor consuming to label stressed syllables manually,especially when the speech database is very large.An efficient and reliable automatic prosody labeler is always desired.When training data is limited,how to get the best use of it? This paper proposes the optimization in using training data for automatic stress detection in English speech utterances.The detector consists of a linguistic classifier,an acoustic classifier and an AdaBoost classifier that can improve the accuracy by using more features and manual labels.The best result we obtained is 94.0%,which is approaching to the self-agreement ratio (97.2%) of the same annotator,or the upper bound of the performance.
Keywords:automatic stress detection  automatic prosody labeler  automatic speech recognition
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号