首页 | 本学科首页   官方微博 | 高级检索  
     

条件随机场模型在韵律结构预测中的应用
引用本文:董远,周涛,董乘宇,王海拉. 条件随机场模型在韵律结构预测中的应用[J]. 北京邮电大学学报, 2009, 32(5): 36-40
作者姓名:董远  周涛  董乘宇  王海拉
作者单位:北京邮电大学,信息与通信工程学院,北京,100876;法国电信北京研发中心,北京,100190
基金项目:教育部科学技术研究重点项目 
摘    要:为提高中文语音合成的自然度,对文本的韵律结构体系进行研究,并提出一种基于条件随机场(CRF)的韵律结构预测方法. 从一个大规模人工标注的语料库中,选取由机器生成的分词标注特征和分级的韵律边界信息,利用CRF算法进行机器学习产生韵律词和韵律短语的CRF模型,并用于韵律结构的预测中. 实验结果表明,韵律词和韵律短语的F-score分别达到90.67%和80.05%,相比于基于最大熵(ME)模型的韵律结构预测方法分别提高了3.62%和5.65%,同时准确率和召回率也有较大提高.

关 键 词:语音合成  韵律结构  条件随机场  机器学习
收稿时间:2009-03-11
修稿时间:2009-08-03

Prosodic Structure Prediction Based on Conditional Random Field Model
DONG Yuan,ZHOU Tao,DONG Cheng-yu,WANG Hai-la. Prosodic Structure Prediction Based on Conditional Random Field Model[J]. Journal of Beijing University of Posts and Telecommunications, 2009, 32(5): 36-40
Authors:DONG Yuan  ZHOU Tao  DONG Cheng-yu  WANG Hai-la
Affiliation:DONG Yuan1,〓ZHOU Tao1,DONG Cheng-yu2,WANG Hai-la2
Abstract:Prosodic structure prediction is an important component in mandarin text-to-speech (TTS) system. A prosodic structure prediction method is proposed, based on the conditional random field (CRF) algorithm. Prosodic word model and prosodic phrase model utilize CRF method for machine learning based on automatically segmented and tagged features and hierarchal prosodic structure information extracted from a large-scale manually labeled speech corpus. The approach achieves F-score of 90.67% in prosody word prediction and 80.05% in prosody phrase prediction, 3.62% and 5.65% higher than that of max entropy (ME) algorithm based method. Experiment results show that the approach of CRF based method makes considerable improvement in prosodic structure prediction, and works well in real mandarin TTS system.
Keywords:text-to-speech  prosodic structure  conditional random field  machine learning
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号