条件随机场模型在韵律结构预测中的应用 Prosodic Structure Prediction Based on Conditional Random Field Model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

条件随机场模型在韵律结构预测中的应用

引用本文：	董远,周涛,董乘宇,王海拉. 条件随机场模型在韵律结构预测中的应用[J]. 北京邮电大学学报, 2009, 32(5): 36-40

作者姓名：	董远周涛董乘宇王海拉

作者单位：	北京邮电大学,信息与通信工程学院,北京,100876;法国电信北京研发中心,北京,100190

基金项目：	教育部科学技术研究重点项目

摘要：	为提高中文语音合成的自然度，对文本的韵律结构体系进行研究，并提出一种基于条件随机场(CRF)的韵律结构预测方法. 从一个大规模人工标注的语料库中，选取由机器生成的分词标注特征和分级的韵律边界信息，利用CRF算法进行机器学习产生韵律词和韵律短语的CRF模型，并用于韵律结构的预测中. 实验结果表明，韵律词和韵律短语的F-score分别达到90.67%和80.05%，相比于基于最大熵(ME)模型的韵律结构预测方法分别提高了3.62%和5.65%，同时准确率和召回率也有较大提高.
关键词：	语音合成韵律结构条件随机场机器学习
收稿时间：	2009-03-11
修稿时间：	2009-08-03
Prosodic Structure Prediction Based on Conditional Random Field Model

DONG Yuan,ZHOU Tao,DONG Cheng-yu,WANG Hai-la. Prosodic Structure Prediction Based on Conditional Random Field Model[J]. Journal of Beijing University of Posts and Telecommunications, 2009, 32(5): 36-40

Authors:	DONG Yuan ZHOU Tao DONG Cheng-yu WANG Hai-la

Affiliation:	DONG Yuan1,〓ZHOU Tao1,DONG Cheng-yu2,WANG Hai-la2

Abstract:	Prosodic structure prediction is an important component in mandarin text-to-speech (TTS) system. A prosodic structure prediction method is proposed, based on the conditional random field (CRF) algorithm. Prosodic word model and prosodic phrase model utilize CRF method for machine learning based on automatically segmented and tagged features and hierarchal prosodic structure information extracted from a large-scale manually labeled speech corpus. The approach achieves F-score of 90.67% in prosody word prediction and 80.05% in prosody phrase prediction, 3.62% and 5.65% higher than that of max entropy (ME) algorithm based method. Experiment results show that the approach of CRF based method makes considerable improvement in prosodic structure prediction, and works well in real mandarin TTS system.

Keywords:	text-to-speech prosodic structure conditional random field machine learning
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《北京邮电大学学报》浏览原始摘要信息
	点击此处可从《北京邮电大学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏