基于BTSM和DBN模型的唇读和视素切分研究 BTSM AND DBN MODEL FOR CONTINUOUS SPEECH RECOGNITION AND VISEME SEGMENTATION期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于BTSM和DBN模型的唇读和视素切分研究

引用本文：	吕国云赵荣椿蒋冬梅蒋晓悦侯云舒 H.Sahli.基于BTSM和DBN模型的唇读和视素切分研究[J].计算机工程与应用,2007,43(14):21-24.

作者姓名：	吕国云赵荣椿蒋冬梅蒋晓悦侯云舒 H.Sahli

作者单位：	[1]西北工业大学计算机学院,西安710072 [2]布鲁塞尔自由大学电子信息系,Pleinlaan2,1050Brussel

基金项目：	中国-比利时合作项目 , 西北工业大学校科研和校改项目

摘要：	为实现文本/语音驱动的说话人头部动画,提出基于贝叶斯切线形状模型的口形轮廓特征提取方法和基于动态贝叶斯网络(Dynamic Bayesian Network,DBN)模型的唇读系统。在描述词与它的组成视素关系的基础上,得到视素时间切分序列。为比较性能,音素DBN模型和HMM的音素识别结果被影射成视素序列。在评价准则上,提出绝对视素切分正确性和基于图像与嘴唇几何特征两种相对视素切分正确性的评价标准。实验表明,DBN模型识别性能优于HMM,而基于视素的DBN模型能为说话人头部动画提供最好的口形。
关键词：	动态贝叶斯网络贝叶斯切线形状模型语音识别视觉语音
文章编号：	1002-8331（2007）14-0021-04
收稿时间：	2007-2-2
修稿时间：	2007-01
BTSM AND DBN MODEL FOR CONTINUOUS SPEECH RECOGNITION AND VISEME SEGMENTATION

Dongmei Jiang xiaoyue jiang yunshu hou hichem sahli.BTSM AND DBN MODEL FOR CONTINUOUS SPEECH RECOGNITION AND VISEME SEGMENTATION[J].Computer Engineering and Applications,2007,43(14):21-24.

Authors:	Dongmei Jiang xiaoyue jiang yunshu hou hichem sahli

Abstract:	A mouth outline feature extraction based on Bayesian Tangent Shape Model(BTSM) and a lip-reading system based on Dynamic Bayesian Network(DBN) is proposed for a talking head in this paper.This model describes the relationship of the word and its corresponding composed viseme,as a result,viseme segmentation sequence with time boundary is achieved.As a comparison,a DBN model based on word-phone relationship and a tri-phone HMM are used.For the system evaluation,an absolute Viseme Segmentation Accuracy(VSA) and two relative VSA based on image and geometrical feature of lip are brought out.The experiments show that DBN model has the better performance than HMM,and DBN model based on viseme can provide the best mouth shape for talking head.

Keywords:	dynamic Bayesian network Bayesian tangent shape model speech recognition visual speech
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏