首页 | 本学科首页   官方微博 | 高级检索  
     

基于BTSM和DBN模型的唇读和视素切分研究
引用本文:吕国云 赵荣椿 蒋冬梅 蒋晓悦 侯云舒 H.Sahli.基于BTSM和DBN模型的唇读和视素切分研究[J].计算机工程与应用,2007,43(14):21-24.
作者姓名:吕国云  赵荣椿  蒋冬梅  蒋晓悦  侯云舒  H.Sahli
作者单位:[1]西北工业大学计算机学院,西安710072 [2]布鲁塞尔自由大学电子信息系,Pleinlaan2,1050Brussel
基金项目:中国-比利时合作项目 , 西北工业大学校科研和校改项目
摘    要:为实现文本/语音驱动的说话人头部动画,提出基于贝叶斯切线形状模型的口形轮廓特征提取方法和基于动态贝叶斯网络(Dynamic Bayesian Network,DBN)模型的唇读系统。在描述词与它的组成视素关系的基础上,得到视素时间切分序列。为比较性能,音素DBN模型和HMM的音素识别结果被影射成视素序列。在评价准则上,提出绝对视素切分正确性和基于图像与嘴唇几何特征两种相对视素切分正确性的评价标准。实验表明,DBN模型识别性能优于HMM,而基于视素的DBN模型能为说话人头部动画提供最好的口形。

关 键 词:动态贝叶斯网络  贝叶斯切线形状模型  语音识别  视觉语音
文章编号:1002-8331(2007)14-0021-04
收稿时间:2007-2-2
修稿时间:2007-01

BTSM AND DBN MODEL FOR CONTINUOUS SPEECH RECOGNITION AND VISEME SEGMENTATION
Dongmei Jiang xiaoyue jiang yunshu hou hichem sahli.BTSM AND DBN MODEL FOR CONTINUOUS SPEECH RECOGNITION AND VISEME SEGMENTATION[J].Computer Engineering and Applications,2007,43(14):21-24.
Authors:Dongmei Jiang xiaoyue jiang yunshu hou hichem sahli
Abstract:A mouth outline feature extraction based on Bayesian Tangent Shape Model(BTSM) and a lip-reading system based on Dynamic Bayesian Network(DBN) is proposed for a talking head in this paper.This model describes the relationship of the word and its corresponding composed viseme,as a result,viseme segmentation sequence with time boundary is achieved.As a comparison,a DBN model based on word-phone relationship and a tri-phone HMM are used.For the system evaluation,an absolute Viseme Segmentation Accuracy(VSA) and two relative VSA based on image and geometrical feature of lip are brought out.The experiments show that DBN model has the better performance than HMM,and DBN model based on viseme can provide the best mouth shape for talking head.
Keywords:dynamic Bayesian network  Bayesian tangent shape model  speech recognition  visual speech
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号