首页 | 本学科首页   官方微博 | 高级检索  
     

基于状态异步DBN的语音驱动面部动画合成
引用本文:赵 勇,蒋冬梅,Sahli Hichem.基于状态异步DBN的语音驱动面部动画合成[J].计算机工程,2014(2):180-183,188.
作者姓名:赵 勇  蒋冬梅  Sahli Hichem
作者单位:[1]西北工业大学计算机学院,西安710072 [2]布鲁塞尔自由大学电子与信息工程系,比利时布鲁塞尔1050
基金项目:国家自然科学基金资助项目(61273265);陕西省国际科技合作基金资助重点项目(2011KW-04)
摘    要:提出一种基于状态异步动态贝叶斯网络模型(SA-DBN)的语音驱动面部动画合成方法。提取音视频语音数据库中音频的感知线性预测特征和面部图像的主动外观模型(AAM)特征来训练模型参数,对于给定的输入语音,基于极大似然估计原理学习得到对应的最优AAM特征序列,并由此合成面部图像序列和面部动画。对合成面部动画的主观评测结果表明,与听视觉状态同步的DBN模型相比,通过限制听觉语音状态和视觉语音状态间的最大异步程度,SA-DBN可以得到清晰自然并且嘴部运动与输入语音高度一致的面部动画。

关 键 词:面部动画合成  状态异步动态贝叶斯网络模型  异步约束  主动外观模型  感知线性预测  极大似然估计

Speech Driven Facial Animation Synthesis Based on State Asynchronous DBN
Affiliation:ZHAO Yong, JIANG Dong-mei, Sahli Hichem (1. School of Computer Science, Northwestern Polytechnical University, Xi'an 710072, China 2. ETRO Department, Vrije Universiteit Brussel, Brussels 1050, Belgium)
Abstract:An audio visual Dynamic Bayesian Network model with State Asynchrony(SA-DBN) transforming acoustic speech to photo realistic facial animation is proposed. Perceptual Linear Prediction(PLP) features from audio speech, as well as Active Appearance ModeI(AAM) features from face images of an audio visual speech database, are adopted to train the model parameters of the proposed SA-DBN. Based on the SADBN model, an input audio stream is given, the optimal A.AM visual features are learned by the Maximum Likelihood Estimation(MLE) criterion, which are used to construct facial images for the animation. Subjective evaluation is presented to compare the proposed constrained state asynchrony DBN with a state synchronous audio visual DBN model. Experimental results show that with the SA-DBN model, high quality facial animations can be obtained with mouth movements matching the input speech.
Keywords:facial animation synthesis  Dynamic Bayesian Network model with State Asynchrony(SA-DBN)  asynchrony constraint  Active Appearance Model(A_AM)  Perceptual Linear Prediction(PLP)  Maximum Likelihood Estimation(MLE)
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号