汉语文本-可视语音转换的研究 Study of Text to Visual Speech in Chinese期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

汉语文本-可视语音转换的研究

引用本文：	王志明,蔡莲红,吴志勇,陶建华. 汉语文本-可视语音转换的研究[J]. 小型微型计算机系统, 2002, 23(4): 474-477

作者姓名：	王志明蔡莲红吴志勇陶建华

作者单位：	清华大学,计算机科学与技术系,北京,100084

基金项目：	高校博士点基金(20010003049)资助项目

摘要：	本文通过对发音者可见器官动作的研究 ,从视觉方面抽取汉语发音的 2 6个基本口形 ,并利用 MPEG- 4所规定的面部动画参数 (FAP)来描述这些口形 ,从而获得了符合国际标准的描述汉语发音的视觉参数 .另外 ,我们研究了这些参数在连续语流中的变化及协同发音对口形的影响 ,基于已有的汉语文语转换系统 (Sonic)和二维网格人脸模型(Plane Face)实现了一个汉语文本 -可视语音转换系统 (TTVS)
关键词：	视觉语音面部动画参数(FAP) 文语转换系统(TTS) 文本-可视语音转换系统(TTVS) 协同发音
文章编号：	1000-1220(2002)04-0474-04
Study of Text to Visual Speech in Chinese

WANG Zhi-ming,CAI Lian-hong,WU Zhi-yong,TAO Jian-hua. Study of Text to Visual Speech in Chinese[J]. Mini-micro Systems, 2002, 23(4): 474-477

Authors:	WANG Zhi-ming CAI Lian-hong WU Zhi-yong TAO Jian-hua

Abstract:	After study the motion of visual organ of the speaker, we divided Chinese phonemes into 26 basic visual classes. We described these basic classes by FAPs defined by MPEG-4, and then we got the universal parameters of Chinese phonemes. We also study the modification of these parameters in successive speech and coarticulation circumstance. Base on our TTS system (Sonic) and image warping technology on our 2-D mesh model (PlaneFace), we realized a TTVS sytem.

Keywords:	visual speech facial animation parameter(FAP) text-to-speech(TTS) text-to -audioVisual speech(TTVS) coarticulation
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏