首页 | 本学科首页   官方微博 | 高级检索  
     

唇读研究进展与展望
引用本文:陈小鼎, 盛常冲, 匡纲要, 刘丽. 唇读研究进展与展望. 自动化学报, 2020, 46(11): 2275−2301 doi: 10.16383/j.aas.c190531
作者姓名:陈小鼎  盛常冲  匡纲要  刘丽
作者单位:1.国防科技大学电子科学学院 长沙 410073;2.国防科技大学系统工程学院 长沙 410073
基金项目:国家自然科学基金(61872379)资助
摘    要:唇读, 也称视觉语言识别, 旨在通过说话者嘴唇运动的视觉信息, 解码出其所说文本内容. 唇读是计算机视觉和模式识别领域的一个重要问题, 在公共安防、医疗、国防军事和影视娱乐等领域有着广泛的应用价值. 近年来, 深度学习技术极大地推动了唇读研究进展. 本文首先阐述了唇读研究的内容和意义, 并深入剖析了唇读研究面临的难点与挑战; 然后介绍了目前唇读研究的现状与发展水平, 对近期主流唇读方法进行了梳理、归类和评述, 包括传统方法和近期的基于深度学习的方法; 最后, 探讨唇读研究潜在的问题和可能的研究方向. 以期引起大家对唇读问题的关注与兴趣, 并推动与此相关问题的研究进展.

关 键 词:唇读   视觉语言识别   时空特征提取   计算机视觉   深度学习
收稿时间:2019-07-16

The State of the Art and Prospects of Lip Reading
Chen Xiao-Ding, Sheng Chang-Chong, Kuang Gang-Yao, Liu Li. The state of the art and prospects of lip reading. Acta Automatica Sinica, 2020, 46(11): 2275−2301 doi: 10.16383/j.aas.c190531
Authors:CHEN Xiao-Ding  SHENG Chang-Chong  KUANG Gang-Yao  LIU Li
Affiliation:1. College of Electronic Science, National University of Defense Technology, Changsha 410073;2. College of Systems Engineering, National University of Defense Technology, Changsha 410073
Abstract:Lip reading, also known as visual speech recognition, aims to infer the content of a speech through the motion of the speaker′s mouth. Lip reading is an important issue in the field of computer vision and pattern recognition. It has a wide range of applications in the fields of public security, medical, defense military and professional filming. In recent years, deep learning technology has greatly promoted the progress of lip reading research. Starting from the definition of lip reading problem, this paper first expounds the content and significance of lip reading research, and deeply analyzes the difficulties and challenges of lip reading research. Then, the recent achievements of lip reading research are introduced, and the current mainstream lip reading methods are combed, categorized and reviewed as well, including traditional methods and recent methods based on deep learning. Finally, the potential problems and possible research directions of lip reading research are discussed to arouse the attention and interest of this research, and promote the research progress of related issues.
Keywords:Lip reading  visual speech recognition  spatiotemporal feature extraction  computer vision  deep learning
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号