首页 | 本学科首页   官方微博 | 高级检索  
     

一种快速嘴部检测方法在视听语音识别的应用
引用本文:刘家涛,;陈一民. 一种快速嘴部检测方法在视听语音识别的应用[J]. 微机发展, 2008, 0(10): 16-19
作者姓名:刘家涛,  陈一民
作者单位:上海大学计算机科学与工程学院
基金项目:基金项目:上海市科技基金资助项目(7A07094)
摘    要:在改进噪音环境下的语音识别率中,来自于说话人嘴部的可视化语音信息有着显著的作用。介绍了在视听语音识别(AVSR)中的重要组成部分之一:可视化信息的前端设计;描述了一种用于快速处理图像并能达到较高识别率的人脸嘴部检测的机器学习方法,此方法引入了旋转Hart-like特征在积分图像中的应用,在基于AdaBoost学习算法上通过使用单值分类作为基础特征分类器,以级联的方式合并强分类器,最后划分检测区域用于嘴部定位。将上述方法应用于AVSR系统中,基本上达到了对人脸嘴部实时准确的检测效果。

关 键 词:模态  视听语音识别  Harr-like特征  重要区域  积分图像  区域划分

Fast Mouth Detection Approach Applied in Audio-Visual Speech Recognition
LIU Jia-tao,CHEN Yi-min. Fast Mouth Detection Approach Applied in Audio-Visual Speech Recognition[J]. Microcomputer Development, 2008, 0(10): 16-19
Authors:LIU Jia-tao  CHEN Yi-min
Affiliation:LIU Jia-tao, CHEN Yi-min (Seh. of Computer Sci. & Eng.,Shanghai Univ. of Science and Tech., Shanghai 200072,China)
Abstract:The visual information comes from speaker's mouth had proved very useful in improving speech recognition,especially in noise environment.In this paper,first introduced one of the main components in audio-visual speech recognition system: visual front end design then proved a machine learning method for mouth region detection which could rapidly process image with high detection rates.This approach includes the introduction of rotated Harr-like feature in integral image,a learning algorithm based on Adaboost with sign value trees as base classifiers,combination of complex classifiers in cascade and regionalization of the face area.At the end,applied this scheme in AVSR system yield high detection rates which may reaches basically real time requirement.
Keywords:modality  audio-visual speech recognition  Harr-like feature  region of interest  integral image  regionalization
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号