计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (32): 150-152.DOI: 10.3778/j.issn.1002-8331.2009.32.047

• 图形、图像、模式识别 • 上一篇    下一篇

在DCT域进行LDA的唇读特征提取方法

何 俊,张 华,刘继忠   

  1. 南昌大学 江西省机器人焊接重点实验室,南昌 330031
  • 收稿日期:2008-06-18 修回日期:2008-10-15 出版日期:2009-11-11 发布日期:2009-11-11
  • 通讯作者: 何 俊

LDA based feature extraction method in DCT domain in lipreading

HE Jun,ZHANG Hua,LIU Ji-zhong   

  1. Key Laboratory of Robot & Welding,Nanchang University,Nanchang 330031,China
  • Received:2008-06-18 Revised:2008-10-15 Online:2009-11-11 Published:2009-11-11
  • Contact: HE Jun

摘要: 为解决视觉语言特征提取这个唇读技术中最关键的难题,提出一种新的基于DCT和LDA的特征提取方法。为提取对不同口型最具分类能力的特征矢量,首先基于DCT对视觉语言部位变换降维,然后基于LDA算法从DCT系数提取对口型分类性能最优的特征矢量。在特定人与非特定人的唇读数据库上以及实时唇读识别的实验都表明,该方法唇读识别率比传统的人工直接选择DCT系数法以及PCA提取法有明显提高。

关键词: 唇读, 特征提取, 离散余弦变换(DCT), 线性判别分析(LDA)

Abstract: To solve the key problem of extracting visual speech feature in lipreading,a method based on DCT and LDA is proposed.To extract most discriminative visual feature among different mouth classes,first,DCT is performed on the visual speech region;and then based on LDA the most discriminative feature vector is extracted from DCT coefficients.The experiments on speaker-dependent,speaker-independent database and in real-time lipreading environment show that this method is more effective than traditional manual DCT coefficients extraction method and PCA feature extraction method.

Key words: lipreading, feature extraction, Discrete Cosine Transformation(DCT), Linear Discriminative Analysis(LDA)

中图分类号: