首页 | 本学科首页   官方微博 | 高级检索  
     

一种识别说话者的新方法
引用本文:刘琪. 一种识别说话者的新方法[J]. 智能计算机与应用, 2013, 0(6): 85-87
作者姓名:刘琪
作者单位:谢菲尔德大学计算机科学学院,英国谢菲尔德S14DP
摘    要:在能够自动识别视频中的说话者的系统中,大部分采用的是声音和唇部运动相结合的方法。文中则采用了另一种方法有效地达到了目的,即通过检测人体头部和手部的运动来鉴别说话者。基于演讲者在说话时通常会伴有头部运动或是手部运动,该方法既能实现说话者的检测,又能避免由于观测点过远而导致无法判断人唇部运动的局限性。在系统的实施过程中,运用了多种图像处理方法,并且对三帧差运动法做出了改善,使其能更高效、更准确地检测到头部和手部的运动。经过多个不同的视频测试后,本系统的F1 score高达91.91%,从而验证了该系统的可行性。

关 键 词:图像处理  脸部检测  手部检测  运动检测  F1  score

A New Method of Identifying A Speaker
LIU Qi. A New Method of Identifying A Speaker[J]. INTELLIGENT COMPUTER AND APPLICATIONS, 2013, 0(6): 85-87
Authors:LIU Qi
Affiliation:LIU Qi ( College of Computer Science, The University of Sheffield, Sheffield, S1 4DP, UK)
Abstract:In systems that are able to detect speakers in a video automatically, a method based on the voice and lip move- merit is proposed generally. But in this paper, another technique, which is against detecting head movement and hand movement, is employed. Since those two kinds of body movements are always accompanied when a person is speaking, this method effectively detects a speaker and avoids the limitation of accurately identifying a lip movement because of a far ob- serving distance. During the process of implementation, a combination of several different image processing algorithms is a- dopted, and the three -frame difference algorithm is modified in order to reach a further effectiveness. The average of Fl score of this system is 9t. 91% after testing several different videos, which verifies the feasibility and accuracy.
Keywords:Image Processing  Face Detection  Hand Detection  Movement Detection  F1 score
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号