首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于滑动窗口分割的中国手语识别系统
引用本文:王鑫炎,王青山,马晓迪,刘鹏,戴海鹏.一种基于滑动窗口分割的中国手语识别系统[J].北京邮电大学学报,2021,44(5):48-54.
作者姓名:王鑫炎  王青山  马晓迪  刘鹏  戴海鹏
作者单位:1. 合肥工业大学 数学学院, 合肥 230601;2. 杭州电子科技大学 计算机学院, 杭州 310018;3. 南京大学 计算机科学与技术学院, 南京 210023
摘    要:听力障碍者在全世界残疾人群体中占有较大的比重.他们能通过手语与健全人交流,但因手语不被大众所掌握,导致彼此交流存在较大障碍.为此提出了一种基于滑动窗口分割(SSW)的连续中国手语识别系统来实现手语自动识别.SSW系统将通过滑动窗口选取出来的手语信号平均分割,依次删去其中一组数据,从而得到新的数据,输入手语识别神经网络进行训练,得出单个手语单词手势预测值,最后运用基于阈值的多投票策略对识别出的预测值进行判断,得出识别结果.SSW系统在对20名志愿者采集的30条手语语句上进行训练,结果显示,所提SSW系统自动识别手语的平均准确率在测试集上达到83.9%,较长短期记忆网络模型提高了16.7%.

关 键 词:滑动窗口  双向长短期记忆网络  阈值  数据分割  手语识别  
收稿时间:2021-03-26

A Split Sliding Window-Based Continuous Chinese Sign Language Recognition System
WANG Xin-yan,WANG Qing-shan,MA Xiao-di,LIU Peng,DAI Hai-peng.A Split Sliding Window-Based Continuous Chinese Sign Language Recognition System[J].Journal of Beijing University of Posts and Telecommunications,2021,44(5):48-54.
Authors:WANG Xin-yan  WANG Qing-shan  MA Xiao-di  LIU Peng  DAI Hai-peng
Affiliation:1. Institute of Mathematics, Hefei University of Technology, Hefei 230601, China;2. School of Computer Science, Hangzhou Dianzi University, Hangzhou 310018, China;3. School of Computer Science and Technology, Nanjing University, Nanjing 210023, China
Abstract:A large proportion of the world's disabled population is accounted for the individuals with hearing impairment which can communicate with people through the sign language. However, sign language is not mastered by the public, and there are still big obstacles between the individuals with hearing impairment and the normal people. A continuous Chinese sign language recognition system based on split sli-ding window (SSW) to realize automatic sign language recognition is proposed. The SSW system divides the sign language signal selected through the sliding window, and deletes one group of data to get new data in the original order, which is inputted to the sign language recognition neural network for training to obtain the gesture prediction value of a single sign language word. Finally, the majority voting strategy based on threshold is used to judge the identified prediction values. The SSW system is trained on 30 sign language sentences collected by 20 volunteers. The results show that the average accuracy of the SSW system reachs 83.9% on the test dataset, which is 16.7% higher than the long short-term memory model.
Keywords:sliding window  bi-directional long short-term memory network  threshold  data segmentation  sign language recognition  
本文献已被 万方数据 等数据库收录!
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号