基于深度可分卷积神经网络的实时人脸表情和性别分类 Real-time facial expression and gender recognition based on depthwise separable convolutional neural network期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于深度可分卷积神经网络的实时人脸表情和性别分类

引用本文：	刘尚旺,刘承伟,张爱丽.基于深度可分卷积神经网络的实时人脸表情和性别分类[J].计算机应用,2020,40(4):990-995.

作者姓名：	刘尚旺刘承伟张爱丽

作者单位：	河南师范大学计算机与信息工程学院, 河南新乡 453007

基金项目：	河南省科技攻关项目（192102210290）；河南省高等学校重点科研项目基础研究计划（18A510014）。

摘要：	针对目前普通卷积神经网络（CNN）在表情和性别识别任务中出现的训练过程复杂、耗时过长、实时性差等问题，提出一种深度可分卷积神经网络的实时人脸表情和性别识别模型。首先，利用多任务级联卷积网络（MTCNN）对不同尺度输入图像进行人脸检测，并利用核相关滤波（KCF）对检测到的人脸位置进行跟踪进而提高检测速度。然后，设置不同尺度卷积核的瓶颈层，用通道合并的特征融合方式形成核卷积单元，以具有残差块和可分卷积单元的深度可分卷积神经网络提取多样化特征，并减少参数数量，轻量化模型结构；使用实时启用的反向传播可视化来揭示权重动态的变化并评估了学习的特征。最后，将表情识别和性别识别两个网络并联融合，实现表情和性别的实时识别。实验结果表明，所提出的网络模型在FER-2013数据集上取得73.8%的识别率，在CK+数据集上的识别率达到96%，在IMDB数据集中性别分类的准确率达到96%；模型的整体处理帧率达到80 frame/s，与结合支持向量机的全连接卷积神经网络方法所得结果相比，有着1.5倍的提升。因此针对数量、分辨率、大小等差异较大的数据集，该网络模型检测快，训练时间短，特征提取简单，具有较高的识别率和实时性。
关键词：	深度可分卷积神经网络面部检测性别分类情感分类特征提取
收稿时间：	2019-08-19
修稿时间：	2019-11-01
Real-time facial expression and gender recognition based on depthwise separable convolutional neural network

LIU Shangwang,LIU Chengwei,ZHANG Aili.Real-time facial expression and gender recognition based on depthwise separable convolutional neural network[J].journal of Computer Applications,2020,40(4):990-995.

Authors:	LIU Shangwang LIU Chengwei ZHANG Aili

Affiliation:	College of Computer and Information Engineering, Henan Normal University, Xinxiang Henan 453007, China

Abstract:	Aiming at the problem of the current common Convolutional Neural Network(CNN)in the expression and gender recognition tasks,that is training process is complicated,time-consuming,and poor in real-time performance,a realtime facial expression and gender recognition model based on depthwise separable convolutional neural network was proposed. Firstly,the Multi-Task Convolutional Neural Network(MTCNN)was used to detect faces in different scale input images,and the detected face positions were tracked by Kernelized Correlation Filter(KCF)to increase the detection speed. Then,the bottleneck layers of convolution kernels of different scales were set,the kernel convolution units were formed by the feature fusion method of channel combination,the diversified features were extracted by the depthwise separable convolutional neural network with residual blocks and separable convolution units,and the number of parameters was reduced to lightweight the model structure. Besides,real-time enabled backpropagation visualization was used to reveal the dynamic changes of the weights and characteristics of learning. Finally,the two networks of expression recognition and gender recognition were combined in parallel to realize real-time recognition of expression and gender. Experimental results show that the proposed network model has a recognition rate of 73. 8% on the FER-2013 dataset,a recognition rate of 96% on the CK+ dataset,the accuracy of gender classification on the IMDB dataset reaches 96%;and this model has the overall processing speed reached 70 frames per second,which is improved by 1. 5 times compared with the method of common convolutional neural network combined with support vector machine. Therefore,for datasets with large differences in quantity,resolution and size,the proposed network model has fast detection,short training time,simple feature extraction, and high recognition rate and real-time performance.

Keywords:	depthwise separable convolutional neural network face detection gender recognition facial expression recognition feature extraction
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机应用》浏览原始摘要信息
	点击此处可从《计算机应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏