首页 | 本学科首页   官方微博 | 高级检索  
     

基于最小类内方差的伪三维残差网络
引用本文:谢超宇,秦玉,张开放,王晓明.基于最小类内方差的伪三维残差网络[J].计算机应用研究,2021,38(12):3801-3807.
作者姓名:谢超宇  秦玉  张开放  王晓明
作者单位:西华大学 计算机与软件工程学院,成都610039
基金项目:西华大学研究生创新基金资助项目(ycjj2019085)
摘    要:作为一种提取视频时空特征的深度学习方法,伪三维残差网络(pseudo-3D residual net,P3D ResNet)利用SVM目标函数来驱动深度网络学习,这样该方法继承了SVM的不足——仅考虑了不同类别间的间隔,忽略了同类样本数据的分布信息.针对该问题,提出了基于最小类内方差的伪三维残差网络方法,不仅体现了大间隔原理,同时又利用了样本数据的分布信息.该方法首先使用P3D ResNet提取的特征向量计算类内散度矩阵;然后利用该矩阵构建了新的目标函数;最后通过新构建的目标函数来驱动P3D ResNet的学习.将该方法应用到行为识别领域,多个数据集上的实验结果表明,相比于传统的P3D ResNet,所提出的方法获得了更高的识别准确率,体现出了更好的泛化性能.

关 键 词:深度学习  伪三维残差网络  支持向量机  类内散度矩阵  行为识别
收稿时间:2021/4/12 0:00:00
修稿时间:2021/11/18 0:00:00

Pseudo-3D residual network based on minimum intra-class variance
Xie Chaoyu,Qin Yu,Zhang Kaifang and Wang Xiaoming.Pseudo-3D residual network based on minimum intra-class variance[J].Application Research of Computers,2021,38(12):3801-3807.
Authors:Xie Chaoyu  Qin Yu  Zhang Kaifang and Wang Xiaoming
Affiliation:School of Computer & Software Engineer, Xihua University,,,
Abstract:As a deep learning method for extracting video spatio-temporal features, pseudo-3D residual net(P3D ResNet) uses the objective function of SVM to drive the learning of deep network. In this way, this method inherits the insufficiency of SVM, which only considers the interval between different categories, and ignores the distribution information of similar samples. Aiming at this problem, this paper proposed an improved method called P3D ResNet based on minimum intra-class variance. This method not only embodied the principle of large interval, but also used the distribution information of sample data. First, the method used the feature vector extracted by P3D ResNet to calculated the intra-class divergence matrix. Then it used the matrix to construct a new objective function. Finally, it drove the learning of P3D ResNet by the newly constructed objective function. This paper applied the method to the field of behavior recognition. Experimental results on multiple datasets show that compared with the traditional P3D ResNet, the proposed method achieves higher recognition accuracy and shows better generalization performance.
Keywords:deep learning  pseudo-3D residual network  SVM  intra-class divergence matrix  action recognition
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号