首页 | 本学科首页   官方微博 | 高级检索  
     

改进聚类的深度神经网络压缩实现方法
引用本文:刘涵,王宇,马琰.改进聚类的深度神经网络压缩实现方法[J].控制理论与应用,2019,36(7):1130-1136.
作者姓名:刘涵  王宇  马琰
作者单位:西安理工大学自动化与信息工程学院,陕西西安,710048;西安理工大学自动化与信息工程学院,陕西西安,710048;西安理工大学自动化与信息工程学院,陕西西安,710048
基金项目:国家自然科学基金重点项目(61833013), 陕西省重点研发计划重点项目(2018ZDXM-GY-089), 陕西省现代装备绿色制造协同创新中心研究计划(304-210891704), 陕西省教育厅科学研究计划(2017JS088), 西安理工大学特色研究计划(2016TS023)
摘    要:深度神经网络通常是过参数化的,并且深度学习模型存在严重冗余,这导致了计算和存储的巨大浪费.针对这个问题,本文提出了一种基于改进聚类的方法来对深度神经网络进行压缩.首先通过剪枝策略对正常训练后的网络进行修剪,然后通过K-Means++聚类得到每层权重的聚类中心从而实现权值共享,最后进行各层权重的量化.本文在LeNet,AlexNet和VGG-16上分别进行了实验,提出的方法最终将深度神经网络整体压缩了30到40倍,并且没有精度损失.实验结果表明通过基于改进聚类的压缩方法,深度神经网络在不损失精度的条件下实现了有效压缩,这使得深度网络在移动端的部署成为了可能.

关 键 词:深度神经网络  剪枝  K—Means++聚类  深度网络压缩
收稿时间:2017/12/22 0:00:00
修稿时间:2018/11/12 0:00:00

Deep neural networks compression based on improved clustering
LIU Han,WANG Yu and MA Yan.Deep neural networks compression based on improved clustering[J].Control Theory & Applications,2019,36(7):1130-1136.
Authors:LIU Han  WANG Yu and MA Yan
Abstract:Deep neural networks are typically over-parametrized and there is significant redundancy for deep learning models, which results in a waste of both computation and memory usage. In order to solve the problem, a new method based on improved clustering to compress the deep neural network is proposed. First of all, the network is pruned after the normal training. Then through the K-Means++ clustering the clustering center of each layer is gotten to achieve weight sharing. After the first two steps network weight quantization are also performed. The experiments on LeNet, AlexNet and VGG-16 are carried out, in which the deep neural network are compressed by 30 to 40 times without any loss of precision. The experimental results show that the deep neural network achieves effective compression without loss of accuracy through the method based on improved clustering, which makes the deployment of deep network on the mobile end possible.
Keywords:deep neural networks  pruning  K-Means ++  deep network compression
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《控制理论与应用》浏览原始摘要信息
点击此处可从《控制理论与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号