基于多隐层Gibbs采样的深度信念网络训练方法 A Deep Belief Networks Training Strategy Based on Multi-hidden Layer Gibbs Sampling期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于多隐层Gibbs采样的深度信念网络训练方法

引用本文：	史科,陆阳,刘广亮,毕翔,王辉.基于多隐层Gibbs采样的深度信念网络训练方法[J].自动化学报,2019,45(5):975-984.

作者姓名：	史科陆阳刘广亮毕翔王辉

作者单位：	1.合肥工业大学计算机与信息学院合肥 230009;;2.安全关键工业测控技术教育部工程研究中心合肥 230009

基金项目：	国家自然科学基金61572167国家重点研发计划专项2016YFC0801405国家重点研发计划专项2016YFC0801804

摘要：	深度信念网络（Deep belief network，DBN）作为一类非常重要的概率生成模型，在多个领域都有着广泛的用途.现有深度信念网的训练分为两个阶段，首先是对受限玻尔兹曼机（Restricted Boltzmann machine，RBM）层自底向上逐层进行的贪婪预训练，使得每层的重构误差最小，这个阶段是无监督的；随后再对整体的权值使用有监督的反向传播方法进行精调.本文提出了一种新的DBN训练方法，通过多隐层的Gibbs采样，将局部RBM层组合，并在原有的逐层预训练和整体精调之间进行额外的预训练，有效地提高了DBN的精度.本文同时比较了多种隐层的组合方式，在MNIST和ShapeSet以及Cifar10数据集上的实验表明，使用两两嵌套组合方式比传统的方法错误率更低.新的训练方法可以在更少的神经元上获得比以往的训练方法更好的准确度，有着更高的算法效率.
关键词：	深度信念网络受限玻尔兹曼机 Gibbs采样对比散度
收稿时间：	2017-11-22
A Deep Belief Networks Training Strategy Based on Multi-hidden Layer Gibbs Sampling

SHI Ke,LU Yang,LIU Guang-Liang,BI Xiang,WANG Hui.A Deep Belief Networks Training Strategy Based on Multi-hidden Layer Gibbs Sampling[J].Acta Automatica Sinica,2019,45(5):975-984.

Authors:	SHI Ke LU Yang LIU Guang-Liang BI Xiang WANG Hui

Affiliation:	1. School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230009;;2. Engineering Research Center of Safety Critical Industry Measure and Control Technology, Ministry of Education, Hefei 230009

Abstract:	Deep belief network (DBN) is a very important probabilistic generative model that can be used in many areas. The current training approach of DBN involves two phases. The first is a fully unsupervised pre-training process, which is a down-top and layer-by-layer one to train the restricted Boltzmann machine (RBM) layers, making the reconstruction error of each layer minimal. The second is a supervised stage which uses the back propagation to fine-tune the entire parameters of the model. In this paper, a new training strategy for DBN is proposed. Between the current two training phases, this paper introduces another training strategy to combine multiple local RBMs into an overall probability model for multi hidden layer Gibbs sampling, which effectively improves the accuracy of DBN. This paper has compared a variety of combinations of RBM layers, experiments on the MNIST, ShapeSet and Cifar10 dataset show that our method outperforms the existing training algorithms for DBN. The new algorithm can achieve better accuracy with fewer neurons, also achieves higher algorithm efficiency.

Keywords:	Deep belief network(DBN) restricted Boltzmann machine(RBM) Gibbs sampling contrastive divergence(CD)
本文献已被维普等数据库收录！
	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏