首页 | 本学科首页   官方微博 | 高级检索  
     

Spark下BP神经网络并行化算法研究
引用本文:吴正江,陈如校,张霄宏.Spark下BP神经网络并行化算法研究[J].测控技术,2018,37(3):23-27.
作者姓名:吴正江  陈如校  张霄宏
作者单位:河南理工大学计算机科学与技术学院,河南焦作,454003
基金项目:国家自然科学基金资助项目(11601129)
摘    要:BP算法(反向传播算法)以其良好的非线性逼近能力、泛化能力以及实用性成为了人工神经网络训练算法中应用最为广泛的算法.但同时使用BP算法又存在收敛速度较慢、易陷入局部极小值等问题.为了将BP算法用于大规模数据分类问题,采用MapReduce思想,将大数据集切分成若干小的数据集来并行加速处理,同时引入Bagging算法的思想来综合并行结果,提高分类的准确率.通过在各个节点上根据子数据集独立地训练各个BP神经网络,直至各网络收敛,再将各节点上的网络收集起来进行集成,形成最终的分类器.基于Spark平台的实验表明,本文提出的算法具有良好的并行加速性能,且具有较高的分类准确率.

关 键 词:神经网络算法  并行  Spark  Bagging  neural  network  algorithm  parallel  Spark  Bagging

Research on Parallel Algorithm of BP Neural Network Under Spark
WU Zheng-jiang,CHEN Ru-xiao,ZHANG Xiao-hong.Research on Parallel Algorithm of BP Neural Network Under Spark[J].Measurement & Control Technology,2018,37(3):23-27.
Authors:WU Zheng-jiang  CHEN Ru-xiao  ZHANG Xiao-hong
Abstract:BP algorithm has become the most widely used algorithm in artificial neural network training algorithm with its good nonlinear approximation ability,generalization ability and practicability,but it has some problems such as slow convergence rate and easy to fall into local minimum.In order to use the BP algorithm for large-scale data classification,MapReduce was used to divide large data set into several small data set to accelerate parallel processing.At the same time,Bagging algorithm was introduced to synthesize the parallel result and improve the classification accuracy.By training each BP neural network independently on each node according to the sub-data set until the network convergence,the network on each node was collected and integrated to form the final classifier.Experiments based on Spark platform show that the proposed algorithm has good parallel acceleration performance and high classification accuracy.
Keywords:neural network algorithm  parallel  Spark  Bagging
本文献已被 万方数据 等数据库收录!
点击此处可从《测控技术》浏览原始摘要信息
点击此处可从《测控技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号