首页 | 本学科首页   官方微博 | 高级检索  
     

基于Spark的矩阵分解推荐算法
引用本文:郑凤飞,黄文培,贾明正.基于Spark的矩阵分解推荐算法[J].计算机应用,2015,35(10):2781-2783.
作者姓名:郑凤飞  黄文培  贾明正
作者单位:西南交通大学 信息科学与技术学院, 成都 611756
摘    要:针对传统矩阵分解算法在处理海量数据信息时所面临的处理速度和计算资源的瓶颈问题,利用Spark在内存计算和迭代计算上的优势,提出了Spark框架下的矩阵分解并行化算法。首先,依据历史数据矩阵初始化用户因子矩阵和项目因子矩阵;其次,迭代更新因子矩阵,将迭代结果置于内存中作为下次迭代的输入;最后,迭代结束时得到矩阵推荐模型。通过在GroupLens网站上提供的MovieLens数据集上的实验结果表明,加速比(Speedup)值达到了线性的结果,该算法可以提高协同过滤推荐算法在大数据规模下的执行效率。

关 键 词:协同过滤  推荐算法  矩阵分解  迭代最小二乘法  Spark  
收稿时间:2015-06-01
修稿时间:2015-07-06

Matrix factorization recommendation algorithm based on Spark
ZHENG Fengfei,HUANG Wenpei,JIA Mingzheng.Matrix factorization recommendation algorithm based on Spark[J].journal of Computer Applications,2015,35(10):2781-2783.
Authors:ZHENG Fengfei  HUANG Wenpei  JIA Mingzheng
Affiliation:School of Information Science and Technology, Southwest Jiaotong University, Chengdu Sichuan 611756, China
Abstract:In order to solve the bottleneck problems of processing speed and resource allocation, a Spark based matrix factorization recommendation algorithm was proposed. Firstly, user factor matrix and item factor matrix were initialized according to historical data. Secondly, factor matrix was iteratively updated and the result was stored in memory as the input of next iteration. Finally, recommendation model was generated when iteration ended. The experiment on MovieLens shows that the speedup is linear and the proposed Spark based algorithm can save time and significantly improve the execution efficiency of collaborative filtering recommendation algorithm.
Keywords:collaborative filtering                                                                                                                        recommendation algorithm                                                                                                                        matrix factorization                                                                                                                        Alternating Least Square (ALS)                                                                                                                        Spark
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号