首页 | 本学科首页   官方微博 | 高级检索  
     

基于Hadoop分布式改进聚类协同过滤推荐算法研究
引用本文:孙天昊,黎安能,李 明,朱庆生.基于Hadoop分布式改进聚类协同过滤推荐算法研究[J].计算机工程与应用,2015,51(15):124-128.
作者姓名:孙天昊  黎安能  李 明  朱庆生
作者单位:重庆大学 计算机学院,重庆 400044
摘    要:为了改善协同过滤推荐算法在大数据下的稀疏性和可扩展性问题,提出一种基于Hadoop平台的分布式改进聚类协同过滤推荐算法。在分布式平台下,离线对高维稀疏数据采用矩阵分解算法预处理,改善数据稀疏性后通过改进项目聚类算法构建聚类模型,根据聚类模型和相似性计算形成推荐候选空间,在线完成推荐。实验验证该算法能够有效改善推荐系统的推荐质量并大大提高推荐效率,同时在云环境中具有良好可扩展性。

关 键 词:协同过滤  Hadoop  矩阵分解  聚类  分布式计算  

Study on distributed improved clustering collaborative filtering algorithm based on Hadoop
SUN Tianhao,LI Anneng,LI Ming,ZHU Qingsheng.Study on distributed improved clustering collaborative filtering algorithm based on Hadoop[J].Computer Engineering and Applications,2015,51(15):124-128.
Authors:SUN Tianhao  LI Anneng  LI Ming  ZHU Qingsheng
Affiliation:College of Computer Science, Chongqing University, Chongqing 400044, China
Abstract:In order to improve the data sparsity and scalability of collaborative filtering recommendation algorithms in big data, integrating matrix factorization with distributed computing, this paper proposes a distributed improved clustering collaborative filtering algorithm based on Hadoop. It uses ALS matrix factorization algorithm to fill sparse data offline. Filled matrix is clustered by improved item clustering algorithm. Then based on the clusters and similarities it creates the candidate set of recommendation. Recommendations are accomplished online. Experimental results show that the proposed algorithm can not only efficiently improve the quality of recommendation system, but also has good scalability in clouds.
Keywords:collaborative filtering  Hadoop  matrix factorization  clustering  distributed computing  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号