首页 | 本学科首页   官方微博 | 高级检索  
     


Exploit latent Dirichlet allocation for collaborative filtering
Authors:Zhoujun Li  Haijun Zhang  Senzhang Wang  Feiran Huang  Zhenping Li  Jianshe Zhou
Affiliation:1.State Key Laboratory of Software Development Environment,Beihang University,Beijing,China;2.School of Information,Beijing Wuzi University,Beijing,China;3.College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing,China;4.Collaborative Innovation Center of Novel Software Technology and Industrialization,Nanjing,China;5.Beijing Advanced Innovation Center for Imaging Technology,Capital Normal University,Beijing,China
Abstract:Previous work on the one-class collaborative filtering (OCCF) problem can be roughly categorized into pointwise methods, pairwise methods, and content-based methods. A fundamental assumption of these approaches is that all missing values in the user-item rating matrix are considered negative. However, this assumption may not hold because the missing values may contain negative and positive examples. For example, a user who fails to give positive feedback about an item may not necessarily dislike it; he may simply be unfamiliar with it. Meanwhile, content-based methods, e.g. collaborative topic regression (CTR), usually require textual content information of the items, and thus their applicability is largely limited when the text information is not available. In this paper, we propose to apply the latent Dirichlet allocation (LDA) model on OCCF to address the above-mentioned problems. The basic idea of this approach is that items are regarded as words, users are considered as documents, and the user-item feedback matrix constitutes the corpus. Our model drops the strong assumption that missing values are all negative and only utilizes the observed data to predict a user’s interest. Additionally, the proposed model does not need content information of the items. Experimental results indicate that the proposed method outperforms previous methods on various ranking-oriented evaluation metrics. We further combine this method with a matrix factorization-based method to tackle the multi-class collaborative filtering (MCCF) problem, which also achieves better performance on predicting user ratings.
Keywords:latent Dirichlet allocation  one-class collaborative filtering  multi-class collaborative filtering  
本文献已被 SpringerLink 等数据库收录!
点击此处可从《Frontiers of Computer Science》浏览原始摘要信息
点击此处可从《Frontiers of Computer Science》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号