首页 | 本学科首页   官方微博 | 高级检索  
     

R-means:以关联规则为簇中心的文本聚类
引用本文:龙昊,冯剑琳,李曲.R-means:以关联规则为簇中心的文本聚类[J].计算机科学,2005,32(9):156-159.
作者姓名:龙昊  冯剑琳  李曲
作者单位:华中科技大学计算机科学与技术系,武汉,430074
基金项目:本文受国家自然科学基金(编号60303030)和重庆自然科学基金(编号8721)支持.
摘    要:本文将k-means与关联规则(或频繁项目集)相结合,提出了一种新的文本聚类算法R-means.R-means算法以关联规则作为簇中心,通过类似于k-meams的迭代优化得到最终的簇.因此R-means不仅继承了k-means的简单性,而且用关联规则产生的簇描述易于为人们所理解.在几个实际数据集上的实验表明该算法可以得到高精度和高性能.

关 键 词:关联规则  频繁项目集  簇中心  关联文本聚类  R-means算法  信息检索

R-means: Exploiting Association Rules as Means for Text Clustering
LONG Hao,FENG Jian-Lin,LI Qu.R-means: Exploiting Association Rules as Means for Text Clustering[J].Computer Science,2005,32(9):156-159.
Authors:LONG Hao  FENG Jian-Lin  LI Qu
Affiliation:Department of Computer Science, Huazhong University of Science and Technology, Wuhan 430074
Abstract:This paper proposes a new text clustering algorithm called R-means which integrates k-means with associa- tion rule (or frequent itemset). R-means exploits association rules as means of clusters and refines clusters by an itera- tive procedure which is similar to that of k-means. R-means not only inherits the simplicity of k-means, but also gener- ates more comprehensive cluster labels which are described by association rules. The experiments with several real data sets have demonstrated that R-means can achieve quite well precision and high performance.
Keywords:Association rules  Frequent itemset  Means of clusters  Associative text clustering
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号