首页 | 本学科首页   官方微博 | 高级检索  
     

基于文本聚类的映射聚类算法研究
引用本文:黄建春,邹汉斌,李晓峰.基于文本聚类的映射聚类算法研究[J].计算机工程与设计,2007,28(6):1264-1266.
作者姓名:黄建春  邹汉斌  李晓峰
作者单位:1. 湖南文理学院,电气工程系,湖南,常德,415000
2. 湖南文理学院,计算机系,湖南,常德,415000
摘    要:文本聚类在信息过滤和网页分类等方面有着较好的应用,可是它面临数据维数很高的难点.由于维度很高,使得经典的聚类算法难以有效处理.针对这个问题给出了一种快速鲁棒的映射聚类算法,其中利用关联规则查询簇的相关维,然后使用相关维进行进一步的分析.实验结果说明了该算法具有速度快以及较好的鲁棒性等特点,可以应用在文本聚类中.

关 键 词:文本聚类  映射聚类  关联规则  鲁棒  高维  文本聚类  映射聚类  算法研究  clustering  algorithm  text  鲁棒性  速度  结果  实验  分析  使用  相关维  规则查询  关联  利用  快速  问题  处理  聚类算法  数据维数
文章编号:1000-7024(2007)06-1264-03
修稿时间:2006-04-11

Research on text clustering-based projected clustering algorithm
HUANG Jian-chun,ZOU Han-bin,LI Xiao-feng.Research on text clustering-based projected clustering algorithm[J].Computer Engineering and Design,2007,28(6):1264-1266.
Authors:HUANG Jian-chun  ZOU Han-bin  LI Xiao-feng
Affiliation:1. Department of Electrial Engineering, Hunan University of Arts and Science, Changde 415000, China; 2. Department of Computer Science, Hunan University of Arts and Science, Changde 415000, China
Abstract:Document clustering had been employed in information filtering,web page classification and so on.The difficulty of it is that the dimension of the data set is so high,which makes some algorithms difficult to be implemented.Therefore,a fast and robust projected clustering algorithm is presented here.The clustering algorithm first utilize the association rule method to get the relevant di-mensions of each cluster,and then further adopt these relevant dimensions to find the proper clusters.The experimental results demonstrate the proposed algorithm is fast and robust and can be applied in document clustering.
Keywords:text clustering  projected clustering  association rule  robust  high dimension
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号