首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进DBSCAN算法的文本聚类
引用本文:蔡岳,袁津生.基于改进DBSCAN算法的文本聚类[J].计算机工程,2011,37(12):50-52.
作者姓名:蔡岳  袁津生
作者单位:北京林业大学信息学院,北京,100083
摘    要:目前多数聚类算法不能很好地适应文本聚类的快速自适应需求。为此,论述DBSCAN算法的基本原理和实现过程,提出一种基于改进DBSCAN算法的文本聚类算法,利用最小二乘法降低文本向量的维度,并创建一种应用于DBSCAN算法的簇关系树结构。实验结果表明,该算法能自适应地进行文本聚类,且与DBSCAN相比,准确率较高。

关 键 词:DBSCAN算法  文本聚类  最小二乘法  簇关系树
收稿时间:2010-12-10

Text Clustering Based on Improved DBSCAN Algorithm
CAI Yue,YUAN Jin-sheng.Text Clustering Based on Improved DBSCAN Algorithm[J].Computer Engineering,2011,37(12):50-52.
Authors:CAI Yue  YUAN Jin-sheng
Affiliation:(School of Information,Beijing Forestry University,Beijing 100083,China)
Abstract:Most clustering algorithms can not meet the demand of speed and self-adapting about text clustering. In this paper, after fundamental theory and implement are expounded, the idea of creating an algorithm based improved DBSCAN is proposed. The least square method is used for decreasing divisions and the cluster-tree is created to gain a strong self-adapting of the algorithm. According to the data from an experiment mentioned in this paper, the self-adapting algorithm is feasible and involves better performance than DBSCAN.
Keywords:DBSCAN algorithm  text clustering  least square method  cluster-tree
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号