首页 | 本学科首页   官方微博 | 高级检索  
     

阈值优化的文本密度聚类算法
引用本文:马素琴,施化吉.阈值优化的文本密度聚类算法[J].计算机工程与应用,2011,47(17):134-136.
作者姓名:马素琴  施化吉
作者单位:江苏大学 计算机科学与通信工程学院,江苏 镇江 212013
基金项目:国家自然科学基金,国家火炬计划项目
摘    要:针对DBSCAN算法的聚类性能受全局阈值影响而降低的问题,提出一种阈值优化的文本密度聚类算法。该算法使用k-近邻距离对对象进行排序,通过分位数区分密度不同的各序列,找到与其对应的优化,根据优化阈值使用密度聚类方法对对象进行聚类。改进后的聚类算法克服了阈值选取对聚类结果影响的问题,提高了聚类精确度和时间效率。采用树形结构存储聚簇,增加了聚簇的可读性。实验结果证明了该算法的有效性。

关 键 词:文本挖掘  文本聚类  一个基于高密度连接区域的密度聚类方法  一种阈值优化的文本密度聚类算法  分位数  
修稿时间: 

Text density clustering algorithm with optimized threshold values
MA Suqin,SHI Huaji.Text density clustering algorithm with optimized threshold values[J].Computer Engineering and Applications,2011,47(17):134-136.
Authors:MA Suqin  SHI Huaji
Affiliation:School of Computer Science and Telecommunication Engineering,Jiangsu University,Zhenjiang,Jiangsu 212013,China
Abstract:A text density clustering algorithm with the optimized threshold values is proposed to solve the problem of reduced clustering performance of the DBSCAN algorithm because of global threshold values.The proposed algorithm sorts objects with k-neighbor distance,and discerns arrays with different densities by quantile,and finds the corresponding optimization, then carries out clustering of objects using density clustering algorithm based on optimized threshold values.The advanced clustering algorithm has overcome the problem of reduced clustering performance caused by threshold values selection, and has improved clustering accuracy and efficiency.This paper stores clusters with tree structure, and has made clusters more legible.The experimental results show the effectiveness of this algorithm
Keywords:text mining  text clustering  Density-Based Spatial Clustering of Applications with Noise(DBSCAN) algorithm  Text Density Clustering Algorithm with Optimized Threshold Values (TDCAOTV) algorithm  quantile
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号