首页 | 本学科首页   官方微博 | 高级检索  
     

基于蚁群算法的文本分类和聚类
引用本文:李燕,张月国,李生红.基于蚁群算法的文本分类和聚类[J].信息安全与通信保密,2009(10):57-58.
作者姓名:李燕  张月国  李生红
作者单位:1. 上海交通大学信息安全工程学院,上海,200240
2. 上海交通大学信息安全工程学院,上海,200240;上海交通大学电子工程系,上海,200240
基金项目:国家自然科学基金资助项目,上海市曙光计划项目:863计划项目,教育部新世纪优秀人才支持计划项目 
摘    要:为了研究并提高文本的分类和聚类算法的性能,笔者根据蚁群算法在TSP问题中的应用方法,将其改进引用到文本的分聚类中。在文本聚类中,改变蚂蚁的信息素释放机制,道路节点的聚合方式,最终将相似文本进行聚合。在文本的分类中,将所需要的分类信息装入蚂蚁,蚂蚁根据系统外部所希望的方式将文本分类。实验结果证明,这种新的算法可以使文本分类和聚类的准确度提高,蚁群算法在文本分类聚类中的应用是可行的。

关 键 词:蚁群算法  文本分类  文本聚类

Text Classification and Clustering Based on Ant Colony Algorithms
LI Yan,ZHANG Yue-guo,LI Sheng-hong.Text Classification and Clustering Based on Ant Colony Algorithms[J].China Information Security,2009(10):57-58.
Authors:LI Yan  ZHANG Yue-guo  LI Sheng-hong
Affiliation:LI Yan, ZHANG Yue-guo, LI Sheng-hong(a. School of Information Security; b. School of Electronic, Information and Electrical Engineering Shanghai Jiaotong University, Shanghai 200240, China)
Abstract:In order to study and improve performance of text classification and clustering, the authors, based on the usage of ant colony algorithm in solving the TSP(travelling salesman problem), modify and use this algorithm in the text classification and clustering. When this algorithm is used to cluster texts, the way for releasing ants' pheromone, and the mode for clustering path-nodes as well should be changed, and finally the similar texts are placed together. In text classification, the information must be told to the ants, which indicates the final categories and is wanted before the process. The experiment indicates the facts that this new method could increase the rate of accuracy, and that the ant colony algorithm could be used in text classification and clustering.
Keywords:ant colony algorithm  text classification  text clustering
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号