首页 | 本学科首页   官方微博 | 高级检索  
     

基于KD-Tree的KNN文本分类算法
引用本文:刘忠,刘洋,建晓.基于KD-Tree的KNN文本分类算法[J].网络安全技术与应用,2012(5):38-40.
作者姓名:刘忠  刘洋  建晓
作者单位:桂林理工大学信息科学与工程学院,广西,541004
摘    要:本文将KD-Tree应用到KNN文本分类算法中,先对训练文本集建立一个KD-Tree,然后在KD-Tree中搜索测试文本的所有祖先节点文本,这些祖先节点文本集合就是待测文本的最邻近文本集合,与测试文本有最大相似度的祖先的文本类型就是待测试文本的类型,这种算法大大减少了参与比较的向量文本数目,时间复杂度仅为O(log2N)。实验表明,改进后的KNN文本分类算法具有比传统KNN文本分类法更高的分类效率。

关 键 词:KNN  KNN  文本分类

KNN Algorithm for Text Classification Based on KD-Tree
Liu Zhong,Liu Yang,Jian Xiao.KNN Algorithm for Text Classification Based on KD-Tree[J].Net Security Technologies and Application,2012(5):38-40.
Authors:Liu Zhong  Liu Yang  Jian Xiao
Affiliation:Information Science and Engineering College,Guilin University of Technology,Guangxi,541004,China
Abstract:This paper apply KD-Tree to KNN text classification algorithm,firstly put a training text set into a KD-Tree,then search KD-Tree for the all parents nodes of the tested text node,the set including these parents text nodes is the most nearest text set,the type of the tested text is the same as the type of the most nearest text which has the most similarity with the test text,this algorithm decreases the number of the compared texts,and the time complexity is o(log2N).Experiments show that the improved KNN text classification algorithm is better than the traditional KNN text classification in classification efficiency.
Keywords:KNN  KNN  Text Categorization
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号