首页 | 本学科首页   官方微博 | 高级检索  
     

基于中心法的多层次文本分类方法的研究
引用本文:何尧,陈治平,杨亚蕾.基于中心法的多层次文本分类方法的研究[J].信息技术,2007,31(12):116-118.
作者姓名:何尧  陈治平  杨亚蕾
作者单位:福建工程学院计算机系,福州,350014
摘    要:中心法运算速度快,效率高,而多层次分类器能有效地应对较多类别的分类任务,为此,提出了基于中心法的多层次分类法,通过分析大量类别之间的关系,把类别组织成树状结构,并在特征选择时,根据层次结构特色采取去根处理,在分类时采用中心法来进行分类。经过实验,与一般的层次分类算法、平面分类算法进行比较,该分类法具有较好的性能。

关 键 词:中心法  多层次分类  文本分类
文章编号:1009-2552(2007)12-0116-03
修稿时间:2007年8月6日

Research on a hierarchical text categorization method based on centroid
HE Yao,CHEN Zhi-ping,YANG Ya-lei.Research on a hierarchical text categorization method based on centroid[J].Information Technology,2007,31(12):116-118.
Authors:HE Yao  CHEN Zhi-ping  YANG Ya-lei
Abstract:Centroid-based methods is a high efficient class of methods for text categorization,and hierarchical classification can deal with the classification task of the many categories efficiently.So this paper presents a new approach that combines these two methods,through analyzing the relationship between many classis,organizing the categorize into a tree structure,reducing each word to its root to decrease the effect of word variations on the classification,and automatically classifies a document into one or more predefined categories using centroid-based method.Finally,the experiment results show that the new approach,proposed in this paper,outperforms the flat or generic hierarchical methods with improved accuracy.
Keywords:centroid-based methods  hierarchical classification  text categorization
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号