首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进KNFL算法的海量文本分类研究
引用本文:张先飞,李弼程,刘安斐.基于改进KNFL算法的海量文本分类研究[J].微计算机信息,2005(21).
作者姓名:张先飞  李弼程  刘安斐
作者单位:河南省郑州解放军信息工程学院信息科学系 450002(张先飞,李弼程),河南省郑州解放军信息工程学院信息科学系 450002(刘安斐)
基金项目:河南省教育厅基金资助项目编号:sp200303099
摘    要:KNFL算法是近年来在人脸识别领域提出并广泛应用的分类算法,它认为类空间中两点的连线可以比类内的点更能代表类空间的特征。如果仅依据特征线距离来分类,会造成误分。这里为消除类内离群点对分类的影响提出引入加权系数,并结合类中心距的概念提出改进算法,并将其应用到海量文本分类中去。试验结果证明此改进算法能够提高文本分类精度,很好的降低了分类器对训练规模的要求。

关 键 词:K最近特征线  离群点  类中心距

Large Quantity of Text Classification Based on the Improved KNFL Algorithm
Zhang,Xianfei Li,Bicheng Liu,Anfei.Large Quantity of Text Classification Based on the Improved KNFL Algorithm[J].Control & Automation,2005(21).
Authors:Zhang  Xianfei Li  Bicheng Liu  Anfei
Abstract:KNFL has been a classification algorithm popular in Face Identification in recent years. It deems that a line between two points in the same type of space represents the feature of the whole space than a single point. However, it brings faults in results in terms of distance only. Here coefficient was put for ward to eliminate the influence of the off-group point, which was also combined with the central distance of class, then formed the improved algorithm,which is used in large quantity of text classification. The results of experiment show that the improved algorithm advances the precision of text classification, and reduces the requirement of training scale.
Keywords:K-Nearest Feature Line  off-group point  central distance of class  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号