首页 | 本学科首页   官方微博 | 高级检索  
     

基于Boosting算法集成遗传模糊分类器的文本分类
引用本文:罗军,况夯.基于Boosting算法集成遗传模糊分类器的文本分类[J].计算机应用,2008,28(9):2386-2388.
作者姓名:罗军  况夯
作者单位:重庆教育学院,计算机与现代教育技术系,重庆,400067
摘    要:提出一种新颖的基于Boosting模糊分类的文本分类方法。首先采用潜在语义索引(LSI)对文本特征进行选择;然后提出Boosting算法集成模糊分类器学习,在每轮迭代训练过程中,算法通过调整训练样本的分布,利用遗传算法产生分类规则。减少分类规则能够正确分类样本的权值,使得新产生的分类规则重点考虑难于分类的样本。实验结果表明,该文本分类算法具有良好分类的性能。

关 键 词:模糊分类  特征选择  潜在语义索引  Boosting算法  文本分类
收稿时间:2008-03-25
修稿时间:2008-05-09

Text categorization based on genetic fuzzy classification and Boosting method
LUO Jun,KUANG Hang.Text categorization based on genetic fuzzy classification and Boosting method[J].journal of Computer Applications,2008,28(9):2386-2388.
Authors:LUO Jun  KUANG Hang
Affiliation:LUO Jun,KUANG Hang(Department of Computer Science , Modern Educational Technology,Chongqing Education College,Chongqing 400067,China)
Abstract:A novel method for text categorization, which is based on boosting fuzzy classification, was proposed in the paper. Latent Semantic Index (LSI) was used to select text feature and then Boosting algorithm was proposed to integrate fuzzy classification. In each iteration training of boosting algorithm, the distribution of training instances was adjusted, and classification rules were created by genetic algorithm. The weights of the training instances that were classified correctly by available rules were reduced, so that the new fuzzy rule focuses on the misestimate or uncovered instances. Experimental results show that classifier based on fuzzy classification is effective and efficient.
Keywords:fuzzy classification  feature selection  Latent Semantic Index (LSI)  Boosting algorithm  text categorization
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号