首页 | 本学科首页   官方微博 | 高级检索  
     

基于粗糙集的特征选择方法的研究
引用本文:陈思睿,张永,杨志勇.基于粗糙集的特征选择方法的研究[J].计算机工程与应用,2006,42(21):159-161.
作者姓名:陈思睿  张永  杨志勇
作者单位:兰州理工大学计算机与通信学院,兰州,730050
摘    要:文本自动分类是指将文本按照一定的策略归于一个或多个类别中的应用技术。文本分类是文本挖掘的基础,而特征选择又是文本分类中的核心。论文分析了以前特征选择方法中由于特征数目过多而造成分类时间和精度不高的缺点,提出了一种基于粗糙集的特征选择方法,其特点是以特征在文本分类中的重要性对特征进行选择。最后通过实验验证了该算法,证明该方法是可行的。

关 键 词:特征选择  属性约简  文本挖掘
文章编号:1002-8331-(2006)21-0159-03
收稿时间:2005-07-01
修稿时间:2005-07-01

The Research of the Feature Selection Method Based on Rough Set
Chen Sirui,Zhang Yong,Yang Zhiyong.The Research of the Feature Selection Method Based on Rough Set[J].Computer Engineering and Applications,2006,42(21):159-161.
Authors:Chen Sirui  Zhang Yong  Yang Zhiyong
Affiliation:School of Computer and Communication,Lanzhou University of Technology,Lanzhou 730050
Abstract:The technique of text automatic category is to classify texts into one or more classes according to some strategy.Text categorization is the foundation of text mining,and feature selection is the core of text categorization.This paper analyses feature selection methods disadvantages which causes the low of times and accuracy of text categorization because of many features.We propose a feature selection method based on rough set.It is the characteristic of the method that features based on the importance in the text categorization are selected.It is shown by our experiment that this method is feasible.
Keywords:feature selection  reduction of attributes  text mining
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号