首页 | 本学科首页   官方微博 | 高级检索  
     

数据流分类中的增量特征选择算法
引用本文:李敏,王勇,蔡立军.数据流分类中的增量特征选择算法[J].计算机应用,2010,30(9):2321-2323.
作者姓名:李敏  王勇  蔡立军
作者单位:1. 西北工业大学理学院2. 西北工业大学 计算机学院3. 西北工业大学 理学院
基金项目:国家自然科学基金资助项目 
摘    要:概念流动的出现及数据的高维性增加了数据流特征选择的复杂性。信息增益是最有效的特征选择算法之一,但计算量大。对信息增益做了等价替换,提出一种基于改进信息增益的混合增量特征选择(IFS)算法。该算法首先利用与分类器无关的评价函数选出候选特征集合,然后将分类器作用于候选特征集合,利用分类精度作为评价标准去选择特征子集,在遇到概念漂移时重新选择特征子集。通过在超平面数据集和UCI数据集上的实验,表明基于IFS算法的分类器能够很快地适应概念漂移,并且比基于全部特征的分类算法有更高的精度。

关 键 词:数据流分类  信息增益  增量特征选择  概念漂移  
收稿时间:2010-03-12
修稿时间:2010-05-04

Incremental feature selection algorithm for data stream classification
LI Min,WANG Yong,CAI Li-jun.Incremental feature selection algorithm for data stream classification[J].journal of Computer Applications,2010,30(9):2321-2323.
Authors:LI Min  WANG Yong  CAI Li-jun
Abstract:The complexity of feature selection for real-world data stream will increase because of high-dimensional data and concept drifting. Information gain is one of the most effective feature selections, but its computation is too huge. In order to deal with the problem, the authors proposed an incremental feature selection algorithm based on improved information gain, named IFS. Firstly, the algorithm selected candidate feature set by using independent evaluation function; secondly, feature set was selected with classifer role in candidate feature set. Finally, it selected feature set again while encountering concept drifting. The experiment was operated on moving hyperplane data set and UCI data set. The experimental results show that the proposed approach can adapt to the concept drifting with higher speed and works much better than non-feature selection algorithms.
Keywords:data stream classification                                                                                                                        information gain                                                                                                                        Incremental Feature Selection (IFS)                                                                                                                        concept drifting
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号