首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于模糊粗糙集的快速特征选择算法
引用本文:张晓,杨燕燕. 一种基于模糊粗糙集的快速特征选择算法[J]. 数据采集与处理, 2019, 34(3): 538-547
作者姓名:张晓  杨燕燕
作者单位:1.西安理工大学应用数学系,西安,710048;2.清华大学自动化系,北京,100084
基金项目:国家自然科学基金61602372,61806108;西安理工大学博士研究启动基金109-256081504;中国博士后基金2018M631475国家自然科学基金(61602372,61806108)资助项目;西安理工大学博士研究启动基金(109-256081504) 资助项目;中国博士后基金(2018M631475)资助项目
摘    要:模糊粗糙集由于能够处理实数值数据,甚至是混合值数据中的不确定性受到人们的广泛关注,其最重要的应用之一是特征选择,相关的特征选择方法已有不少研究,但其快速的特征选择算法研究很少。实际中的数据一般含有噪声点或信息含量低的样例,如果对数据集先筛选出代表样例,再对筛选的样例集进行数据挖掘便会降低挖掘计算量。本文基于模糊粗糙集,先根据样例的模糊下近似值对样例进行筛选,然后利用筛选样例的模糊粗糙信息熵构造特征选择的评估度量,并给出相应的特征选择算法,从而降低了算法的计算复杂度。数值试验表明该快速算法具有有效性,并且对控制筛选样例个数的参数给出了建议。

关 键 词:模糊粗糙集  样例选择  特征选择  信息熵
收稿时间:2018-04-23
修稿时间:2019-02-19

Fast Feature Selection Algorithm Based on Fuzzy Rough Sets
Zhang Xiao,Yang Yanyan. Fast Feature Selection Algorithm Based on Fuzzy Rough Sets[J]. Journal of Data Acquisition & Processing, 2019, 34(3): 538-547
Authors:Zhang Xiao  Yang Yanyan
Affiliation:1.Department of Applied Mathematics, Xi''an University of Technology, Xi''an, 710048, China;2.Department of Automation, Tsinghua University, Beijing, 100084, China
Abstract:Fuzzy rough set theory has been paid much attention since it can be used to deal with the uncertainty in the real-valued data or even the mixed data. One of the most important applications of fuzzy rough sets is feature selection, and there have existed many related feature selection methods. However, little attention has been paid on fast feature selection algorithms. Data collected in practice generally include noises or possess some instances with less information. Considering to previously select representative instances from the original data set and perform data mining algorithms on the selected instances set, one may reduce the computation of the algorithms. In view of the advantage of instance selection, the instances are firstly selected based on fuzzy rough sets according to the values of the fuzzy lower approximation of instances in this paper. Then, the evaluation measure of feature selection is constructed by using fuzzy rough set-based information entropy of the selected instances, and the corresponding feature selection algorithm is provided to alleviate the computational complexity. Some numerical experiments are conducted to show the efficiency of the proposed fast algorithm, and the reasonable suggestion of the critical parameter is given to determine the number of the selected instances.
Keywords:fuzzy rough sets  instance selection  feature selection  information entropy
点击此处可从《数据采集与处理》浏览原始摘要信息
点击此处可从《数据采集与处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号