首页 | 本学科首页   官方微博 | 高级检索  
     

高维数据挖掘中特征选择的稳健方法
引用本文:李泽安 陈建平 章雅娟 赵为华. 高维数据挖掘中特征选择的稳健方法[J]. 计算机应用, 2013, 33(8): 2194-2197
作者姓名:李泽安 陈建平 章雅娟 赵为华
作者单位:1. 南通大学 计算机科学与技术学院,江苏 南通2260192. 南通大学 计算机科学与技术学院,江苏 南通2260193. 南通大学 理学院,江苏 南通 226019
基金项目:南通大学杏林学院自然科学基金资助项目;南通大学自然科学基金资助项目
摘    要:针对高维数据的特点,即数据中变量个数往往大于样本观测数目,并且数据往往具有异质性特点,基于众数回归分析和变量选择降维技术,提出了一种稳健有效的特征选择方法,利用局部二次逼近算法(LQA)和最大期望(EM)算法,给出估计算法和最优调节参数的选取方法。通过实验的模拟数据分析表明,所提出的特征提取选择方法整体优于基于最小二乘和中位数的正则化估计方法,特别当误差是非正态分布时,与已有方法相比具有较高的预测能力和稳健性。

关 键 词:高维数据  特征选择  众数回归  自适应LASSO  最大期望算法  
收稿时间:2013-03-11
修稿时间:2013-05-06

Robust feature selection method in high-dimensional data mining
LI Zhean CHEN Jianping ZHANG Yajuan ZHAO Weihua. Robust feature selection method in high-dimensional data mining[J]. Journal of Computer Applications, 2013, 33(8): 2194-2197
Authors:LI Zhean CHEN Jianping ZHANG Yajuan ZHAO Weihua
Affiliation:1. College of Computer Science and Technology, Nantong University, Nantong Jiangsu 226019, China
2. Colloge of Science, Nantong University, Nantong Jiangsu 226019, China
Abstract:According to the feature of high-dimensional data, the number of variables is usually larger than the sample size and the data are often heterogeneous, a robust and effective feature selection method was proposed by using the dimensional reduction technique of variable selection and the modal regression based estimation method. The estimation algorithm was given by using Local Quadratic Algorithm (LQA) and Expectation-Maximum (EM) algorithm, and the selection method of the parameter adjustment was also discussed. Data analysis of the simulation shows that the proposed method is overall better than the least square and median regression based regularized method. Compared with the existing methods, the proposed method has higher prediction ability and stronger robustness especially for the non-normal error distribution.
Keywords:high-dimensional data   feature selection   modal regression   adaptive Least Absolute Shrinkage and Selection Operator (LASSO)   Expectation-Maximum (EM) algorithm  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号