首页 | 本学科首页   官方微博 | 高级检索  
     

特征的支持度与其分类能力的关系研究
引用本文:尹建芹,田国会,魏军,李金屏,林佳本.特征的支持度与其分类能力的关系研究[J].电子学报,2015,43(2):248-254.
作者姓名:尹建芹  田国会  魏军  李金屏  林佳本
作者单位:1. 济南大学信息科学与工程学院 山东省网络环境智能计算技术重点实验室, 山东济南 250022; 2. 中国科学院太阳活动重点实验室, 北京 100012; 3. 山东大学控制科学与工程学院, 山东济南 250061
基金项目:国家自然科学基金(No .61075092,No .61203341,No .61173079);山东省高等学校科技发展计划(No .TJY1112);中科院太阳活动重点实验室开放课题
摘    要:频繁模式挖掘在分类问题中得到了广泛的应用,大量的工作利用频繁模式挖掘对分类问题进行特征选择,但对于为什么频繁模式挖掘可以在分类问题中进行有效的特征选择则缺乏系统的研究.为了为频繁模式挖掘在分类问题中的特征选择应用提供理论基础,需要确立特征的支持度与特征分类能力之间的关系,本文以特征的信息增益作为分类能力的评价准则,讨论其与特征支持度之间的联系.首先证明了信息增益是特征支持度的上凸函数;然后,在二类问题和多类问题情况下,分别证明了具有低支持度或高支持度的特征具有有限的信息增益,即具有低支持度或高支持度的特征具有有限的分类能力.最后,通过仿真实验验证了支持度与信息增益之间的关系,为频繁模式挖掘在分类问题中的应用提供了理论基础.

关 键 词:频繁模式  分类  特征选择  信息增益  
收稿时间:2013-11-05

Research on the Relationship of the Support and the Discriminative Ability for Classification of Features
YIN Jian-qin,TIAN Guo-hui,WEI Jun,LI Jin-ping,LIN Jia-ben.Research on the Relationship of the Support and the Discriminative Ability for Classification of Features[J].Acta Electronica Sinica,2015,43(2):248-254.
Authors:YIN Jian-qin  TIAN Guo-hui  WEI Jun  LI Jin-ping  LIN Jia-ben
Affiliation:1. Shandong Provincial Key Laboratory of Network Based Intelligent Computing, School of Information Science and Engineering, University of Jinan, Jinan, Shandong 250022, China; 2. Key Laboratory of Solar Activity, National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100012, China; 3. School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
Abstract:Frequent pattern mining is used widely in feature selection for classification problem.In order to provide theoretical basis for the application,we established the relationship between the classification discriminative ability and the support of the feature.Information gain was adopted as evaluation criteria,and we discussed the connection between the support of the feature and its discriminative ability.Firstly,we proved the information gain is a concave function about the support of the feature;secondly,we proved the conclusion that the feature with too-high or too-low support has limited discriminative ability under the two classes and multiple classes circumstances separately;Finally,simulation experiments validate our conclusions.And the conclusion provides a theoretical basis for the application of frequent pattern mining in classification problems.
Keywords:frequent pattern  classification  feature selection  information gain
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号