首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的基于条件互信息的特征选择算法
引用本文:王卫玲,刘培玉,初建崇.一种改进的基于条件互信息的特征选择算法[J].计算机应用,2007,27(2):433-435.
作者姓名:王卫玲  刘培玉  初建崇
作者单位:1. 山东师范大学,信息科学与工程学院,山东,济南,250014
2. 海军航空工程学院,训练部,山东,烟台,264001
摘    要:目前在文本分类领域较常用到的特征选择算法中,仅仅考虑了特征与类别之间的关联性,而对特征与特征之间的关联性没有予以足够的重视,这导致了特征之间预测能力的相互削弱,无法选出最有效的特征。提出了一种新的用于文本分类的特征选择算法(CMIM),它可以帮助选出区分能力强、弱相关的特征。经实验验证,CMIM比传统的特征选择算法具有更好的性能。

关 键 词:特征选择  文本分类  条件互信息
文章编号:1001-9081(2007)02-0433-03
收稿时间:2006-08-17
修稿时间:2006-08-22

Improved feature selection algorithm with conditional mutual information
WANG Wei-ling,LIU Pei-yu,CHU Jian-chong.Improved feature selection algorithm with conditional mutual information[J].journal of Computer Applications,2007,27(2):433-435.
Authors:WANG Wei-ling  LIU Pei-yu  CHU Jian-chong
Affiliation:1. College of Computer Science and Engineering, School of Shandong Normal University, Jinan Jiangsu 250014, China;2. Department of Training, Naval Aeronautical Engineering Institute, Yantai Shandong 264001, China
Abstract:Traditional feature selection algorithms have a common drawback, i.e. they do not consider the mutual relationships between features. It can result in that one feature's predictive power is weakened by others and the lost of efficiency. In this paper, we proposed a new feature selection method called Conditional Mutual Information Maximin (CMIM).It can select a set of individually discriminating and weakly dependent features. Simulation results demonstrate that the proposed method can improve the precision of text classification.
Keywords:feature selection  text categorization  conditional mutual information
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号