首页 | 本学科首页   官方微博 | 高级检索  
     

基于关联特征扩展的特征选择算法
引用本文:古平,朱庆生,何希平,李云峰.基于关联特征扩展的特征选择算法[J].计算机工程,2007,33(16):150-152.
作者姓名:古平  朱庆生  何希平  李云峰
作者单位:重庆大学计算机学院,重庆,400044
基金项目:国家自然科学基金 , 重庆市自然科学基金
摘    要:特征选择是文档分类中常见的预处理工作,通过对文档特征空间降维,可以提高文档的分类性能。针对多数特征选择算法不考虑特征词共现关系的问题,该文提出了一种利用关联特征来增强文档分类性能的方法,针对特征扩展后产生的高维向量空间设计了一种快速冗余特征去除和选择算法,以满足实际应用中对增强特征分类性能和执行效率的需要。实验采用朴素贝叶斯网作为分类器,从特征降维效果、分类性能以及算法执行效率等方面与其他算法进行了比较。

关 键 词:文档分类  特征选择  关联特征
文章编号:1000-3428(2007)16-0150-03
修稿时间:2006-08-30

Feature Selection Algorithm Based on Association Features Enhancement
GU Ping,ZHU Qing-sheng,HE Xi-ping,LI Yun-feng.Feature Selection Algorithm Based on Association Features Enhancement[J].Computer Engineering,2007,33(16):150-152.
Authors:GU Ping  ZHU Qing-sheng  HE Xi-ping  LI Yun-feng
Affiliation:School of Computer Science, Chongqing University, Chongqing 400044
Abstract:Feature selection is frequently used as a preprocessing step to text classification, which is effective in reducing dimensionality and increasing classification accuracy. However, most feature selection algorithms fail to take advantage of the co-occurrence of words. This paper explores the use of association features to enhance the performance of primitive features and proposes a new fast algorithm for identifying relevant features as well as redundancy among high dimensional features. The experiment are conducted with Naive Bayes, it compares the method with other feature selection algorithms with respect to the feature numbers, accuracy and effectiveness.
Keywords:text classification  feature selection  association feature
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号