首页 | 本学科首页   官方微博 | 高级检索  
     

基于Log似然比的特征选择算法
引用本文:林森,唐发根.基于Log似然比的特征选择算法[J].计算机工程,2009,35(19):56-58,6.
作者姓名:林森  唐发根
作者单位:北京航空航天大学计算机学院,北京,100083
摘    要:针对基于向量空间模型文本分类系统中特征选择算法存在的问题,提出一种基于Log似然比的特征选择算法,引进Log似然比统计量,在考虑稀有事件对分类结果产生正面影响的同时,较好地控制其对分类产生的负面影响。采用KNN分类方法,将Log似然比特征选择算法与典型特征算法进行比较,实验结果表明,该算法能够获得良好的性能。

关 键 词:文本分类  向量空间模型  特征选择
修稿时间: 

Feature Selection Algorithm Based on Log Likelihood Ratio
LIN Sen,TANG Fa-gen.Feature Selection Algorithm Based on Log Likelihood Ratio[J].Computer Engineering,2009,35(19):56-58,6.
Authors:LIN Sen  TANG Fa-gen
Affiliation:(School of Computer, Beijing University of Aeronautics and Astronautics, Beijing 100083)
Abstract:Aiming at the problems in feature selection algorithm of text classification system based on vector space model, a feature selection algorithm based on Log likelihood ratio is proposed, which introduces the Log likelihood ratio statistic, and considers the positive impact on classification results by uncommon events, while controlling the negative ones. It is compared with typical feature algorithm by using K Nearest Neighbor(KNN) method. Experimental results show this algorithm can obtain better performance.
Keywords:text categorization  vector space model  feature selection
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号