首页 | 本学科首页   官方微博 | 高级检索  
     

一种有效的垃圾邮件过滤新方法
引用本文:林琛,李弼程.一种有效的垃圾邮件过滤新方法[J].计算机应用,2006,26(8):1980-1982.
作者姓名:林琛  李弼程
作者单位:信息工程大学,信息工程学院,河南,郑州,450002
摘    要:受到信息粒度原理的启发,给出了一种有效的垃圾邮件过滤新方法。该方法训练过程是将训练样本集合中合法邮件类和垃圾邮件类拆分成四个小类,得到四个小类的类中心向量,从粒度原理角度来看,就是采用更细的粒度来描述训练样本的先验知识。过滤过程则将新进来的邮件分别与四个小类的类中心向量进行相似度比较,最终来判定所属类别。在公共垃圾邮件语料库上测试新方法,同时与目前过滤性能较高的KNN方法进行比较,结果显示新方法具有过滤精度高,过滤速度快等优点。

关 键 词:垃圾邮件过滤  粒度  KNN
文章编号:1001-9081(2006)08-1980-03
收稿时间:2006-02-13
修稿时间:2006-02-132006-05-09

New effective method for spam filtering
LIN Chen,LI Bi-cheng.New effective method for spam filtering[J].journal of Computer Applications,2006,26(8):1980-1982.
Authors:LIN Chen  LI Bi-cheng
Affiliation:College of Information Engineering, Information Engineering University, Zhengzhou Henan 450002, China
Abstract:A new effective method for spam filtering according to the principle of granularity was presented. First, this method divided spam class and legit class in train corpus into four small classes, and four center vectors were obtained. In the view of the principle of granularity, smaller granularity is used to describe knowledge in train corpus. When faltering, the new E-mail was compared with four center vectors respectively to decide which class it belonged to. This method was tested on spain corpus and compared with KNN. The results show that the new method has some advantages including high accuracy, high speed of filtering and so on.
Keywords:KNN
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号