首页 | 本学科首页   官方微博 | 高级检索  
     

距离函数分类法在垃圾邮件过滤中的应用
引用本文:林琛,李弼程.距离函数分类法在垃圾邮件过滤中的应用[J].计算机工程与设计,2007,28(2):322-323,447.
作者姓名:林琛  李弼程
作者单位:解放军信息工程大学信息工程学院,河南郑州450002
摘    要:为了得到实用性强的垃圾邮件过滤方法,将距离函数分类法首次引入到垃圾邮件过滤中.在通用邮件语料库上进行测试,并与目前过滤性能较好的KNN算法进行比较,实验结果显示距离函数分类法中的类中心向量法不适合用于垃圾邮件的过滤,而类重心向量法在保持较高过滤性能的同时,具有训练和过滤速度快的优点,是一种理想实用的垃圾邮件过滤方法.

关 键 词:垃圾邮件  分类  距离函数  类重心向量  垃圾邮件过滤  距离函数  分类法  垃圾邮件  过滤中  应用  filtering  spam  distance  function  based  categorization  过滤速度  训练  中心向量法  显示  结果  实验  比较  算法  过滤性能  测试
文章编号:1000-7024(2007)02-0322-02
修稿时间:2006-02-20

Application of categorization based on distance function in spam filtering
LIN Chen,LI Bi-cheng.Application of categorization based on distance function in spam filtering[J].Computer Engineering and Design,2007,28(2):322-323,447.
Authors:LIN Chen  LI Bi-cheng
Affiliation:College of Information Engineering, PLA Information Engineering University, Zhengzhou 450002, China
Abstract:In order to get applicable performance method of spam filtering, categorization based on distance function is firstly applied to filter spare. It is tested on e-mail corpus and compared with KNN method that is good method in spam filtering. Experimental result show categorization based on category center vector is bad method for spam filtering and categorization based on category centroid method not only is better than KNN in filtering performance, but also the speed of training and filtering is high. It is a good and useful method for spam filtering.
Keywords:spain  categorization  distance function  category centroid  spam filtering
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号