首页 | 本学科首页   官方微博 | 高级检索  
     

基于贝叶斯公式的最小损失垃圾邮件过滤算法
引用本文:谢金晶,张艺濒.基于贝叶斯公式的最小损失垃圾邮件过滤算法[J].现代电子技术,2006,29(24):55-57.
作者姓名:谢金晶  张艺濒
作者单位:武汉大学,计算机学院,湖北,武汉,430072
摘    要:为了减少将合法邮件误判为垃圾邮件的误报率及将垃圾邮件误判为合法邮件的漏报率的损失,首先基于现有的文本特征提取评估函数:期望交叉熵及互信息提出一种新的评估函数。利用此函数可提取到更具有代表性的邮件特征向量。在此之上提出一种基于贝叶斯公式可减少损失的垃圾邮件过滤方法。经过仿真测试后,发现基于新评估函数的新方法可有效降低误报率和漏报率。

关 键 词:贝叶斯公式  评估函数  最小损失  垃圾邮件
文章编号:1004-373X(2006)24-055-03
收稿时间:2006-06-11
修稿时间:2006年6月11日

Minimizing Cost Filtering Algorithm for Spam E-mail Based on Bayesian
XIE Jinjing,ZHANG Yibin.Minimizing Cost Filtering Algorithm for Spam E-mail Based on Bayesian[J].Modern Electronic Technique,2006,29(24):55-57.
Authors:XIE Jinjing  ZHANG Yibin
Affiliation:Computer College,Wuhan University,Wuhan,430072,China
Abstract:To minimize the cost of wrong report rate that mistake the legal mails as spare and missing report rate that mistake the spam as legal mails,flrst a new evaluation function which based on existing evaluation function of text feature extraetion; expectation cross entropy and mutual information is brought forward in this paper. Using this function,we can get more representational eigenvector from email. And then this paper presents a minimizing cost anti- spare filtering algorithm based on Bayesian. After some simulation tests, it found that new algorithm based on new evaluation function can cut down wrong report rate and missing report rate efficiently.
Keywords:Bayesian  evaluation function  cost minimizing  spam
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号