首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的贝叶斯邮件过滤算法
引用本文:夏超,徐德华.一种改进的贝叶斯邮件过滤算法[J].计算机与现代化,2010(10):125-128,132.
作者姓名:夏超  徐德华
作者单位:同济大学经济与管理学院,上海,200092
基金项目:国家自然科学基金资助项目 
摘    要:贝叶斯过滤算法是反垃圾邮件过滤技术中应用最为广泛的方法之一。考虑到邮件的错误分类对邮件接收者带来的损失不同,引入判定垃圾邮件是判定正常邮件的λ倍作为最终邮件分类依据;同时,为了提高贝叶斯过滤算法的分类质量,运用遗传算法来对邮件中正文和标题的特征词在邮件分类中不同的重要程度做区分。最后用实际的邮件样本对改进后的算法进行验证,验证结果表明,利用遗传算法优化配合贝叶斯过滤算法能有效提高邮件分类的质量。

关 键 词:贝叶斯  反垃圾邮件  遗传算法

An Improved Bayesian Mail Filtering Algorithm
XIA Chao,XU De-hua.An Improved Bayesian Mail Filtering Algorithm[J].Computer and Modernization,2010(10):125-128,132.
Authors:XIA Chao  XU De-hua
Affiliation:(College of Economics and Management,Tongji University,Shanghai 200092,China)
Abstract:Bayesian filtering algorithm is one of most widely used methods of anti-spam filtering technology.Taking into account the fact that the wrong classification of the mail causes different losses to recipients,so introducing a message that if judging as a spam mail is λ times that of judging as a normal mail,it can conclude that this is a spam mail.Meanwhile,in order to improve the quality of classification,the paper uses genetic algorithm to distinguish between tokens in the body and tokens in the subject.Finally,using the sample to validate the improved algorithm,the result shows that using new algorithm can improve the quality of the message classification.
Keywords:Bayesian  anti-spam mail  genetic algorithm
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号