首页 | 本学科首页   官方微博 | 高级检索  
     

基于贝叶斯算法的垃圾邮件过滤器的模拟实现
引用本文:刘红,陈静,郑健.基于贝叶斯算法的垃圾邮件过滤器的模拟实现[J].上海电机学院学报,2013(4):224-228.
作者姓名:刘红  陈静  郑健
作者单位:[1]上海电机学院电子信息学院,上海200240 [2]公安部第三研究所刑侦事业部,上海200031
基金项目:上海市大学生创新计划项目资助(2012SCXl5);上海电机学院科研启动经费项目资助(13DX02);上海电机学院重点学科资助(13XKJ01)
摘    要:对贝叶斯算法进行了深入分析与研究。在过滤算法设计中,研究发现基于贝叶斯算法的过滤模拟器运算的错误率与选取的敏感词汇数量有关,选取的敏感词汇与邮件训练集的数量越多,设计的邮件过滤器的正确率就越高。综合考虑了实用性和经济性,在选取训练集数量和敏感词汇数量时,根据实际情况选择了一个度,设计了一个基于贝叶斯算法的垃圾邮件模拟过滤模型。

关 键 词:互联网  电子邮件  垃圾邮件过滤  贝叶斯算法

Simulation and Implementation of Spam Filter Based on Bayesian Algorithm
LIUHong,CHENJing,ZHENGJian.Simulation and Implementation of Spam Filter Based on Bayesian Algorithm[J].JOurnal of Shanghai Dianji University,2013(4):224-228.
Authors:LIUHong  CHENJing  ZHENGJian
Affiliation:1. School 2. of Electronics and Information, Shanghai Dianji University, Shanghai 200240,.China; Criminal Investigation Department, The Third Researh Institute of Ministry of Public Security, Shanghai 200031, China)
Abstract:This paper analyzes the Bayesian algorithms. It is found that the error rate of the emulator based on Bayesian filtering is related to the selected number of training sensitive words. The correct rate of the designed spam filter is higher with higher selected number and more train- ing sets. However, considering practicality and economy, we set a degree to select the number of training sets and sensitive words according to actual situation. Availability and economy are con- sidered, and a filter model based on Bayesian for the spam is designed.
Keywords:Internet  e-mail  spare filter  simple Bayesian algorithm
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号