首页 | 本学科首页   官方微博 | 高级检索  
     

基于粗糙集的两阶段邮件过滤方法
引用本文:邓维斌,洪智勇. 基于粗糙集的两阶段邮件过滤方法[J]. 计算机应用, 2010, 30(8): 2006-2009
作者姓名:邓维斌  洪智勇
作者单位:1. 重庆邮电大学 经济管理学院2. 广东,江门五邑大学
基金项目:重庆市自然科学基金重点资助项目,重庆邮电大学自然科学基金资助项目 
摘    要:如何将邮件的头信息和内容信息有效结合起来进行垃圾邮件过滤备受研究人员的关注。基于粗糙集具有很好地处理不确定信息的特点,提出了一种基于粗糙集的两阶段邮件过滤方法,首先根据邮件头信息将其分为正常邮件、垃圾邮件和可疑邮件,再根据邮件内容将可疑邮件分为正常和垃圾邮件。通过在中英文邮件集上的测试实验,证明了所提出的邮件过滤方法不仅能提高垃圾邮件过滤的准确率,而且能大幅降低误杀率。

关 键 词:粗糙集  朴素贝叶斯  特征选择  垃圾邮件过滤  
收稿时间:2010-02-01
修稿时间:2010-02-28

Double-stage spam filtering method based on rough set
DENG Wei-bin,HONG Zhi-yong. Double-stage spam filtering method based on rough set[J]. Journal of Computer Applications, 2010, 30(8): 2006-2009
Authors:DENG Wei-bin  HONG Zhi-yong
Abstract:How to combine the head information and body information of an E mail for spam filtering has drawn many researchersattention. Owing to that the rough set is a useful tool to deal with uncertain information, a new double stage spam filtering method was proposed. Firstly, the E mails were classified into non spam set, spam set and doubt set according to the head information. Secondly, the doubt set was classified into non spam set and spam set according to the body information. The simulation results on two E mail data sets in English and Chinese respectively illustrate that not only the accuracy is improved but also the manslaughter rate of classifying non spam emails into spam set is reduced significantly.
Keywords:rough set   naive Bayes   feature selecting   spam filtering
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号