首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于N-Gram的垃圾邮件过滤方法研究
引用本文:林伟,柳荣其,徐熙.一种基于N-Gram的垃圾邮件过滤方法研究[J].计算机应用与软件,2010,27(2):121-123.
作者姓名:林伟  柳荣其  徐熙
作者单位:1. 四川警察学院计算机系,四川,泸州,646000;西华大学数学与计算机学院,四川,成都,610039
2. 西华大学数学与计算机学院,四川,成都,610039
基金项目:四川省青年软件创新工程项目(2007AA42)
摘    要:为了能够有效提取邮件样本集的特征及提高垃圾邮件过滤系统的性能,介绍基于N—Gram的切分算法及语言模型,在其基础上,提出了一种改进的N—Gram切分算法,给出了一种结合N—Gram语言模型的贝叶斯过滤模型。实验结果表明,提出的方法有效地提高了垃圾邮件过滤的性能。

关 键 词:邮件过滤  N—Gram  贝叶斯模型  特征选择

ON APPROACH OF SPAM FILTERING BASED ON N-GRAM
Lin Wei,Liu Rongqi,Xu Xi.ON APPROACH OF SPAM FILTERING BASED ON N-GRAM[J].Computer Applications and Software,2010,27(2):121-123.
Authors:Lin Wei  Liu Rongqi  Xu Xi
Affiliation:Department of Computer Science/a>;Sichuan Police College/a>;Luzhou 646000/a>;Sichuan/a>;China;School of Mathematics and Computer Engineering/a>;Xihua University/a>;Chengdu 610039/a>;China
Abstract:In order to extract E-mail samples' features effectively and improve the performance of spam filtering system,this paper introduces segmentation algorithm and language model based on N-Gram.Then according to that an improved N-Gram segmentation algorithm is proposed,a Bayesian filtering model integrating the N-Gram model is given as well.Experimental results show that the improved approach is effective in improving the performance of spam filtering.
Keywords:Spam filtering N-Gram Bayesian model Feature selection  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号