首页 | 本学科首页   官方微博 | 高级检索  
     

基于聚类的类别模糊邮件过滤方法
引用本文:郎加云,胡学钢.基于聚类的类别模糊邮件过滤方法[J].计算机系统应用,2010,19(9):147-150.
作者姓名:郎加云  胡学钢
作者单位:合肥工业大学,计算机与信息学院,安徽,合肥,230009
基金项目:安徽省基金课题(090412044)
摘    要:目前各种基于规则的分类方法在电子邮件过滤中起到了良好的效果,在邮件过滤器的训练中,训练集中会存在部分邮件具有邮件类别模糊的现象,如何将训练集中的此类类别界限模糊的邮件提取出来将会对邮件的分类效果有明显提高的作用。提出一种基于聚类的过滤方法,根据界限模糊邮件数据之间的共性特征,对邮件训练集进行聚类。实验表明,与单纯的进行基于规则的分类算法相比,这种方法在各项评价指标上具有优越性。

关 键 词:聚类  文本分类  垃圾邮件
收稿时间:2009/12/17 0:00:00
修稿时间:2010/1/18 0:00:00

Clustering-Based Email Filtering Method with Hazy Category
LANG Jia-Yun and HU Xue-Gang.Clustering-Based Email Filtering Method with Hazy Category[J].Computer Systems& Applications,2010,19(9):147-150.
Authors:LANG Jia-Yun and HU Xue-Gang
Affiliation:(Department of Computer and Information, Hefei University of Technology, Hefei 230009,China)
Abstract:Presently, a variety of rule-based classification methods in e-mail filtering obtain good results. In the training of e-mail filtering, the training set has the notion that some e-mail messages will be sent to the hazy category. Extracting these e-mails from training set will have a noticeable increase in the results of classification. Therefore, a clustering-based filtering method is proposed in this paper. The common features of the hazy-category email include cluster the training set. Experiments demonstrate that the method has better performance on the appraisal standard than that of a simple rule-based classification algorithm.
Keywords:clustering  text categorization  spam
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号