首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进的朴素贝叶斯算法在垃圾短信过滤中的研究
引用本文:张东亮,董礼.基于改进的朴素贝叶斯算法在垃圾短信过滤中的研究[J].计算机测量与控制,2012,20(2):526-528,551.
作者姓名:张东亮  董礼
作者单位:秦皇岛职业技术学院,河北秦皇岛,066100
摘    要:研究了基于SVM算法的改进朴素贝叶斯文本分类算法及在垃圾短信过滤中的应用。针对朴素贝叶斯算法条件独立性假设、过分依赖于样本空间的分布和内在不稳定性的缺陷,造成了算法时间复杂度的增加,提出了改进的基于SVM算法的朴素贝叶斯算法垃圾短信过滤的解决方案,充分结合了朴素贝叶斯算法高效分类和SVM算法增量学习及不依赖样本空间的特点;首先利用结构风险最小化原理和非线性变换将分类问题转化为二次寻优问题,最后利用朴素贝叶斯算法过滤短信,提高分类的准确度和稳定性;仿真实验结果表明,该算法能够快速得到最优分类特征子集,有效提高了垃圾短信过滤的准确率和分类速度。

关 键 词:SVM  文本分类  朴素贝叶斯  垃圾短信

Research of SMS Spam Filtering Based on Optimized NAIVE Bayesian Algorithm
Zhang Dongliang , Dong Li.Research of SMS Spam Filtering Based on Optimized NAIVE Bayesian Algorithm[J].Computer Measurement & Control,2012,20(2):526-528,551.
Authors:Zhang Dongliang  Dong Li
Affiliation:Zhang Dongliang,Dong Li(Qinhuangdao Institute of Technology,Qinhuangdao 066100,China)
Abstract:This paper discusses improvement of native Bayesian text classification algorithms based on the SVM algorithm and applications in SMS spam filtering.For Bayesian algorithms requiring for assumptions of the conditional’s independence,over-reliance on the distribution of sample space and the inherent instability of the defect,resulting in an increase in time complexity,a SVM-based algorithm solution is proposed to improve the simple Bayesian spam messages filtering,which is combined with efficient algorithms Bayesian classification and the advantage of SVM algorithm that it can incremental learns and does not rely on the characteristics of the sample space.First make structural risk minimization principle and the classification of non-linear transform into the second optimization problem,and finally the Bayesian filters the messages,to improve the classification accuracy and stability.Simulation results show that the algorithm can quickly obtain the optimal feature subset classification,effectively improve the accuracy of spam SMS filtering and classification speed.
Keywords:SVM  text classification  Bayesian  spam messages
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号