首页 | 本学科首页   官方微博 | 高级检索  
     

一种针对同音词伪装的反垃圾短信系统设计
引用本文:胡德敏,胡金龙.一种针对同音词伪装的反垃圾短信系统设计[J].计算机工程与应用,2013,49(2):92-96.
作者姓名:胡德敏  胡金龙
作者单位:上海理工大学 光电信息与计算机工程学院,上海 200093
摘    要:近年来随着垃圾短信过滤技术的进步,垃圾短信的特征也在发生变化,其中利用同音词伪装的垃圾短信,就能轻松逃避很多过滤系统的拦截。针对这个问题,利用同音词伪装其拼音不变的特点,提出了以拼音串作为提取垃圾短信特征的关键字,从短信中提取出普通向量和伪装向量,并分别作为输入量,进行相互独立的贝叶斯过滤的方法,最后综合两次过滤的结果,判断是否为垃圾短信。实验结果表明,该方法能有效地识利用同音字伪装的垃圾短信。

关 键 词:垃圾短信  贝叶斯分类  分词  概率  提取  

System design against spam message disguised with homonym
HU Demin , Hu Jinlong.System design against spam message disguised with homonym[J].Computer Engineering and Applications,2013,49(2):92-96.
Authors:HU Demin  Hu Jinlong
Affiliation:School of Optical-Electrical and Computer Engineering, Shanghai 200093, China
Abstract:As the progress of the spam message filtering technology,characteristics of spam message are changing all the time.Of them,spam message disguised with homonym can easily escape from filtering system.Feature that homonym shares same pinyin makes it possible that by replacing Key words with pinyin it can pick up common vector and disguised vector.Making such two vectors as input of the filter system based on Bayesian respectively,it can get two independent outputs,by analyzing the outputs,the system can tell the spam message from the normal.Experimental result confirms that this system can identify spam message disguised with homonym effectively.
Keywords:spam message  Bayesian classification  words spit  possibility  extract
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号