首页 | 本学科首页   官方微博 | 高级检索  
     

基于复杂网络的垃圾短信过滤算法
引用本文:黄文良,刘勇,钟志强,沈仲明.基于复杂网络的垃圾短信过滤算法[J].自动化学报,2009,35(7):990-996.
作者姓名:黄文良  刘勇  钟志强  沈仲明
作者单位:1.浙江大学工业控制技术国家重点实验室 杭州 310027
基金项目:国家自然科学基金(60803053);;国家博士后科学基金(20070420231);;国家博士科学基金(20081459)资助~~
摘    要:对垃圾短信发送用户的识别和过滤具有十分重要的研究价值和社会意义. 随着新形式和内容的垃圾短信出现, 传统的关键字匹配和发送速度频率过滤方法无法有效地处理这一问题. 在对短信发送/接收网络形式化表达的基础上, 以真实短信发送和接收以及通话关系数据为例, 统计和分析了短信发送网络的网络特性. 进一步分析和挖掘了垃圾短信用户在网络上发送接收的异常模式和行为, 并以此提出了一个基于语音关联程度和短信回复比率的过滤算法(NASFA算法). 通过实验和分析表明, 本文的算法能够高效地识别垃圾短信发送用户, 同时能够有效地控制将正常用户误识别为垃圾短信用户的比率.

关 键 词:复杂网络    无标度网络    垃圾短信过滤    幂律    出入度比
收稿时间:2008-2-29
修稿时间:2008-12-16

Complex Network Based SMS Filtering Algorithm
HUANG Wen-Liang, LIU Yong, ZHONG Zhi-Qiang SHEN Zhong-Ming.State Key Laboratory of Industrial Control Technology,Zhejiang University,Hangzhou.Complex Network Based SMS Filtering Algorithm[J].Acta Automatica Sinica,2009,35(7):990-996.
Authors:HUANG Wen-Liang    LIU Yong  ZHONG Zhi-Qiang SHEN Zhong-MingState Key Laboratory of Industrial Control Technology  Zhejiang University  Hangzhou
Affiliation:1.State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310027;2.Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou 310027;3.China United Network Communication Corporation, Zhejiang Branch, Hangzhou 310006
Abstract:It is very important to recognize and filter the spam short messages (SMS). As the contents and formats of spam messages are diverse, the ordinary filtering methods based on keyword matching and sending speed can not tackle this problem effectively. This paper first presents a formalized representation of the SMS network. On the basis of real short message samples, the social characteristics of the SMS network are analyzed and studied. Further analysis and statistical work are carried out to discover the un-normal patterns of spam senders in SMS network. An $N$-degree association spam filter algorithm (NASFA) based on the un-normal patterns of spam senders is presented. Experiments and analysis show that the algorithm can efficiently recognize spam senders, and the wrong recognition rate is reduced significantly.
Keywords:Complex network  scale-free network  spam short messages (SMS) filter  power law  out-in degree ratio
本文献已被 CNKI 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号