首页 | 本学科首页   官方微博 | 高级检索  
     

基于集成学习和社交关系识别骚扰诈骗不良号码的方法
引用本文:周旭莹,周宇飞,林宇俊,钱湖海,许鑫伶. 基于集成学习和社交关系识别骚扰诈骗不良号码的方法[J]. 电信工程技术与标准化, 2021, 34(12)
作者姓名:周旭莹  周宇飞  林宇俊  钱湖海  许鑫伶
作者单位:中移(杭州)信息技术有限公司,杭州 310000;中国移动通信集团有限公司,北京 100053
摘    要:本文基于中国移动网络通信数据和业务数据,探索“多数据汇聚、多技术融合”的新型数据挖掘方式。在研究手段上,首先采用XGBoost集成学习算法精准分类号码,有效区分骚扰诈骗号码、外卖快递号码和正常号码三大类号码。并构建号码间的社交特征,基于社交关系进一步提升骚扰诈骗号码精度。经验证,本文通过XGBoost集成学习算法和社交关系的融合模型,进一步提升诈骗骚扰号码的精度至80%,可广泛应用于新型不良信息治理领域,特别是助力打击电信网络新型违法犯罪治理,维护社会公共安全。

关 键 词:骚扰诈骗  外卖快递  集成学习  社交关系
收稿时间:2021-11-17
修稿时间:2021-11-22

Method for identifying bad number of harassment fraud based on integrated learning and social relationship
ZHOUXUYING,ZHOUYUFEI,LINYUJUN,QIANHUHAI and XUXINLING. Method for identifying bad number of harassment fraud based on integrated learning and social relationship[J]. Telecom Engineering Technics and Standardization, 2021, 34(12)
Authors:ZHOUXUYING  ZHOUYUFEI  LINYUJUN  QIANHUHAI  XUXINLING
Affiliation:China Mobile Hangzhou Information Technology Co,Ltd,Hangzhou,,Information security management and operation center,Beijing,China Mobile Hangzhou Information Technology Co,Ltd,Hangzhou,,China Mobile Hangzhou Information Technology Co,Ltd,Hangzhou,,China Mobile Hangzhou Information Technology Co,Ltd,Hangzhou,
Abstract:Based on the communication data and business data of ChinaMobile network, this paper explores a new data mining method of "multi-data aggregation and multi-technology integration". In terms of research means, firstly, Extreme Gradient Boosting(xgboost) integrated learning algorithm is used to accurately classify numbers and effectively distinguish three categories of numbers: harassment fraud numbers, takeout express numbers and normal numbers. And it builds the social characteristics between numbers to further improve the accuracy of harassment and fraud numbers based on social relations. After verification, this paper further improves the accuracy of fraud and harassment numbers to 80% through the xgboost integrated learning algorithm and the fusion model of social relations, which can be widely used in the field of new bad information governance, especially to help combat the new illegal and criminal governance of telecom networks and maintain social and public security.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《电信工程技术与标准化》浏览原始摘要信息
点击此处可从《电信工程技术与标准化》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号