首页 | 本学科首页   官方微博 | 高级检索  
     

基于Bert模型的互联网不良信息检测
引用本文:蔡鑫.基于Bert模型的互联网不良信息检测[J].电信科学,2020,36(11):121-126.
作者姓名:蔡鑫
作者单位:中国电信股份有限公司研究院,上海 200122
摘    要:针对互联网不良信息检测这一业务场景,探讨了基于网站文本内容进行检测的方法。回顾了经典的文本分析技术,重点介绍了Bert模型的关键技术特点及其两种不同用法。详细描述了利用其中的特征提取方法,进行网站不良信息检测的具体实施方案,并且与传统的TF-IDF模型以及word2vec+LSTM模型进行了对比验证,证实了这一方法的有效性。

关 键 词:不良信息  Bert模型  文本分析  特征提取  

Internet bad information detection based on Bert model
Xin CAI.Internet bad information detection based on Bert model[J].Telecommunications Science,2020,36(11):121-126.
Authors:Xin CAI
Affiliation:Research Institute of China Telecom Co.,Ltd.,Shanghai 200122,China
Abstract:In view of the business scenario of bad information detection on the internet,the method of detection based on the text content of the website was discussed .Classical text analysis techniques were reviewed.The key technical features and two different usages of Bert model were introduced.The specific implementation scheme of using the feature extraction method to detect website bad information was described in detail,and was compared with the traditional TF-IDF model and word2vec+LSTM model.The validity of this method is verified.
Keywords:bad information  Bert model  text analysis  feature extraction  
点击此处可从《电信科学》浏览原始摘要信息
点击此处可从《电信科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号