基于Bert模型的互联网不良信息检测 Internet bad information detection based on Bert model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Bert模型的互联网不良信息检测

引用本文：	蔡鑫.基于Bert模型的互联网不良信息检测[J].电信科学,2020,36(11):121-126.

作者姓名：	蔡鑫

作者单位：	中国电信股份有限公司研究院，上海 200122

摘要：	针对互联网不良信息检测这一业务场景，探讨了基于网站文本内容进行检测的方法。回顾了经典的文本分析技术，重点介绍了Bert模型的关键技术特点及其两种不同用法。详细描述了利用其中的特征提取方法，进行网站不良信息检测的具体实施方案，并且与传统的TF-IDF模型以及word2vec+LSTM模型进行了对比验证，证实了这一方法的有效性。
关键词：	不良信息 Bert模型文本分析特征提取
Internet bad information detection based on Bert model

Xin CAI.Internet bad information detection based on Bert model[J].Telecommunications Science,2020,36(11):121-126.

Authors:	Xin CAI

Affiliation:	Research Institute of China Telecom Co.,Ltd.,Shanghai 200122,China

Abstract:	In view of the business scenario of bad information detection on the internet,the method of detection based on the text content of the website was discussed .Classical text analysis techniques were reviewed.The key technical features and two different usages of Bert model were introduced.The specific implementation scheme of using the feature extraction method to detect website bad information was described in detail,and was compared with the traditional TF-IDF model and word2vec+LSTM model.The validity of this method is verified.

Keywords:	bad information Bert model text analysis feature extraction

	点击此处可从《电信科学》浏览原始摘要信息
	点击此处可从《电信科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏