一种基于域名的非法网站过滤技术 Filtering Illegal Website on Domain Name期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于域名的非法网站过滤技术

引用本文：	张阳,李建良,李战怀.一种基于域名的非法网站过滤技术[J].计算机工程与应用,2003,39(14):170-172.

作者姓名：	张阳李建良李战怀

作者单位：	西北工业大学计算机科学与工程系,西安,710072

摘要：	近年来,因特网上有大量包含非法或者不健康信息的网站,对非法网站进行过滤尤为重要。通常的做法是利用网页中记载的信息对网站进行分类,论文提出一种基于N-gram的朴素贝叶斯分类器,利用网站的域名对网站进行分类。作者采用该方法来自动识别包含不健康信息或非法信息的网站,实验结果证明,该方法具有相当的准确度。目前,该方法已经应用到某软件公司的网络防火墙产品中。
关键词：	文本分类信息过滤
文章编号：	1002-8331-(2003)14-0170-03
修稿时间：	2003年2月1日
Filtering Illegal Website on Domain Name

ZhangYangLiJianliangLiZhanhuai.Filtering Illegal Website on Domain Name[J].Computer Engineering and Applications,2003,39(14):170-172.

Authors:	ZhangYangLiJianliangLiZhanhuai

Abstract:	Nowada ys ,there are a lot of websites including illegal information among internet.It i s very important to filter such illegal websites.The common way is to classify web sites according to the information in their web pages.In this paper,the a uthors present a Naive Bayes classifier based on N-gram algorithm,which class ifies web sites according to their domain names.The result have proved its high accuracy when deploying it to classify illegal web sites.This tech-nology has been realized in a firewall product of a software company now.

Keywords:	Text Classification Information Filtering
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏