首页 | 本学科首页   官方微博 | 高级检索  
     

基于集对分析的半监督ISODATA聚类
引用本文:魏小涛.基于集对分析的半监督ISODATA聚类[J].计算机工程与应用,2009,45(36):99-100.
作者姓名:魏小涛
作者单位:北京交通大学 软件学院,北京 100044
基金项目:北京市教委与北京交通大学共建项目 
摘    要:提出一个基于集对分析的半监督ISODATA聚类算法,用于网络异常检测。在三方面进行了改进:首先,算法能够直接处理字符数字混合属性的数据,并使用集对分析来计算数据记录之间的距离;其次,算法同时处理有标号和无标号的数据,并利用少量的有标号数据来指导算法的分裂过程;最后,将算法的输入参数减少到只有两个。在KDD99入侵检测数据集上的实验结果显示,该算法获得了95.62%的检测率和1.29%的误报率。

关 键 词:集对分析  网络异常检测  半监督聚类  迭代自组织数据分析方法(ISODATA)  
收稿时间:2009-3-31
修稿时间:2009-10-23  

Semi-supervised ISODATA clustering based on set pair analysis
WEI Xiao-tao.Semi-supervised ISODATA clustering based on set pair analysis[J].Computer Engineering and Applications,2009,45(36):99-100.
Authors:WEI Xiao-tao
Affiliation:Software School,Beijing Jiaotong University,Beijing 100044,China
Abstract:A semi-supervised ISODATA clustering algorithm based on the Set Pair Analysis(SPA) is proposed for network anomaly detection.This paper improves the original ISODATA algorithm mainly in three aspects.Firstly,the modified algorithm can directly process the mixed attributes of symbolic and numeric values,and employ the SPA to calculate the distance between data records.Secondly,the algorithm can process both labeled and unlabeled samples.The small portion of labeled samples is used to supervise the clustering process in the splitting stage.Thirdly,the initial parameters needed to be input into the algorithm are reduced to only two.Experimental result on the KDD 99 intrusion detection datasets shows that the algorithm has high detection rate(95.62%) while maintaining a low false positive rate(1.29%).
Keywords:set pair analysis  network anomaly detection  semi-supervised clustering  Iterative Self-Organizing Data Analysis Technique( ISODATA)
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号