首页 | 本学科首页   官方微博 | 高级检索  
     

网络入侵检测中的自动决定聚类数算法
引用本文:肖立中,邵志清,马汉华,王秀英,刘 刚.网络入侵检测中的自动决定聚类数算法[J].软件学报,2008,19(8):2140-2148.
作者姓名:肖立中  邵志清  马汉华  王秀英  刘 刚
作者单位:1. 上海应用技术学院,计算机科学与信息工程系,上海,200235
2. 华东理工大学,信息科学与工程学院,上海,200237
基金项目:Supported by the National Natural Science Foundation of China under Grant No.60373075 (国家自然科学基金); the Shanghai Education Commission Foundation for Excellent Young High Education Teacher of China under Grant No.YYY-07008 (上海高校选拔培养优秀青年教师科研专项基金); the Open Research Foundation of Shanghai Institute of Technology of China under Grant No.YJ2007-24 (上海应用技术学院引进人才科研启动项目)
摘    要:针对模糊C均值算法(fuzzy C-means algorithm,简称FCM)在入侵检测中需要预先指定聚类数的问题,提出了一种自动决定聚类数算法(fuzzy C-means and support vector machine algorithm,简称F-CMSVM).它首先用模糊C均值算法把目标数据集分为两类,然后使用带有模糊成员函数的支持向量机(support vector machihe,简称SVM)算法对结果进行评估以确定目标数据集是否可分,再迭代计算,最终得到聚类结果.支持向量机算法引入模糊C均值算法得出的隶属矩阵作为模糊成员函数,使得不同的输入样本可以得到不同的惩罚值,从而得到最优的分类超平面.该算法既不需要对训练数据集进行标记,也不需要指定聚类数,因此是一种真正的无监督算法.在对KDD CUP 1999数据集的仿真实验结果表明,该算法不仅能够得到最佳聚类数,而且对入侵有较好的检测效果.

关 键 词:模糊C均值算法  支持向量机  模糊成员函数  聚类数  入侵检测
收稿时间:2006/7/13 0:00:00
修稿时间:2007/5/24 0:00:00

An Algorithm for Automatic Clustering Number Determination in Networks Intrusion Detection
XIAO Li-Zhong,SHAO Zhi-Qing,MA Han-Hu,WANG Xiu-Ying and LIU Gang.An Algorithm for Automatic Clustering Number Determination in Networks Intrusion Detection[J].Journal of Software,2008,19(8):2140-2148.
Authors:XIAO Li-Zhong  SHAO Zhi-Qing  MA Han-Hu  WANG Xiu-Ying and LIU Gang
Abstract:To address the issue in fuzzy C-means algorithm (FCM) that clustering number has to be pre-defined,a clustering algorithm,F-CMSVM (fuzzy C-means and support vector machine algorithm),is proposed for automatic clustering number determination.Above all,the data set is classifed into two clusters by FCM.Then,support vector machine (SVM) with a fuzzy membership function is used to testify whether the data set can be classified further. Finally,the result of clusters can be obtained by repeating the computation process.Because affiliating matrix, obtained by the introduction of SVM into FCM,is defined to be the fuzzy membership function,each different input data sample can have different penalty value,and the separating hyper-plane is optimized.F-CMSVM is an unsupervised algorithm in which it is neither needed to label training data set nor specify clustering number.As shown from our simulation experiment over networks connection records from KDD CUP 1999 data set,F-CMSVM has efficient performance in clustering number optimization and intrusion detection.
Keywords:fuzzy C-means algorithm  support vector machine  fuzzy membership function  clustering number  intrusion detection
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号