首页 | 本学科首页   官方微博 | 高级检索  
     

一种混合粒子群优化模型的Web聚类方法*
引用本文:李世威,王建强.一种混合粒子群优化模型的Web聚类方法*[J].计算机应用研究,2010,27(9):3259-3262.
作者姓名:李世威  王建强
作者单位:兰州交通大学,交通运输学院,兰州,730070
基金项目:国家社会科学基金资助项目(08XTQ010);甘肃省自然科学基金资助项目(096RJZA088)
摘    要:通过分析在电子商务环境下Web挖掘的现状,考虑到Web数据的海量性和高维度性对抽取隐含的、事先未知的知识所带来的复杂性和维数灾,在普通K均值聚类、PSO聚类和K均值与PSO混合聚类算法的基础上,提出了一种将主成分分析与PSO混合聚类算法相结合的模型来对Web服务器中的日志文件进行聚类分析,将抽取的相关Web数据进行主成分分析,分析结果作为PSO混合聚类算法的输入数据,这样不仅减少了输入变量的维数,减少聚类的规模,而且保留了原始变量的主要信息,消除变量之间的多重共线性,为具有海量性、高维度性、异构性等特点的

关 键 词:主成分分析    K均值聚类    粒子群优化    混合粒子群聚类    Web聚类    维数灾

Approach of Web clustering based on hybrid particle swarm optimization model
LI Shi-wei,WANG Jian-qiang.Approach of Web clustering based on hybrid particle swarm optimization model[J].Application Research of Computers,2010,27(9):3259-3262.
Authors:LI Shi-wei  WANG Jian-qiang
Affiliation:(School of Traffic & Transportation, Lanzhou Jiaotong University, Lanzhou 730070, China)
Abstract:This paper analyzed the status of Web mining in e-commerce environment, considered the massive and high-dimensional Web data, in order to extract the implicit and unknown knowledge, brought the complexity and the curse of dimensiona-lity. Based on the K-means clustering, particle swarm optimization(PSO) clustering and hybrid PSO clustering algorithms, presented a combination model based on principal component analysis(PCA) and hybrid PSO to cluster log files in the Web servers. The interrelated Web data have been processed by principal component analysis, the results of PCA are input data for hybrid PSO clustering algorithms. It not only reduces the number of input variable and the size of clustering, but also reserve the main information of original variables and eliminates of multicollinearity between the variables; presented an effective mo-del of Web data clustering which have characters of massive, high-dimensional and heterogeneous.
Keywords:principal component analysis  K-means clustering  particle swarm optimization(PSO)  hybrid PSO clustering  Web clustering  curse of dimensionality
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号