首页 | 本学科首页   官方微博 | 高级检索  
     

基于免疫原理的不确定数据流聚类算法
引用本文:肖丹萍,叶东毅.基于免疫原理的不确定数据流聚类算法[J].模式识别与人工智能,2012,25(5):826-834.
作者姓名:肖丹萍  叶东毅
作者单位:福州大学数学与计算机科学学院福州350108
基金项目:福建省自然科学基金项目,福建省高校产学研合作科技重大项目
摘    要:提出一种基于免疫原理、对不确定数据流进行聚类的算法——IUMicro。IUMicro针对不确定数据流上元组级不确定性问题,引入动态更新以适应数据变化的免疫模型,其中包括一种有效的在线收集数据流统计信息的B细胞特征结构及其更新策略。为兼顾元组存在概率与元组间的距离两方面因素,定义概率识别半径,为每个不断到达的数据元组找到合理的候选簇。离线聚类根据免疫细胞识别区域的空间关系,进行任意形状的无监督聚类。实验结果表明,IUMicro能有效抑制噪声,具有良好的聚类质量和较快的处理速度。

关 键 词:免疫原理  不确定数据流  聚类  概率识别半径  
收稿时间:2011-06-20

Clustering Uncertain Data Streams Based on Immune Principle
XIAO Dan-Ping , YE Dong-Yi.Clustering Uncertain Data Streams Based on Immune Principle[J].Pattern Recognition and Artificial Intelligence,2012,25(5):826-834.
Authors:XIAO Dan-Ping  YE Dong-Yi
Affiliation:College of Mathematics and Computer Science,Fuzhou University,Fuzhou 350108
Abstract:An algorithm based on immune principle, named IUMicro, is proposed to cluster uncertain data streams. IUMicro applies a dynamically updated immune model to adapt to the data streams. An effective B-cell feature vector and updating strategy are used to collect statistical information of data streams on line by this model. To choose the optimal candidate cluster for each increasing tuple in the data stream, IUMicro defines a probability radius of a B-cell’s recognition zone to address both uncertainty and distance metric. The offline clustering is an arbitrary-shape unsupervised clustering based on immune B-cells’ spatial relationship between regions. The experimental results show that IUMicro effectively suppresses noise and gains better clustering quality at a high processing speed.
Keywords:Immune Principle  Uncertain Data Stream  Clustering  Probabilistic Radius of Recognition  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号