首页 | 本学科首页   官方微博 | 高级检索  
     

基于平均互信息的混合条件属性聚类算法
引用本文:刘晋胜. 基于平均互信息的混合条件属性聚类算法[J]. 计算机科学, 2015, 42(3): 261-265
作者姓名:刘晋胜
作者单位:广东石油化工学院计算机与电子信息学院 茂名525000
基金项目:本文受广东省教育部产学研结合项目(2011A090200088),广东省茂名市科技计划项目(2012B009),广东省石化装备故障诊断重点实验室资助
摘    要:混合条件属性参数间的距离值存在较大的差异,导致仅聚合距离数量级较大、较规律的数值条件属性对象,而忽视数量级较小、混沌,但类别特征更加明显的分类条件属性对象。提出了一种基于平均互信息的聚类算法。通过熵量化参数类别特性的大小,再根据熵的平均互信息计算方法衡量数据对象间类别的相同、相异特征量,统一数值和分类条件属性参数间距离的数量级,最后通过优化迭代自适应过程得到最终聚类结果。实验结果表明,该算法具有良好的聚类质量和自适应性。

关 键 词:混合条件属性  平均互信息  聚类

Clustering with Mixed Condition Attributes Based on Average Mutual Information
LIU Jin-sheng. Clustering with Mixed Condition Attributes Based on Average Mutual Information[J]. Computer Science, 2015, 42(3): 261-265
Authors:LIU Jin-sheng
Affiliation:College of Computer and Electronic Information,Guangdong University of Petrochemical Technology,Maoming 525000,China
Abstract:There is a great difference between the distances of mixed condition attributes parameter.The numeric condition attributes object with larger and law magnitude tends to be clustered only.With small and chaos magnitude,the cate-gorical condition attributes object which has obvious category characteristics will be ignored.A clustering algorithm based on average mutual information was proposed.First,the size of parameter category characteristics is quantified through entropy.Then,the similarity and the difference between category characteristics are measured according ave-rage mutual information of entropy.The magnitude between distances of numeric and categorical condition attributes parameter is unified.At last,the final clustering result is got by optimizing iterative adaptive process.The experimental results show that the proposed algorithm was high clustering quality and good adaptability.
Keywords:Mixed condition attributes  Average mutual information  Clustering
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号