首页 | 本学科首页   官方微博 | 高级检索  
     

非共现数据的二元化加权转化算法
引用本文:姬波,叶阳东. 非共现数据的二元化加权转化算法[J]. 模式识别与人工智能, 2013, 26(6): 584-591
作者姓名:姬波  叶阳东
作者单位:郑州大学信息工程学院计算机科学与技术系郑州450001
基金项目:国家自然科学基金资助项目
摘    要:面向范畴数据的序列化信息瓶颈算法(CD-sIB)假设数据各个属性特征对二元化转化的贡献均匀,从而影响转化效果。文中提出二元化加权转化方法来反映非共现数据的特征。该方法通过突出非共现数据的代表性属性,从抑制非代表性(冗余)属性,从而获取最佳共现表示。文中提出随机分布数据的适用性和计算方法的无监督性两个非共现加权原则,并基于加权粒度概念构造二元化加权转化算法。实验结果表明,文中算法的聚类精度优于其它算法。

关 键 词:非共现数据  特征权重  信息瓶颈  面向范畴数据的序列化信息瓶颈(CD-sIB)算法  二元化转化  
收稿时间:2012-05-28

Weighting Binary Transformation Algorithm for Non Co-occurrence Data
JI Bo , YE Yang-Dong. Weighting Binary Transformation Algorithm for Non Co-occurrence Data[J]. Pattern Recognition and Artificial Intelligence, 2013, 26(6): 584-591
Authors:JI Bo    YE Yang-Dong
Affiliation:Department of Computer Science and Technology,School of Information Engineering,Zhengzhou University,Zhengzhou 450001
Abstract:The assumption that all data features are equally important in the categorical data-sequential information bottleneck(CD-sIB) lowers the transformation quality. A weighting binary transformation method is proposed to reveal the feature of non co-occurrence data by highlighting the representative features and depressing the redundancy features. Meanwhile,two weighting rules,the applicability of stochastically distributed data and the non supervision of weighting schemes,are introduced. Then,the weighted categorical data-sequential information bottleneck(WCD-sIB) algorithm is presented based on the weighting granularity concept. The experimental results show that the weighting binary transformation method generates good co-occurrence data representation,and the WCD-sIB algorithm is superior to the other algorithms.
Keywords:Non Co-occurrence Data  Feature Weighting  Information Bottleneck  Categorical Data-Sequential Information Bottleneck (CD-sIB) Algorithm  Binary Transformation
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号