首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于加权相似性的粗糙集数据补齐方法
引用本文:赵洪波,江峰,曾惠芬,高宏. 一种基于加权相似性的粗糙集数据补齐方法[J]. 计算机科学, 2011, 38(11): 167-170,190
作者姓名:赵洪波  江峰  曾惠芬  高宏
作者单位:1. 青岛科技大学信息科学与技术学院 青岛266061
2. 九江职业技术学院 九江332007
3. 91286部队气象台,青岛266003
基金项目:国家自然科学基金(60802042); 山东省自然科学基金(ZR2009GQ013,ZR2010FQ027)资助
摘    要:近年来,对不完备数据的处理引起了人们的广泛关注。目前,在粗糙集理论中已经提出了多种不完备数据补齐方法,这些方法通常需要计算决策表中具有缺失值的对象与其他没有缺失值的对象之间的相似性,并以最相似对象的取值来代替缺失值。然而,这些方法普遍存在一个问题,即在计算决策表中对象之间的相似性时假设决策属性对所有条件属性的依赖性都是相等的,而且所有条件属性都是同等重要的,并没有考虑不同条件属性之间的差异性。针对这一问题,引入一个加权相似性的概念,以决策属性对条件属性的依赖性和条件属性的重要性作为权值来计算相似性。基于加权相似性,提出一种新的粗糙集数据补齐算法WSDCA。最后,在UCI数据集上,将WSDCA算法与现有的数据补齐算法进行了比较分析。实验结果表明,所提出的数据补齐方法是有效的。

关 键 词:粗糙集,不完备数据,数据补齐,相似性,加权相似性

Rough Set Approach to Data Completion Based on Weighted Similarity
ZHAO Hong-bo,JIANG Feng,ZENG Hui-fen,GAO Hong. Rough Set Approach to Data Completion Based on Weighted Similarity[J]. Computer Science, 2011, 38(11): 167-170,190
Authors:ZHAO Hong-bo  JIANG Feng  ZENG Hui-fen  GAO Hong
Affiliation:ZHAO Hong-bo1 JIANG Feng1 ZENG Hui-fen2 GAO Hong3(College of Information Science and Technology,Qingdao University of Science and Technology,Qingdao 266061,China)1(Jiujiang Vocational and Technical College,Jiujiang 332007,China)2(91286 Army Weather Station,Qingdao 266003,China)3
Abstract:In recent years,much attention has been given to the treatment of incomplete data. By now,many completion methods to incomplete data have been proposed in rough set theory. hhese methods usually compute the similarities between the object that contains missing values and other objects that do not contain missing values,and use the values of the most similar object to replace the missing values. However, there is a common problem for these methods. That is,these methods assume that the dependencies of decision attribute on all condition attributes arc the same, and the significances of all condition attributes are also the same,they ignore the differences between different condition attributes in a decision table. To solve this problem, in this paper we introduced a new notion of weighted similarity, which employs the dependencies of decision attribute on condition attributes and the significances of condition attributes as weights to compute the similarity. Based on the weighted similarity, we proposed a novel rough set data completion algorithm WSDCA.We compared WSDCA with the current data completion algorithms on UCI data sets. And experimental results demonstrate the effectiveness of our method to data completion.
Keywords:Rough sets   Incomplete data   Data completion   Similarity   Weighted similarity
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号