一种基于加权相似性的粗糙集数据补齐方法 Rough Set Approach to Data Completion Based on Weighted Similarity期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于加权相似性的粗糙集数据补齐方法

引用本文：	赵洪波,江峰,曾惠芬,高宏. 一种基于加权相似性的粗糙集数据补齐方法[J]. 计算机科学, 2011, 38(11): 167-170,190

作者姓名：	赵洪波江峰曾惠芬高宏

作者单位：	1. 青岛科技大学信息科学与技术学院青岛266061 2. 九江职业技术学院九江332007 3. 91286部队气象台,青岛266003

基金项目：	国家自然科学基金(60802042); 山东省自然科学基金(ZR2009GQ013,ZR2010FQ027)资助

摘要：	近年来，对不完备数据的处理引起了人们的广泛关注。目前，在粗糙集理论中已经提出了多种不完备数据补齐方法，这些方法通常需要计算决策表中具有缺失值的对象与其他没有缺失值的对象之间的相似性，并以最相似对象的取值来代替缺失值。然而，这些方法普遍存在一个问题，即在计算决策表中对象之间的相似性时假设决策属性对所有条件属性的依赖性都是相等的，而且所有条件属性都是同等重要的，并没有考虑不同条件属性之间的差异性。针对这一问题，引入一个加权相似性的概念，以决策属性对条件属性的依赖性和条件属性的重要性作为权值来计算相似性。基于加权相似性，提出一种新的粗糙集数据补齐算法WSDCA。最后，在UCI数据集上，将WSDCA算法与现有的数据补齐算法进行了比较分析。实验结果表明，所提出的数据补齐方法是有效的。
关键词：	粗糙集，不完备数据，数据补齐，相似性，加权相似性
Rough Set Approach to Data Completion Based on Weighted Similarity

ZHAO Hong-bo,JIANG Feng,ZENG Hui-fen,GAO Hong. Rough Set Approach to Data Completion Based on Weighted Similarity[J]. Computer Science, 2011, 38(11): 167-170,190

Authors:	ZHAO Hong-bo JIANG Feng ZENG Hui-fen GAO Hong

Affiliation:	ZHAO Hong-bo1 JIANG Feng1 ZENG Hui-fen2 GAO Hong3(College of Information Science and Technology,Qingdao University of Science and Technology,Qingdao 266061,China)1(Jiujiang Vocational and Technical College,Jiujiang 332007,China)2(91286 Army Weather Station,Qingdao 266003,China)3

Abstract:	In recent years,much attention has been given to the treatment of incomplete data. By now,many completion methods to incomplete data have been proposed in rough set theory. hhese methods usually compute the similarities between the object that contains missing values and other objects that do not contain missing values,and use the values of the most similar object to replace the missing values. However, there is a common problem for these methods. That is,these methods assume that the dependencies of decision attribute on all condition attributes arc the same, and the significances of all condition attributes are also the same,they ignore the differences between different condition attributes in a decision table. To solve this problem, in this paper we introduced a new notion of weighted similarity, which employs the dependencies of decision attribute on condition attributes and the significances of condition attributes as weights to compute the similarity. Based on the weighted similarity, we proposed a novel rough set data completion algorithm WSDCA.We compared WSDCA with the current data completion algorithms on UCI data sets. And experimental results demonstrate the effectiveness of our method to data completion.

Keywords:	Rough sets Incomplete data Data completion Similarity Weighted similarity
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机科学》浏览原始摘要信息
	点击此处可从《计算机科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏