首页 | 本学科首页   官方微博 | 高级检索  
     

不一致数据库上带信任标记的查询结果
引用本文:吴爱华,谈子敬,汪卫.不一致数据库上带信任标记的查询结果[J].软件学报,2012,23(5):1167-1182.
作者姓名:吴爱华  谈子敬  汪卫
作者单位:1. 上海海事大学信息工程学院,上海,201306
2. 复旦大学计算机科学技术学院,上海,200433
基金项目:上海海事大学校基金(20110042)
摘    要:不一致数据无法正确反映现实世界,其上的查询结果内含错误或矛盾,而现有的很多不一致数据查询处理相关研究都存在信息丢失的问题.AQA(annotation based query answer)针对这一问题采用信任标签在属性级别上区分一致和不一致数据,避免了信息丢失.但AQA假设记录在依赖左边属性上的分量可信,且只针对函数依赖一种约束,具有应用局限性.在综合约束(函数依赖、包含依赖和域约束)范围内、不确定属性任意的情况下扩展了AQA,重新审视了AQA的数据模型及其上的查询代数,讨论了任意约束在查询结果上的蕴含约束计算问题.实验结果表明,扩展后的AQA非连接类查询的性能和普通的SQL基夺相同,连接查询经优化后性能接近普通SQL查询,但AQA不丢失信息与部分同类研究相比有很大优势.

关 键 词:不确定数据  数据质量  一致的查询回答  完整性约束  数据清洗
收稿时间:1/2/2011 12:00:00 AM
修稿时间:2011/3/21 0:00:00

Query Answer over Inconsistent Database with Credible Annotations
WU Ai-Hu,TAN Zi-Jing and WANG Wei.Query Answer over Inconsistent Database with Credible Annotations[J].Journal of Software,2012,23(5):1167-1182.
Authors:WU Ai-Hu  TAN Zi-Jing and WANG Wei
Affiliation:1(College of Information Engineering,Shanghai Maritime University,Shanghai 201306,China) 2(School of Computer Science,Fudan University,Shanghai 200433,China)
Abstract:Inconsistent data is confusing and conflicting.Computing credible query answers over such data is significant.However,previous related works lose information.The approach of annotation based query answer(AQA) introduces confidence annotation to differ consistently and inconsistently in attribute value.Thus,a credible query answer can be computed and information loss can also be avoided.This is limited,however,in functional dependencies.This paper extends the approach to applications where multi constraints are involved,and no attribute is definitely credible.This paper redefines its representing model and query algebra,discusses the rules for calculating valid implied constraints of the above types on query result for any query algebra,proposes a cost based heuristic algorithm to repair,and annotates the initial database.The experiments show that time performance of extended AQA is almost similar to that of SQL for any query without join,and close to SQL for join queries after optimization,but it doesn’t loss information.
Keywords:uncertain data  data quality  consistent query answer  integrity constraints  data cleaning
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号