首页 | 本学科首页   官方微博 | 高级检索  
     

基于规则库的数据质量评估方法
引用本文:刘芳,李敏,任洪敏,周兆明. 基于规则库的数据质量评估方法[J]. 计算机系统应用, 2017, 26(11): 165-169
作者姓名:刘芳  李敏  任洪敏  周兆明
作者单位:上海海事大学 信息工程学院, 上海 201306,青岛西海岸新区管委, 青岛 266555,上海海事大学 信息工程学院, 上海 201306,上海产业研究院, 上海 201306
基金项目:上海市科委重点项目(SKY2015004)
摘    要:在当今大数据时代下,数据质量的保证是大数据价值得以发挥的前提,数据质量的评估是其中一个重要的研究课题.本文基于规则库的数据质量评估方法,提出了数据质量评估整体模型,包括规则、规则库、数据质量评估指标、评估模板、评估报告.设计了规则评估模板,组合规则库中的规则,根据数据质量评估指标的重要性设置规则的权重,采用简单比率法和加权平均法相结合的评估方法,计算评估结果并确定数据质量的等级,利用了数据可视化技术来展现数据质量的评估结果.本文既考虑了单个规则的执行合格率,又考虑了各规则在数据质量评估模板中的比重,公正地准确地评估数据质量,并且简洁、直观地呈现评估结果.

关 键 词:规则库  数据质量  评估模板  数据可视化
收稿时间:2017-02-23
修稿时间:2017-03-09

Data Quality Evaluation Method Based on Rule Base
LIU Fang,LI Min,REN Hong-Min and ZHOU Zhao-Ming. Data Quality Evaluation Method Based on Rule Base[J]. Computer Systems& Applications, 2017, 26(11): 165-169
Authors:LIU Fang  LI Min  REN Hong-Min  ZHOU Zhao-Ming
Affiliation:College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China,Qingdao West Coast New District Administrative Committee, Qingdao 266035, China,College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China and Shanghai Industrial Research Institute, Shanghai 201306, China
Abstract:In today''s era of big data, data quality is the premise of the significance of big data. The evaluation of data quality is one of the most important research topics. In this paper, the data quality assessment method based on rule base is put forward, and the overall model of data quality assessment is presented, which includes rules, rule base, data quality evaluation index, evaluation model and evaluation report. This paper designs the rule evaluation template, combines rules in the rule base, sets rule weight according to the importance of data quality evaluation index, adopts the evaluation method that combines the simple ratio method and the weighted average method, calculates the evaluation result, determines the grade of the data quality, and shows the evaluation result of data quality with the data visualization technology. In order to fairly and accurately assess the data quality, and concisely and intuitively present the evaluation results, the paper does not only consider the execution rate of a single rule, but also considers the proportion of each rule in the data quality evaluation template.
Keywords:rule base  data quality  evaluation template  data visualization
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号