首页 | 本学科首页   官方微博 | 高级检索  
     


A Data Quality in Use model for Big Data
Affiliation:1. Frances Payne Bolton School of Nursing, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH 44106, United States;2. Cleveland Clinic, 10900 Euclid Avenue, Cleveland, OH 44195, United States;1. Department of Operational Sciences, Air Force Institute of Technology, Wright-Patterson Air Force Base, OH, USA;2. Department of Management, Lipscomb University, Nashville, TN, USA;3. Department of Computer Information Systems and Business Analytics, James Madison University, Harrisonburg, VA, USA;4. Department of Marketing and Supply Chain Management, The University of Tennessee, Knoxville, TN, USA;1. OSER research team, Computer Science Department, FSTG, Cadi Ayyad University, Morocco;2. LISI Laboratory, Computer Science Department, FSSM, Cadi Ayyad University, Morocco;3. ICube Laboratory, University of Strasbourg, France;1. Lucentia Research Group, Computing Technology and Data Processing, University of Alicante, Spain;2. College of Computing and Informatics, Drexel University, Philadelphia, PA 19104, USA
Abstract:Beyond the hype of Big Data, something within business intelligence projects is indeed changing. This is mainly because Big Data is not only about data, but also about a complete conceptual and technological stack including raw and processed data, storage, ways of managing data, processing and analytics. A challenge that becomes even trickier is the management of the quality of the data in Big Data environments. More than ever before the need for assessing the Quality-in-Use gains importance since the real contribution–business value–of data can be only estimated in its context of use. Although there exists different Data Quality models for assessing the quality of regular data, none of them has been adapted to Big Data. To fill this gap, we propose the “3As Data Quality-in-Use model”, which is composed of three Data Quality characteristics for assessing the levels of Data Quality-in-Use in Big Data projects: Contextual Adequacy, Operational Adequacy and Temporal Adequacy. The model can be integrated into any sort of Big Data project, as it is independent of any pre-conditions or technologies. The paper shows the way to use the model with a working example. The model accomplishes every challenge related to Data Quality program aimed for Big Data. The main conclusion is that the model can be used as an appropriate way to obtain the Quality-in-Use levels of the input data of the Big Data analysis, and those levels can be understood as indicators of trustworthiness and soundness of the results of the Big Data analysis.
Keywords:Data Quality  Big Data  Measurement  Quality-in-Use  Model
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号