A family of experiments to validate measures for UML activity diagrams of ETL processes in data warehouses |
| |
Authors: | Lilia Muñoz Jose-Norberto Mazón Juan Trujillo |
| |
Affiliation: | 1. Lucentia Research Group, Department of Information Systems, Control, Evaluation and Computing Resources, Technological University of Panama, P.O. Box #0819-07289, Republic of Panama;2. Lucentia Research Group, Department of Software and Computing Systems, University of Alicante, San Vicente del Raspeig, 03080, Spain;1. Department of informatics, Saad Dahleb University, Blida 1, Blida, Algeria;2. University of Lyon 2, Lyon, France;3. Department of informatics, USTHB, Algiers, Algeria;1. Universitat Politécnica de Catalunya, Barcelona, Spain;2. Technische Universität Dresden, Dresden, Germany;1. Department of Physics and Astronomy, University of the Western Cape, Robert Sobukwe Rd, Bellville 7530, South Africa;2. National Metrology Institute of South Africa, Private Bag X34, Lynwood Ridge, Pretoria 0040, South Africa;3. Electron Microscopy Unit, University of the Western Cape, Robert Sobukwe Rd, Bellville 7530, South Africa;1. Department of Community Medicine and School of Public Health, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India;2. Department of Hematology, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India;3. Department of Experimental Medicine and Biotechnology, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India;4. Molecular Genetics Lab, R&D Centre, Dayanand Medical College & Hospital, Ludhiana 141001, Punjab, India;5. Department of Biostatistics, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India;6. Department of Pharmacology, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India;7. Department of Neurology, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India;8. National Institute of Nursing Education, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India;1. S.K.Patel Institute of Management and Computer Studies-MCA, Kadi Sarva Vishwavidyalaya, India;2. National Forensic Sciences University, Gandhinagar, India |
| |
Abstract: | In data warehousing, Extract, Transform, and Load (ETL) processes are in charge of extracting the data from the data sources that will be contained in the data warehouse. Their design and maintenance is thus a cornerstone in any data warehouse development project. Due to their relevance, the quality of these processes should be formally assessed early in the development in order to avoid populating the data warehouse with incorrect data. To this end, this paper presents a set of measures with which to evaluate the structural complexity of ETL process models at the conceptual level. This study is, moreover, accompanied by the application of formal frameworks and a family of experiments whose aim is to theoretical and empirically validate the proposed measures, respectively. Our experiments show that the use of these measures can aid designers to predict the effort associated with the maintenance tasks of ETL processes and to make ETL process models more usable. Our work is based on Unified Modeling Language (UML) activity diagrams for modeling ETL processes, and on the Framework for the Modeling and Evaluation of Software Processes (FMESP) framework for the definition and validation of the measures. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|