期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Reconciling requirement-driven data warehouses with data sources via multidimensional normal forms

Jose-Norberto Juan Jens 《Data & Knowledge Engineering》2007,63(3):725-751

Successful data warehouse (DW) design needs to be based upon a requirement analysis phase in order to adequately represent the information needs of DW users. Moreover, since the DW integrates the information provided by data sources, it is also crucial to take these sources into account throughout the development process to obtain a consistent reconciliation of data sources and information needs. In this paper, we start by summarizing our approach to specify user requirements for data warehouses and to obtain a conceptual multidimensional model capturing these requirements. Then, we make use of the multidimensional normal forms to define a set of Query/View/Transformation (QVT) relations to assure that the conceptual multidimensional model obtained from user requirements agrees with the available data sources that will populate the DW. Thus, we propose a hybrid approach to develop DWs, i.e., we firstly obtain the conceptual multidimensional model of the DW from user requirements and then we verify and enforce its correctness against data sources by using a set of QVT relations based on multidimensional normal forms. Finally, we provide some snapshots of the CASE tool we have used to implement our QVT relations. 相似文献

2.

Reconciling requirement-driven data warehouses with data sources via multidimensional normal forms

《Data & Knowledge Engineering》2008,64(3):725-751

Successful data warehouse (DW) design needs to be based upon a requirement analysis phase in order to adequately represent the information needs of DW users. Moreover, since the DW integrates the information provided by data sources, it is also crucial to take these sources into account throughout the development process to obtain a consistent reconciliation of data sources and information needs. In this paper, we start by summarizing our approach to specify user requirements for data warehouses and to obtain a conceptual multidimensional model capturing these requirements. Then, we make use of the multidimensional normal forms to define a set of Query/View/Transformation (QVT) relations to assure that the conceptual multidimensional model obtained from user requirements agrees with the available data sources that will populate the DW. Thus, we propose a hybrid approach to develop DWs, i.e., we firstly obtain the conceptual multidimensional model of the DW from user requirements and then we verify and enforce its correctness against data sources by using a set of QVT relations based on multidimensional normal forms. Finally, we provide some snapshots of the CASE tool we have used to implement our QVT relations. 相似文献

3.

Active XML-based Web data integration

Rashed Salem Omar Boussaïd Jérôme Darmont 《Information Systems Frontiers》2013,15(3):371-398

Today, the Web is the largest source of information worldwide. There is currently a strong trend for decision-making applications such as Data Warehousing (DW) and Business Intelligence (BI) to move onto the Web, especially in the cloud. Integrating data into DW/BI applications is a critical and time-consuming task. To make better decisions in DW/BI applications, next generation data integration poses new requirements to data integration systems, over those posed by traditional data integration. In this paper, we propose a generic, metadata-based, service-oriented, and event-driven approach for integrating Web data timely and autonomously. Beside handling data heterogeneity, distribution and interoperability, our approach satisfies near real-time requirements and realize active data integration. For this sake, we design and develop a framework that utilizes Web standards (e.g., XML and Web services) for tackling data heterogeneity, distribution and interoperability issues. Moreover, our framework utilizes Active XML (AXML) to warehouse passive data as well as services to integrate active and dynamic data on-the-fly. AXML embedded services and changes detection services ensure near real-time data integration. Furthermore, the idea of integrating Web data actively and autonomously revolves around mining events logged by the data integration environment. Therefore, we propose an incremental XML-based algorithm for mining association rules from logged events. Then, we define active rules dynamically upon mined data to automate and reactivate integration tasks. Finally, as a proof of concept, we implement a framework prototype as a Web application using open-source tools. 相似文献

4.

A requirement-driven approach to the design and evolution of data warehouses

《Information Systems》2014

Designing data warehouse (DW) systems in highly dynamic enterprise environments is not an easy task. At each moment, the multidimensional (MD) schema needs to satisfy the set of information requirements posed by the business users. At the same time, the diversity and heterogeneity of the data sources need to be considered in order to properly retrieve needed data. Frequent arrival of new business needs requires that the system is adaptable to changes. To cope with such an inevitable complexity (both at the beginning of the design process and when potential evolution events occur), in this paper we present a semi-automatic method called ORE, for creating DW designs in an iterative fashion based on a given set of information requirements. Requirements are first considered separately. For each requirement, ORE expects the set of possible MD interpretations of the source data needed for that requirement (in a form similar to an MD schema). Incrementally, ORE builds the unified MD schema that satisfies the entire set of requirements and meet some predefined quality objectives. We have implemented ORE and performed a number of experiments to study our approach. We have also conducted a limited-scale case study to investigate its usefulness to designers. 相似文献

5.

A trace metamodel proposal based on the model driven architecture framework for the traceability of user requirements in data warehouses

Alejandro Maté Juan Trujillo 《Information Systems》2012

The complexity of the data warehouse (DW) development process requires to follow a methodological approach in order to be successful. A widely accepted approach for this development is the hybrid one, in which requirements and data sources must be accommodated to a new DW model. The main problem is that we lose the relationships between requirements, elements in the multidimensional (MD) conceptual models and data sources in the process, since no traceability is explicitly specified. Therefore, this hurts requirements validation capability and increases the complexity of Extraction, Transformation and Loading processes. In this paper, we propose a novel trace metamodel for DWs and focus on the relationships between requirements and MD conceptual models. We propose a set of Query/View/Transformation rules to include traceability in DWs in an automatic way, allowing us to obtain a MD conceptual model of the DW, as well as a trace model. Therefore, we are able to trace every requirement to the MD elements, further increasing user satisfaction. Finally, we show the implementation in our Lucentia BI tool. 相似文献

6.

Adding semantic modules to improve goal-oriented analysis of data warehouses using I-star

《Journal of Systems and Software》2014

The success rate of data warehouse (DW) development is improved by performing a requirements elicitation stage in which the users’ needs are modeled. Currently, among the different proposals for modeling requirements, there is a special focus on goal-oriented models, and in particular on the i* framework. In order to adapt this framework for DW development, we previously developed a UML profile for DWs. However, as the general i* framework, the proposal lacks modularity. This has a specially negative impact for DW development, since DW requirement models tend to include a huge number of elements with crossed relationships between them. In turn, the readability of the models is decreased, harming their utility and increasing the error rate and development time. In this paper, we propose an extension of our i* profile for DWs considering the modularization of goals. We provide a set of guidelines in order to correctly apply our proposal. Furthermore, we have performed an experiment in order to assess the validity our proposal. The benefits of our proposal are an increase in the modularity and scalability of the models which, in turn, increases the error correction capability, and makes complex models easier to understand by DW developers and non expert users. 相似文献

7.

View selection for designing the global data warehouse 总被引：1，自引：0，他引：1

Dimitri Spyros Timos 《Data & Knowledge Engineering》2001,39(3):219-240

A global data warehouse (DW) integrates data from multiple distributed heterogeneous databases and other information sources. A global DW can be abstractly seen as a set of materialized views. The selection of views for materialization in a DW is an important decision in the design of a DW. Current commercial products do not provide tools for automatic DW design. We provide a general method that, given a set of select-project-join queries to be satisfied by the DW, generates sets of materialized views that satisfy all the input queries. This process is complex since ‘common subexpressions' between the queries need to be detected and exploited. Our method is then applied to solve the problem of selecting such a materialized view set that fits in the space allocated to the DW for materialization and minimizes the combined overall query evaluation and view maintenance cost. We design algorithms which are implemented and we report on their experimental evaluation. 相似文献

8.

Designing data warehouses 总被引：9，自引：0，他引：9

Dimitri Timos 《Data & Knowledge Engineering》1999,31(3):279-301

A Data Warehouse (DW) is a database that collects and stores data from multiple remote and heterogeneous information sources. When a query is posed, it is evaluated locally, without accessing the original information sources. In this paper we deal with the issue of designing a DW, in the context of the relational model, by selecting a set of views to materialize in the DW. First, we briefly present a theoretical framework for the DW design problem, which concerns the selection of a set of views that (a) fit in the space allocated to the DW, (b) answer all the queries of interest, and (c) minimize the total query evaluation and view maintenance cost. We then formalize the DW design problem as a state space search problem by taking into account multiquery optimization over the maintenance queries (i.e., queries that compute changes to the materialized views) and the use of auxiliary views for reducing the view maintenance cost. Finally, incremental algorithms and heuristics for pruning the search space are presented. 相似文献

9.

Sheaves are the canonical data structure for sensor integration

《Information Fusion》2017

A sensor integration framework should be sufficiently general to accurately represent many sensor modalities, and also be able to summarize information in a faithful way that emphasizes important, actionable information. Few approaches adequately address these two discordant requirements. The purpose of this expository paper is to explain why sheaves are the canonical data structure for sensor integration and how the mathematics of sheaves satisfies our two requirements. We outline some of the powerful inferential tools that are not available to other representational frameworks. 相似文献

10.

An engineering process for developing Secure Data Warehouses

Juan Trujillo Emilio Soler Eduardo Fernández-Medina Mario Piattini 《Information and Software Technology》2009,51(6):1033-1051

We present a new approach for the elicitation and development security requirements in the entire Data Warehouse (DWs) life cycle, which we have called a Secure Engineering process for DAta WArehouses (SEDAWA). Whilst many methods for the requirements analysis phase of the DWs have been proposed, the elicitation of security requirements as non-functional requirements has not received sufficient attention. Hence, in this paper we propose a methodology for the DW design based on Model Driven Architecture (MDA) and the standard Software Process Engineering Metamodel Specification (SPEM) from the Object Management Group (OMG). We define four phases comprising of several activities and steps, an d five disciplines which cover the whole DW design. Our methodology adapts the i¹ framework to be used under MDA and the SPEM approaches in order to elicit and develop security requirements for DWs. The benefits of our proposal are shown through an example related to the management of the pharmacies consortium business. 相似文献

11.

An Adaptive Approach to Schema Classification for Data Warehouse Modeling

下载免费PDF全文

Hong-Ding Wang Yun-Hai Tong Shao-Hua Tan Shi-Wei Tang Dong-Qing Yang and Guo-Hui Sun 《计算机科学技术学报》2007,22(2):252-260

Data warehouse （DW） modeling is a complicated task, involving both knowledge of business processes and familiarity with operational information systems structure and behavior. Existing DW modeling techniques suffer from the following major drawbacks -- data-driven approach requires high levels of expertise and neglects the requirements of end users, while demand-driven approach lacks enterprise-wide vision and is regardless of existing models of underlying operational systems. In order to make up for those shortcomings, a method of classification of schema elements for DW modeling is proposed in this paper. We first put forward the vector space models for subjects and schema elements, then present an adaptive approach with self-tuning theory to construct context vectors of subjects, and finally classify the source schema elements into different subjects of the DW automatically. Benefited from the result of the schema elements classification, designers can model and construct a DW more easily. 相似文献

12.

Architecture and quality in data warehouses: An extended repository approach

《Information Systems》1999,24(3):229-253

Most database researchers have studied data warehouses (DW) in their role as buffers of materialized views, mediating between update-intensive OLTP systems and query-intensive decision support. This neglects the organizational role of data warehousing as a means of centralized information flow control. As a consequence, a large number of quality aspects relevant for data warehousing cannot be expressed with the current DW meta models. This paper makes two contributions towards solving these problems. Firstly, we enrich the meta data about DW architectures by explicit enterprise models. Secondly, many very different mathematical techniques for measuring or optimizing certain aspects of DW quality are being developed. We adapt the Goal-Question-Metric approach from software quality management to a meta data management environment in order to link these special techniques to a generic conceptual framework of DW quality. The approach has been implemented in full on top of the ConceptBase repository system and has undergone some validation by applying it to the support of specific quality-oriented methods, tools, and application projects in data warehousing. 相似文献

13.

Multiversion join index for multiversion data warehouse

Jan Chmiel Tadeusz Morzy Robert Wrembel 《Information and Software Technology》2009,51(1):98-108

The data warehouse (DW) technology is developed in order to support the integration of external data sources (EDSs) for the purpose of advanced data analysis by On-Line Analytical Processing (OLAP) applications. Since contents and structures of integrated EDSs may evolve in time, the content and schema of a DW must evolve too in order to correctly reflect the evolution of EDSs. In order to manage a DW evolution, we developed the multiversion data warehouse (MVDW) approach. In this approach, different states of a DW are represented by the sequence of persistent DW versions that correspond either to the real world state or to a simulation scenario. Typically, OLAP applications execute star queries that join multiple fact and dimension tables. An important optimization technique for this kind of queries is based on join indexes. Since in the MVDW fact and dimension data are physically distributed among multiple DW versions, standard join indexes need extensions. In this paper we present the concept of a multiversion join index (MVJI) applicable to indexing dimension and fact tables in the MVDW. The MVJI has a two-level structure, where an upper level is used for indexing attributes and a lower level is used for indexing DW versions. The paper also presents the theoretical upper bound (pessimistic) analysis of the MVJI performance characteristic with respect to I/O operations. The analysis is followed by experimental evaluation. It shows that the MVJI increases a system performance for queries addressing multiple DW versions with exact match and range predicates. 相似文献

14.

Managing the exchange of engineering product data to support through life ship design

R.I. Whitfield A.H.B. Duffy P. York D. Vassalos P. KaklisAuthor vitae 《Computer aided design》2011,(5):516-532

An approach for managing the exchange of engineering product data between geographically distributed designers and analysts using a heterogeneous tool set for the through-life design of a ship is described. The approach was developed within a pan-European maritime project called VRShips-ROPAX 2000 that demonstrated how information technology could be integrated into the design process. This paper describes the development of a common model containing neutral ship product data through a bottom-up consideration of the requirements of the tools to be integrated, as well as a top-down consideration of the data requirements for through life design. This common model was supported within an Integrated Design Environment (IDE) that co-ordinated design activity distributed across Europe. The IDE ensured that the users were provided with the right data in the right form at the right time to do the right task, i.e., that the design activity was timely and appropriate. The strengths and weaknesses of the approach are highlighted. 相似文献

15.

User-centered requirements engineering in health information systems: a study in the hemophilia field

Teixeira L Ferreira C Santos BS 《Computer methods and programs in biomedicine》2012,106(3):160-174

The use of sophisticated information and communication technologies (ICTs) in the health care domain is a way to improve the quality of services. However, there are also hazards associated with the introduction of ICTs in this domain and a great number of projects have failed due to the lack of systematic consideration of human and other non-technology issues throughout the design or implementation process, particularly in the requirements engineering process. This paper presents the methodological approach followed in the design process of a web-based information system (WbIS) for managing the clinical information in hemophilia care, which integrates the values and practices of user-centered design (UCD) activities into the principles of software engineering, particularly in the phase of requirements engineering (RE). This process followed a paradigm that combines a grounded theory for data collection with an evolutionary design based on constant development and refinement of the generic domain model using three well-known methodological approaches: (a) object-oriented system analysis; (b) task analysis; and, (c) prototyping, in a triangulation work. This approach seems to be a good solution for the requirements engineering process in this particular case of the health care domain, since the inherent weaknesses of individual methods are reduced, and emergent requirements are easier to elicit. Moreover, the requirements triangulation matrix gives the opportunity to look across the results of all used methods and decide what requirements are critical for the system success. 相似文献

16.

Developing secure data warehouses with a UML extension

Eduardo Fernández-Medina Juan Trujillo Rodolfo Villarroel Mario Piattini 《Information Systems》2007

Data Warehouses (DWs), Multidimensional (MD) Databases, and On-Line Analytical Processing Applications are used as a very powerful mechanism for discovering crucial business information. Considering the extreme importance of the information managed by these kinds of applications, it is essential to specify security measures from the early stages of the DW design in the MD modeling process, and enforce them. In the past years, some proposals for representing main MD modeling properties at the conceptual level have been stated. Nevertheless, none of these proposals considers security issues as an important element in its model, so they do not allow us to specify confidentiality constraints to be enforced by the applications that will use these MD models. In this paper, we will discuss the specific confidentiality problems regarding DWs as well as present an extension of the Unified Modeling Language for specifying security constraints in the conceptual MD modeling, thereby allowing us to design secure DWs. One key advantage of our approach is that we accomplish the conceptual modeling of secure DWs independently of the target platform where the DW has to be implemented, allowing the implementation of the corresponding DWs on any secure commercial database management system. Finally, we will present a case study to show how a conceptual model designed with our approach can be directly implemented on top of Oracle 10g. 相似文献

17.

The application of information systems for the design and operation of flexible machining cells

SHAHIN RAHIMIFARD STEPHEN T NEWMAN 《Journal of Intelligent Manufacturing》1999,10(1):21-27

The use of information systems in manufacturing applications has dramatically changed over the last few years. The design and implementation of somewhat dated relational databases has been replaced by the generation of information models, that can be simultaneously used for the development of information systems and satisfy their integration requirements. Over the last ten years the authors have been involved in a series of research programmes focusing on the design and operation of flexible machining cells. The use of information systems has been a central theme and the enabling technology to achieve a number of novel design concepts and operational strategies for such cells. The initial research was based on the utilization of relational databases to integrate a variety of modelling and design tools. However, the additional effort required to integrate such databases to manufacturing software tools, in the form of developing file translators, information gateways and interfac es, has made the authors adopt a new approach. With this approach the information requirements are represented in a neutral format within a data model, using a formal data specification language developed by the Standards for the Exchange of Product (STEP) committee. This paper describes these changes in the design and implementation of information systems in manufacturing applications, and provides an initial view of future research requirements. 相似文献

18.

An experiment on the impact of transparency on the effectiveness of requirements documents

Yu-Cheng Tu Ewan Tempero Clark Thomborson 《Empirical Software Engineering》2016,21(3):1035-1066

Effective communication is important to successful software development, but it is difficult to achieve. We believe transparency — the visibility of information to stakeholders — is an important factor in the effectiveness of communication in software projects. We theorise that more effective communication results from more transparent requirements documents. To test our theory, we conducted an experiment. We developed an operational definition of transparency with three attributes: accessibility, understandability, and relevance. We had students and software practitioners use requirements documents of differing levels of transparency based on these attributes to answer questions. We found that participants with the more transparent document spent less time, answered more questions correctly, and were more confident about their answers, than participants with the less transparent document. The results of our experiment provide evidence that our view of transparency may help evaluate the effectiveness of documents as a form of communication. Further work is needed to reproduce our results, and to determine whether they are generalizable to other types of stakeholders and forms of communication. 相似文献

19.

PROCEDURES OF DESIGNING ENTERPRISE NETWORK COMMUNICATION SUBSYSTEMS

Adam Grzech 《控制论与系统》2013,44(5):531-545

The aim of the paper is to present some basic rules allowing comparison of data processing and data communication subsystems in a form applicable in practice. The proposed approach addresses the corporate network communication subsystems design problem. Corporate networks classification, design, evaluation, planning, monitoring, and management as well as tuning, integration, or migration tasks are usually formulated as finding a communication system that is most suitable for a given corporate data processing system. The design procedures are based on the assumption that it is possible to transform data processing subsystem features and requirements into communication subsystem characteristics and vice versa. The features and characteristics when known allow comparison of categorized requirements and available services. 相似文献

20.

基于移动网络的嵌入式远程数据终端实现

郝晓弘李桂肃瞿华《微计算机信息》2007,23(5):24-25

随着移动通讯技术的迅猛发展,利用移动通讯技术实现远程控制已经有着越来越广阔的理论和实践方面的研究价值,本文通过对现代通讯方式的讨论,提出了借助GSM/GPRS网络提供的业务实现微控制器远程通讯的实现方法,本文根据实际传输的数量和应用场合的不同,分别介绍了微控制器的短信接入方法和GPRS(通用分组无线业务)接入方法,给出了具体设计方案,并对每种方案进行了比较。相似文献