首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 391 毫秒
1.
We present some basic concepts of a modelling environment for data integration in business analytics. Main emphasis is on defining a process model for the different activities occurring in connection with data integration, which allow later on assessment of the quality of the data. The model is based on combination of knowledge and techniques from statistical metadata management and from workflow processes. The modelling concepts are presented in a problem oriented formulation. The approach is embedded into an open model framework which aims for a modelling platform for all kinds of models useful in business applications.  相似文献   

2.
Engineering application domains need database management systems to supply them with a good means of modeling,a high data access efficiency and a language interface with strong functionality.This paper presents a semantic hypergraph model based on relations,in order to express many-to-many relations among objects belonging to different semantic classes in engineering applications.A management mechanism expressed by the model and the basic data of engineering databases are managed in main memory.Especially,different objects are linked by different kinds of semantics defined by users,therefore the table swap,the record swap and some unnecessary examinations ar reduced and the access efficiency of the engineering data is increased.C language interface that includes some generic and special functionality is proposed for closer connection with application programs.  相似文献   

3.
Existing data management tools have some limitations such as restrictions to specific file systems or shortage of transparence to applications.In this paper,we present a new data management tool called AIP,which is implemented via the standard data management API,and hence it supports multiple file systems and makes data management operations transparent to applications.First,AIP provides centralized policy-based data management for controlling the placement of files in different storage tiers.Second,AIP uses differentiated collections of file states to improve the execution efficiency of data management policies,with the help of the caching mechanism of file states.Third,AIP also provides a resource arbitration mechanism for controlling the rate of initiated data management operations.Our results from representative experiments demonstrate that AIP has the ability to provide high performance,to introduce low management overhead,and to have good scalability.  相似文献   

4.
Object Identity in Database Systems   总被引:1,自引:0,他引:1       下载免费PDF全文
The concept of object identity and implementation of object identity in some systems have been explained in literature.Based on an analysis on the idea of data scheme in ANSI/X3/SPARC,this paper presents the concept of full-identity,which includes entity identity,conceptual object identity,and internal object identity,In addition,the equality of objects,which is richer and more practical,is discussed based on the full identity of objects.Therefore,the semantics and constructions of the identity for the complex objects are fully observed,and some appliactions in object management,version management,and user interface are found.Also,it could support the combination of O-O model with V-O model.  相似文献   

5.
Today, grid technology has evolved to the point where it is no longer a theory but a proven practice. It represents a viable direction for corporations to explore grid computing as an answer to their business needs within tight financial constraints. In general, grids enable the efficient sharing and management of computing resources for the purpose of performing large complex tasks. Data grid provides the data management features to enable data access, synchronization, and distribution of a grid. The main aim here is to ensure a efficient access and quality data, to improve the availability, and be able to continue delivering acceptable services. In such systems, these advantages are not yielded by means others than replication mechanisms. The effective use the replication technique involves several problems, in relation with the problem of the coherence maintenance of replicas. Our contribution consists new service for the consistency management in the data grid. This service combines between pessimistic and optimistic approaches, taking into account benefits of both approaches, to find a compromise between performance and quality. In addition, our service has been extended by a mechanism placement of replicas based on economics model.  相似文献   

6.
IP网络计费管理研究   总被引:2,自引:0,他引:2  
赵慧  侯建荣 《计算机科学》2003,30(10):152-154
Accounting Management is probably the least developed IP network management application. Now it is paid more attention than before. Accounting is relatively a unattached and typical application compare to other IP network management applications. First , the non-technical factors are described. And then, the accounting objects and accounting policy are researched and described formally. The CORBA-based accounting architecture is presented.The architecture has three layers. First layer is the Web interface used as the users' interface. Java Applet is used to pass some active data. HTTP Server and CORBAAcct Server locate in the second layer. The database server is also located in this layer. The CORBAAcct Server defined by some CORBA objects accomplishes the accounting. The third layer is the device Agent on which the raw accounting data are introduced. The client communicates with the CORBAAcct Server by IIOP and the communication between CORBAAcct Server and device Agent is based on network management orotocol.  相似文献   

7.
Uncertain data are data with uncertainty information,which exist widely in database applications.In recent years,uncertainty in data has brought challenges in almost all database management areas such as data modeling,query representation,query processing,and data mining.There is no doubt that uncertain data management has become a hot research topic in the field of data management.In this study,we explore problems in managing uncertain data,present state-of-the-art solutions,and provide future research directions in this area.The discussed uncertain data management techniques include data modeling,query processing,and data mining in uncertain data in the forms of relational,XML,graph,and stream.  相似文献   

8.
With the convergence of high-performance computing(HPC),big data and artificial intelligence(AI),the HPC community is pushing for"triple use"systems to expedite scientific discoveries.However,supporting these converged applications on HPC systems presents formidable challenges in terms of storage and data management due to the explosive growth of scientific data and the fundamental differences in I/O characteristics among HPC,big data and AI workloads.In this paper,we discuss the driving force behind the converging trend,highlight three data management challenges,and summarize our efforts in addressing these data management challenges on a typical HPC system at the parallel file system,data management middleware,and user application levels.As HPC systems are approaching the border of exascale computing,this paper sheds light on how to enable application-driven data management as a preliminary step toward the deep convergence of exascale computing ecosystems,big data,and AI.  相似文献   

9.
In this paper,the authors present the design and implementation of an Interoperable Object Platform for Multi-Databases(IOPMD).The aim of the system is to provide a uniform object view and a set of tools for object manipulation and query based on heterogeneous multiple data sources under client/server environment.The common object model is compatible with ODMG2.0 and OMG‘s CORBA,which provides main OO features such as OID,attribute ,method,inheritance,reference,etc.Three types of interfaces,namely Vface,IOQL and C API,are given to provide the database programmer with tools and functionalities for application development.Nested transactions and compensating technology are adopted in transaction manager,In discussing some key implementation techniques.Translation and mapping approaches from various schemata to a common object schema are proposed.Buffer management provides the data caching policy and consistency maintenance of cached data.Version management presents some operations based on the definitions in semantic version model,and introduces the implementation of the semantic version graph.  相似文献   

10.
HFC宽带接入网上行带宽分配策略的改进   总被引:1,自引:0,他引:1  
The request/transmit based upstream bandwidth resource allocation policy of DOCSIS introduces a trouble to the quality of the data service provided in the HFC networks.In this paper,the mechanism of the upstream data transmitting and the process of data service transmitting in the HFC networks are described in detail,and the perfor-mance of the data service in HFC networks is analyzed.An advanced upstream bandwidth resource allocation policy is proposed to improve the quality of the data service in the HFC networks.  相似文献   

11.
大型数据仓库实现技术的研究   总被引:2,自引:0,他引:2  
大型数据仓库是实现海量数据存储的有效途径,但在大型数据仓库的实现中存在很多问题。在分析问题的基础上,对大型数据仓库的实现问题提出了一定的解决策略,对其中的几个关键技术即数据立方体的有效计算、增量式更新维护、索引优化、故障恢复、模式设计和查询优化的代价模型及元数据的定义和管理等作了研究。  相似文献   

12.
李瑞旭  李扬 《微机发展》2011,(9):175-178
元数据集成是数据仓库元数据管理的一项重要内容。文章在目前元数据集成研究成果的基础上,提出了一种基于SOA架构的数据仓库元数据集成技术。该技术以Web Service技术为应用框架,以CWM为元数据模型,采用XML设计元数据封装器,实现了分布环境下数据仓库元数据的集成与重用。文章重点介绍了系统的体系构架,及CWM元数据模型的结构设计和不同Web Service方法设计和调用。最后,将该技术应用到消防工程领域的一个实际数据仓库项目中,验证了该技术的可行性、有效性、实用性。  相似文献   

13.
对拥有多个远程站点或仓库的企业的而言,如何对企业所属远程站点或仓库的物流和资金流进行统一的结算和管理是一个十分重要的问题。目前构建企业对远程站点或仓库进行统一结算和管理的方法通常是在企业总部的服务器中创建企业统一的数据库,该数据库中存储各远程站点或仓库的全部数据,各远程站点或仓库连网登录到企业总部的服务器中,企业在此基础上进行统一的结算和管理。但对于已将数据库分散建立在远程站点或仓库计算机中的情况,应如何进行企业统一的结算,应如何创建经济、可靠、便捷的远程站点或仓库的数据管理,也是企业在计算机信息化建设和管理中面临的问题。利用ADsL技术、pcAnywhere软件,通过Visual,Basic 6.0编程,就可以便捷经济地完成多个远程站点或仓库物流和资金流的统一结算和管理。  相似文献   

14.
Paulson  L.D. 《IT Professional》2000,2(4):10-14
Data quality has become a business-critical issue because it can make or break the databases and data warehouses that drive e-business. Garbage in, garbage out is not new; it started to become more important because of data warehouses and data marts as well as the Web. In recent InformationWeek surveys, IT professionals have consistently ranked data quality tasks as the leading IT challenge in the post-Y2K remediation era. What's also new is that data quality issues aren't limited to individual erroneous records: they now include how applications associate various pieces of data, and how users use or interpret data. As new data usage revealed the need to ensure data consistency throughout the enterprise, new tools have emerged to help IT reconcile data inconsistencies. Not all professionals are confident off-the-shelf solutions can flexibly meet their needs, but vendors argue that automating tasks allows quicker, cost-effective project and application deployment  相似文献   

15.
工业数据仓库设计方法及其在质量分析中的应用   总被引:2,自引:0,他引:2  
提出一种建立工业数据仓库的基本方法,并结合某大型钢铁企业的具体情况,给出一种数据仓库系统的实现方案,讨论了数据仓库在企业产品质量分析中的应用。实践证明,数据仓库可为企业的经营管理提供全面、准确的数据,可在改进产品性能,提高产品质量方面发挥重要作用。  相似文献   

16.
数据挖掘技术在证券客户关系中的应用   总被引:2,自引:2,他引:0  
叶良 《计算机仿真》2009,26(12):270-273
研究证券管理问题,客户关系管理系统(CRM)是现代经营管理科学与现代信息技术结合的科学问题.数据挖掘技术是有效地利用现有数据资源的重要手段.重点是针对数据挖掘技术在证券客户关系管理中的具体问题.运用数据仓库技术建立了客户交易行为数据仓库,并运用聚类技术完成了基于证券公司客户交易行为数据仓库的证券公司客户细分.基于数据挖掘的CRM是对传统企业管理思想的一个创新,充分体现了管理的科学性和艺术性.对企业的经营决策和客户关系管理都具有相当重要的作用和意义.  相似文献   

17.
《Information Systems》1999,24(3):229-253
Most database researchers have studied data warehouses (DW) in their role as buffers of materialized views, mediating between update-intensive OLTP systems and query-intensive decision support. This neglects the organizational role of data warehousing as a means of centralized information flow control. As a consequence, a large number of quality aspects relevant for data warehousing cannot be expressed with the current DW meta models. This paper makes two contributions towards solving these problems. Firstly, we enrich the meta data about DW architectures by explicit enterprise models. Secondly, many very different mathematical techniques for measuring or optimizing certain aspects of DW quality are being developed. We adapt the Goal-Question-Metric approach from software quality management to a meta data management environment in order to link these special techniques to a generic conceptual framework of DW quality. The approach has been implemented in full on top of the ConceptBase repository system and has undergone some validation by applying it to the support of specific quality-oriented methods, tools, and application projects in data warehousing.  相似文献   

18.
A federation of data warehouses is understood as a set of data warehouses, which can be processed as a whole in the logic level. Physically, the federation does not gather data into one place. This paper presents a formal framework for data and knowledge processing in data warehouse federations. The management system for a data warehouse federation consists of an user interface enabling presentation of user queries, a program for query decomposition and a program for integrating knowledge coming from different data warehouses as the answers to a user query. We propose a model for query decomposition process and knowledge integration. It contains also the algorithm for knowledge inconstancy processing. This kind of inconsistency often occurs since very often the knowledge extracted from different data warehouses refers to the same subject, but is not consistent.  相似文献   

19.
Web数据仓库的异步迭代查询处理方法   总被引:2,自引:0,他引:2  
何震瀛  李建中  高宏 《软件学报》2002,13(2):214-218
数据仓库信息量的飞速膨胀对数据仓库提出了巨大挑战.如何提高Web环境下数据仓库的查询效率成为数据仓库研究领域重要的研究问题.对Web数据仓库的体系结构和查询方法进行了研究和探讨.在分析几种Web数据仓库实现方法的基础上,提出了一种Web数据仓库的层次体系结构,并在此基础上提出了Web数据仓库的异步迭代查询方法.该方法充分利用了流水线并行技术,在Web数据仓库的查询处理过程中不同层次的结点以流水线方式运行,并行完成查询的处理,提高了查询效率.理论分析表明,该方法可以有效地提高Web数据仓库的查询效率.  相似文献   

20.
《Information Sciences》2007,177(11):2238-2254
Many data warehouse systems have been developed recently, yet data warehouse practice is not sufficiently sophisticated for practical usage. Most data warehouse systems have some limitations in terms of flexibility, efficiency, and scalability. In particular, the sizes of these data warehouses are forever growing and becoming overloaded with data, a scenario that leads to difficulties in data maintenance and data analysis. This research focuses on data-information integration between data cubes. This research might contribute to the resolution of two concerns: the problem of redundancy and the problem of data cubes’ independent information. This work presents a semantic cube model, which extends object-oriented technology to data warehouses and which enables users to design the generalization relationship between different cubes. In this regard, this work’s objectives are to improve the performance of query integrity and to reduce data duplication in data warehouse. To deal with the handling of increasing data volume in data warehouses, we discovered important inter-relationships that hold among data cubes, that facilitate information integration, and that prevent the loss of data semantics.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号