期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Data management scheme for parallel radiosity

Kadi Bouatouch Thierry Priol 《Computer aided design》1994,26(12):876-882

The radiosity method is a very demanding process in terms of computing and memory resources. To cope with these problems, parallel solutions have been proposed in the literature. A new parallel solution, based on the use of a shared virtual memory, is proposed. It will be shown that this concept of SVM greatly simplifies the implementation of a parallel algorithm since distributed data are managed by the operating system. This new parallel radiosity algorithm has been implemented on an iPSC/2 hypercube using the SVM. The first results obtained with this algorithm are encouraging since the calculated efficiency curve is nearly linear. 相似文献

2.

数据并行语言中的任务并行

刘振英方滨兴《计算机工程与应用》1999,35(10):43-45

数据并行虽然已经获得了广泛的应用,但是,仍然有一些应用程序不适于数据并行语言的并行模式,如树结构算法。数据并行与任务并行的结合可以很好地解决这些问题。该文主要讨论了在数据并行中引入任务并行时,遇到的共享变量、代码生成和处理器分配等问题,比较和分析了基于编译、基于语言和基于协作库的方法。相似文献

3.

数据融合的方法及应用研究 总被引：7，自引：0，他引：7

王征刘宁庄张建成《自动化与仪器仪表》2006,(4):77-80

介绍了数据融合的定义、融合层次、融合方法及其应用领域，特别是在军事上，机器人控制及图像处理上的应用。在此基础上，分析了数据融合的研究热点及发展趋势。数据融合技术是一种用途广泛的数字信号及信息处理方法，它通过对大量的数据进行处理提纯，得到一组直观有效的数据，为进一步处理和判断控制提供精确的数据依据。相似文献

4.

任务间的次并行性

下载免费PDF全文

杜建成徐融陈道蓄谢立《软件学报》1998,9(12):917-921

首先给出了任务间次并行性存在的条件,讨论了两个任务之间的通讯、通讯等待开销的计算和任务间次并行性发掘的一般过程.此外,还就代码移动和任务合并对增强并行性、消减不必要的通讯等待开销的影响作了说明. 相似文献

5.

Detecting value-based scalar dependence

Eric Stoltz Michael Wolfe 《International journal of parallel programming》1995,23(4):327-358

Precise value-based data dependence analysis for scalars is useful for advanced compiler optimizations. The new method presented here for flow and output dependence uses Factored Use and Def chains (FUD chains), our interpretation and extension of Static Single Assignment. It is precise with respect to conditional control flow and dependence vectors. Our method detects dependences which are independent with respect to arbitrary loop nesting, as well as loop-carried dependences. A loop-carried dependence is further classified as being carried from the previous iteration, with distance 1, or from any previous iteration, with direction <. This precision cannot be achieved by traditional analysis, such as dominator information or reaching definitions. To compute anti- and input dependence, we use Factored Redef-Use chains, which are related to FUD chains. We are not aware of any prior work which explicitly deals with scalar data dependence utilizing a sparse graph representation. A preliminary version of this paper appeared in theSeventh Anual Workshop on Languages and Compilers for Parallel Computing, August 1994. Supported in part by NSF Grant CCR-9113885 and a grant from Intel Corporation and the Oregon Advanced Computing Institute. 相似文献

6.

A unifying theory of control dependence and its application to arbitrary program structures

Sebastian Danicic Richard W. BarracloughMark Harman John D. HowroydÁkos Kiss Michael R. Laurence 《Theoretical computer science》2011,412(49):6809-6842

There are several similar, but not identical, definitions of control dependence in the literature. These definitions are given in terms of control flow graphs which have had extra restrictions imposed (for example, end-reachability).We define two new generalisations of non-termination insensitive and non-termination sensitive control dependence called weak and strong control-closure. These are defined for all finite directed graphs, not just control flow graphs and are hence allow control dependence to be applied to a wider class of program structures than before.We investigate all previous forms of control dependence in the literature and prove that, for the restricted graphs for which each is defined, vertex sets are closed under each if and only if they are either weakly or strongly control-closed. Low polynomial-time algorithms for producing minimal weakly and strongly control-closed sets over generalised control flow graphs are given.This paper is the first to define an underlying semantics for control dependence: we define two relations between graphs called weak and strong projections, and prove that the graph induced by a set of vertices is a weak/strong projection of the original if and only if the set is weakly/strongly control-closed. Thus, all previous forms of control dependence also satisfy our semantics. Weak and strong projections, therefore, precisely capture the essence of control dependence in our generalisations and all the previous, more restricted forms. More fundamentally, these semantics can be thought of as correctness criteria for future definitions of control dependence. 相似文献

7.

依赖距离主导的向量化方法研究*

丁丽丽韩林王冬张素平王鹏翔于海宁《计算机应用研究》2017,34(5)

向量寄存器的非满载使用方式为大量迭代次数不足的循环提供了向量化的机会,但也导致向量化的并行宽度不固定,传统的向量因子主导的依赖测试方法不再适用。本文提出了一种依赖距离主导的依赖测试方法,通过分析依赖图中所有依赖环的破环关键边所携带的依赖距离,选择其中最小的依赖距离来决定并行宽度,破除依赖环,实现基于向量寄存器非满载使用方式的向量化。实验结果表明,该方法能够有效增加循环向量化的机会和提高向量寄存器的使用率,测试用例的向量化加速比平均提高14.6%。相似文献

8.

Workflow-Based Data Parallel Applications on the EGEE Production Grid Infrastructure

Johan Montagnat Tristan Glatard Isabel Campos Plasencia Francisco Castejón Xavier Pennec Giuliano Taffoni Vladimir Voznesensky Claudio Vuerli 《Journal of Grid Computing》2008,6(4):369-383

Setting up and deploying complex applications on a Grid infrastructure is still challenging and the programming models are rapidly evolving. Efficiently exploiting Grid parallelism is often not straight forward. In this paper, we report on the techniques used for deploying applications on the EGEE production Grid through four experiments coming from completely different scientific areas: nuclear fusion, astrophysics and medical imaging. These applications have in common the need for manipulating huge amounts of data and all are computationally intensive. All the cases studied show that the deployment of data intensive applications require the development of more or less elaborated application-level workload management systems on top of the gLite middleware to efficiently exploit the EGEE Grid resources. In particular, the adoption of high level workflow management systems eases the integration of large scale applications while exploiting Grid parallelism transparently. Different approaches for scientific workflow management are discussed. The MOTEUR workflow manager strategy to efficiently deal with complex data flows is more particularly detailed. Without requiring specific application development, it leads to very significant speed-ups. 相似文献

9.

Data migration on parallel disks: Algorithms and evaluation

Leana Golubchik Samir Khuller Yoo-Ah Kim Svetlana Shargorodskaya Yung-Chun Wan 《Algorithmica》2006,45(1):137-158

Our work is motivated by the problem of managing data on storage devices, typically a set of disks. Such storage servers are used as web servers or multimedia servers, for handling high demand for data. As the system is running, to exhibit good performance, it needs to respond dynamically to changes in demand for different data items. There are known algorithms for mapping demand to a layout. When the demand changes, a new layout can be computed. In this work we study thedata migration problem, which arises when we need to change one layout to another quickly. This problem has been studied earlier where for each disk a new layout has been prescribed. However, to apply these algorithms effectively, we identify another problem that we refer to as the correspondence problem, whose solution has a significant impact on the overall solution for the data migration problem. We study algorithms for the data migration problem in more detail and identify variations of the basic algorithm that seem to improve performance in practice, even though some of the variations have poor worst-case behavior. This research was supported by the NSF Awards CCR-0113192 and EIA-0091474 as well as the Okawa Research Award. This work made use of Integrated Media Systems Center Shared Facilities supported by the National Science Foundation under Cooperative Agreement No. EEC-9529152; any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect those of the National Science Foundation. This work was done while Svetlana Shargorodskaya was at the University of Maryland. 相似文献

10.

Nonlinear matrix decompositions and an application to parallel processing

J. L. Davidson 《Journal of Mathematical Imaging and Vision》1992,1(2):169-192

This paper introduces several decomposition results for a class of nonlinear transforms calledlattice transforms. A lattice transform has a matrix representation in the context of minimax algebra, a matrix algebraic structure developed for operations research. A general matrix decomposition method is presented and is then extended to provide necessary and sufficient conditions for mapping a lattice transform to a limited-connection parallel architecture. An additional result, necessary and sufficient conditions for finding a decomposition of a block Toeplitz matrix with Toeplitz blocks, is also given. Prior to these results, no minimax matrix decompositions had been developed. 相似文献

11.

An exact data dependence testing method for quadratic expressions

Jia-Hwa Wu Chih-Ping Chu 《Information Sciences》2007,177(23):5316-5328

相似文献

12.

串行程序的依赖关系分析和向量化 总被引：1，自引：0，他引：1

曾扬《计算机学报》1993,16(2):130-142

本文提出了两种新的数据依赖关系分析方法——系数判别法和实分析方法,其中,系数判别法在GCD方法基础上,给出数组项之间精确的依赖关系,并直接给出依赖的方向.实分析方法没有目前其他分析方法所要求的下标为循环控制变量的线性函数的限制.并且,很自然地解决了Coupled subscr-ipts及隐关系的分析问题.另外,本文探讨了破除数据依赖关系的方法及向量化的问题.文中的算法均己在UNIX环境下实现. 相似文献

13.

Effectiveness of data dependence analysis

Dror E. Maydan John L. Hennessy Monica S. Lam 《International journal of parallel programming》1995,23(1):63-81

Data dependence testing is the basic step in detecting loop level parallelism in numerical programs. The problem is undecidable in the general case. Therefore, work has been concentrated on a simplified problem, affine memory disambiguation. In this simpler domain, array references and loops bounds are assumed to be linear integer functions of loop variables. Dataflow information is ignored. For this domain, we have shown that in practice the problem can be solved accurately and efficiently.⁽¹⁾ This paper studies empirically the effectiveness of this domain restriction, how many real references are affine and flow insensitive. We use Larus's llpp system⁽²⁾ to find all the data dependences dynamically. We compare these to the results given by our affine memory disambiguation system. This system is exact for all the cases we see in practice. We show that while the affine approximation is reasonable, memory disambiguation is not a sufficient approximation for data dependence analysis. We propose extensions to improve the analysis. This research was supported in part by a fellowship from AT & T Bell Laboratories and by DARPA contract N00014-87-K-0828. 相似文献

14.

Instruction-level parallel processing: History,overview, and perspective 总被引：11，自引：0，他引：11

B. Ramakrishna Rau Joseph A. Fisher 《The Journal of supercomputing》1993,7(1-2):9-50

Instruction-level parallelism (ILP) is a family of processor and compiler design techniques that speed up execution by causing individual machine operations to execute in parallel. Although ILP has appeared in the highest performance uniprocessors for the past 30 years, the 1980s saw it become a much more significant force in computer design. Several systems were built and sold commercially, which pushed ILP far beyond where it had been before, both in terms of the amount of ILP offered and in the central role ILP played in the design of the system. By the end of the decade, advanced microprocessor design at all major CPU manufacturers had incorporated ILP, and new techniques for ILP had become a popular topic at academic conferences. This article provides an overview and historical perspective of the field of ILP and its development over the past three decades. 相似文献

15.

Data dependence and program restructuring

Michael Wolfe 《The Journal of supercomputing》1991,4(4):321-344

Data dependence concepts are reviewed, concentrating on and extending previous work on direction vectors. A bit vector representation of direction vectors is discussed. Various program restructuring transformations, such as loop circulation (a form of loop interchanging), reversal, skewing, sectioning (strip mining), combing, and rotation, are discussed in terms of their effects on the execution of the program, the required dependence tests for legality, and the effects of each transformation on the dependence graph. The bit vector representation of direction vectors is used to develop simple and efficient bit vector operations for the dependence tests and to generate the modified direction vector for each transformation. Finally, a simple method to interchange complex convex loop limits is given, which is useful when several loop restructuring operations are being applied in sequence.This work was supported by NSF Grant CCR-8906909 and DARPA Grant MDA972-88-J-1004. 相似文献

16.

GUI录制回放技术在分布并行计算中的研究与应用 总被引：1，自引：0，他引：1

任涛黄永忠《计算机工程与设计》2007,28(8):1934-1936,1940

GUI录制回放技术在软件测试自动化领域有着广泛的应用,而这里主要讨论了将该技术运用在分布并行计算中.目前,很多软件其自身已经集成了很好的算法,但由于这些软件是非开源的,所以无法直接将这些算法并行化.利用GUI录制回放技术设计并实现了一套工具;使用这套工具并结合特定的分布式计算平台,并行程序开发人员能够方便快速地将这些软件并行化,并将精力放在并行算法的设计上. 相似文献

17.

维数据仓库建模技术及其在数据建模中的应用 总被引：1，自引：0，他引：1

朱卫锋费奇陈学广《计算机工程与应用》2002,38(19):201-202,211

该文首先介绍了数据建模理论与方法的应用现状,接着介绍了基于星型模式的维数据仓库建模技术,然后给出了数据建模的步骤,最后分析了三峡工程散装水泥粉煤灰调运信息系统(TGP-CFADIS)分析型数据建模的过程。相似文献

18.

动态数据交换技术及其实现方法研究 总被引：5，自引：0，他引：5

陈荦景宁《计算机工程与应用》1999,35(3):42-45

随着软件系统开放性的日益增强,数据交换技术已成为越来越重要的一个研究课题。文章介绍了系统实现中采用的数据交换技术,着重研究了基于消息传递的动态数据交换技术（ＤＤＥ）的实现机理和利用其建立数据交换应用的方法。在实际系统的数据交换中应用ＤＤＥ技术,在实践中获得了良好的效果。文中还研究了ＤＤＥ技术的扩展,即对象链接与嵌入（ＯＬＥ）的原理和实现。相似文献

19.

Correct translation of data parallel assignment onto array processors

J. P. Wray A. Stewart 《Formal Aspects of Computing》1994,6(4):417-439

相似文献

20.

并行程序性能数据采集的设计与实现

董玉宇张延园《微机发展》1996,6(4):49-52

本文以并行３ＬＦＯＲＴＲＡＮ源程序为分析对象．阐述了并行程序运行可视化技术中的关键环节──有关并行程序运行时性能数据的采集系统的设计思路及实现方案，并通过一实例进行了说明．相似文献