期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

M. R. Henzinger 《Algorithmica》1995,13(6):503-538

We present an algorithm for maintaining the biconnected components of a graph during a sequence of edge insertions and deletions. It requires linear storage and preprocessing time. The amortized running time for insertions and for deletions isO(m ^2/3 ), wherem is the number of edges in the graph. Any query of the form ‘Are the verticesu andv biconnected?’ can be answered in timeO(1). This is the first sublinear algorithm for this problem. We can also output all articulation points separating any two vertices efficiently. If the input is a plane graph, the amortized running time for insertions and deletions drops toO(√n logn) and the query time isO(log² n), wheren is the number of vertices in the graph. The best previously known solution takes timeO(n ^2/3 ) per update or query. 相似文献

2.

Lock-free parallel dynamic programming

Alex Stivala Peter J. Stuckey Maria Garcia de la Banda Manuel Hermenegildo Anthony Wirth 《Journal of Parallel and Distributed Computing》2010

We show a method for parallelizing top down dynamic programs in a straightforward way by a careful choice of a lock-free shared hash table implementation and randomization of the order in which the dynamic program computes its subproblems. This generic approach is applied to dynamic programs for knapsack, shortest paths, and RNA structure alignment, as well as to a state-of-the-art solution for minimizing the maximum number of open stacks. Experimental results are provided on three different modern multicore architectures which show that this parallelization is effective and reasonably scalable. In particular, we obtain over 10 times speedup for 32 threads on the open stacks problem. 相似文献

3.

Performing incremental dynamic analysis in parallel

Dimitrios Vamvatsikos 《Computers & Structures》2011,89(1-2):170-180

Incremental dynamic analysis has recently emerged to offer comprehensive evaluation of the seismic performance of structures using multiple nonlinear dynamic analyses under scaled ground-motion records. Being computer-intensive, it can benefit from parallel processing to accelerate its application on realistic structural models. While the task-farming master–slave paradigm seems ideal, severe load imbalances arise due to analysis non-convergence at structural instability, prompting the examination of task partitioning at the level of single records or single dynamic runs. Combined with a multi-tier master–slave processor hierarchy employing dynamic task generation and self-scheduling we achieve a flexible and efficient parallel algorithm with excellent scalability. 相似文献

4.

Fully parallel thinning with tolerance to boundary noise

Frank Y. Shih Wai-Tak Wong 《Pattern recognition》1994,27(12):1677-1695

A new fully parallel thinning algorithm is developed and evaluated in this paper to solve the noise spurs problem and preserve geometric properties efficiently. The algorithm not only prevents the excessive erosions but also lessens the creation of spurious end points for an image with boundary noise. When two input images are similar in shape but with boundary noise, our skeletons produced appear more consistent in topology as compared to those using other algorithms. Although a few additional neighbors other than 3 × 3 are considered in the deletability conditions, the smoothing procedure prior to thinning is avoided. The parallel thinning algorithm runs very fast and can be implemented in real time. Several English and Chinese characters and the difficult patterns often illustrated in the literature are also experimented to show the efficiency and consistency of our algorithm. 相似文献

5.

Fully dynamic algorithms for permutation graph coloring

《国际计算机数学杂志》2012,89(1-2):37-55

We introduce a dynamic model for maintaining permutation graph coloring. Our motivation comes from the strait type river routing problem in VLSI. This paper presents fully dynamic algorithms for the permutation graph coloring problem. These algorithms are designed to handle Insert and Delete operations and answer some queries. The aim is to provide for running times that are asymptotically more efficient than recomputation (off-line algorithms that run in 0(n logw) time, are known [5,6,10,3]). First, the algorithm A^ that runs in 0(n) uniform running time per Insert/Delete operation is presented. Second, a more sophisticated data structure leads to the algorithm A2 that runs in (9(m logw) uniform running time per Insert I Delete, where m denotes the number of chains in the decomposition. It follows from [7,4] that the running time of A2 when the points from the dynamically changing set are drawn independently from a uniform distribution on the unit square is G(yfn logn) per Insert/Delete in probability. Third, we sketch a composite algorithm A3 that switches between A± and A2 guarantees an amortized running time of (min{n,m logw)) per Insert/Delete. Finally, we outline a number of applications 相似文献

6.

Asynchronous analysis of parallel dynamic programming algorithms

Lewandowski G. Condon A. Bach E. 《Parallel and Distributed Systems, IEEE Transactions on》1996,7(4):425-438

We examine a very simple asynchronous model of parallel computation that assumes the time to compute a task is random, following some probability distribution. The goal of this model is to capture the effects of unpredictable delays on processors, due to communication delays or cache misses, for example. Using techniques from queueing theory and occupancy problems, we use this model to analyze two parallel dynamic programming algorithms. We show that this model is simple to analyze and correctly predicts which algorithm will perform better in practice. The algorithms we consider are a pipeline algorithm, where each processor i computes in order the entries of rows i, i+p, and so on, where p is the number of processors; and a diagonal algorithm, where entries along each diagonal extending from the left to the top of the table are computed in turn. It is likely that the techniques used here can be useful in the analysis of other algorithms that use barriers or pipelining techniques 相似文献

7.

Generalized core maintenance of dynamic bipartite graphs

Bai Wen Chen Yadi Wu Di Huang Zhichuan Zhou Yipeng Xu Chuan 《Data mining and knowledge discovery》2022,36(1):209-239

Data Mining and Knowledge Discovery - k-core is important in many graph mining applications, such as community detection and clique finding. As one generalized concept of k-core,... 相似文献

8.

Verifying parallel programs with dynamic communication structures

Tayssir Touili Mohamed Faouzi Atig 《Theoretical computer science》2010,411(38-39):3460-3468

We address the verification problem of networks of communicating pushdown systems modeling communicating parallel programs with procedure calls. Processes in such networks can read the control state of the other processes according to a given communication structure (specifying the observability rights between processes). The reachability problem of such models is undecidable in general. First, we define a class of networks that effectively preserves recognizability (hence, its reachability problem is decidable). Then, we consider networks where the communication structure can change dynamically during the execution according to a phase graph. The reachability problem for these dynamic networks being undecidable in general, we define a subclass for which it becomes decidable. Then, we consider reachability when the switches in the communication structures are bounded. We show that this problem is undecidable even for one switch. We define a natural class of models for which this problem is decidable. This class can be used in the definition of an efficient semi-decision procedure for the analysis of the general model of dynamic networks. Our techniques allowed to find bugs in two versions of a Windows NT Bluetooth driver. 相似文献

9.

多路并行动态测试虚拟仪器的设计

陈栋《微计算机信息》2009,25(13)

动态测试仪器是大型复杂结构动态性能测试的必需仪器,它的功能较多,实现难度较大.本文重点论述应用虚拟仪器技术来设计多路并行动态测试虚拟仪器,然后对动态测试仪器的性能进行测试,并将测试分析的结果与动态信号分析仪HP35670的结果进行比较.结果表明,该仪器具有多通道、成本低、测试精度高等优点. 相似文献

10.

Performance of parallel computations with dynamic processor allocation

Saeed Iqbal Graham F. Carey 《Engineering with Computers》2008,24(2):135-143

In parallel adaptive mesh refinement (AMR) computations the problem size can vary significantly during a simulation. The goal here is to explore the performance implications of dynamically varying the number of processors proportional to the problem size during simulation. An emulator has been developed to assess the effects of this approach on parallel communication, parallel runtime and resource consumption. The computation and communication models used in the emulator are described in detail. Results using the emulator with different AMR strategies are described for a test case. Results show for the test case, varying the number of processors, on average, reduces the total parallel communications overhead from 16 to 19% and improves parallel runtime time from 4 to 8%. These results also show that on average resource utilization improves more than 37%. 相似文献

11.

一种有效的并行数据库动态负载平衡连接算法 总被引：1，自引：0，他引：1

关心欧增桂王玲《计算机工程与应用》2007,43(12):150-154

在基于Shared-nothing结构的并行数据库中,负载平衡一直是影响查询处理性能的重要因素。在数据库中频繁使用的连接操作会因为各种因素导致的负载倾斜和额外的通讯开销而降低数据库的整体性能。提出了一种基于RCMD分布方法的动态负载平衡连接算法,能够在连接操作的执行过程中动态调整各个结点的负载。理论分析和实验结果证明提出的算法能够有效地平衡负载,提高并行数据库的执行效率。相似文献

12.

Fully dynamic metric access methods based on hyperplane partitioning

Gonzalo Navarro Roberto Uribe-Paredes 《Information Systems》2011

Metric access methods based on hyperplane partitioning have the advantage, compared to the ball partitioning-based ones, that regions do not overlap. The price is less flexibility for controlling the tree shape, especially in the dynamic scenario, that is, upon insertions and deletions of objects. In this paper we introduce a technique called ghost hyperplanes, which enables fully dynamic data structures based on hyperplane partitioning. We apply the technique to Brin's GNAT static index, obtaining a dynamic variant called EGNAT, which in addition we adapt to secondary memory. We show experimentally that the EGNAT is competitive with the M-tree, the baseline for this scenario. We also apply the ghost hyperplane technique to Voronoi trees, obtaining a competitive dynamic structure for main memory. 相似文献

13.

Degeneracy conditions of the dynamic model of parallel robots

Sébastien Briot Georges Pagis Nicolas Bouton Philippe Martinet 《Multibody System Dynamics》2016,37(4):371-412

Despite their well-known advantages in terms of higher intrinsic rigidity, larger payload-to-weight ratio, and higher velocity and acceleration capacities, parallel robots have drawbacks. Among them, the most important one is surely the presence of singularities in the workspace, which divide the workspace into different aspects (each aspect corresponding to one or more assembly modes) and near which the performance is considerably reduced.In order to increase the reachable workspace of parallel robots, a promising solution consists in the definition of optimal trajectories passing through the singularities to change either the leg working modes or the robot assembly modes. Previous works on the field have shown that it is possible to define optimal trajectories that allow the passing through the robot type 2 singularities. Such trajectories must respect a physical criterion that can be obtained through the analysis of the degeneracy conditions of the parallel robot inverse dynamic model.However, the mentioned works were not complete: they lacked a degeneracy condition of the parallel robot inverse dynamic model, which is not due to type 2 singularity anymore, but to a serial singularity. Crossing a serial singularity is appealing as in that case we can change the robot leg working mode and then potentially access to other workspace zones. This absence is due to the fact that the authors used a reduced dynamic model, which was not taking into account all link dynamic parameters.The present paper aims to fill this gap by providing a complete study of the degeneracy conditions of the parallel robot dynamic model and by demonstrating that it is possible to cross the type 2, but also serial singularity, by defining trajectories that respect some given criteria obtained from the analysis of the degeneracy of the robot dynamic model. It also aims to demonstrate that the serial singularities have impacts on the robot effort transmission, which is a point that is usually bypassed in the literature. All theoretical developments are validated through simulations and experiments. 相似文献

14.

Identification of dynamic comprehension processes during largescale maintenance

Von Mayrhauser A. Vans A.M. 《IEEE transactions on pattern analysis and machine intelligence》1996,22(6):424-437

We present results of observing professional maintenance engineers working with industrial code at actual maintenance tasks. Protocol analysis is used to explore how code understanding might differ for small versus large scale code. The experiment confirms that cognition processes work at all levels of abstraction simultaneously as programmers build a mental model of the code. Analysis focused on dynamic properties and processes of code understanding. Cognition processes emerged at three levels of aggregation representing lower and higher level strategies of understanding. They show differences in what triggers them and how they achieve their goals. Results are useful for defining information which maintenance engineers need for their work and for documentation and development standards 相似文献

15.

A virtual bus architecture for dynamic parallel processing

Lee K.C. 《Parallel and Distributed Systems, IEEE Transactions on》1993,4(2):121-130

To support parallel processing of data-intensive applications, the interconnection network of a parallel/distributed machine must provide high end-to-end communication bandwidth and handle the bursty and concentrated communication patterns generated by dynamic load balancing and data collection operations. A large-scale interconnection network architecture called a virtual bus is proposed. The virtual bus can scale to terabits-per-second end-to-end communication bandwidth with low queuing delay for nonuniform traffic. A terabit virtual bus architecture can be efficiently implemented for less than 5% of the total cost of an eight-thousand-node system. In addition, the virtual bus has an open system parallel interface that is flexible enough to support up to gigabytes per second data transfer rates, different grades of services, and broadcast operation. Such flexibility makes the virtual bus a plausible open system communication backbone for a broad range of applications 相似文献

16.

Improving matrix-based dynamic programming on massively parallel accelerators

《Information Systems》2017

Dynamic programming techniques are well-established and employed by various practical algorithms, including the edit-distance algorithm or the dynamic time warping algorithm. These algorithms usually operate in an iteration-based manner where new values are computed from values of the previous iteration. The data dependencies enforce synchronization which limits possibilities for internal parallel processing. In this paper, we investigate parallel approaches to processing matrix-based dynamic programming algorithms on modern multicore CPUs, Intel Xeon Phi accelerators, and general purpose GPUs. We address both the problem of computing a single distance on large inputs and the problem of computing a number of distances of smaller inputs simultaneously (e.g., when a similarity query is being resolved). Our proposed solutions yielded significant improvements in performance and achieved speedup of two orders of magnitude when compared to the serial baseline. 相似文献

17.

Optimization techniques for parallel force-decomposition algorithm in molecular dynamic simulations

Ji Wu Shu Bing Wang Jin Zhao Wang 《Computer Physics Communications》2003,154(2):121-130

The efficiency and scalability of traditional parallel force-decomposition (FD) algorithms are not good because of high communication cost which is introduced when skew-symmetric character of force matrix is applied. This paper proposed a new parallel algorithm called UTFBD (Under Triangle Force Block Decomposition), which is based on a new efficient force matrix decomposition strategy. This strategy decomposes only the under triangle force matrix and greatly reduces parallel communication cost, e.g., the communication cost of UTFBD algorithm is only one third of Taylor's FD algorithm. UTFBD algorithm is implemented on Cluster system and applied to solve a physical nucleation problem with 500,000 particles. Numerical results are analyzed and compared among three algorithms, namely, FRI, Taylor's FD and UTFBD. The efficiency of UTFBD on 105 processors is 41.3%, and the efficiencies of FRI and Taylor's FD on 100 processors are 4.3 and 35.2%, respectively. In another words, the efficiency of UTFBD on about 100 processors is 37.0 and 6.1% higher than that of FRI and Taylor's FD, respectively. Results show that UTFBD can increase the efficiency of parallel MD (Molecular Dynamics) simulation to a higher degree and has a better scalability. 相似文献

18.

The dynamic behaviour of parallel programs under process migration

Rosemary Candlin Joseph Phillips 《Concurrency and Computation》1995,7(7):591-613

We have studied the interaction between process-based parallel programs whose characteristics change in various ways at run time and the operation of load-balancing, as implemented by process migration. In order to do this, we propose a simple performance model, whose parameters represent features of the program's execution such as the frequency and regularity of the changes in computational characteristics, and conduct a series of experiments involving simulated executions of synthetic programs with controlled parameter values. From these we can deduce the relative importance of the parameters from the point of view of their influence on performance. We can explain our observations in terms of a simplified stochastic model that relates local changes in load to global behaviour. We show that the dynamics of load-balancing can be represented approximately by a first-order difference equation, and that the distributed process migration algorithm is consistent with a behaviour on the global scale which can be regarded as that of a traditional feedback controller. 相似文献

19.

Application of distributed parallel computing for dynamic visual cryptography

Raimondas Čiegis Vadimas Starikovičius Natalija Tumanova Minvydas Ragulskis 《The Journal of supercomputing》2016,72(11):4204-4220

相似文献

20.

Fully dynamic all pairs shortest paths with real edge weights

《Journal of Computer and System Sciences》2006,72(5):813-837

相似文献