首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The use of parallel systems is no longer limited to dedicated clusters as multicore chips are more and more appearing in embedded applications. To meet power, performance and cost targets these systems need to be adaptive. The reconfiguration features of recent FPGAs make new approaches for this type of parallel computing possible: Dynamic reconfiguration at runtime offers an important step to adaptive behavior of systems-on-chip (SoCs). This article analyzes the challenges of such an adaptive SoC. It is shown that many of the requirements for an adaptive FPGA-realization are met by the SDVM, the scalable dataflow-driven virtual machine which has been successfully implemented and tested on a cluster of workstations. The SDVM has evolved to a virtualization layer for multicore-FPGAs, now called SDVM R . This virtualization layer allows a transparent runtime-reconfiguration of the underlying hardware to adapt to the changing system environment. Results for a basic application for both systems are presented.  相似文献   

2.
优先队列广泛地使用在许多并行算法中(例如,多处理机调度和某些组合优化算法)。在这些算法中,共享优先队列的存取冲突限制了加速比的提高。本文提出一种链表优先队列的并行插入和删除方法,具有较小并行开销和较大的并行度,并且保证和串行存取算法的优先顺序完全一致,即删除操作返回已经插入和正在插入的所有元素中的最佳元素。同时,我们还介绍了目前性能最好的堆的并行插入和删除算法,并对准和链表结构并行插入和删除算法的性能和适用范围进行了比较,进一步提出了散列结构的优先队列。在ENCORE Multimax520多处理机上的实验结果验证了我们的理论分析结果:使用链表结构的并行分枝限界算法性能上可获得很大提高。  相似文献   

3.
We present simple randomized algorithms for parallel priority queues on distributed memory machines. Inserting O(n) elements or deleting the O(n) out ofmsmallest elements usingnprocessors requires O(Tcoll+ log(m/n)) amortized time with high probability whereTcollbounds the time for performing prefix sums and randomized routing. The memory requirement is bounded by (m/n)(1 +o(1)) + O(logn) whp. These bounds are an improvement over the best previously known algorithms for many interconnection networks and even matches the speed of the best known PRAM algorithms. Generalizations for accessing theknsmallest elements are even more efficient. A portable implementation using MPI demonstrates that our approach is already useful for medium scale parallelism. Two parallel selection algorithms for randomly placed data are a spin-off. One runs in time O(Tcoll) with high probability, beating a lower bound for the worst case. The other requires only a single reduction operation.  相似文献   

4.
传统自旋锁具有无序竞争的特点,本文对自旋锁机制进行了研究,通过引入优先级和等待次数阈值提出可调优先级自旋锁,保证高优先级任务尽可能多地获得锁,低优先级任务经过一段等待时间后调整优先级、增加获取锁的机会,实现了可调优先级自旋锁并通过实验进行了验证.测试结果表明,可调优先级自旋锁既能够减少传统自旋锁的时间开销,又能保证高优先级处理器核锁申请较快得到响应,验证了可调优先级自旋锁在多核系统中是可用的.  相似文献   

5.
分布式主动实时数据库基于多优先级队列的优先级分派   总被引:1,自引:0,他引:1  
事务的分布性和事务触发的不可预测性使得分布式主动实时数据库系统中的事务调度变得较为复杂.该文给出了立即和推迟两种模式下执行的被触发事务的截止期确定方法和事务紧急度计算方法,并结合事务的应用语又提出了一种基于多优先级队列的优先级分派策略,有利于主动事务和被触发事务的顺利提交.  相似文献   

6.
基于业务的多优先级队列区别服务方案   总被引:5,自引:0,他引:5  
缓冲队列的管理和调度是有关互连网服务质量的重要话题。鉴于时延是QoS(QualityofService)的一个重要指标,论文提出并详细讨论了一种基于EDF(EarliestDeadlineFirst)的优先级队列PQBEDF(priorityqueuebasedonEDF),针对不同的业务类型采用各自的PQBEDF,这样可以使得“区别”更加合理,且便于各队列采用不同的控制策略。配以异步时分机制,可以对每一类业务的信元保证一个最小的服务速率。  相似文献   

7.
一种基于流量预测的多优先级队列缓存管理算法   总被引:1,自引:0,他引:1  
刘殿兴  薛建生 《计算机工程》2006,32(10):116-118
在引入优先级因素的流量预测模型基础上,提出了一种多优先级队列的缓存管理算法,该算法通过固定时刻的缓存预分配,结合时划间隔中的缓存竞争,使缓存的分配更加合理而有预见性,仿真实输表明它能够明最降低高优先级数据包的丢包率,同时兼顾低优先级数据包的转发,并且不产生额外的延迟,表明该算法有很好的应用价值。  相似文献   

8.
We present techniques to parallelize membership tests for Deterministic Finite Automata (DFAs). Our method searches arbitrary regular expressions by matching multiple bytes in parallel using speculation. We partition the input string into chunks, match chunks in parallel, and combine the matching results. Our parallel matching algorithm exploits structural DFA properties to minimize the speculative overhead. Unlike previous approaches, our speculation is failure-free, i.e., (1) sequential semantics are maintained, and (2) speed-downs are avoided altogether. On architectures with a SIMD gather-operation for indexed memory loads, our matching operation is fully vectorized. The proposed load-balancing scheme uses an off-line profiling step to determine the matching capacity of each participating processor. Based on matching capacities, DFA matches are load-balanced on inhomogeneous parallel architectures such as cloud computing environments. We evaluated our speculative DFA membership test for a representative set of benchmarks from the Perl-compatible Regular Expression (PCRE) library and the PROSITE protein database. Evaluation was conducted on a 4 CPU (40 cores) shared-memory node of the Intel Academic Program Manycore Testing Lab (Intel MTL), on the Intel AVX2 SDE simulator for 8-way fully vectorized SIMD execution, and on a 20-node (288 cores) cluster on the Amazon EC2 computing cloud. Obtained speedups are on the order of $\mathcal O \left( 1+\frac{|P|-1}{|Q|\cdot \gamma }\right) $ , where $|P|$ denotes the number of processors or SIMD units, $|Q|$ denotes the number of DFA states, and $0<\gamma \le 1$ represents a statically computed DFA property. For all observed cases, we found that $0.02<\gamma <0.47$ . Actual speedups range from 2.3 $\times $ to 38.8 $\times $ for up to 512 DFA states for PCRE, and between 1.3 $\times $ and 19.9 $\times $ for up to 1,288 DFA states for PROSITE on a 40-core MTL node. Speedups on the EC2 computing cloud range from 5.0 $\times $ to 65.8 $\times $ for PCRE, and from 5.0 $\times $ to 138.5 $\times $ for PROSITE. Speedups of our C-based DFA matcher over the Perl-based ScanProsite scan tool range from 559.3 $\times $ to 15079.7 $\times $ on a 40-core MTL node. We show the scalability of our approach for input-sizes of up to 10 GB.  相似文献   

9.
10.
In recent years, inspired by the emerging Web services standard and peer-to-peer technology, a new federated service providing (FSP) system paradigm has attracted increasing research interests. Many existing systems have either explicitly or implicitly followed this paradigm. Instead of exchanging files, peers in FSP systems share their computation resources in order to offer domain-specific services. In this paper, we focused on the coordination problem of how to self-organize the service group structures in response to the varying service demand. We presented our solution in the form of a coordination mechanism, which includes a labor-market model, a recruiting protocol, and a policy-driven decision architecture. Peers make their service providing decisions based on their local policies, which can be added, removed, or modified by users. A general methodology is introduced in this paper to facilitate policy design. Specifically, a heuristic inspired by the extremal optimization technique is utilized to handle potential inconsistencies among policies. A stimulus-response mechanism was further applied to make the decision process adjustable. Experiments under five application scenarios verified our ideas and demonstrated the effectiveness of our coordination mechanism.  相似文献   

11.
一个组织间松散耦合跨组织工作流的仿真模型   总被引:5,自引:0,他引:5  
程绍武  徐晓飞  王刚  李全龙 《软件学报》2006,17(12):2461-2470
为了解决松散耦合跨组织工作流的仿真建模问题,以染色Petri网为理论基础,将颜色集、染色函数、资源库所、等待库所、忙库所、开始变迁、结束变迁、角色、组织、时间函数、资源函数和变迁函数引入到IOWF(inter-organizational workflow),提出了染色多维跨组织工作流网CMD/IOWF(colored multi-dimension/IOWF).基于CMD/IOWF,讨论了组织间松散耦合跨组织工作流中多个不同项目工作流实例通过资源共享和活动同步相互耦合以及与仿真相关的资源约束和时间的建模问题.通过定义输入/输出、状态、事件、时间推进函数和状态转移函数,给出了一个组织间松散耦合的多个跨组织工作流的仿真模型.基于该模型的仿真分析可以求解组织间松散耦合跨组织工作流的关键性能指标,包括:对应项目工作流实例的平均执行时间、平均执行成本及其关于组织的分布,各项目的资源利用率.最后,以一个实例验证了提出模型的有效性.  相似文献   

12.
多核处理器已经成为主流,并且被广泛应用于嵌入式设备中.在操作系统如何有效支持多核处理器方面的研究中,目前国内外大多基于常见的紧耦合共享存储架构的多核处理器,而对一些特殊存储架构的多核处理器研究并不多.本文针对内存受限的多级存储架构的多核处理器,提出一种单代码多数据的嵌入式多核操作系统模型.实验表明,该模型应用在具有多级存储架构的八核DSP上,比AMP模型减少约80%的代码空间开销;与SMP模型相比,与实时性紧密相关的时间开销减少约10倍.  相似文献   

13.
Automatic parallelization has thus far not been successful at extracting scalable parallelism from general programs. An aggressive automatic thread extraction framework, coupled with natural extensions to the sequential programming model that allow for a range of legal outcomes rather than forcing programmers to define a single legal program outcome, will let programmers achieve the performance of parallel programming via the simpler sequential model.  相似文献   

14.
张杰  陈相宁  徐超永 《计算机工程》2012,38(20):272-274
已有优先级反转抑制方法的通用性较差,优先级天花板值难于选取.为此,提出一种改进的优先级反转抑制模型.分析现有解决优先级反转方法存在的缺陷,通过为各种稀缺资源单独建立服务队列,将其排除在优先级调度队列的排队模型之外,从而在单处理核的嵌入式系统中形成多服务员处理模型,以降低优先级反转的发生概率.仿真结果表明,该模型的优先级反转概率约为1‰.  相似文献   

15.
一个基于证券市场的计算网格环境下的资源分配模型   总被引:5,自引:0,他引:5  
计算网格(Computational Grid)下对资源进行有效管理和调度是十分具有挑战性的问题.论文中提出了一个证券市场模型来实现计算网格环境下的资源分配.在此模式中,资源作为一种证券商品被自由买卖,用户可以方便快捷地实现对系统资源的存取,模拟实验表明证券市场模型是实现计算网格环境下资源分配的一个有效手段.  相似文献   

16.
Sugawara  Toshiharu  Lesser  Victor 《Machine Learning》1998,33(2-3):129-153
Coordination is an essential technique in cooperative, distributed multiagent systems. However, sophisticated coordination strategies are not always cost-effective in all problem-solving situations. This paper presents a learning method to identify what information will improve coordination in specific problem-solving situations. Learning is accomplished by recording and analyzing traces of inferences after problem solving. The analysis identifies situations where inappropriate coordination strategies caused redundant activities, or the lack of timely execution of important activities, thus degrading system performance. To remedy this problem, situation-specific control rules are created which acquire additional nonlocal information about activities in the agent networks and then select another plan or another scheduling strategy. Examples from a real distributed problem-solving application involving diagnosis of a local area network are described.  相似文献   

17.
实时系统中任务优先级的综合决策模型   总被引:2,自引:0,他引:2  
实时系统设计中,如何准确而合理地确定每一个任务的优先级是能否保证系统实时性要求的关键问题之一。本文基于模糊集合理论提出了一种任务优先级的综合决策模型,并通过实例进行了说明。  相似文献   

18.
基于信誉构建信任机制是解决虚拟计算环境[1]中行为可信问题的重要途径。现有信誉模型对不良行为的动态适应能力和对信誉信息的有效聚合能力不足,缺乏鼓励节点积极参与诚实合作的有效机制。本文提出一种基于惩罚激励机制的信誉管理模型PERep,根据自主元素行为特征和信誉状态区分交易行为的合作与偏离,并对偏离行为进行惩罚;给出了PERep的分布式实现。实验表明,PERep能准确地区分诚实交易和恶意行为,有效提高自主元素诚实交易的积极性并减少恶意行为的危害。  相似文献   

19.
An algorithm for storing information redundantly on the nodes of a broadcast network is proposed. A voting technique is used to increase reliability. Since multiple votes are cast only when copies of a data item disagree, the algorithm has the property that communication overhead is minimal. In addition, nodes storing erroneous copies are automatically resynchronized. A Markov analysis is performed which relates parameters of the algorithm to the mean time to failure.  相似文献   

20.
DTM:一种面向网络计算的动态信任管理模型   总被引:4,自引:0,他引:4  
在开放的互联网中,信任关系的建立是实现跨自治域资源共享与协同的前提.然而,网络计算环境的分布自治特性,使得各类复杂系统中应用不同的信任管理机制,容易导致信任定义的不一致问题,特别是很多系统为支持多域协作,直接假设实体间信任具有传递特性,而在模型中缺乏该性质成立的条件;此外,网络计算环境中的动态演化特性,使得驱动实体间协作的信任关系随需而变,而现有信任管理模型仅仅关注于系统功能结构,缺乏对这种动态性的描述.文中提出了一种动态信任管理模型DTM,基于信念公式形式化定义了主体间的信任公式,并将信任的传递特性(信任链)解释为模型的一条性质.在该模型中,针对信任关系的动态特征,以时间为参量刻画主体公式集,以事件为触发条件刻画主体间信任的变化,并基于正则事件序列描述信任管理的资源授权过程,可刻画主体间信任的建立过程.最终,设计、实现了一个信任管理系统CROWN-TM,并进行了初步实验分析.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号