首页 | 本学科首页   官方微博 | 高级检索  
     

一种CMP结构上的事务存储编程模型设计
引用本文:陈嘉,安虹,刘圆,王莉. 一种CMP结构上的事务存储编程模型设计[J]. 计算机仿真, 2007, 24(6): 81-85
作者姓名:陈嘉  安虹  刘圆  王莉
作者单位:中国科学技术大学,计算机科学技术系,安徽,合肥,230027;中国科学技术大学,计算机科学技术系,安徽,合肥,230027;中国科学院计算技术研究所计算机系统结构重点实验室,北京,100080
基金项目:国家自然科学基金 , 安徽省自然科学基金 , Intel高等教育项目 , 中国科学院重点实验室基金
摘    要:多核结构上采用由用户显式制导的并行程序设计模型,使用锁和同步变量来实现同步.事务存储模型能够解决由锁机制带来的一系列问题,提高程序的并发性.介绍了在文中提出的一种基于事务存储模型的多核结构(Transactional-Memory based Chip Multiple-Superscaler,TMCMS)上的并行编程模型,以及针对循环程序的执行模型;以FFT程序为例具体介绍了循环结构的并行化方法和编译转换过程.在初步的实验中,将处理单元从1增加到16个时,在所设计的编程模型的支持下,IPC(Instruction Per Cycle)有接近线性的增长,说明该并行编程模型能够充分发掘程序中潜在的细粒度线程级并行性,同时保持并行程序设计的简单性.

关 键 词:多核芯片结构  并行程序设计模型  事务存储
文章编号:1006-9348(2007)06-0081-05
修稿时间:2006-05-252006-05-26

A Programming Model for Chip Multi-Processor Based on Transactional Memory
CHEN Jia,AN Hong,LIU Yuan,WANG Li. A Programming Model for Chip Multi-Processor Based on Transactional Memory[J]. Computer Simulation, 2007, 24(6): 81-85
Authors:CHEN Jia  AN Hong  LIU Yuan  WANG Li
Affiliation:1. Department of Computer Science and Technology, University of Science and Technology of China, Hefei Anhui 230027, China; 2. Key Laboratory of Computer System and Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080,China
Abstract:Traditional parallel programming models have many limitations when applied to multi-core, or programmers have to explicitly identify synchronization on Multiprocessor with locks and synchronous variables. Transactional Memory can solve problems brought by locks and improve parallelism. This paper introduces a programming model on TMCMS (Transactional-Memory based Chip Multiple-Superscaler) which is a CMP architecture based on Transactional-Memory model, and its execution model for loop-based program. FFT program is used as an example to illustrate the method of loop structure paralleling and to describe compiler-directed translation in detail. The performance of the model is evaluated by varying the processor unit size from 1 to 16, and the IPC (instruction per cycle) increases nearly linearly with the number of processor unit. The preliminary experiment results indicate that this programming model can sufficiently exploit fine-grained parallelism in the program while maintaining the simplicity of parallel programming.
Keywords:Chip multiprocessor architecture   Parallel programming model    Transactional memory
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号