首页 | 本学科首页   官方微博 | 高级检索  
     

龙芯2号处理器设计和性能分析
引用本文:胡伟武,张福新,李祖松.龙芯2号处理器设计和性能分析[J].计算机研究与发展,2006,43(6):959-966.
作者姓名:胡伟武  张福新  李祖松
作者单位:中国科学院计算技术研究所计算机系统结构重点实验室,北京,100080
基金项目:国家自然科学基金;高比容电子铝箔的研究开发与应用项目;国家重点基础研究发展计划(973计划);中国科学院基础研究项目;中国科学院知识创新工程项目
摘    要:介绍龙芯2号处理器设计及其性能测试结果.龙芯2号采用四发射超标量超流水结构。片内一级指令和数据高速缓存各64KB,片外二级高速缓存最多可达8MB.为了充分发挥流水线的效率,龙芯2号实现了先进的转移猜测、寄存器重命名、动态调度等乱序执行技术以及非阻塞的Cache访问和load Speculation等动态存储访问机制.龙芯2号处理器采用0.18gm的CMOS工艺实现,在正常电压下的最高工作频率为500MHz,500MHz时的实测功耗为3~5W.龙芯2号单精度峰值浮点运算速度为20亿a/秒,双精度浮点运算速度为10亿a/秒,SPECCPU2000的实测性能是龙芯1号的8~10倍,综合性能已经达到PentiumⅢ的水平.目前芯片样机能流畅运行完整的64位中文Linux操作系统,全功能的Mozilla浏览器、多媒体播放器和OpenOffice办公套件,可以满足绝大多数桌面应用的要求.

关 键 词:超标量流水线  乱序执行  转移猜测  寄存器重命名  动态调度  非阻塞的Cache  load指令猜测执行  性能分析
收稿时间:07 11 2005 12:00AM
修稿时间:2005-07-112006-01-04

Design and Performance Analysis of the Godson-2 Processor
Hu Weiwu,Zhang Fuxin,Li Zusong.Design and Performance Analysis of the Godson-2 Processor[J].Journal of Computer Research and Development,2006,43(6):959-966.
Authors:Hu Weiwu  Zhang Fuxin  Li Zusong
Affiliation:Key Laboratory of Computer System and Architecture, Institute of Computing Technology, Chinese Academy of Sciences,Beijing 100080
Abstract:In this paper, the design and the result of performance analysis of the Godson-2 processor are presented. The Godson-2 implements a 4-way superscalar pipelined architecture, contains two 64KB L1 caches for instruction and data, and supports up to 8MB off-chip L2 cache. To improve the pipeline efficiency, The Godson-2 utilizes out-of-order executing technologies such as advanced branch prediction unit, register renaming and dynamic scheduler, and dynamic memory access mechanism like non-blocking cache and load speculation. The Godson-2 is implemented on 0.18um CMOS technology, with a maximum frequency of 500MHz under normal voltage and consumes 3-5 watts power under that frequency. The Godson-2 can perform one billion double-precision floating-point operations per second (two billion for single-precision), and the overall performance is comparable to Intel Pentium III with similar frequency. Presently a full Linux distribution (Debian) is running well on the Godson-2 prototype machines, including important desktop applications such as Mozilla web browsers, media players and OpenOffice.
Keywords:superscalar pipeline  out-of-order execution  branch prediction  register renaming  dynamical scheduling  non blocking cache  load speculation  performance analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号