期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

偶尔有空的时候，我总是喜欢在一些大牌厂商里面跑来跑去，因为可以听到一些难得而且有趣的消息。像不久前我又十分偶然的听说ｎＶＩＤＩＡ的Ｇｅｆａｒｅｅ２ＭＸ２００图形芯片快要停产了。如果这个消息是真的，那么由Ｇｅｆｏｒｃｅ２ＭＸ２００抽身离开而挪出来的市场份额将是十分吸引人的。在与ＭＸ２００性能相当的图形芯片中，除了ＡＴＩ的ＲａｄｅｏｎＶＥ之外还有ＳＩＳ３１５，但就目前来说，ＳＩＳ３１５完全无法与ＲａｄｅｏｎＶＥ相争。ＲａｄｅｏｎＶＥ会否凭借这个机会一统低端显卡市场？或者。不久前，Ｕ－ＮＩＫＡ… 相似文献

8.

高性能微处理器的EISA支持系统

郑飞《电脑学习》1993,(1):44-48

相似文献

9.

PC工业标准的发展和ATX标准介绍

《个人电脑》2004,10(4):186-187

在PC发展初期，IBM PC击败苹果PC的主要秘诀就在于标准化的部件，通用性的结构和广泛的兼容性，包括微处理器、内存模组、主机板、硬盘、I/O接口以及键盘、鼠标、机箱都有自己的逻辑和物理标准，只要遵循共有的标准，不同制造商制造的零部件都可以毫无障碍组合在一起。相似文献

10.

正在崛起的低功耗微处理器技术

刘鑫《工业控制计算机》2002,15(11):13-15

本详细讨论了X86及其兼容架构微处理器的低功耗设计技术，探讨了该技术和产品的现状和发展趋势，预见了低功耗微处理器的广泛应用前景。相似文献

11.

Microarchitecture of the Godson-2 Processor 总被引：23，自引：3，他引：23

下载免费PDF全文

Wei-WuHu Fu-XinZhang Zu-SongLi 《计算机科学技术学报》2005,20(2):0-0

The Godson project is the first attempt to design high performance general-purpose microprocessors in China. This paper introduces the microarchitecture of the Godson-2 processor which is a 64-bit, 4-issue, out-of-order execution RISC processor that implements the 64-bit MlPS-like instruction set. The adoption of the aggressive out-of-order execution techniques (such as register mapping, branch prediction, and dynamic scheduling) and cache techniques (such as non-blocking cache, load speculation, dynamic memory disambiguation) helps the Godson-2 processor to achieve high performance even at not so high frequency. The Godson-2 processor has been physically implemented on a 6-metal 0.18μm CMOS technology based on the automatic placing and routing flow with the help of some crafted library cells and macros. The area of the chip is 6,700 micrometers by 6,200 micrometers and the clock cycle at typical corner is 2.3ns. 相似文献

12.

SMA:前瞻性多线程体系结构 总被引：3，自引：1，他引：3

肖刚周兴铭徐明邓鹍《计算机学报》1999,22(6):582-590

提出了一种新的ＩＬＰ处理器体系结构－前瞻性多线程体系的结构,简称ＳＭＡ．它结合了前瞻性执行机制和多线程执行机制,以整个线程为长步进行前瞻性执行,多个线程并行执行并且共享处理器硬件资源,这样,处理器既通过组合每个线程的指令窗口形成一个大的动态指令窗口,开发出程序中更大的ＩＬＰ,又利用多线程执行机制屏蔽各种长延迟操作,达到较高的资源利用率;介绍了ＳＭＡ执行模型,并讨论了ＳＭＡ处理器的实现和其中的关键技相似文献

13.

龙芯2号处理器设计和性能分析 总被引：16，自引：4，他引：16

胡伟武张福新李祖松《计算机研究与发展》2006,43(6):959-966

介绍龙芯2号处理器设计及其性能测试结果．龙芯2号采用四发射超标量超流水结构。片内一级指令和数据高速缓存各64KB，片外二级高速缓存最多可达8MB．为了充分发挥流水线的效率，龙芯2号实现了先进的转移猜测、寄存器重命名、动态调度等乱序执行技术以及非阻塞的Cache访问和load Speculation等动态存储访问机制．龙芯2号处理器采用0．18gm的CMOS工艺实现，在正常电压下的最高工作频率为500MHz，500MHz时的实测功耗为3～5W．龙芯2号单精度峰值浮点运算速度为20亿a／秒，双精度浮点运算速度为10亿a／秒，SPECCPU2000的实测性能是龙芯1号的8～10倍，综合性能已经达到PentiumⅢ的水平．目前芯片样机能流畅运行完整的64位中文Linux操作系统，全功能的Mozilla浏览器、多媒体播放器和OpenOffice办公套件，可以满足绝大多数桌面应用的要求．相似文献

14.

一种基于活跃周期的低端口数低能耗寄存器堆设计

赵雨来李险峰佟冬孙含欣陈杰程旭《计算机学报》2008,31(2):299-308

多端口寄存器堆有助于挖掘指令级和线程级并行性,但同时带来面积、能耗和访问时间的压力.文章面向超标量和SMT处理器,给出了一种方法,即通过增加一个小的活跃值堆(Active Value File,AVF)选择性地保存处于活跃周期(从产生到最后一次使用之间)的物理寄存器值.AVF结构可分担主寄存器堆的访问压力并降低端口数目,实现简单且具有写过滤的特点.在获得较大幅度能耗降低的同时不影响时钟频率且IPC损失较小. 相似文献

15.

Recovery requirements of branch prediction storage structures in the presence of mispredicted-path execution 总被引：1，自引：0，他引：1

Stéphan Jourdan Jared Stark Tse-Hao Hsing Yale N. Patt 《International journal of parallel programming》1997,25(5):363-383

Execution along mispredicted paths may or may not affect the accuracy of subsequent branch predictions if recovery mechanisms are not provided to undo the erroneous information that is acquired by the branch prediction storage structures. In this paper, we study four elements of the Two-Level Branch Predictor: the Branch Target Buffer (BTB), the Branch History Register (BHR), the Pattern History Tables (PHTs), and the Return Address Stack (RAS). For each we determine whether a recovery mechanism is needed, and, if so, show how to design a cost-effective one. Using five benchmarks from the SPECint92 suite, we show that there is no need to provide recovery mechanisms for the BTB and the PHTs, but that performance is degraded by an average of 30% if recovery mechanisms are not provided for the BHR and RAS. 相似文献

16.

一种精确的分支预测微处理器模型 总被引：3，自引：0，他引：3

陈跃跃周兴铭《计算机研究与发展》2003,40(5):741-745

在当今深流水宽发射的微处理器中，为实现高性能，精确的分支预测是不可缺少的关键技术．分支预测失效将浪费大量的时钟周期，无法发挥乱序执行的效能．宽发射微处理器的有效性能同时还依赖指令窗口的大小和指令预取宽度．提出了一种新的更精确的支持分支预测和分支误预测周期损失的微处理器模型．根据指令的执行带宽为指令窗口中可用指令数的平方根统计规律，给出了一个更为精确的描述微处理器取指带宽、分支预测精度、分支误预测周期损失、指令窗口大小和IPC之间关系的算法，并讨论了这些参数的综合权衡以及这些参数对程序IPC的影响．由此可以确定依赖多个微处理器参数的取指带宽阈值和微处理器中几个关键参数的选取．相似文献

17.

A novel architecture for ahead branch prediction

Wenbing JIN Feng SHI Qiugui SONG Yang ZHANG 《Frontiers of Computer Science》2013,7(6):914-923

In theory, branch predictors with more complicated algorithms and larger data structures provide more accurate predictions. Unfortunately, overly large structures and excessively complicated algorithms cannot be implemented because of their long access delay. To date, many strategies have been proposed to balance delay with accuracy, but none has completely solved the issue. The architecture for ahead branch prediction (A²BP) separates traditional predictors into two parts. First is a small table located at the front-end of the pipeline, which makes the prediction brief enough even for some aggressive processors. Second, operations on complicated algorithms and large data structures for accurate predictions are all moved to the back-end of the pipeline. An effective mechanism is introduced for ahead branch prediction in the back-end and small table update in the front. To substantially improve prediction accuracy, an indirect branch prediction algorithm based on branch history and target path (BHTP) is implemented in A²BP. Experiments with the standard performance evaluation corporation (SPEC) benchmarks on gem5/SimpleScalar simulators demonstrate that A²BP improves average performance by 2.92% compared with a commonly used branch target buffer-based predictor. In addition, indirect branch misses with the BHTP algorithm are reduced by an average of 28.98% compared with the traditional algorithm. 相似文献

18.

Worst Case Execution Time Analysis for a Processor with Branch Prediction 总被引：4，自引：0，他引：4

Colin Antoine Puaut Isabelle 《Real-Time Systems》2000,18(2-3):249-274

The fundamental requirement for hard real-time systems is that task deadlines be never missed. As a consequence, knowing tasks worst case execution times (WCET) is crucial for such systems. Taking into account modern architectural features makes it possible to determine tighter WCET bounds than with program analysis that ignores such features. While effects of caches and pipelines on WCET analysis have been extensively studied, to our knowledge the effect of the branch prediction on WCET evaluation has not been studied yet. This paper describes a method for statically bounding the number of timing penalties due to erroneous branch predictions. The proposed method is based on static program analysis and branch target buffer modelling. It consists in collecting information on branch target buffer evolution by considering all possible execution paths of a program. Collected information can then be used to classify control transfer instructions so that their worst case branching cost can be estimated and incorporated into the program WCET. A method is also given to tightly predict the WCET of loops whose number of iterations depend on counter variables of outer loops. Experimental results show that the timing penalty due to wrong branch predictions estimated by the proposed technique is close to the real one, which demonstrates the practical applicability of our method. 相似文献

19.

Modeling Control Speculation for Timing Analysis

Li Xianfeng Mitra Tulika Roychoudhury Abhik 《Real-Time Systems》2005,29(1):27-58

The schedulability analysis of real-time embedded systems requires worst case execution time (WCET) analysis for the individual tasks. Bounding WCET involves not only language-level program path analysis, but also modeling the performance impact of complex micro-architectural features present in modern processors. In this paper, we statically analyze the execution time of embedded software on processors with speculative execution. The speculation of conditional branch outcomes (branch prediction) significantly improves a program's execution time. Thus, accurate modeling of control speculation is important for calculating tight WCET estimates. We present a parameterized framework to model the different branch prediction schemes. We further consider the complex interaction between speculative execution and instruction cache performance, that is, the fact that speculatively executed blocks can generate additional cache hits/misses. We extend our modeling to capture this effect of branch prediction on cache performance. Starting with the control flow graph of a program, our technique uses integer linear programming to estimate the program's WCET. The accuracy of our method is demonstrated by tight estimates obtained on realistic benchmarks. 相似文献

20.

嵌入式微处理器分支预测的设计与实现

陈海民李峥王瑞蛟《计算机应用》2011,31(7):2004-2007

针对五级流水线嵌入式微处理器的特定应用环境,对分支预测技术进行了深入研究,提出了一种新的分支预测方案。该方案兼容带缓存设计,通过扩展指令总线,在取指段提前对分支指令跳转方向和目标地址进行预测,保存可能执行而未执行的指令和地址指针以备分支预测失效时得以恢复,减少了预测失效的代价,同时保证了指令流的正确执行。研究表明,该方案硬件开销小,预测效率高,预测失效代价低。相似文献