首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于频率的多核共享Cache替换算法
引用本文:方娟, 李成艳, 王帅, 姚治成. 一种基于频率的多核共享Cache替换算法[J]. 电子与信息学报, 2014, 36(5): 1229-1234. doi: 10.3724/SP.J.1146.2013.01030
作者姓名:方娟  李成艳  王帅  姚治成
作者单位:北京工业大学计算机学院;
基金项目:国家自然科学基金(61202076);北京市教委科技计划面上项目(KM201210005022)资助课题
摘    要:LRU替换算法在单核处理器中得到了广泛应用,而多核环境大都采用多核共享最后一级Cache(LLC)的策略,随着LLC容量和相联度的增加以及多核应用的工作集增大,LRU替换算法和理论最优替换算法之间的差距越来越大。该文提出了一种平均划分下基于频率的多核共享Cache替换算法(ALRU-F)。该算法将当前所需要的部分工作集保留在Cache内,逐出无用块,同时还提出了块粒度动态划分下基于频率的替换算法(BLRU-F)。该文提出的ALRU-F算法相比传统的LRU算法缺失率降低了26.59%, CPU每一时钟周期内所执行的指令数IPC(Instruction Per Clock)则提升了13.59%。在此基础上提出的块粒度动态划分下,基于频率的BLUR-F算法相比较传统的LRU算法性能提高更大,缺失率降低了33.72%,而IPC 则提升了16.59%。提出的两种算法在性能提升的同时,并没有明显地增加能耗。

关 键 词:多核处理器   共享Cache   划分   替换算法
收稿时间:2013-07-16
修稿时间:2013-11-07

A Frequency Based Cache Replacement Algorithm with Partition of CMPs
Fang Juan, Li Cheng-Yan, Wang Shuai, Yao Zhi-Cheng. A Frequency Based Cache Replacement Algorithm with Partition of CMPs[J]. Journal of Electronics & Information Technology, 2014, 36(5): 1229-1234. doi: 10.3724/SP.J.1146.2013.01030
Authors:Fang Juan  Li Cheng-Yan  Wang Shuai  Yao Zhi-Cheng
Abstract:LRU has been widely used in single-core processor, while Chip Multi-Processors (CMP) employ a large Last-Level Cache (LLC) which is shared among the multiple cores. With the increasement of the LLC capacity and associativity, and the grows of working set of multicores applications, the performance gap between the LRU and the theoretical optimal replacement algorithms gets wider and wider. This paper proposes an Average partition LRU algorithm based on Frequency (ALRU-F). The algorithm has maintained the working set at Cache and drive out the ignore block. Also, a Cache line stealing strategy is proposed to realize a Block partition LRU replacement algorithm based on Frequency (BLRU-F). The result of experiments shows that comparing to the traditional LRU algorithm, the proposed ALRU-F algorithm reduces the miss rate by 26.59%, and improves the Instruction Per Clock (IPC) by 13.59 % with little change of power consumption. Comparing to the traditional LRU and BLRU-F algorithms, the proposed algorithm reduces the Cache miss rate by 33.72% and improves the IPC by 16.59%.
Keywords:Chip Multi-Processors (CMP)  Shared Cache  Partition  Replacement algorithm
本文献已被 CNKI 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号