共查询到16条相似文献,搜索用时 406 毫秒
1.
2.
3.
4.
5.
Imagine是一款致力于媒体处理的流体系结构芯片,而它是否适合科学计算应用是当前国际上的热点讨论问题。本文研究了典型科学计算核心--矩阵向量乘在Imagine处理器 上的有效映射,主要开发了四种不同的流程序版本,并且提出了优化方法。实验结果表明,经过优化的矩阵向量乘程序能够开发Imagine处理器的潜能。 相似文献
6.
斯坦福大学的Imagine流处理器具有很强的计算能力,如何将该体系结构应用在科学计算领域是当前研究的热点。解线性方程组的迭代法在工程和科学计算的各个领域中有着十分广泛的应用,该算法具有较好的计算密集性和并行性,十分适合流处理器的计算模型。本文分别针对系数矩阵的规模大小和稠密程度,介绍了Jacobi和Seidel迭代在流处
理器上的映射。实验结果表明,迭代算法能高效地开发Imagine的计算能力,取得较高的性能加速。 相似文献
理器上的映射。实验结果表明,迭代算法能高效地开发Imagine的计算能力,取得较高的性能加速。 相似文献
7.
基于Imagine体系结构,提出了一种科学程序局部性优化方法,旨在提高流程序的带宽利用率并保证Imagine强大的计算能力.关键技术在于通过对循环的计算变换和数据变换来开发体系结构的优势.对4个典型科学程序的实验表明,该优化能够有效地提高程序计算密集性且减少索引流,从而增强程序的局部性. 相似文献
8.
随着互联网环境下计算系统规模的不断扩大,分布式流体系结构的可靠性问题面临着严峻的挑战。以多模冗余容错技术为基础,针对软错误提出了一种面向分布式流体系结构的多副本积极容错技术TREFT,利用三个程序副本进行高效的检错与纠错。在分布式流体系结构原型系统上的实验结果表明,该技术能有效提高系统的可靠性,具有较低的容错成本,平均增加10.77%的容错开销。 相似文献
9.
流体系结构在新兴的体系结构中备受关注.鉴于多种多样的流体系结构的存在,本文提出了一个流体系结构的抽象模型.本项工作对开发应用在多种流体系结构上的可移植性具有重要意义. 相似文献
10.
鉴于Imagine流处理器在媒体处理领域取得的优越性能,我们将H.264视频编码中耗时较大的运动搜索算法映射到Imagine上,利用流处理的思想充分开发其并行性来提高编码性能,对QCIF图像编码可达372fps,加速效果是非常明显的。由此证明,H.264是适于Imagine实现的。 相似文献
11.
HAN Di 《数字社区&智能家居》2008,(15)
网络改变着我们得生活,越来越多的人都通过网络来观看电视节目,让人们有了更多的频道选择。在此基础上,各种基于P2P流媒体播放软件层出不穷,本文将着重介绍P2P流媒体技术,包括流媒体传输协议以及如何实时传送。最后简要介绍流媒体播放的实现架构。 相似文献
12.
Haitao Wei Mingkang Qin Weiwei Zhang Junqing Yu Dongrui Fan Guang R. Gao 《Journal of Parallel and Distributed Computing》2013
Tiled multi-core architectures have become an important kind of multi-core design for its good scalability and low power consumption. Stream programming has been productively applied to a number of important application domains. It provides an attractive way to exploit the parallelism. However, the architecture characteristics of large amounts of cores, memory hierarchy and exposed communication between tiles have presented a performance challenge for stream programs running on tiled multi-cores. In this paper, we present StreamTMC, an efficient stream compilation framework that optimizes the execution of stream applications for the tiled multi-core. This framework is composed of three optimization phases. First, a software pipelining schedule is constructed to exploit the parallelism. Second, an efficient hybrid of SPM and cache buffer allocation algorithm and data copy elimination mechanism is proposed to improve the efficiency of the data access. Last, a communication aware mapping is proposed to reduce the network communication and synchronization overhead. We implement the StreamTMC compiler on Godson-T, a 64-core tiled architecture and conduct an experimental study to verify the effectiveness. The experimental results indicate that StreamTMC can achieve an average of 58% improvement over the performance before optimization. 相似文献
13.
Moustafa A. Hammad Walid G. Aref Ahmed K. Elmagarmid 《The VLDB Journal The International Journal on Very Large Data Bases》2008,17(3):469-488
This paper introduces a class of join algorithms, termed W-join, for joining multiple infinite data streams. W-join addresses
the infinite nature of the data streams by joining stream data items that lie within a sliding window and that match a certain
join condition. In addition to its general applicability in stream query processing, W-join can be used to track the motion
of a moving object or detect the propagation of clouds of hazardous material or pollution spills over time in a sensor network
environment. We describe two new algorithms for W-join and address variations and local/global optimizations related to specifying
the nature of the window constraints to fulfill the posed queries. The performance of the proposed algorithms is studied experimentally
in a prototype stream database system, using synthetic data streams and real time-series data. Tradeoffs of the proposed algorithms
and their advantages and disadvantages are highlighted, given variations in the aggregate arrival rates of the input data
streams and the desired response times per query.
This is an extended version of the paper published in the Proceedings of the 15th International Conference on Scientific and
Statistical Database Management, SSDBM 2003, Boston, U.S.A., pp. 75–84. 相似文献
14.
Khailany B. Dally W.J. Kapasi U.J. Mattson P. Namkoong J. Owens J.D. Towles B. Chang A. Rixner S. 《Micro, IEEE》2001,21(2):35-46
The power-efficient Imagine stream processor achieves performance densities comparable to those of special-purpose embedded processors. Executing programs mapped to streams and kernels, a single Imagine processor is expected to have a peak performance of 20 gflops and sustain 18.3 gops on mpeg-2 encoding 相似文献
15.
采用设置本地端缓冲服务器的方法提高流传榆质量.在开放型网络英语教学系统中应用流媒体提供QoS的管理功能,解决音视频流缓冲问题,并提供相应机制支持网络环境下的流媒体QoS。实验结果表明,流体系结构较好实现网络教学环境下的流媒体播放,保证音视频流的QoS。采用此流体系结构能较好地实现对流的管理和控制。从而保证多媒体课件的传输质量。 相似文献