共查询到20条相似文献,搜索用时 78 毫秒
1.
2.
3.
4.
基于Altera FPGA的软硬件协同仿真 总被引:3,自引:0,他引:3
简要介绍了软硬件协同仿真技术,指出了在大规模FPGA开发中软硬件协同仿真的重要性和必要性,给出基于A1tera FPGA的门级软硬件协同仿真实例。 相似文献
5.
随着集成电路工艺的飞速发展,传统的设计方法已不能满足设计高集成度的复杂数字系统的要求。软硬件协同设计成为嵌入式系统设计的新方法。SystemC是一种兼容C++的系统建模语言,它同时支持RTL级、行为级和系统级描述,使其成为软硬件协同设计平台的基础。 相似文献
6.
7.
文章介绍了软硬件协同验证方法学及其验证流程。在软件方面,采用了一套完整的软件编译调试仿真工具链,它包括处理器的仿真虚拟原型和基本的汇编、链接、调试器;在硬件方面,对软件调试好的应用程序进行RTL仿真、综合,并最终在SoC设计的硬件映像加速器(FPGA)上实现并验证。 相似文献
8.
软硬件协同设计的目标结构包括一个CPU和多个ASIC,它们通过一条总线进行通信.本文介绍一种用于多目标、多模式系统综合的协同设计的新方法.各个工作模式具有不同的运行概率.在满足设计约束的条件下,我们应用遗传算法对系统的速度和功耗两个目标进行优化.遗传算法是全局算法,它能避免陷入局部最小. 相似文献
9.
本文首先介绍了SOC软硬件协同验证方法及其平台SeamlessCVE的工作原理和流程,进而在此基础上搭建一个高效的USBA-device验证平台,以实现USB高速A-device软硬件的并行设计,并详细介绍了USBA-device的工作原理和验证流程。 相似文献
10.
基于JTAG的SoC软硬件协同验证平台设计 总被引:1,自引:1,他引:1
基于JTAG接口,提出了一种以FPGA为基础的SoC软硬件协同验证平台.在验证平台的硬件基础上,开发了调试验证软件,能够完成SRAM的读写、CF卡的读写、串口的收发、程序的下载、及程序复位等功能.利用验证平台的软硬件完成了SoC的IP模块的调试验证及操作系统μClinux的调试验证.实践表明,该验证平台有益于SoC的设计和调试,降低SoC应用系统的开发成本. 相似文献
11.
为了能够充分、快速验证USB2.0主控器的功能,设计了一个软硬件协同仿真平台。其中,CPU模型部分采用一种高效的SystemC模型,而不使用基于指令集的复杂CPU模型。测试用例采用抽象层次更高的C语言编写,通过调用仿真平台对外提供的API完成激励生成与响应检查。结果表明,该方式能够有效降低对仿真资源的占用,减少仿真时间;同时使软件人员能在IP的硬件验证阶段就能完成软件的设计测试工作,缩短软硬件接口整合时间,加快开发进度。 相似文献
12.
为了提高产品的验证覆盖率和产品的首次成功率,验证工程师越来越多的使用固件、硬件诊断程序和其它软件部分作为实际嵌入式处理器的SoC验证的激励,以保证RTL设计与最终设计实现的的应用环境相同,并覆盖更为复杂的场景,但该RTL验证环境对软件调试的可视性比较有限。Mentor公司的Questa Codelink提供了独特的软硬件协同验证的技术可以让验证人员同时看到软件的执行情况和与软件同步的硬件波形,其回放模式减少了仿真等待的时间,可以快速追踪并定位到程序出错的地方。Codelink也提供了多核调试的技术,可同时看到软件在不同处理器的执行情况,极大地提高了多核验证的效率。 相似文献
13.
14.
An embedded system is called a multi-mode embedded system if it performs multiple applications by dynamically reconfiguring the system functionality. Further, the
embedded system is called a multi-mode multi-task embedded system if it additionally supports multiple tasks to be executed in a mode. In this paper, we address an important
HW/SW partitioning problem, that is, HW/SW partitioning of multi-mode multi-task embedded applications with timing constraints
of tasks. The objective of the optimization problem is to find a minimal total system cost of allocation/mapping of processing
resources to functional modules in tasks together with a schedule that satisfies the timing constraints. The key success of
solving the problem is closely related to the degree of the amount of utilization of the potential parallelism among the executions
of modules. However, due to an inherently excessively large search space of the parallelism, and to make the task of schedulability
analysis easy, the prior HW/SW partitioning methods have not been able to fully exploit the potential parallel execution of
modules. To overcome the limitation, we propose a set of comprehensive HW/SW partitioning techniques which solve the three
subproblems of the partitioning problem simultaneously: (1) allocation of processing resources, (2) mapping the processing resources to the modules in tasks, and (3) determining
an execution schedule of modules. Specifically, based on a precise measurement on the parallel execution and schedulability
of modules, we develop a stepwise refinement partitioning technique for single-mode multi-task applications, which aims to solve the subproblems 1, 2 and 3 effectively in an integrated fashion. The proposed
techniques is then extended to solve the HW/SW partitioning problem of multi-mode multi-task applications (i.e., to find a globally optimized allocation/mapping of processing resources with feasible execution schedule of modules). From experiments with
a set of real-life applications, it is shown that the proposed techniques are able to reduce the implementation cost by 19.0
and 17.0% for single- and multi-mode multi-task applications over that by the conventional method, respectively. 相似文献
15.
Ahmed Patrice Fahmi Patrice Nouri Herve 《AEUE-International Journal of Electronics and Communications》2007,61(9):605-620
In this paper, we present an efficient HW/SW codesign architecture for H.263 video encoder and its FPGA implementation. Each module of the encoder is investigated to find which approach between HW and SW is better to achieve real-time processing speed as well as flexibility. The hardware portions include the Discrete Cosine Transform (DCT), inverse DCT (IDCT), quantization (Q) and inverse quantization (IQ). Remaining parts were realized in software executed by the NIOS II softcore processor. This paper also introduces efficient design methods for HW and SW modules. In hardware, an efficient architecture for the 2-D DCT/IDCT is suggested to reduce the chip size. A NIOS II Custom instruction logic is used to implement Q/IQ. Software optimization technique is also explored by using the fast block-matching algorithm for motion estimation (ME). The whole design is described in VHDL language, verified in simulations and implemented in Stratix II EP2S60 FPGA. Finally, the encoder has been tested on the Altera NIOS II development board and can work up to 120 MHz. Implementation results show that when HW/SW codesign is used, a 15.8-16.5 times improvement in coding speed is obtained compared to the software based solution. 相似文献
16.
17.
对系统级设计中的硬软件分割问题建立数学模型.将系统级设计工具提供的性能分析功能与模拟退火算法相结合,设计了一种软件导引的模拟退火算法。该实现中将嵌入式系统性能指标(任务的执行时间)作为约束,将实现代价以及功耗作为硬软件分割方法优化的目标。在保证系统的设计目标(任务执行时间)满足要求的基础上。通过选择系统中各模块实现代价和功耗较小的实现方法.进而优化整个系统的实现代价和功耗.彻底改变了以往航天应用的嵌入式系统设计中依赖人工经验进行硬软件划分下现状。 相似文献
18.
19.
This paper presents a new methodology of automatic RTL code generation from coarse-grain dataflow specification for fast HW/SW
cosynthesis. A node in a coarse-grain dataflow specification represents a functional block such as FIR and DCT and an arc
may deliver multiple data samples per block invocation, which complicates the problem and distinguishes it from behavioral
synthesis problem. Given optimized HW library blocks for dataflow nodes, we aim to generate the RTL codes for the entire hardware
system including glue logics such as buffer and MUX, and the central controller. In the proposed design methodology, a dataflow
graph can be mapped to various hardware structures by changing the resource allocation and schedule information. It simplifies
the management of the area/performance tradeoff in hardware design and widens the design space of hardware implementation
of a dataflow graph. We also support Fractional Rate Dataflow (FRDF) specification for more efficient hardware implementation.
To overcome the additional hardware area overhead in the synthesized architecture, we propose two techniques reducing buffer
overhead. Through experiments with some real examples, the usefulness of the proposed technique is demonstrated.
相似文献
Soonhoi Ha (Corresponding author)Email: |