期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

赵衍娟张艳关博《传感器世界》2010,16(9):26-28

介绍了一种基于S3C2440硬件平台和嵌八式Linux操作系统的远程视频传输系统的总体设计方案,阐述了系统的总体结构和各部分的实现并给出关键功能的软件买现方法。在系统中的视频处理采用MPEG-4压缩算法。该系统实现了802．11无线局域网的视频传输,与传统的视频监控系统比较,该方案具有体积小、成本低、稳定可靠等优点。相似文献

2.

孪生网络跟踪算法并行计算结构研究

卢金仪唐维伟徐文辉颜露新钟胜邹旭《测控技术》2021,40(3):39-45

基于嵌入式平台的复杂背景目标跟踪技术在智能视频监控设备、无人机跟踪等领域有重要作用.卷积神经网络在跟踪问题上有准确率高、鲁棒性强的优点,但基于卷积特征的算法计算复杂度高,受嵌入式平台面积和功耗的限制,实时性难以满足嵌入式平台应用场景的需求.针对基于卷积特征的跟踪算法计算复杂度高、存储参数量大的难题,率先提出一种利用FPGA实现基于卷积神经网络的复杂背景目标跟踪硬件加速架构.该方法通过利用KL相对熵对目标跟踪算法Siamese-FC进行定点量化,设计了基于通道并行的卷积层加速架构.实验结果表明,定点量化后跟踪算法相比于原算法的平均精度损失不超过4.57％,FPGA部署后前向推理耗时仅为CPU的16.15％,功耗仅为CPU的13.7％. 相似文献

3.

基于ZYNQ的优化Adaboost人脸检测

下载免费PDF全文

高树静王程龙董廷坤《计算机工程与应用》2020,56(6):201-206

针对目前大多数嵌入式人脸检测系统实时性差的问题,通过优化的人脸检测算法和软硬件协同处理方式达到加速人脸检测的目的。基于ZYNQ SoC架构下,利用YCbCr肤色空间算法在FPGA部分加速提取肤色区域,利用优化的Adaboost算法与Phash算法在双核ARM中完成人脸检测与追踪,输出检测到的人脸。实验表明,提出的优化人脸检测算法相比传统的Adaboost人脸检测算法更具实时性,并且通过合理的软硬件协同处理也可以加快人脸检测速率,同时减少系统硬件资源消耗量从而降低成本。相似文献

4.

Approaching the accuracy–cost conflict in embedded classification system design

Ulf Jensen Patrick Kugler Matthias Ring Bjoern M. Eskofier 《Pattern Analysis & Applications》2016,19(3):839-855

Smart embedded systems often run sophisticated pattern recognition algorithms and are found in many areas like automotive, sports and medicine. The developer of such a system is often confronted with the accuracy–cost conflict as the resulting system should be as accurate as possible while being able to run on resource constraint hardware. This article introduces a method to support the solution of this design conflict with accuracy–cost reports. These reports compare classification systems regarding their classification rate (accuracy) and the mathematical operations and parameters of the working phase (cost). Our method is used to deduce the specific cost of various popular pattern recognition algorithms and to derive the overall cost of a classification system. We also show how our analysis can be used to estimate the computational cost for specific hardware architectures. A software toolbox to create accuracy–cost reports was implemented to facilitate the automatic classification system comparison with the presented methodology. The software is available for download and as supplementary material. We performed different experiments on synthetic and real-world data to underline the value of this analysis. Accurate and computationally cheap classification systems were easily identified. We were even able to find a better implementation candidate in an existing embedded classification problem. This work is the first step towards a comprehensive support tool for the design of embedded classification systems. 相似文献

5.

Algorithmic aspects for multiple-choice hardware/software partitioning

Jigang Wu Qiqiang Sun Thambipillai Srikanthan 《Computers & Operations Research》2012

Hardware–software partitioning (HW/SW) divides an application into software and hardware. It is one of the crucial steps in embedded system design. For a given task, hardware with different areas may provide different execution speeds due to the potential of parallel execution in hardware implementation. Thus, one task may have multiple-choice in hardware implementation according to the available hardware areas. Existing HW/SW partitioning approaches typically consider only a single implementation manner in hardware, overlooking the multiple-choice of hardware implementations. This paper presents a computing model to cater for the HW/SW partitioning problems with the multiple-choice implementation in hardware. An efficient heuristic algorithm is proposed to rapidly generate approximate solution, that is further refined by a tabu search algorithm also customized in this paper. Moreover, a dynamic programming algorithm is proposed for the exact solution of the relatively small problems. Extensive simulation results show that the approximate solutions are very close to the exact ones, and they can be refined by tabu search to the solutions with the error no more than 1.5% for all cases considered in this paper. 相似文献

6.

Design and implementation of embedded computer vision systems based on particle filters

Sankalita Saha Neal K. Bambha Shuvra S. Bhattacharyya 《Computer Vision and Image Understanding》2010,114(11):1203-1214

Particle filtering methods are gradually attaining significant importance in a variety of embedded computer vision applications. For example, in smart camera systems, object tracking is a very important application and particle filter based tracking algorithms have shown promising results with robust tracking performance. However, most particle filters involve vast amount of computational complexity, thereby intensifying the challenges faced in their real-time, embedded implementation. Many of these applications share common characteristics, and the same system design can be reused by identifying and varying key system parameters and varying them appropriately. In this paper, we present a System-on-Chip (SoC) architecture involving both hardware and software components for a class of particle filters. The framework uses parameterization to enable fast and efficient reuse of the architecture with minimal re-design effort for a wide range of particle filtering applications as well as implementation platforms. 相似文献

7.

基于DSP的嵌入式目标跟踪系统

时旭东施华君陆国强《计算机系统应用》2019,28(11):87-95

近年,由Henriques等人提出的核化相关滤波算法（KCF算法）在算法规模、复杂度、性能等方面表现优越.本文以KCF算法为核心,提出并设计了一种基于DSP的目标跟踪系统.硬件方面,本文设计实现了一套完整独立的硬件平台;软件方面,本文提出一系列针对DSP的算法优化方法,使优化后的KCF算法能够满足重要的工程指标要求.结果表明,系统在工程环境中表现良好,跟踪角速度可达20度/秒,平均帧率25 fps,跟踪准确率较高,为计算机视觉领域内的各类算法的嵌入式应用提供参考. 相似文献

8.

An FPGA-based architecture for embedded systems performance acceleration applied to Optimum-Path Forest classifier

《Microprocessors and Microsystems》2017

Classification techniques development constitutes a foundation for machine learning evolution, which has become a major part of the current mainstream of Artificial Intelligence research lines. However, the computational cost associated with these techniques limits their use in resource constrained embedded platforms. As the classification task is often combined with other high computational cost functions, efficient performance of the main modules is fundamental requirements to achieve hard real-time speed for the whole system. Graph-based machine learning techniques offer a powerful framework for building classifiers. Optimum-Path Forest (OPF) is a graph-based classifier presenting the interesting ability to provide nonlinear classes separation surfaces. This work proposes a SoC/FPGA based design and implementation of an architecture for embedded applications, presenting a hardware converted algorithm for an OPF classifier. Comparison of the achieved results with an embedded processor software implementation shows accelerations of the OPF classification from 2.18 to 9 times, which permits to expect real-time performance to embedded applications. 相似文献

9.

Harris角点结合金字塔光流法的目标跟踪算法设计研究

下载免费PDF全文

徐里萍耿斌李小龙赵丽《计算机测量与控制》2018,26(5):162-165

针对现存很多跟踪算法在速度和准确度方面很难满足嵌入式跟踪开发的需要,提出一种基于Harris角点和金字塔光流法的快速跟踪算法,并详细给出了DSP-FPGA的硬件设计。首先,使用Harris角点提取目标角点特征;然后,使用金字塔光流法为后续视频帧匹配角点;最后,基于角点的质心跟踪算法用于匹配目标的重心,确定目标的位置,重心跟踪算法可以较好地抵消由于旋转或扭曲带来的形变问题。在硬件实现过程中,FPGA方便电路设计,使用硬件描述程序语言实现硬件算法、逻辑控制和外部接口,DSP则运行目标跟踪算法。实验结果验证了本文硬件实现算法的有效性,相比于AVT21开发板的质心跟踪算法、相位相关跟踪算法和金字塔相关性跟踪算法相比,本文算法在平均重叠和平均中心误差方面具有一定优势,在720p的视频流上可以满足25fps。相似文献

10.

一种基于监测的嵌入式系统设计技术 总被引：6，自引：0，他引：6

吴百锋彭澄廉孙晓光《计算机学报》2003,26(12):1728-1733

提出一种嵌入式系统软硬件协同设计方法，它以数据流图为系统模型对嵌入式系统的功能和性能需求进行描述，并通过一种特定的实现结构，使得设计者可以借助快速样机平台和事件驱动式监测技术来精确测定目标系统对系统模型的实现状况，从而使得软硬件协同设计过程特别是系统优化和性能验证能在精确、可靠的测试数据基础上进行．同目前通常使用的以软硬件部件性能估算为基础的软硬件协同设计方法相比，这种以测试为基础的设计技术更能确保设计结果的合理．相似文献

11.

Statistical skin color detection method without color transformation for real-time surveillance systems

Yen-Hsiang Chen Kai-Ti Hu Shanq-Jang Ruan 《Engineering Applications of Artificial Intelligence》2012,25(7):1331-1337

Skin color is the significant information for many emerging applications in surveillance systems. However, the common skin color models usually need to perform color space transformation. This is not suitable for direct hardware implementation. This paper develops a statistical skin color model using the default RGB color space, which is especially suitable to implement on hardware for image processing applications. Moreover, an efficient face detection system is also proposed with our skin color model for hardware implementation. Compared with other skin color models, the proposed model produces the highest detection rate. Furthermore, the extended face detection system also significantly decreases the computational cost of the hardware implementation based on our skin color model. Experimental results demonstrate that our proposed detection system can be easily implemented on a field-programmable gate array (FPGA), where only 3202 logic cells is occupied with the high detection rate. 相似文献

12.

FPGA-based architecture for the real-time computation of 2-D convolution with large kernel size

F. Javier Toledo-Moreo J. Javier Martínez-Alvarez Javier Garrigós-Guerrero J. Manuel Ferrández-Vicente 《Journal of Systems Architecture》2012,58(8):277-285

Bidimensional convolution is a low-level processing algorithm of interest in many areas, but its high computational cost constrains the size of the kernels, especially in real-time embedded systems. This paper presents a hardware architecture for the FPGA-based implementation of 2-D convolution with medium–large kernels. It is a multiplierless solution based on Distributed Arithmetic implemented using general purpose resources in FPGAs. Our proposal is modular and coefficient independent, so it remains fully flexible and customizable for any application. The architecture design includes a control unit to manage efficiently the operations at the borders of the input array. Results in terms of occupied resources and timing are reported for different configurations. We compare these results with other approaches in the state of the art to validate our approach. 相似文献

13.

K-means clustering algorithm for multimedia applications with flexible HW/SW co-design

《Journal of Systems Architecture》2013,59(3):155-164

In this paper, we report a hardware/software (HW/SW) co-designed K-means clustering algorithm with high flexibility and high performance for machine learning, pattern recognition and multimedia applications. The contributions of this work can be attributed to two aspects. The first is the hardware architecture for nearest neighbor searching, which is used to overcome the main computational cost of a K-means clustering algorithm. The second aspect is the high flexibility for different applications which comes from not only the software but also the hardware. High flexibility with respect to the number of training data samples, the dimensionality of each sample vector, the number of clusters, and the target application, is one of the major shortcomings of dedicated hardware implementations for the K-means algorithm. In particular, the HW/SW K-means algorithm is extendable to embedded systems and mobile devices. We benchmark our multi-purpose K-means system against the application of handwritten digit recognition, face recognition and image segmentation to demonstrate its excellent performance, high flexibility, fast clustering speed, short recognition time, good recognition rate and versatile functionality. 相似文献

14.

一种用于无线传感器网络的模块化设计方法 总被引：2，自引：0，他引：2

高超张頔罗嵘《电子技术应用》2009,35(5)

针对无线传感器网络应用多样化的特点,建立了基于ZigBee技术的无线传感器网络节点与网关节点的模块化软、硬件设计方案。该硬件方案具有模块化与集成度高的特点,软件方案基于嵌入式操作系统进行多种功能的模块化设计,具有良好扩展性以及可维护性。实现了一种基于单芯片平台的传感器节点与ARM平台的网关节点,讨论了当前与未来适用的各种嵌入式设计关键技术。相似文献

15.

基于Bootloader的可靠嵌入式软件远程更新机制 总被引：6，自引：0，他引：6

王恒王颋王泉李勇《微计算机信息》2007,23(20):57-59

嵌入式软件的远程自动更新技术能够显著的降低嵌入式系统的维护成本,而更新过程的可靠性直接影响着远程更新的质量.本文针对基于bootloader的嵌入式系统,提出了一种高可靠的嵌入式软件远程自动更新机制,并以采用ARM微处理器、嵌入式Linux操作系统和无线网络接口的嵌入式平台为例给出了更新机制的软硬件实现方案.最后在实际系统中对更新机制的性能进行了测试.测试结果表明,本更新机制具有良好的抗干扰能力,能有效地提高嵌入式软件远程更新的可靠性. 相似文献

16.

基于改进型高斯混合模型的嵌入式实时运动检测系统

下载免费PDF全文

陈龙虎尚岩峰梅林刘云淮汤志伟《计算机测量与控制》2014,22(12)

混合高斯模型由于其计算量大,算法结构复杂,难以在嵌入式系统中实现运动物体的实时检测,为解决此问题,文中提出了一种基于改进型混合高斯模型的实时运动检测方案,对混合高斯模型进行简化和结构调整,同时进行了C语言层面和CPU层级的优化,使其更合适于嵌入式平台,并详细分析了DM6446平台的软硬件设计,介绍了该算法在DM6446平台上的实现过程;实验结果表明:该系统能够有效克服外界环境变化带来的干扰,能够实时检测,可以实现多目标跟踪。相似文献

17.

嵌入式系统GUI调色板查找改进算法 总被引：1，自引：0，他引：1

杨军高小鹏龙翔《计算机工程与应用》2005,41(33):34-35,50

通过分析硬件调色板的基本工作原理和嵌入式系统GUI图形引擎中调色板查找算法的实现,提出了一种应用于硬件调色板的嵌入式系统GUI中,基于软件Cache技术的改进调色板查找算法,极大地提高了嵌入式系统GUI图形引擎的效率。相似文献

18.

Hardware approach to tool path computation for STEP-NC enabled CNC: A case study of turning operations

S. Cuenca Author VitaeA. Jimeno-MorenillaAuthor Vitae A. Martínez Author VitaeR. Maestre Author Vitae 《Computers in Industry》2011,62(5):509-518

相似文献

19.

FPGA implementation of full-search vector quantization based on partial distance search

Wen-Jyi Hwang Wen-Kang Wei Yao-Jung Yeh 《Microprocessors and Microsystems》2007,31(8):516-528

This paper presents a novel algorithm for field programmable gate array (FPGA) realization of vector quantizer (VQ) encoders using partial distance search (PDS). In most applications, the PDS is adopted as a software approach for attaining moderate codeword search acceleration. In this paper, a novel PDS algorithm well suited for hardware realization is proposed. The algorithm employs subspace search, bitplane reduction, and multiple-coefficient accumulation techniques for the effective reduction of the area complexity and computation latency. Concurrent encoding of different input vectors for further computation acceleration is also allowed by the employment of multiple-module PDS. The proposed implementation has been embedded in a softcore CPU for physical performance measurement. Experimental results show that the implementation provides a cost-effective solution to the FPGA realization of VQ encoding systems where both high throughput and high fidelity are desired. 相似文献

20.

Design and hardware implementation of a stereo-matching system based on dynamic programming

J.A. Kalomiros J. Lygouras 《Microprocessors and Microsystems》2011,35(5):496-509

A new real-time stereo system is presented based on a hardware implementation of an efficient Dynamic Programming algorithm. A simple state-machine calculates the cost-matrix along the diagonal of the 2-D disparity space for each epipolar pair of image scan-lines. Minimum transition costs are stored in embedded RAM and are used to backtrack disparities at clock rate. All calculations are within a pre-determined slice of the cost plane, representing the useful disparity range. The system is designed as a VHDL library component and is implemented as a SoC in a medium-capacity Field Programmable Gate Array chip. It can process stereo-pairs in full VGA resolution at a rate of 25 Mpixels/s and produces 8-bit dense disparity maps within a range of disparities up to 65 pixels. The design is evaluated comparing to ground truth and in terms of resource usage. It is also compared to a software implementation of the Dynamic Programming algorithm and to other FPGA-based stereo systems. 相似文献