首页 | 官方网站   微博 | 高级检索  
     

基于嵌入式多核DSP 系统的并行粒子滤波目标跟踪
引用本文:田立,周付根,孟偲.基于嵌入式多核DSP 系统的并行粒子滤波目标跟踪[J].红外与激光工程,2014,43(7):2354-2361.
作者姓名:田立  周付根  孟偲
作者单位:1.北京航空航天大学宇航学院图像处理中心,北京 100191
基金项目:国家863计划(2011AA0641,2013AA041201-7)
摘    要:目标跟踪中的伺服系统需要极低的跟踪延时,由于粒子滤波跟踪算法固有的庞大计算量使得目标跟踪的精度大受影响。提出了一种粒子滤波跟踪算法在多核DSP 系统中的快速实现方法。首先,利用DSP 片上的包加速器来降低以太网相机的采集延时以及CPU 占用率,CPU 占用率从31%降低到10%;其次,通过手动操作高速缓存的刷新和实效,解决了多核同时共享图像数据带来的存储器一致性问题,多个核能通过高速缓存快速获取图像数据;最后,通过在多核核心上设置代理任务的方法,建立了一种多核并行计算的机制。粒子滤波算法中计算复杂度高的运算阶段被分配到多个核心上同时运算,实现了算法的低延时。实验结果显示8 核加速比达到7 倍以上,优于开放多处理标准 OpenMP 的并行优化效果。

关 键 词:粒子滤波    目标跟踪    多核DSP    并行计算
收稿时间:2013-11-16

Parallel particle filter object tracking based on embedded multicore DSP systems
Affiliation:1.Image Processing Center,School of Astronautics,Beihang University,Beijing 100191,China
Abstract:The object tracking servo system requires a low delay from an object moving to starting of rotations while the inherent computational complexity of PF (Particle Filter) affects the tracking precision. In this paper, a multicore DSP parallel implementation strategy for particle filter object tracking was proposed. Firstly, the PA module on chip was used to reduce the GigE image capturing delay and the CPU occupancy. The CPU load was considerably reduced from 31% to 10%. Secondly, by manually FLUSH after writing and INVALID before reading, the memory consistency problem was addressed and cacheable shared image data can be accessed at high efficiency. Finally, a mechanism of parallel computing on multi-core processor was introduced by adding proxy task. The computational intensive stages of particle filter were dispatched to 8 cores to eliminate system delay. Experimental results show that the tracking response time was decreased and algorithmic speedup runs up to 7 and exceeds OpenMP.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《红外与激光工程》浏览原始摘要信息
点击此处可从《红外与激光工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号