首页 | 本学科首页   官方微博 | 高级检索  
     

基于单目视频和无监督学习的轻轨定位方法
引用本文:姚萌, 贾克斌, 萧允治. 基于单目视频和无监督学习的轻轨定位方法[J]. 电子与信息学报, 2018, 40(9): 2127-2134. doi: 10.11999/JEIT171017
作者姓名:姚萌  贾克斌  萧允治
作者单位:1.北京工业大学信息学部 北京 100124;;2.香港理工大学电子讯息工程系 香港;;3.先进信息网络北京实验室 北京 100124;;4.未来网络科技高精尖创新中心 北京 100124
基金项目:国家自然科学基金面上项目(61672064),北京市自然科学基金重点项目(KZ201610005007)
摘    要:基于视觉信息的场景识别定位模块被广泛应用于车辆安全系统。针对目前场景逐帧匹配算法训练数据量大、匹配处理计算复杂度高以及跟踪精度低导致难以实际应用的问题,该文提出一种新的基于局部关键区域与关键帧的场景识别方法,在保证匹配精度的同时满足系统实时性的要求。首先,该方法仅使用单目摄像机捕获的单一序列作为参考序列,采用无监督方式提取序列的显著性区域作为关键区域,并计算关键区域中低相关性的二值化特征,提高了场景匹配的精确度并大幅减少了实时场景匹配过程中特征生成与匹配的计算复杂度。其次,该方法以显著性分数为依据提取参考序列中的关键帧,缩小了跟踪模块的检索范围并提高了检索效率。该文使用香港轻轨系统数据集以及公开测试数据集进行方法测试。实验结果表明,该文方法在实现快速匹配的同时,其匹配正确率较基于全局特征匹配方法SeqSLAM提高了9.8%。

关 键 词:视觉定位   关键区域   关键帧   二值化特征
收稿时间:2017-10-31
修稿时间:2018-05-21

Learning-based Localization with Monocular Camera for Light-rail System
Meng YAO, Kebin JIA, Wanchi SIU. Learning-based Localization with Monocular Camera for Light-rail System[J]. Journal of Electronics & Information Technology, 2018, 40(9): 2127-2134. doi: 10.11999/JEIT171017
Authors:Meng YAO  Kebin JIA  Wanchi SIU
Affiliation:1. Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China;;2. Department of Electronic and Information Engineering, Hong Kong Polytechnic University, Hong Kong, China;;3. Beijing Laboratory of Advanced Information Networks, Beijing 100124, China;;4. Advanced Innovation Center for Future Internet Technology, Beijing 100124, China
Abstract:The visual-based scene recognition and localization module is widely used in vehicle safety system. This paper proposes a new method of scene recognition based on local key region and key frame, which is based on the problem of large amount of training data, large matching complexity and low tracking precision. The proposed method meets the real-time requirements with high accuracy. First, the method uses the unsupervised method to extract the significant regions of the single reference sequence captured by the monocular camera as the key regions. The binary features with low correlation in key regions are also extracted to improve the scene matching accuracy and reduce the computational complexity of feature generation and matching. Secondly, key frames in the reference sequence are extracted based on the discrimination score to reduce the retrieval range of the tracking module and improve the efficiency. Practical field tests are done on real data of the light railway system in Hong Kong and the open test data set in Nordland. The experimental results show that the proposed method achieves fast matching and the precision is 9.8% higher than SeqSLAM which is based on global feature.
Keywords:Visual-based localization  Key region  Key frame  Binary feature
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号