首页 | 本学科首页   官方微博 | 高级检索  
     

目标检测数据集半自动生成技术研究
引用本文:孙晓璇,张磊,李健. 目标检测数据集半自动生成技术研究[J]. 计算机系统应用, 2019, 28(10): 8-14
作者姓名:孙晓璇  张磊  李健
作者单位:中国科学院 计算机网络信息中心, 北京 100190;中国科学院大学 计算机与控制学院, 北京 100049;中国科学院 计算机网络信息中心,北京,100190
基金项目:中国科学院科技服务网络计划(KFJ-STS-QYZD-058)
摘    要:目标检测广泛使用于计算机视觉领域.在不同的场景中,我们需要使用不同的数据集训练模型.但是,人工生成数据集标签非常耗时.本文提出一种半自动的方法生成数据集标签,然后按照图像相似度设置的阈值自动筛选,最后保留符合要求的图像和对应的标签作为最终的数据集.实验表明,该方法可以提高数据集生成标签的速度,同时确保了准确率.

关 键 词:YOLOv3  SSD  差异值哈希  半自动生成训练集
收稿时间:2019-03-25
修稿时间:2019-04-18

Research on Semi-Automatic Generation Technology of Object Detection Datasets
SUN Xiao-Xuan,ZHANG Lei and LI Jian. Research on Semi-Automatic Generation Technology of Object Detection Datasets[J]. Computer Systems& Applications, 2019, 28(10): 8-14
Authors:SUN Xiao-Xuan  ZHANG Lei  LI Jian
Affiliation:Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China;School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing 100049, China,Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China and Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
Abstract:Object detection is widely used in the field of computer vision. In different occasions, we need to use different training set to train the model. However, manually generating label is very time consuming. This study proposed a semi-automatic method to generate labels for dataset, then automatically filter them according to the threshold set by image similarity, lastly retain the required images and corresponding labels as the final dataset. Experiments show that the method can both improve the speed and ensure accuracy rate of generating labels for dataset.
Keywords:YOLOv3  SSD  difference Hash  semi-auto produce training set
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号