首页 | 本学科首页   官方微博 | 高级检索  
     

结合视觉Transformer和CNN的道路裂缝检测方法
引用本文:代少升,刘科生,余自安.结合视觉Transformer和CNN的道路裂缝检测方法[J].半导体光电,2024,45(2):252-260.
作者姓名:代少升  刘科生  余自安
作者单位:重庆邮电大学 通信与信息工程学院, 重庆 400065;昆明云内动力股份有限公司, 昆明 650200
摘    要:提出了一种结合视觉Transformer和CNN的道路裂缝检测方法。利用CNN来捕获局部的细节信息,同时利用视觉Transformer来捕获全局特征。通过设计的Fusion特征融合模块将两者提取的特征有机地结合在一起,从而解决了单独使用CNN或视觉Transformer方法存在的局限。最终将结果传递至交互式解码器,生成道路裂缝的检测结果。实验结果表明,无论是在公开的数据集上还是在自建的数据集上,相较于单独使用CNN或视觉Transformer的方法,所提出的方法在道路裂缝检测任务中有更好的效果。

关 键 词:道路裂缝检测  视觉Transformer和CNN  动态加权交叉特征融合
收稿时间:2023/11/10 0:00:00

Road Crack Detection Method Combining A Visual Transformer and CNN
DAI Shaosheng,LIU Kesheng,YU Zian.Road Crack Detection Method Combining A Visual Transformer and CNN[J].Semiconductor Optoelectronics,2024,45(2):252-260.
Authors:DAI Shaosheng  LIU Kesheng  YU Zian
Affiliation:School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, CHN; Kunming Yunnei Power Co., Ltd., Kunming 650200, CHN
Abstract:This study introduces an integrated approach for road crack detection that harnesses the strengths of both visual transformers and CNN. A CNN is employed to capture fine-grained details, and a visual transformer is fully utilized to capture global characteristics. A feature fusion module is then designed to seamlessly merge the extracted features from both methods, thereby addressing the limitations of using CNN or visual transformer methods separately. Finally, the results are fed into an interactive decoder to produce accurate road crack detection results. Experimental results demonstrate that, whether on a publicly available or self-constructed dataset, the proposed method demonstrates an improvement in performance as compared with using CNN or visual transformer methods separately for road crack detection tasks.
Keywords:road crack detection  visual transformer and CNN  dynamic weighted cross feature fusion
点击此处可从《半导体光电》浏览原始摘要信息
点击此处可从《半导体光电》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号