无线电工程

2024, 07, v.54 1602-1613

基于YOLOv5s微光环境下的多模态识别网络

吴学礼^1,2 赵俊棋¹ 刘雨涵¹ 甄然^1,2

1.河北科技大学电气工程学院 2.河北省生产过程自动化工程技术研究中心

基金项目(Foundation): 国家自然科学基金(62003129); 河北省重点研发计划项目(19250801D)~~

邮箱(Email):

DOI:

发布时间： 2023-09-22

出版时间： 2023-09-22

网络发布时间： 2023-09-22

移动端阅读

321	3	313
下载次数	被引频次	阅读次数

引用本文下载本文

PDF

引用导出

GB/T 7714-2015 MLA APA Refworks EndNote NoteExpress NoteFirst

摘要全文参考文献出版信息相关文章

摘要：

近些年来，在目标检测以及图像分割等领域涌现了许多先进的算法。在能见度较差的微光场景下如夜晚、大雾天气等场景中，视频图像具有像素声高、对比度低、无彩色信息等特点，算法的检测性能受到明显限制。与目前主流的RGB相机相比，毫米波雷达对上述复杂环境具有一定的免疫能力，可以在不利条件下辅助RGB相机进行目标检测工作。以单阶段目标检测器中实时性较高的YOLOv5s为基础，结合毫米波雷达的特性，提出了用于微光环境下目标检测的多模态识别网络。与现有的传感器融合方法相比，多模态识别网络有几个关键优势。系统以基于学习的方式融合了2种模态，只需要少量新场景的标记图像和雷达数据，因为其可以充分利用已经开源的大型图像数据集进行大批量的训练。这一突出特性使新系统能够适应高度复杂的现实环境。由于使用了高度计算效率的融合方法，系统是非常轻量级的，因此适用于各个复杂场景下的实时应用。为了评估系统的性能，制作了一个小批量的雷达和摄像机融合数据集，包含普通光照和不同强度微光光照条件下的多模态数据。实验结果表明，微光场景下多模态识别网络的平均精度达到76.6%,相比Faster R-CNN算法和YOLOv7算法，全类平均精度(mean Average Precision, mAP)提高了16.8%和9.3%,且误检、漏检率低，达到了在微光环境下完成目标检测任务的要求。

关键词： 多模态识别网络; 毫米波雷达; 目标检测; YOLOv5s; 微光场景;

Abstract：

In recent years, many advanced algorithms have emerged in the fields of object detection and image segmentation. However, in low light scenarios with poor visibility, such as night and foggy weather, video images have the characteristics of high pixel noise, low contrast and no color information, which significantly limits the detection performance of the algorithm. Meanwhile, compared with mainstream RGB cameras, the millimeter wave radar has certain immunity to the complex environment mentioned above, and can assist RGB cameras in object detection under adverse conditions. Therefore, based on YOLOv5s, which has high real-time performance in single stage object detectors, and combined with the characteristics of millimeter wave radar, a multimodal recognition network is proposed for object detection in low light environments. Compared with existing sensor fusion methods, the proposed multimodal recognition network has several key advantages. Firstly, although this system integrates two modalities in a learning based manner, it only requires a small amount of labeled images and radar data for new scenes, as it can fully utilize large open source image datasets for large-scale training. This outstanding feature enables the new system to adapt to highly complex real-world environments. Secondly, due to the use of highly computationally efficient fusion methods, this system is very lightweight and therefore suitable for real-time applications in various complex scenarios. In order to evaluate the performance of this system, a small batch of radar and camera fusion dataset is produced, which includes multimodal data under normal lighting and different intensities of low light illumination.The experimental results show that the average accuracy of the multimodal recognition network in low light environments reaches 76. 6%.Compared with Faster R-CNN algorithm and YOLOv7 algorithm,the mean Average Precision(mAP)has improved by 16. 8% and 9. 3%,and the false detection and missed detection rates are low,meeting the requirements for object recognition in low light environment.

KeyWords： multimodal recognition network; millimeter wave radar; object detection; YOLOv5s; low light scene;

如需获取全文，请访问cnki.net

参考文献

[1] GIRSHICK R.Fast R-CNN[C]//2015 IEEE International Conference on Computer Vision(ICCV).Santiago:IEEE,2015:1440-1448.

[2] REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-time Object Detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Las Vegas:IEEE,2016:779-788.

[3] CHEN M J,LAN Z X,DUAN Z X,et al.HDS-YOLOv5:An Improved Safety Harness Hook Detection Algorithm Based on YOLOv5s[J].Mathematical Biosciences and Engineering,2023,20(8):15476-15495.

[4] 甄然，刘颖，孟凡华，等.基于YOLOv5的轻量化目标检测算法[J].无线电工程，2023,53(6):1242-1250.

[5] 邱天衡，王玲，王鹏，等.基于改进YOLOv5的目标检测算法研究[J].计算机工程与应用，2022,58(13):63-73.

[6] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet Classification with Deep Convolutional Neural Networks[J].Communications of the ACM,2017,60(6):84-90.

[7] LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft COCO:Common Objects in Context[C]//Computer Vision-ECCV 2014.Zurich:Springer,2014:740-755.

[8] CAESAR H,BANKITI V,LANG A H,et al.nuScenes:A Multimodal Dataset for Autonomous Driving[EB/OL].(2019-03-26)[2023-08-01].https://arixiv.org/abs/1903.11027.

[9] CHO H,SEO Y W,KUMAR B V K V,et al.A Multi-sensor Fusion System for Moving Object Detection and Tracking in Urban Driving Environments[C]//2014 IEEE International Conference on Robotics and Automation(ICRA).Hong Kong:IEEE,2014:1836-1843.

[10] WANG X,XU L H,SUN H B,et al.On-road Vehicle Detection and Tracking Using MMW Radar and Monovision Fusion[J].IEEE Transactions on Intelligent Transportation Systems,2016,17(7):2075-2084.

[11] CHADWICK S,NEWMAN P.Radar as a Teacher:Weakly Supervised Vehicle Detection Using Radar Labels[C]//2020 IEEE International Conference on Robotics and Automation(ICRA).Paris:IEEE,2020:222-228.

[12] FENG D,SCHüTZ C H,ROSENBAUM L,et al.Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driving:Datasets,Methods,and Challenges[J].IEEE Transactions on Intelligent Transportation Systems,2021,22(3):1341-1360.

[13] 黄志忠.基于STFT的汽车雷达抗干扰处理方法[J].无线电工程，2021,51(4):313-317.

[14] 朱玉刚.基于深度学习的复杂环境下多目标检测方法研究[D].杭州：杭州电子科技大学，2019.

[15] 吴贻杰，贾浩男.基于多传感器信息融合的障碍物感知追踪系统设计[J].无线电工程，2023,53(5):1068-1077.

[16] 陈映谦.基于信息融合的夜间行驶车辆检测研究[D].广州：广东工业大学，2022.

[17] LI Z M,PENG C,YU G,et al.Light-head R-CNN:In Defense of Two-stage Object Detector[EB/OL].(2017-11-20)[2023-07-20].https://arxiv.org/abs/1711.07264.

[18] HE K M,GKIOXARI G,DOLLáR P,et al.Mask R-CNN[C]//2017 IEEE International Conference on Computer Vision (ICCV).Venice:IEEE,2017:2980-2988.

[19] 梁晨晨，田金鹏，宋春林.基于雷达和摄像头传感器融合的辅助驾驶目标检测算法[J].信息技术与信息化，2021(12):5-9.

[20] CHADWICK C,MADDERN W,NEWMAN P.Distant Vehicle Detection Using Radar and Vision[C]//2019 International Conference on Robotics and Automation(ICRA).Montreal:IEEE,2019:8311-8317.

[21] ESTER M,KRIEGEL H P,SANDER J,et al.A Density Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise[C]//Proceedings of the Second International Conference on Knowledge Discovery and Data Mining.Portland:AAAI,1996:226-231.

[22] 贾文博.基于雷达与视觉融合的车辆检测方法研究[D].大连：大连理工大学，2021.

[23] KUHN H W.The Hungarian Method for the Assignment Problem[J].Naval Research Logist,2010,52(1):7-21.

[24] LIN T Y,GOYAL P,GIRSHICK R,et al.Focal Loss for Dense Object Detection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,42(2):318-327.

基本信息:

中图分类号:TN957.52;TP183

引用信息:

[1]吴学礼,赵俊棋,刘雨涵,等.基于YOLOv5s微光环境下的多模态识别网络[J].无线电工程,2024,54(07):1602-1613.

基金信息:

国家自然科学基金(62003129); 河北省重点研发计划项目(19250801D)~~

发布时间：

2023-09-22

出版时间：

2023-09-22

网络发布时间：

2023-09-22

请选择需要下载的pdf数据

无线电工程

使用微信“扫一扫”功能。
将此内容分享给您的微信好友或者朋友圈

请选择需要下载的pdf数据

无线电工程

使用微信“扫一扫”功能。将此内容分享给您的微信好友或者朋友圈

使用微信“扫一扫”功能。
将此内容分享给您的微信好友或者朋友圈