| 2,842 | 12 | 143 |
| 下载次数 | 被引频次 | 阅读次数 |
针对目前城市背景烟火检测方法存在检测精度不高,易出现误检、漏检和耗时长等问题,提出一种基于YOLOv8s改进的烟火检测算法。引入加权双向特征金字塔(Bi-directional Feature Pyramid Network, BiFPN)增强特征融合,在BiFPN的基础上融合P2特征层提升小目标检测能力,同时添加基于跨空间学习的高效多尺度注意力(Efficient Multi-scale Attention, EMA)模块,突出目标特征同时抑制背景环境的干扰;为了有效利用特征图的语义信息,引入轻量级通用上采样算子——Content-Aware ReAssembly of Features(CARAFE);基于多尺度卷积注意力(Multi-scale Convolutional Attention, MCA)模块设计了一个轻量化的检测头并提升了检测精度;引入分组卷积空间金字塔池化SPPFCSPC_Group模块,在扩大感受野的同时具有更好的特征提取能力。实验结果表明,改进的YOLOv8s算法在基准模型的基础上计算量减少了25%、参数量减少了37.6%、模型权重大小减少了33.2%,平均精度均值(mean Average Precision, mAP)提升了3.4%,基本满足烟火检测的需求。
Abstract:To solve the problems of low detection accuracy, prone to false detection, missing detection, and long processing time in current urban background smoke and fire methods, a smoke and fire detection algorithm based on improved YOLOv8s is proposed.The weighted Bi-directional Feature Pyramid Network(BiFPN) is introduced to enhance feature fusion.Building upon BiFPN,the P2 feature layer is fused to improve the detection ability for small targets.Additionally, an Efficient Multi-scale Attention(EMA) module based on cross-space learning is incorporated to highlight target features while suppressing background interference.To effectively utilize semantic information in feature maps, the lightweight universal upsampling operator Content-Aware ReAssembly of Features(CARAFE) is introduced.Furthermore, a lightweight detection head is designed based on the Multi-scale Convolutional Attention(MCA) module for improving detection accuracy.Finally, the SPPFCSPC_Group module, based on grouped convolution spatial pyramid pooling, is introduced to enlarge the receptive field and enhance feature extraction capability.Experimental results demonstrate that the improved YOLOv8s algorithm reduces computation by 25%,parameters by 37.6%,and model weight size by 33.2% compared to the baseline model, while increasing mean Average Precision(mAP) by 3.4%,thereby meeting the requirements for smoke and fire detection.
[1] 李继超,郭聖煜,孔刘林,等.施工现场火焰检测和预警机器人设计及应用[J].中国安全科学学报,2021,31(4):141-146.
[2] 张驰,孟庆浩,井涛.基于改进 GMM 和多特征融合的视频火焰检测算法[J].激光与光电子学进展,2021,58(4):136-145.
[3] KIM Y H,KIM A,JEONG H Y.RGB Color Model Based the Fire Detection Algorithm in Video Sequences on Wireless Sensor Network[J].International Journal of Distributed Sensor Networks,2014,10(4):285-292.
[4] WU S X,ZHANG L B.Using Popular Object Detection Methods for Real Time Forest Fire Detection[C]//2018 11th International Symposium on Computational Intelligence and Design (ISCID).Hangzhou:IEEE,2018:280-284.
[5] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:Towards Real-time Object Detection with Region Proposal Networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems.Montreal:MIT Press,2015:91-99.
[6] REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-time Object Detection[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:779-788.
[7] REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:7263-7271.
[8] REDMON J,FARHADI A.YOLOv3:An Incremental Improvement[EB/OL].(2018-04-08) [2024-01-20].https://arxiv.org/abs/1804.02767.
[9] BOCHKOVSKIY A,WANG C Y,LIAO H Y M.YOLOv4:Optimal Speed and Accuracy of Object Detection[EB/OL].(2020-04-23)[2024-01-20].https://arxiv.org/pdf/2004.10934.
[10] LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single Shot MultiBox Detector[C]//Proceedings of the 14th European Conference on Computer Vision.Amsterdam:Springer,2016:21-37.
[11] ZHANG Q X,LIN G H,ZHANG Y M,et al.Wildland Forest Fire Smoke Detection Based on Faster R-CNN Using Synthetic Smoke Images[J].Procedia Engineering,2018,211(1):441-446.
[12] 罗小权,潘善亮.改进 YOLOV3 的火灾检测方法[J].计算机工程与应用,2020,56(17):187-196.
[13] 王一旭,肖小玲,王鹏飞,等.改进 YOLOv5s的小目标烟雾火焰检测算法[J].计算机工程与应用,2023,59(1):72-81.
[14] LIU S,QI L,QIN H F,et al.Path Aggregation Network for Instance Segmentation[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE,2018:8759-8768.
[15] JIE H,LI S,SAMUEL A,et al.Squeeze-and-Excitation Networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,42(8):2011-2023.
[16] WOO S,PARK J,LE J Y,et al.CBAM:Convolutional Block Attention Module[C]//15th European Conference on Computer Vision.Munich:Springer,2018:3-19.
[17] HOU Q B,ZHOU D Q,FENG J S.Coordinate Attention for Efficient Mobile Network Design[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Nashville:IEEE,2021:13708-13717.
[18] OUYANG D,HE S,ZHANG G Z,et al.Efficient Multi-scale Attention Module with Cross-spatial Learning[EB/OL].(2023-05-23)[2024-01-20].https://arxiv.org/abs/2305.13563.
[19] LI T G,ZHANG Y Z,LI Q Q,et al.AB-DLM:An Improved Deep Learning Model Based on Attention Mechanism and BifFPN for Driver Distraction Behavior Detection[J].IEEE Access,2022,10:83138-83151.
[20] WANG J Q,CHEN K,XU R,et al.CARAFE:Content-aware ReAssembly of FEatures[C]//2019 IEEE/CVF International Conference on Computer Vision.Seoul:IEEE,2019:3007-3016.
基本信息:
中图分类号:TP183;TP391.41;X932
引用信息:
[1]于泳波,袁栋梁,孙振,等.基于YOLOv8s的城市背景烟火检测算法[J].无线电工程,2024,54(11):2566-2575.
基金信息:
山东省泰山学者项目(tshw201502042); 山东省重大创新工程(2017CXGC0607)~~
2024-03-12
2024-03-12
2024-03-12