| 818 | 5 | 322 |
| 下载次数 | 被引频次 | 阅读次数 |
利用深度学习实现遥感影像耕地区域自动化检测,取代人工解译,能有效提升耕地面积统计效率。针对目前存在分割目标尺度大且连续导致分割区域存在欠分割现象,边界区域情况复杂导致边缘分割困难等问题,提出了语义分割算法——Swin Transformer, TransFuse and U-Net (SF-Unet)。为强化网络不同层次特征提取和信息融合能力,提升边缘分割性能,使用U-Net网络替代TransFuse网络中的ResNet50模块;将Vision Transformer (ViT)替换为改进后的Swin Transformer网络,解决大区域的欠分割问题;通过注意力机制构建的Fusion融合模块将2个网络输出特征进行融合,增强模型对目标的语义表示,提高分割的精度。实验表明,SF-Unet语义分割网络在Gaofen Image Dataset (GID)数据集上的交并比(Intersection over Union, IoU)达到了90.57%,分别比U-Net和TransFuse网络提升了6.48%和6.09%,明显提升了耕地遥感影像分割的准确性。
Abstract:Using deep learning to realize automatic detection of cultivated land areas in remote sensing images, replacing manual interpretation, can effectively improve the statistical efficiency of cultivated land areas. Aiming at the current problems that the segmentation target scale is large and continuous, which leads to under-segmentation in the segmented area, and the complexity of the boundary area makes edge segmentation difficult, the Swin Transformer, TransFuse and U-Net(SF-Unet) semantic segmentation algorithm is proposed. In order to strengthen the feature extraction and information fusion capabilities of different levels of the network and improve the edge segmentation performance, the U-Net network is used to replace the ResNet50 module in the TransFuse network; the Vision Transformer(ViT) is replaced by improved Swin Transformernetwork to solve the problem of under-segmentation of large areas; the Fusion module constructed by the attention mechanism fuses the output features of the two networks to enhance the semantic representation of the model to the target and improve the accuracy of segmentation. Experiments show that the Intersection over Union(IoU) ratio of the SF-Unet semantic segmentation network on the Gaofen Image Dataset(GID) has reached 90.57%, which is 6.48% and 6.09% higher than U-Net and TransFuse networks, respectively. The accuracy of cultivated land remote sensing image segmentation is significantly improved.
[1] 赵少华,刘思含,毛学军,等.新时期我国环保领域卫星遥感技术的应用与发展 [J].无线电工程,2017,47 (3):1-7.
[2] 黄春波,佃袁勇,周志翔,等.基于时间序列统计特性的森林变化监测[J].遥感学报,2015,19(4):657-668.
[3] OTSU N.A Threshold Selection Method from Gray-level Histograms[J].IEEE Transactions on Systems,Man,and Cybernetics,1979,9(1):62-66.
[4] 丁海勇,王雨轩,毛宇琼,等.基于动态阈值区域分裂合并算法的高分辨率遥感图像分割研究 [J].测绘通报,2016 (8):145-146.
[5] 张爱华,余胜生,周敬利.一种基于边缘检测的局部阈值分割算法 [J].小型微型计算机系统,2003,24(4):661-663.
[6] 南柄飞,穆志纯.基于 SLIC0 融合纹理信息的超像素分割方法 [J].仪器仪表学报,2014,35 (3):527-534.
[7] LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-based Learning Applied to Document Recognition[J].Procee-dings of the IEEE,1998,86(11):2278-2324.
[8] LONG J,SHELHAMER E,DARRELL T.Fully Convolutional Networks for Semantic Segmentation[C]//2015 IEEE Conference on Computer Vision and Pattern Recognition.Boston:IEEE,2015:3431-3440.
[9] RONNEBERGER O,FISCHER P,BROX T.U-Net:Con-volutional Networks for Biomedical Image Segmentation[C]//18th International Conference.Munich:Springer,2015:234-241.
[10] VASWANI A,SHAZEER N,PARMAR N,et al.Attention Is All You Need[EB/OL].(2017-12-06)[2023-10-20].https://arxiv.org/pdf/1706.03762v5.pdf.
[11] DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.An Image Is Worth 16×16 Words:Transformers for Image Recognition at Scale[EB/OL].(2020-10-22)[2023-10-15].https://arxiv.org/abs/2010.11929.
[12] LIU Z,LIN Y T,CAO Y,et al.Swin Transformer:Hierarchical Vision Transformer Using Shifted Windows[C]//2021 IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2021:9992-10002.
[13] ZHANG Y D,LIU H Y,HU Q Y.Transfuse:Fusing Transformers and CNNs for Medical Image Segmentation[C]// 24th International Conference.Strasbourg:Springer,2021:14-24.
[14] HE K M,ZHANG X Y,REN S Q,et al.Deep Residual Learning for Image Recognition[C]//2016 IEEE Confe-rence on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:770-778.
[15] 周泽昊.基于Swin-Spec Transformer的胆管癌显微高光谱图像分割研究[D].上海:华东师范大学,2022.
[16] 王金祥,付立军,尹鹏滨,等.基于CNN与Transformer的医学图像分割[J].计算机系统应用,2023,32(4):141-148.
[17] 王军,高放,省海先,等.基于Swin Transformer生成对抗网络的图像生成算法[J].小型微型计算机系统,2024,45(1):241-248.
[18] OKTAY O,SCHLEMPER J,LE FOLGOC L,et al.Attention U-Net:Learning Where to Look for the Pancreas[EB/OL].(2018-04-11)[2023-11-20].https://arxiv.org/abs/1804.03999.
[19] LEE C Y,XIE S N,GALLAGHER P,et al.Deeply-supervised Nets[C]//Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics.[S.l.]:PMLR,2015:562-570.
[20] WEI J,WANG S H,HUANG Q M.F3Net:Fusion,Feedback and Focus for Salient Object Detection[C]//34th AAAI Conference on Artificial Intelligence.[S.l.]:AAAI,2020:12321-12328.
[21] 秦传波,宋子玉,曾军英,等.联合多尺度和注意力-残差的深度监督乳腺癌分割 [J].光学精密工程,2021,29(4):877-895.
基本信息:
中图分类号:TP751;S127
引用信息:
[1]秦伦明,凌雪海,邹钰洁,等.基于SF-Unet的高分辨率耕地遥感影像分割[J].无线电工程,2024,54(05):1197-1204.
基金信息:
国家自然科学基金面上项目(62073024)~~
2024-02-28
2024-02-28
2024-02-28