基于改进YOLOv8的遥感图像目标检测算法

金梅, 王泓沣, 张立国, 张琦, 袁煜淋

计量学报 ›› 2025, Vol. 46 ›› Issue (11) : 1622-1630.

PDF(6926 KB)
PDF(6926 KB)
计量学报 ›› 2025, Vol. 46 ›› Issue (11) : 1622-1630. DOI: 10.3969/j.issn.1000-1158.2025.11.10
光学计量

基于改进YOLOv8的遥感图像目标检测算法

作者信息 +

Remote Sensing Image Target Detection Algorithm Based on Improved YOLOv8

Author information +
文章历史 +

摘要

针对遥感图像背景复杂、尺度多变等特点导致目标检测精度低的问题,提出一种基于改进YOLOv8的遥感图像目标检测算法。首先,基于YOLOv8网络框架,在骨干网络部分引入全局注意力机制,使模型更关注于重要区域,降低无用背景对模型的干扰;其次,采用一种明智的交并比作为新的边界框回归方式,减小低质量示例对模型的影响,提高模型的边界框回归水平;最后,在特征融合网络部分和头网络部分分别引入高效多尺度注意力模块和动态检测头模块,全面提升模型的尺度适应能力。实验结果表明,所提算法在公开遥感数据集DIOR和RSOD上均取得较高的检测精度,阈值为0.5的平均精度均值分别达到了72.0%和94.6%,并且改进后的算法具有较强的鲁棒性,当输入数据集改变时,仍能保持较好的检测性能。

Abstract

Aiming at the problems of low detection accuracy caused by complex background and variable scale of remote sensing images, an improved YOLOv8 based remote sensing image target detection algorithm is proposed. First, based on the YOLOv8 network framework, the global attention mechanism is introduced in the backbone network to make the model pay more attention to important areas and reduce the interference of useless background on the model. Secondly, a wise crossover ratio is adopted as a new bounding box regression method to reduce the influence of low-quality examples on the model and improve the bounding box regression level of the model. Finally, an efficient multi-scale attention module and a dynamic detection head module are introduced into the feature fusion network and the head network respectively to improve the scale adaptability of the model. The experimental results show that the proposed algorithm achieves high detection accuracy on both DIOR and RSOD, and the average accuracy with a threshold value of 0.5 reaches 72.0% and 94.6% respectively. Moreover, the improved algorithm has strong robustness and can still maintain good detection performance when the input data set changes.

关键词

计算机视觉 / 目标检测 / 深度学习 / 遥感图像 / YOLOv8 / 动态检测头

Key words

computer vision / target detection / deep learning / remote sensing image / YOLOv8 / dynamic head

引用本文

导出引用
金梅, 王泓沣, 张立国, . 基于改进YOLOv8的遥感图像目标检测算法[J]. 计量学报. 2025, 46(11): 1622-1630 https://doi.org/10.3969/j.issn.1000-1158.2025.11.10
JIN Mei, WANG Hongfeng, ZHANG Liguo, et al. Remote Sensing Image Target Detection Algorithm Based on Improved YOLOv8[J]. Acta Metrologica Sinica. 2025, 46(11): 1622-1630 https://doi.org/10.3969/j.issn.1000-1158.2025.11.10
中图分类号: TB96    TB973   

参考文献

[1]
金梅,任婷婷,张立国,等. 改进的轻量级行人目标检测算法[J]. 计量学报202445(2): 186-193.
JIN M REN T T ZHANG L G, et al. Improved lightweight pedestrian target detection algorithm[J]. Acta Metrologica Sinica202445(2): 186-193.
[2]
张立国,赵嘉士,金梅,等. 改进YOLOX在近岸船舶检测中的应用[J]. 计量学报202445(1): 30-37.
ZHANG L G ZHAO J S JIN M, et al. Application of improved YOLOX in inshore ship inspection[J]. Acta Metrologica Sinica202445(1): 30-37.
[3]
陈卫东,谢晓东,岑强,等. 基于改进FCOS的水下目标检测算法[J]. 计量学报202344(11): 1659-1666.
CHEN W D XIE X D CEN Q, et al. Underwater target detection algorithm based on improved FCOS[J]. Acta Metrologica Sinica202344(11): 1659-1666.
[4]
张立国,蒋轶轩,田广军. 基于多尺度融合方法的无人机对地车辆目标检测算法研究[J]. 计量学报202142(11): 1436-1442.
ZHANG L G JIANG Y X TIAN G J. Research on unmanned aerial vehicle to ground vehicle target detection algorithm based on multiscale fusion method[J]. Acta Metrologica Sinica202142(11): 1436-1442.
[5]
QI Q Q ZHAO J H LIN L, et al. Combined multi‑level context aggregation and attention mechanism method for photovoltaic panel extraction from high resolution remote sensing images[J]. International Journal of Remote Sensing202445(11): 3560-3576.
[6]
LI H B TIAN E ZHANG W M, et al. Improving remote sensing object detection by using feature extraction and rotational equivariant attention[J]. International Journal of Remote Sensing202445(11): 3789-3806.
[7]
SHEN H L ZHONG T JIA Y F, et al. Remote sensing image dehazing using generative adversarial network with texture and color space enhancement[J]. Scientific reports202414(1): 12382-12382.
[8]
WANG J J MA A L CHEN Z H, et al. EarthVQANet: multi‑task visual question answering for remote sensing image understanding[J]. ISPRS Journal of Photogrammetry and Remote Sensing2024212:422-439.
[9]
SUN Z D ZHONG Y F WANG X Y, et al. Identifying cropland non‑agriculturalization with high representational consistency from bi‑temporal high‑resolution remote sensing images: From benchmark datasets to real‑world application[J]. ISPRS Journal of Photogrammetry and Remote Sensing2024212: 454-474.
[10]
CHEN H H WANG K M. Fusing DCN and BBAV for remote sensing image object detection[J]. International Journal of Cognitive Informatics and Natural Intelligence202417(1): 1-16.
[11]
GONG X L LIU D Q. SGMFNet: a remote sensing image object detection network based on spatial global attention and multi‑scale feature fusion[J]. Remote Sensing Letters202415(5): 466-477.
[12]
DONG C JIANG X K. An intelligent detection method for optical remote sensing images based on improved YOLOv7[J]. Computers, Materials and Continua202377(3): 3015-3036.
[13]
YANG H B QIU S. A Novel Dynamic Contextual Feature Fusion Model for Small Object Detection in Satellite Remote‑Sensing Images[J]. Information202415(4): 230.
[14]
LIU P Y XING W W MA Y X. MCR‑YOLO model for underwater target detection based on multi‑color spatial features[J]. Optoelectronics Letters202420(5): 313-320.
[15]
MUKESH C SONU R VIMAL K. Deep learning‑based framework for the observation of real‑time melt pool and detection of anomaly in wire‑arc additive manufacturing[J]. Materials and Manufacturing Processes202439(6): 761-777.
[16]
YAN P LIU Y, LYU L, et al. AIOD‑YOLO: an algorithm for object detection in low‑altitude aerial images[J]. Journal of Electronic Imaging202433(1): 3023-3023.
[17]
GHIASI G LIN T Y PANG R, et al. Le, NAS‑FPN: learning scalable feature pyramid architecture for object detection[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach, CA, USA, 2019: 7029-7038.
[18]
YANG J F FU X Y HU Y W, et al. PanNet: a deep network architecture for pan‑sharpening[C]// 2017 IEEE International Conference on Computer Vision (ICCV). Venice, Italy, 2017:1753-1761.
[19]
LIU Y C SHAO Z R HOFFMANN N. Global attention mechanism: retain information to enhance channel‑spatial interactions[J/OL].
[20]
TONG Z J CHEN Y H XU Z W, et al. Wise‑IoU: bounding box regression loss with dynamic focusing mechanism[J/OL].
[21]
OUYANG D L HE S ZHANG G Z, et al. Efficient multi‑scale attention module with cross‑spatial learning[C]//2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 2023.
[22]
DAI X Y CHEN Y P XIAO B, et al. Dynamic Head: unifying object detection heads with attentions[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Nashville, TN, USA, 2021:7369-7378.
[23]
LI K WAN G CHENG G, et al. Object detection in optical remote sensing images: A survey and a new benchmark[J]. ISPRS Journal of Photogrammetry and Remote Sensing2020159:296-307.
[24]
XIAO Z F LIU Q TANG G F, et al. Elliptic fourier transformation‑based histograms of oriented gradients for rotationally invariant object detection in remote‑sensing images[J]. International Journal of Remote Sensing201536(2): 618-644.

基金

国家重点研发计划(2020YFB1711001)
河北省军民融合产业发展专项(2018B190)

PDF(6926 KB)

Accesses

Citation

Detail

段落导航
相关文章

/