水下目标的多尺度上下文感知检测模型

王峻韬; 郑红; 陆元军; 徐贤; 吴丽娟

doi:10.14135/j.cnki.1006-3080.20250803001

水下目标的多尺度上下文感知检测模型

Multi-scale Context-Aware Detection model for Underwater Target

摘要

摘要: 针对传统模型无法有效处理水下复杂环境噪声、目标尺度变化大、且无法平衡模型大小和精度的问题，本文提出了MSCA-UODA(Multi-scale Context-Aware Underwater Object Detection Algorithm)模型，其设计了一种上下文增强下采样模块CEADown(Context Enhanced ADown)，能够在降低模型参数量的同时有效捕获上下文信息，减少下采样过程中水下环境噪声的影响；同时提出一种基于双路径部分连接的多尺度特征提取模块CSP-MSPF(Cross Stage Partial-Multi-Scale Partial Feature)，并使用单头注意力机制(Single-Head Self-Attention, SHSA)来改进C2PSA，提高了模型的多尺度特征提取能力。实验表明，相较于基准模型MSCA-UODA在数据集URPC2020和DUO上的mAP50提升了2.0%和1.1%，参数量下降了12.01%，且综合性能优于目前主流的目标检测模型。

Abstract: To address the limitations of traditional models in handling complex underwater environmental noise, large variations in target scale, and the trade-off between model size and accuracy, the MSCA-UODA (Multi-scale Context-Aware Underwater Object Detection Algorithm) was proposed. The model includes a context-enhanced downsampling module, CEADown (Context-Enhanced ADown), which effectively reduces model parameters, captures contextual information efficiently, and mitigates underwater environmental noise. Additionally, it introduces a multi-scale feature extraction module based on dual-path partial connection, named CSP-MSPF (Cross Stage Partial-Multi-scale Partial Feature), and incorporates the SHSA (Single-Head Self-Attention) mechanism to enhance the C2PSA module, thereby improving the model's multi-scale feature extraction capability. Experimental results show that on the URPC2020 and DUO datasets, MSCA-UODA improved mAP50 by 2.0% and 1.1%, respectively, compared to the baseline model, while reducing the number of parameters by 12.01%. Its overall performance surpassed that of current mainstream object detection models.

HTML全文

参考文献(25)

施引文献

资源附件(0)