高级检索

    水下目标的多尺度上下文感知检测模型

    Multi-scale Context-Aware Detection model for Underwater Target

    • 摘要: 针对传统模型无法有效处理水下复杂环境噪声、目标尺度变化大、且无法平衡模型大小和精度的问题,本文提出了MSCA-UODA(Multi-scale Context-Aware Underwater Object Detection Algorithm)模型,其设计了一种上下文增强下采样模块CEADown(Context Enhanced ADown),能够在降低模型参数量的同时有效捕获上下文信息,减少下采样过程中水下环境噪声的影响;同时提出一种基于双路径部分连接的多尺度特征提取模块CSP-MSPF(Cross Stage Partial-Multi-Scale Partial Feature),并使用单头注意力机制(Single-Head Self-Attention, SHSA)来改进C2PSA,提高了模型的多尺度特征提取能力。实验表明,相较于基准模型MSCA-UODA在数据集URPC2020和DUO上的mAP50提升了2.0%和1.1%,参数量下降了12.01%,且综合性能优于目前主流的目标检测模型。

       

      Abstract: To address the limitations of traditional models in handling complex underwater environmental noise, large variations in target scale, and the trade-off between model size and accuracy, the MSCA-UODA (Multi-scale Context-Aware Underwater Object Detection Algorithm) was proposed. The model includes a context-enhanced downsampling module, CEADown (Context-Enhanced ADown), which effectively reduces model parameters, captures contextual information efficiently, and mitigates underwater environmental noise. Additionally, it introduces a multi-scale feature extraction module based on dual-path partial connection, named CSP-MSPF (Cross Stage Partial-Multi-scale Partial Feature), and incorporates the SHSA (Single-Head Self-Attention) mechanism to enhance the C2PSA module, thereby improving the model's multi-scale feature extraction capability. Experimental results show that on the URPC2020 and DUO datasets, MSCA-UODA improved mAP50 by 2.0% and 1.1%, respectively, compared to the baseline model, while reducing the number of parameters by 12.01%. Its overall performance surpassed that of current mainstream object detection models.

       

    /

    返回文章
    返回