高级检索

    水下目标的多尺度上下文感知检测模型

    Multi-scale Context-Aware Detection model for Underwater Target

    • 摘要: 针对传统模型无法有效处理水下复杂环境噪声、目标尺度变化大、且无法平衡模型大小和精度的问题,提出了MSCA-UODA(Multi-scale Context-Aware Underwater Object Detection Algorithm)算法模型。本文设计了一种上下文增强下采样模块CEADown(Context Enhanced ADown),能够在降低模型参数量的同时有效捕获上下文信息,减少下采样过程中水下环境噪声的影响。本模型还提出一种基于双路径部分连接的多尺度特征提取模块CSP-MSPF(Cross Stage Partial - Multi-Scale Partial Feature),并使用单头注意力机制(Single-Head Self-Attention,SHSA)来改进C2PSA,提高了模型的多尺度特征提取能力。通过实验,MSCA-UODA在数据集URPC2020和DUO上相较于基准模型在mAP50提升了2.0%和1.1%,参数量下降了12.01%,且综合性能优于目前主流的目标检测模型。

       

      Abstract: To address the issues that traditional models cannot effectively handle the complex underwater environmental noise, the target scale varies greatly, and the inability to balance model size and accuracy, MSCA-UODA(Multi-scale Context-Aware Underwater Object Detection Algorithm) model was proposed. The model designs a context-enhanced downsampling module, CEADown (Context Enhanced ADown), which can effectively reduce the model parameters, capture context information efficiently and reduce underwater environmental noise. This model also proposes a multi-scale feature extraction module based on dual-path partial connection, named CSP-MSPF(Cross Stage Partial-multi-scale Partial Feature), and uses SHSA (Single-Head Self-Attention) mechanism to improve C2PSA, enhancing the multi-scale feature extraction capability of the model. Through experiments, MSCA-UODA improved by 2.0% and 1.1% respectively on mAP50 compared with baseline model on the datasets URPC2020 and DUO. The number of parameters decreased by 12.01%, and its comprehensive performance was superior to the current mainstream object detection models.

       

    /

    返回文章
    返回