高级检索

    MBM-Mamba:基于Mamba的多分支结构尘肺病筛查及分期模型

    MBM-Mamba: Multi-Branch Model Based on Mamba for Pneumoconiosis Screening and Staging

    • 摘要: 现有的基于卷积神经网络(Convolutional Neural Network,CNN)的辅助诊断方法在尘肺病筛查与分期任务中难以达到理想精度。本文提出了一种面向胸部X光图像尘肺病筛查与分期的多分支结构模型,即MBM-Mamba,并在二维选择性扫描模块(2D-Selective-Scan,SS2D)框架下提出了新的六向扫描策略,借助对角扫描以线性时间复杂度显式捕获了二维局部依赖关系,然后通过在CNN中整合先验信息构建了细节增强模块(Detail Enhancement Module,Dem),从而形成了局部特征提取模块(CNN-Mamba),以显著提升细微病灶信息的表达能力。另外,MBM-Mamba模型在多头自注意力机制基础上设计了全局特征提取模块,有效增强了全局上下文捕捉能力。MBM-Mamba模型利用多路残差整合了上述局部和全局两个特征提取模块,实现了跨结构特征的同步分层融合,从而能更好地理解肺部病灶的整体分布与纤维化程度。在1760张真实匿名患者的胸部X光片上进行验证,MBM-Mamba模型准确率达到0.786,F1分数为0.790,这两项指标均优于现有模型。

       

      Abstract: Existing auxiliary diagnosis methods based on Convolutional Neural Network (CNN) are difficult to achieve ideal accuracy in pneumoconiosis screening and staging tasks. This paper proposes a multi-branch structure model for pneumoconiosis screening and staging based on chest X-ray images, namely MBM-Mamba, and proposes a new six-direction scanning strategy under the framework of 2D-Selective-Scan (SS2D). The diagonal scanning is used to explicitly capture the two-dimensional local dependency with linear time complexity. Then, a Detail Enhancement Module (Dem) is constructed by integrating prior information into CNN, thus forming a local feature extraction module (CNN-Mamba), which significantly improves the expression ability of subtle lesion information.In addition, the MBM-Mamba model designs a global feature extraction module based on the multi-head self-attention mechanism, which effectively enhances the ability to capture global context. The MBM-Mamba model integrates the local and global feature extraction modules by using multi-path residuals, and realizes the synchronous hierarchical fusion of cross-structural features, so that the model can better understand the overall distribution of lung lesions and the degree of fibrosis. Verified on 1760 chest X-ray images of real anonymous patients, the accuracy of the MBM-Mamba model reaches 0.786, and the F1 score is 0.790. Both indicators are better than the existing models.

       

    /

    返回文章
    返回