高级检索

    基于区域上下文感知增强网络的图像情感迁移

    Regional Context Perception enhanced Network for Image Emotion Transfer

    • 摘要: 为了实现准确有效的图像情感迁移,提出情感建模关系引导的区域上下文感知增强网络,在外部情感知识的指导下联合多种损失,实现准确的图像诱发情感迁移。在该网络中,提出一个新颖的区域上下文感知块。其通过多种自注意提取图像中不同感受野的上下文特征,并将其通过交叉注意自适应地融合,更全面地整合图像信息。在此基础上通过残差连接恢复深度特征融合中损失的信息,更准确地保留图像的内容。同时,提出一种新颖的情感轮引导模块,该模块基于情感轮中的情感分布,使用联合损失引导模型准确地迁移图像情感。为了准确有效地评估模型迁移图像情感的能力,提出情感迁移综合度量,综合情感类别、情感极性以及情感在情感轮上的位置多角度地评估图像情感迁移的效果,并基于4个风格不同且广泛使用的情感数据集构建一个新的数据集FATE。在FATE上进行的大量实验充分验证了提出方法的有效性,并优于其他对比的方法。

       

      Abstract: To achieve accurate and effective image emotion transfer, an Emotional Modeling Relationship guided Regional Context Perception enhanced Network is proposed (EMR-RCPN). It is guided by external emotion knowledge and jointly optimized through multiple losses to accurately transfer the emotion of image. In this network, we introduce a novel Regional Context Perception Block (RCPB) to enhance the Twins-SVT encoder. It extracts context features of different receptive fields in the image through Locally-grouped Self Attention, Axis-wise Self Attention, and Neighbor Self Attention. Then it fuses them adaptively through cross attention to comprehensively integrate image information. Furthermore, details lost in the fusion of deep features are restored through residual connections to more accurately preserve the image content. Additionally, we propose a novel Emotion-wheel Guided Module (EGM). It uses the emotion distribution in the emotion wheel to guide the model in accurately transferring image emotions. To accurately and effectively evaluate the model ability to transfer image emotion, we innovatively propose the Emotion Transfer Comprehensive Metric (ETCM). It evaluates the effect of image emotion transfer from multiple perspectives, including emotion categories, emotion polarities, and positions of emotions in the emotion wheel. To evaluate the model effectiveness more effectively, we construct a new emotion transfer dataset called FATE. It is based on four widely used and stylistically different emotion datasets: FI, Twitter-LDL, Emotion6, and Artphoto. Extensive experiments on FATE demonstrate the effectiveness of the proposed method, which outperforms other comparative methods.

       

    /

    返回文章
    返回