基于空域和频域的动态不可见后门样本生成

刘继顺; 耿沛霖; 洪维; 张雪芹

doi:10.14135/j.cnki.1006-3080.20260212002

基于空域和频域的动态不可见后门样本生成

Dynamic Invisible Backdoor Sample Generation based on Spatial-Spectral Domain

摘要

摘要: 深度学习技术的快速发展使其在多领域成效显著，但后门攻击的频发暴露了深度神经网络的脆弱性。针对后门样本触发器易暴露的问题，为了实现高质量的模拟攻击测试以提升模型安全性，本文提出动态不可见后门样本生成框架S²D-DIBA（Spatial-Spectral Domain Dynamic Invisible Backdoor Attack），从空域与频域协同提升样本隐蔽性。空域设计基于Attention U-Net的生成器，借注意力机制聚焦图像关键区域，结合SampleNet实现可微分采样，完成像素级优化以生成样本专属隐蔽触发器。频域通过离散余弦变换转换中毒图像，设计频域相似性损失缩小与干净样本高频分布差异。两组公共数据集实验表明，该算法性能优于现有方法，较次优方案L1范数降低50倍以上，攻击成功率维持99.9%以上，在有效性与隐蔽性上均表现优异。

Abstract: The rapid development of deep learning technologies has enabled deep neural networks to achieve remarkable success in various fields. However, in light of the increasing prevalence of backdoor attacks, deep neural networks have shown significant vulnerability in such novel scenarios. To address the issue that triggers in current backdoor samples are easily exposed during testing, this study enhances the concealment of samples from both the spatial and frequency domains, proposing a dynamic invisible backdoor sample generation framework called S²D-DIBA (Spatial-Spectral Domain Dynamic Invisible Backdoor Attack). In the spatial domain, a generator network based on Attention U-Net is designed, which uses an attention mechanism to focus on key regions of the image to generate a probabilistic modification matrix. A multilayer perceptron network, SampleNet, is employed to simulate a differentiable sampling process, thereby performing pixel-level optimization of key regions to generate specific and concealed spatial triggers for each clean image. In the frequency domain, both clean images and those poisoned with spatial triggers are transformed into the frequency space via discrete cosine transform. By designing a frequency domain similarity loss, the distribution difference between poisoned and clean samples in the high-frequency components is minimized, further enhancing the sample stealthiness. Experiments on two public datasets demonstrate that the proposed algorithm outperforms existing state-of-the-art methods, reducing the L1 norm by more than 50 times compared to the second-best approach while maintaining an attack success rate above 99.9%, exhibiting excellent performance in both attack effectiveness and concealment.

HTML全文

参考文献(30)

施引文献

资源附件(0)