Skip to main navigation Skip to search Skip to main content

Adversarial Transferability in Deep Denoising Models: Theoretical Insights and Robustness Enhancement via Out-of-Distribution Typical Set Sampling

  • School of Mathematics, Harbin Institute of Technology
  • Capital Normal University

Research output: Contribution to journalArticlepeer-review

Abstract

Deep learning-based image denoising models demonstrate remarkable performance, but their lack of robustness analysis remains a significant concern. A major issue is that these models are sus-ceptible to adversarial attacks, where small, carefully crafted perturbations to input data can cause them to fail. Surprisingly, perturbations specifically crafted for one model can easily transfer across various models, including convolutional neural networks, transformers, unfolding models, and plug-and-play models, leading to failures in those models as well. Such high adversarial transferability is not observed in classification models. We analyze the possible underlying reasons behind the high adversarial transferability through a series of hypotheses and validation experiments. By character-izing the manifolds of Gaussian noise and adversarial perturbations using the concept of a typical set and the asymptotic equipartition property, we prove that adversarial samples deviate slightly from the typical set of the original input distribution, causing the models to fail. Based on these insights, we propose a novel adversarial defense method: the out-of-distribution typical set sampling (TSS) training strategy. TSS training strategy not only significantly enhances the model's robustness but also marginally improves denoising performance compared to the original model.

Original languageEnglish
Pages (from-to)1788-1827
Number of pages40
JournalSIAM Journal on Imaging Sciences
Volume18
Issue number3
DOIs
StatePublished - 2025
Externally publishedYes

Keywords

  • adversarial attack
  • image denoising
  • robustness
  • transferability
  • typical set

Fingerprint

Dive into the research topics of 'Adversarial Transferability in Deep Denoising Models: Theoretical Insights and Robustness Enhancement via Out-of-Distribution Typical Set Sampling'. Together they form a unique fingerprint.

Cite this