Skip to main navigation Skip to search Skip to main content

Towards effective and efficient adversarial defense with diffusion models for robust visual tracking

  • Long Xu
  • , Peng Gao*
  • , Wen Jia Tang
  • , Fei Wang
  • , Ru Yue Yuan
  • *Corresponding author for this work
  • Qufu Normal University
  • School of Integrated Circuits, Harbin Institute of Technology Shenzhen

Research output: Contribution to journalArticlepeer-review

Abstract

Although deep learning-based visual tracking methods have made significant progress, they exhibit vulnerabilities when facing carefully designed adversarial attacks, which can lead to a sharp decline in tracking performance. To address this issue, this paper proposes for the first time a novel adversarial defense method based on denoise diffusion probabilistic models, termed DiffDf, aimed at effectively improving the robustness of existing visual tracking methods against adversarial attacks. DiffDf establishes a multi-scale defense mechanism by combining pixel-level reconstruction loss, semantic consistency loss, and structural similarity loss, effectively suppressing adversarial perturbations through a gradual denoising process. Extensive experimental results on several mainstream datasets show that the DiffDf method demonstrates excellent generalization performance for trackers with different architectures, significantly improving various evaluation metrics while achieving real-time inference speeds of over 30 FPS, showcasing outstanding defense performance and efficiency. Codes are available at https://github.com/pgao-lab/DiffDf.

Original languageEnglish
Article number103384
JournalInformation Fusion
Volume124
DOIs
StatePublished - Dec 2025
Externally publishedYes

Keywords

  • Adversarial attack
  • Adversarial defense
  • Diffusion model
  • Visual tracking

Fingerprint

Dive into the research topics of 'Towards effective and efficient adversarial defense with diffusion models for robust visual tracking'. Together they form a unique fingerprint.

Cite this