Skip to main navigation Skip to search Skip to main content

Detecting and Mitigating Backdoor Attacks with Dynamic and Invisible Triggers

  • Zhibin Zheng
  • , Zhongyun Hua*
  • , Leo Yu Zhang
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Harbin Institute of Technology Shenzhen
  • Deakin University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

When a deep learning-based model is attacked by backdoor attacks, it behaves normally for clean inputs, whereas outputs unexpected results for inputs with specific triggers. This causes serious threats to deep learning-based applications. Many backdoor detection methods have been proposed to address these threats. However, these defenses can only work on the backdoored models attacked by static trigger(s). Recently, some backdoor attacks with dynamic and invisible triggers have been developed, and existing detection methods cannot defend against these attacks. To address this new threat, in this paper, we propose a new defense mechanism that can detect and mitigate backdoor attacks with dynamic and invisible triggers. We reverse engineer generators that transform clean images into backdoor images for each label. The generated images by the generator can help to detect the existence of a backdoor and further remove it. To the best of our knowledge, our work is the first work to defend against backdoor attacks with dynamic and invisible triggers. Experiments on multiple datasets show that the proposed method can effectively detect and mitigate the backdoor with dynamic and invisible triggers in deep learning-based models.

Original languageEnglish
Title of host publicationNeural Information Processing - 29th International Conference, ICONIP 2022, Proceedings
EditorsMohammad Tanveer, Sonali Agarwal, Seiichi Ozawa, Asif Ekbal, Adam Jatowt
PublisherSpringer Science and Business Media Deutschland GmbH
Pages216-227
Number of pages12
ISBN (Print)9783031301100
DOIs
StatePublished - 2023
Externally publishedYes
Event29th International Conference on Neural Information Processing, ICONIP 2022 - Virtual, Online
Duration: 22 Nov 202226 Nov 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13625 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th International Conference on Neural Information Processing, ICONIP 2022
CityVirtual, Online
Period22/11/2226/11/22

Keywords

  • AI security
  • Backdoor attack
  • Backdoor detection

Fingerprint

Dive into the research topics of 'Detecting and Mitigating Backdoor Attacks with Dynamic and Invisible Triggers'. Together they form a unique fingerprint.

Cite this