Skip to main navigation Skip to search Skip to main content

SFBDA: A Semantic-Decoupled Data Augmentation Framework for Infrared Few-Shot Object Detection on UAVs

  • Zhenhai Weng
  • , Weijie He
  • , Jianfeng Lv
  • , Dong Zhou*
  • , Zhongliang Yu*
  • *Corresponding author for this work
  • Chongqing University
  • Lanzhou University
  • Chinese University of Hong Kong

Research output: Contribution to journalArticlepeer-review

Abstract

Few-shot object detection (FSOD) is a critical frontier in computer vision research. However, the task of an infrared (IR) FSOD presents significant technical challenges, primarily due to the following: 1) few annotated training samples and 2) low-texture nature of thermal imaging. To address these issues, we propose a semantic-guided foreground-background decoupling augmentation (SFBDA) framework. This method includes an instance-level foreground separation (ILFS) module that utilizes the segment anything model (SAM) to separate the objects, as well as a semantic-constrained background generation network that employs adversarial learning to synthesize contextually compatible backgrounds. To address the insufficiency of scenario diversity in existing uncrewed aerial vehicle (UAV)-based IR object detection datasets, we introduce multiscene IR UAV object detection (MSIR-UAVDET), a novel multiscene IR UAV detection benchmark. This dataset encompasses 16 object categories across diverse environments (terrestrial, maritime, and aerial). To validate the efficacy of the proposed data augmentation methodology, we integrated our approach with existing FSOD frameworks, and comparative experiments were conducted to benchmark our method with existing data augmentation methods. The code and dataset can be publicly available at: https://github.com/Sea814/SFBDA.git.

Original languageEnglish
Article number7002205
JournalIEEE Geoscience and Remote Sensing Letters
Volume22
DOIs
StatePublished - 2025

Keywords

  • Data augmentation
  • few-shot object detection (FSOD)
  • infrared (IR)

Fingerprint

Dive into the research topics of 'SFBDA: A Semantic-Decoupled Data Augmentation Framework for Infrared Few-Shot Object Detection on UAVs'. Together they form a unique fingerprint.

Cite this