Skip to main navigation Skip to search Skip to main content

Self-Supervised Visual Tracking via Image Synthesis and Domain Adversarial Learning

  • Gu Geng
  • , Sida Zhou*
  • , Jianing Tang
  • , Xinming Zhang
  • , Qiao Liu
  • , Di Yuan*
  • *Corresponding author for this work
  • Xidian University
  • Yunnan Minzu University
  • Harbin Institute of Technology
  • Chongqing Normal University

Research output: Contribution to journalArticlepeer-review

Abstract

With the widespread use of sensors in applications such as autonomous driving and intelligent security, stable and efficient target tracking from diverse sensor data has become increasingly important. Self-supervised visual tracking has attracted increasing attention due to its potential to eliminate reliance on costly manual annotations; however, existing methods often train on incomplete object representations, resulting in inaccurate localization during inference. In addition, current methods typically struggle when applied to deep networks. To address these limitations, we propose a novel self-supervised tracking framework based on image synthesis and domain adversarial learning. We first construct a large-scale database of real-world target objects, then synthesize training video pairs by randomly inserting these targets into background frames while applying geometric and appearance transformations to simulate realistic variations. To reduce domain shift introduced by synthetic content, we incorporate a domain classification branch after feature extraction and adopt domain adversarial training to encourage feature alignment between real and synthetic domains. Experimental results on five standard tracking benchmarks demonstrate that our method significantly enhances tracking accuracy compared to existing self-supervised approaches without introducing any additional labeling cost. The proposed framework not only ensures complete target coverage during training but also shows strong scalability to deeper network architectures, offering a practical and effective solution for real-world tracking applications.

Original languageEnglish
Article number4621
JournalSensors
Volume25
Issue number15
DOIs
StatePublished - Aug 2025
Externally publishedYes

Keywords

  • domain adversarial learning
  • image synthesis
  • object tracking
  • self-supervised

Fingerprint

Dive into the research topics of 'Self-Supervised Visual Tracking via Image Synthesis and Domain Adversarial Learning'. Together they form a unique fingerprint.

Cite this