Skip to main navigation Skip to search Skip to main content

Open-vocabulary object detection via debiased curriculum self-training

  • Hanlue Zhang
  • , Dayan Guan
  • , Xiangrui Ke
  • , Abdulmotaleb El Saddik
  • , Shijian Lu*
  • *Corresponding author for this work
  • Mohamed Bin Zayed University of Artificial Intelligence
  • University of Ottawa
  • Nanyang Technological University

Research output: Contribution to journalArticlepeer-review

Abstract

Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base classes, leading to sub-optimal open-vocabulary detectors. We propose DCS, a novel Debiased Curriculum Self-Training technique that generates pseudo object boxes progressively and adaptively for training accurate open-vocabulary detectors. DCS consists of two complementary designs, namely, progressive pseudo-label filtering (PPF) and adaptive pseudo-label selection (APS). Specifically, PPF discards confident but mismatched detection progressively at the early training stage when the trained detector is biased towards the base classes, APS instead fuses class-aware and class-agnostic pseudo labels by prioritizing class-aware pseudo labels at the late training stage when the detector can better recognize novel classes. Without bells and whistles, DCS achieves superior detection performance over two open-vocabulary detection benchmarks.

Original languageEnglish
Article number124762
JournalExpert Systems with Applications
Volume255
DOIs
StatePublished - 1 Dec 2024

Keywords

  • Curriculum learning
  • Object detection
  • Open-vocabulary recognition
  • Pseudo-label filtering
  • Self-training

Fingerprint

Dive into the research topics of 'Open-vocabulary object detection via debiased curriculum self-training'. Together they form a unique fingerprint.

Cite this