Skip to main navigation Skip to search Skip to main content

Towards Learning Multi-Domain Crowd Counting

  • Zhaoyi Yan
  • , Pengyu Li
  • , Biao Wang
  • , Dongwei Ren*
  • , Wangmeng Zuo
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Alibaba Group Holding Ltd.

Research output: Contribution to journalArticlepeer-review

Abstract

Recently, deep learning-based crowd counting methods have achieved promising performance on test data with the same distribution as training set, while performance degradation usually occurs when testing on other or unseen domains. Due to the variations in scene contexts, crowd densities and head scales, it is a very challenging issue to tackle multi-domain crowd counting using one deep model. In this work, we propose a domain-guided channel attention network (DCANet) towards learning multi-domain crowd counting. In particular, our DCANet consists of feature extraction module, channel attention-guided multi-dilation (CAMD) module and density map prediction module. Given a testing image from a certain domain, channel attention is adopted to guide the extraction of domain-specific feature representation, and thus our DCANet can adaptively handle images from multiple domains. We further propose two domain-guided learning strategies, i.e., dataset-level domain kernel (DDK) supervision and image-level domain kernel (IDK) supervision, by which channel attention in CAMD can be explicitly optimized to emphasize the channels corresponding to the domain of an input image. Furthermore, IDK can be adaptively updated when training DCANet, thereby improving the generalization ability to unseen scenes. Experimental results on benchmark datasets show that our DCANet performs favorably for handling multi-domain datasets using one single model. Moreover, our IDK training strategy can be applied to boost state-of-the-art methods on single domain dataset.

Original languageEnglish
Pages (from-to)6544-6557
Number of pages14
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume33
Issue number11
DOIs
StatePublished - 1 Nov 2023
Externally publishedYes

Keywords

  • Crowd counting
  • multi-domain learning

Fingerprint

Dive into the research topics of 'Towards Learning Multi-Domain Crowd Counting'. Together they form a unique fingerprint.

Cite this