Skip to main navigation Skip to search Skip to main content

Crowd Counting by Using Top-k Relations: A Mixed Ground-Truth CNN Framework

  • Harbin Institute of Technology Shenzhen

Research output: Contribution to journalArticlepeer-review

Abstract

Crowd counting has important applications in the environments of smart cities, such as intelligent surveillance. In this paper, we propose a novel convolutional neural network (CNN) framework for crowd counting with mixed ground-truth, called top- k relation-based network (TKRNet). Specifically, the estimated density maps generated in a coarse-to-fine manner are treated as coarse locations for crowds so as to assist our TKRNet to regress the scattered point-annotated ground truth. Moreover, an adaptive top- k relation module (ATRM) is proposed to enhance feature representations by leveraging the top- k dependencies between the pixels with an adaptive filtering mechanism. Specifically, we first compute the similarity between two pixels so as to select the top- k relations for each position. Then, a weight normalization operation with an adaptive filtering mechanism is proposed to make the ATRM adaptively eliminate the influence from the low correlation positions in the top- k relations. Finally, a weight attention mechanism is introduced to make the ATRM pay more attention to the positions with high weights in the top- k relations. Extensive experimental results demonstrate the effectiveness of our proposed TKRNet on several public datasets in comparison to state-of-the-art methods.

Original languageEnglish
Pages (from-to)307-316
Number of pages10
JournalIEEE Transactions on Consumer Electronics
Volume68
Issue number3
DOIs
StatePublished - 1 Aug 2022
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • Crowd counting
  • mixed ground truth
  • self-attention
  • top-k relations

Fingerprint

Dive into the research topics of 'Crowd Counting by Using Top-k Relations: A Mixed Ground-Truth CNN Framework'. Together they form a unique fingerprint.

Cite this