Skip to main navigation Skip to search Skip to main content

Pixel-REfocused Navigated Tri-margin for Semi-Supervised Action Detection

  • Wenxuan Liu
  • , Shilei Zhao
  • , Xiyu Han
  • , Aoyu Yi
  • , Kui Jiang
  • , Zheng Wang
  • , Xian Zhong*
  • *Corresponding author for this work
  • Wuhan University of Technology
  • Wuhan University
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Nanyang Technological University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper identifies a novel issue, termed pixel activation-uncertainty, in semi-supervised action detection, which highlights the difficulty in distinguishing between action and background boundaries due to active motion. To address this, we propose an effective pipeline called Pixel-Refocused Navigated Tri-margin (PRENT), which adaptively leverages class-explicit knowledge. PRENT emphasizes maintaining region consistency by updating pseudo-label selection with each training epoch, ensuring continuous improvement. We introduce a class-explicit tri-margin as a soft solution to manage uncertain boundaries within latent buffer regions. This technique refines the buffer zone based on the unique characteristics of each category, thereby addressing the varying challenges in localizing actions and backgrounds. Experimental results on various benchmarks and training settings demonstrate the superiority of our method compared to state-of-the-art methods.

Original languageEnglish
Title of host publicationHCMA 2024 - Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, Co-Located with
Subtitle of host publicationMM 2024
PublisherAssociation for Computing Machinery, Inc
Pages23-31
Number of pages9
ISBN (Electronic)9798400711923
DOIs
StatePublished - 28 Oct 2024
Externally publishedYes
Event5th International Workshop on Human-centric Multimedia Analysis, HCMA 2024 - Melbourne, Australia
Duration: 28 Oct 20241 Nov 2024

Publication series

NameHCMA 2024 - Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, Co-Located with: MM 2024

Conference

Conference5th International Workshop on Human-centric Multimedia Analysis, HCMA 2024
Country/TerritoryAustralia
CityMelbourne
Period28/10/241/11/24

Keywords

  • class-explicit tri-margin
  • pixel activation-uncertainty
  • region consistency
  • semi-supervised action detection

Fingerprint

Dive into the research topics of 'Pixel-REfocused Navigated Tri-margin for Semi-Supervised Action Detection'. Together they form a unique fingerprint.

Cite this