Skip to main navigation Skip to search Skip to main content

A Two-Stage Framework for Real-Time Action Recognition in Human-Robot Confrontation

  • Zhihui Wang
  • , Yong Dai
  • , Yangkexin An
  • , Bingyin Ren*
  • *Corresponding author for this work
  • School of Mechatronics Engineering, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The balance between high accuracy and low latency in multi-target action recognition for Human-Robot confrontation has been a key research focus. To address the challenge, a novel two-stage Top-Down approach is proposed for perceptual feature extraction and behavioral understanding separately. At the target perception stage, YOLOv8 is utilized to extract multi-target bounding boxes and feature maps. Hue-Saturation-Value (HSV) based color tracking is employed to achieve real-time positioning and identity discrimination of antagonistic individuals. For behavior understanding, Real-Time Multi-person Coordinate Classification (RTMCC) is introduced as the detection head for multi-target pose estimation, enabling the extraction of high-quality topological information of key skeleton points. Hierarchical Multi-Scale Graph Convolutional Network (HMS-GCN) is designed to enhance action recognition. Inference accuracy of the proposed integrated algorithmic framework is enhanced while maintaining the speed of 30+ fps, significantly improving robustness in confrontation scenarios.

Original languageEnglish
Title of host publicationICCCR 2025 - 2025 5th International Conference on Computer, Control and Robotics
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages558-562
Number of pages5
ISBN (Electronic)9798331543549
DOIs
StatePublished - 2025
Externally publishedYes
Event5th International Conference on Computer, Control and Robotics, ICCCR 2025 - Hangzhou, China
Duration: 16 May 202518 May 2025

Publication series

NameICCCR 2025 - 2025 5th International Conference on Computer, Control and Robotics

Conference

Conference5th International Conference on Computer, Control and Robotics, ICCCR 2025
Country/TerritoryChina
CityHangzhou
Period16/05/2518/05/25

Keywords

  • GCN
  • YOLOv8
  • action recognition
  • pose estimation

Fingerprint

Dive into the research topics of 'A Two-Stage Framework for Real-Time Action Recognition in Human-Robot Confrontation'. Together they form a unique fingerprint.

Cite this