Skip to main navigation Skip to search Skip to main content

Realtime Interpersonal Human Synchrony Detection Based on Action Segmentation

  • Bowen Chen
  • , Jiamin Zhang
  • , Zuode Liu
  • , Ruihan Lin
  • , Weihong Ren
  • , Luodi Yu
  • , Honghai Liu*
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen
  • South China Normal University
  • Guangzhou University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

IS (Interpersonal Synchrony), where the follower (participant) tries to behave the same action along with the raiser (human or metronome), is an essential social interaction skill. The evaluation of interpersonal synchronization is valuable for early autism screening. However, the research on IS evaluation is limited, and the current approaches usually evaluate the IS task with “motion energy” that is calculated by imprecise corner detection of the participant, which is not robust in an uncontrollable clinical environment. Moreover, these approaches need to manually mark the start and the end anchor of the specified action segment, which is labor-intensive. In this paper, we construct a realtime action segmentation model to automatically recognize the human-wise action class frame by frame. A simple yet efficient backbone is utilized to classify action class straightly instead of extracting the motion features (e.g. optical flow) with high computational complexity. Specifically, given an action video, a sliding window stacks frames in a fixed window size to feed a Resnet-like action classification branch (ACB) to classify the current action label. To further improve the accuracy of action boundary and eliminate the over-segmentation noises, we incorporate a boundary prediction branch (BPB), cooperating with majority-voting strategy, to refine the action classification generated by ACB. Then we can calculate the IS overlap easily by comparing two action timelines belonging to raiser and follower. To evaluate the proposed model, we collect 200K annotated images belonging to 40 subjects who perform 2 tasks (nod and clap) in 2 conditions (interpersonal and human-metronome). The experiment results demonstrate that our model achieves 87.1% accuracy at 200 FPS and can locate the start and end of action precisely in realtime.

Original languageEnglish
Title of host publicationIntelligent Robotics and Applications - 15th International Conference, ICIRA 2022, Proceedings
EditorsHonghai Liu, Weihong Ren, Zhouping Yin, Lianqing Liu, Li Jiang, Guoying Gu, Xinyu Wu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages331-340
Number of pages10
ISBN (Print)9783031138430
DOIs
StatePublished - 2022
Externally publishedYes
Event15th International Conference on Intelligent Robotics and Applications, ICIRA 2022 - Harbin, China
Duration: 1 Aug 20223 Aug 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13455 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th International Conference on Intelligent Robotics and Applications, ICIRA 2022
Country/TerritoryChina
CityHarbin
Period1/08/223/08/22

Keywords

  • Action segmentation
  • Autism disorders
  • Interpersonal synchrony

Fingerprint

Dive into the research topics of 'Realtime Interpersonal Human Synchrony Detection Based on Action Segmentation'. Together they form a unique fingerprint.

Cite this