Skip to main navigation Skip to search Skip to main content

The Impact of Synchronized Visual and Auditory Attention on Human Perception

  • Lichuan Jiang
  • , Jiani Zhong
  • , Muqing Jian
  • , Xuanzhuo Liu
  • , Siqi Cai*
  • , Haizhou Li
  • *Corresponding author for this work
  • The Chinese University of Hong Kong, Shenzhen
  • Technical University of Munich
  • University of Bremen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The cocktail party problem shows the remarkable human ability to selectively attend to and recognize one source of auditory input in a noisy environment. However, individuals may struggle to identify a speaker’s voice when they are unfamiliar with the speakers and don’t have a clear visual focus, which results in less visual information. This raises the question: How can visual information aid in extracting information from a speaker’s voice? This study explores how synchronized visual and auditory attention impact human perception in scenarios involving two speakers. Using Tobii Glasses 3 to track participants’ eye movements and pupil diameters, combined with questionnaire responses, we explore how these factors influence speech comprehension. Our results demonstrate that participants achieve higher accuracy in speech comprehension when they focus their gaze on the speaker they are listening to, compared to scenarios where visual attention is divided between speakers or where they rely solely on auditory cues. These findings highlight the effectiveness of synchronizing visual and auditory attention in improving the acquisition and processing of information.

Original languageEnglish
Title of host publicationSocial Robotics - 16th International Conference, ICSR + InnoBiz 2024, Proceedings
EditorsHaizhou Li, Jian Zhu, Tanja Schultz, Yalei Bi, Hongsheng He, Jun Ma, Siqi Cai, Shuzhi Sam Ge, Wanyue Jiang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages41-50
Number of pages10
ISBN (Print)9789819611508
DOIs
StatePublished - 2025
Externally publishedYes
Event16th International Conference on Social Robotics, ICSR + InnoBiz 2024 - Shenzhen, China
Duration: 25 Sep 202428 Sep 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15170 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Social Robotics, ICSR + InnoBiz 2024
Country/TerritoryChina
CityShenzhen
Period25/09/2428/09/24

Keywords

  • Audiovisual Attention
  • Human Perception
  • Multi-modal
  • Pupil Diameter
  • Speech Comprehension

Fingerprint

Dive into the research topics of 'The Impact of Synchronized Visual and Auditory Attention on Human Perception'. Together they form a unique fingerprint.

Cite this