Skip to main navigation Skip to search Skip to main content

Local-Global Cross-Fusion Transformer Network for Facial Expression Recognition

  • Yicheng Liu
  • , Zecheng Li
  • , Yanbo Zhang
  • , Jie Wen*
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Facial Expression Recognition (FER) has received increasing attention in the computer vision community. For FER, there are two challenging issues among the facial images: large inter-class similarity and small intra-class discrepancy. To address these challenges and obtain a better performance, we propose a Local-Global Cross-Fusion Transformer network in this paper. Specifically, the method seeks to obtain a more discriminative facial representation by sufficiently considering the local features of multiple local regions of the face and global face features. In order to extract the critical local area features of the face, a local feature decomposition module based on facial landmarks is designed. In addition, a local-global cross-fusion Transformer is designed to enhance the synergistic correlation between local features and global features using the cross-attention mechanism, which can maximize the focus on key regions while considering the connection information among local regions. Extensive experiments conducted on three mainstream expression recognition datasets, RAF-DB, FERPlus, and AffectNet, show that the method outperforms many existing expression recognition methods and can significantly improve the accuracy of expression recognition.

Original languageEnglish
Title of host publicationWeb and Big Data - 7th International Joint Conference, APWeb-WAIM 2023, Proceedings
EditorsXiangyu Song, Ruyi Feng, Yunliang Chen, Jianxin Li, Geyong Min
PublisherSpringer Science and Business Media Deutschland GmbH
Pages254-269
Number of pages16
ISBN (Print)9789819723898
DOIs
StatePublished - 2024
Externally publishedYes
Event7th Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data, APWeb-WAIM 2023 - Wuhan, China
Duration: 6 Oct 20238 Oct 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14332 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data, APWeb-WAIM 2023
Country/TerritoryChina
CityWuhan
Period6/10/238/10/23

Keywords

  • cross-attention mechanism
  • facial expression recognition
  • facial landmark
  • local and global facial features

Fingerprint

Dive into the research topics of 'Local-Global Cross-Fusion Transformer Network for Facial Expression Recognition'. Together they form a unique fingerprint.

Cite this