Skip to main navigation Skip to search Skip to main content

A graph convolution-based heterogeneous fusion network for multimodal sentiment analysis

  • Tong Zhao
  • , Junjie Peng*
  • , Yansong Huang
  • , Lan Wang
  • , Huiran Zhang
  • , Zesu Cai
  • *Corresponding author for this work
  • Shanghai University
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Multimodal sentiment analysis leverages various modalities, including text, audio, and video, to determine human sentiment tendencies, which holds significance in fields such as intention understanding and opinion analysis. However, there are two critical challenges in multimodal sentiment analysis: one is how to effectively extract and integrate information from various modalities, which is important for reducing the heterogeneity gap among modalities; the other is how to overcome the problem of information forgetting while modelling long sequences, which leads to significant information loss and adversely affect the fusion performance of modalities. Based on the above issues, this paper proposes a multimodal heterogeneity fusion network based on graph convolutional neural networks (HFNGC). A shared convolutional aggregation mechanism is used to overcome the semantic gap among modalities and reduce the noise effect caused by modality heterogeneity. In addition, the model applies Dynamic Routing to convert modality features into graph structures. By learning semantic information in the graph representation space, our model can improve the capability of remote-dependent learning. Furthermore, the model integrates complementary information among modalities and explores the intra- and inter-modal interactions during the modality fusion stage. To validate the effectiveness of our model, we conduct experiments on two benchmark datasets. The experimental results demonstrate that our method outperforms the existing methods, exhibiting strong generalisation capability and high competitiveness.

Original languageEnglish
Pages (from-to)30455-30468
Number of pages14
JournalApplied Intelligence
Volume53
Issue number24
DOIs
StatePublished - Dec 2023
Externally publishedYes

Keywords

  • Graph convolution
  • Heterogeneity
  • Information fusion
  • Sentiment analysis

Fingerprint

Dive into the research topics of 'A graph convolution-based heterogeneous fusion network for multimodal sentiment analysis'. Together they form a unique fingerprint.

Cite this