Skip to main navigation Skip to search Skip to main content

Multimodal Learning toward Recommendation

  • National University of Singapore
  • Hong Kong Generative AI Research and Development Center Limited
  • Harbin Institute of Technology Shenzhen

Research output: Book/ReportBookpeer-review

Abstract

This book presents an in-depth exploration of multimodal learning toward recommendation, along with a comprehensive survey of the most important research topics and state-of-the-art methods in this area. First, it presents a semantic-guided feature distillation method which employs a teacher-student framework to robustly extract effective recommendation-oriented features from generic multimodal features. Next, it introduces a novel multimodal attentive metric learning method to model user diverse preferences for various items. Then it proposes a disentangled multimodal representation learning recommendation model, which can capture users’ fine-grained attention to different modalities on each factor in user preference modeling. Furthermore, a meta-learning-based multimodal fusion framework is developed to model the various relationships among multimodal information. Building on the success of disentangled representation learning, it further proposes an attribute-driven disentangled representation learning method, which uses attributes to guide the disentanglement process in order to improve the interpretability and controllability of conventional recommendation methods. Finally, the book concludes with future research directions in multimodal learning toward recommendation. The book is suitable for graduate students and researchers who are interested in multimodal learning and recommender systems. The multimodal learning methods presented are also applicable to other retrieval or sorting related research areas, like image retrieval, moment localization, and visual question answering.

Original languageEnglish
PublisherSpringer Nature
Number of pages152
ISBN (Electronic)9783031831881
ISBN (Print)9783031831874
DOIs
StatePublished - 1 Jan 2025
Externally publishedYes

Fingerprint

Dive into the research topics of 'Multimodal Learning toward Recommendation'. Together they form a unique fingerprint.

Cite this