Skip to main navigation Skip to search Skip to main content

Beyond the Flat Sequence: Hierarchical and Preference-Aware Generative Recommendations

  • Zerui Chen
  • , Heng Chang
  • , Tianying Liu
  • , Chuantian Zhou
  • , Yi Cao
  • , Jiandong Ding
  • , Ming Liu*
  • , Bing Qin
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Huawei Technologies Co., Ltd.
  • Beijing University of Posts and Telecommunications

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Generative Recommenders (GRs), exemplified by the Hierarchical Sequential Transduction Unit (HSTU), have emerged as a powerful paradigm for modeling long user interaction sequences. However, we observe that their "flat-sequence"assumption overlooks the rich, intrinsic structure of user behavior. This leads to two key limitations: a failure to capture the temporal hierarchy of session-based engagement, and computational inefficiency, as dense attention introduces significant noise that obscures true preference signals within semantically sparse histories, which deteriorates the quality of the learned representations. To this end, we propose a novel framework named HPGR (Hierarchical and Preference-aware Generative Recommender), built upon a two-stage paradigm that injects these crucial structural priors into the model to handle the drawback. Specifically, HPGR comprises two synergistic stages. First, a structure-aware pre-training stage employs a session-based Masked Item Modeling (MIM) objective to learn a hierarchically-informed and semantically rich item representation space. Second, a preference-aware fine-tuning stage leverages these powerful representations to implement a Preference-Guided Sparse Attention mechanism, which dynamically constrains computation to only the most relevant historical items, enhancing both efficiency and signal-to-noise ratio. Empirical experiments on a large-scale proprietary industrial dataset from APPGallery and an online A/B test verify that HPGR achieves state-of-the-art performance over multiple strong baselines, including HSTU and MTGR.

Original languageEnglish
Title of host publicationWWW 2026 - Proceedings of the ACM Web Conference 2026
PublisherAssociation for Computing Machinery, Inc
Pages7999-8007
Number of pages9
ISBN (Electronic)9798400723070
DOIs
StatePublished - 12 Apr 2026
Event35th ACM Web Conference, WWW 2026 - Dubai, United Arab Emirates
Duration: 29 Jun 20263 Jul 2026

Publication series

NameWWW 2026 - Proceedings of the ACM Web Conference 2026

Conference

Conference35th ACM Web Conference, WWW 2026
Country/TerritoryUnited Arab Emirates
CityDubai
Period29/06/263/07/26

Keywords

  • attention mechanism
  • generative recommendation

Fingerprint

Dive into the research topics of 'Beyond the Flat Sequence: Hierarchical and Preference-Aware Generative Recommendations'. Together they form a unique fingerprint.

Cite this