Skip to main navigation Skip to search Skip to main content

Learning priority-aware controllable poster layout generation

  • Fuxiang Yang
  • , Wendi Hou
  • , Lei Fan
  • , Tonghua Su*
  • , Lingxiao He
  • , Chengzhou Li
  • , Meng Wang
  • , Qianlong Xie
  • , Xingxing Wang
  • , Donglin Di
  • , Xun Yang
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Meituan
  • University of New South Wales
  • University of Science and Technology of China

Research output: Contribution to journalArticlepeer-review

Abstract

Automated graphic layout generation is vital for scalable and personalized multimedia design. Existing approaches often overlook intra-element relationships within layouts and inter-modal dependencies between layout components and visual content. To this end, we introduce a novel priority-aware coarse-to-fine framework that enables both automated and controllable layout synthesis. Our method utilizes an Optimal Transport matcher to align layout elements with corresponding image regions according to their inferred priorities–yielding a structurally coherent yet coarse initial arrangement. This preliminary layout then serves as a strong structural prior for a flow-based generator that refines the composition into an aesthetically pleasing final design. To enable this process automatically, we introduce a Dual-Path Ranker that leverages large language models to assess textual element importance while employing vision models to detect salient visual regions. Extensive experiments on the CGL and PKU poster datasets demonstrate that our approach not only produces high-quality layouts but also provides enhanced adaptability and personalization compared to previous methods.

Original languageEnglish
Article number113497
JournalPattern Recognition
Volume179
DOIs
StatePublished - Nov 2026

Keywords

  • Flow matching
  • Optimal transport
  • Poster layout generation

Fingerprint

Dive into the research topics of 'Learning priority-aware controllable poster layout generation'. Together they form a unique fingerprint.

Cite this