Skip to main navigation Skip to search Skip to main content

Boundary-Aware Semantic Bird-Eye-View Map Generation Based on Conditional Diffusion Models

  • Shuang Gao
  • , Qiang Wang
  • , Yuxiang Sun*
  • *Corresponding author for this work
  • Hong Kong Polytechnic University
  • Harbin Institute of Technology
  • City University of Hong Kong

Research output: Contribution to journalArticlepeer-review

Abstract

Semantic bird-eye-view (BEV) map is an efficient data representation for environment perception in autonomous driving. In real driving scenarios, the collected sensory data usually exhibit class imbalance. For example, road layouts are often the majority classes and road objects are the minority. Such imbalanced data could lead to inferior performance in BEV map generation, particularly for minority objects due to insufficient learning samples. This work attempts to mitigate this issue from the perspective of network and loss function design. To this end, a diffusion-guided semantic BEV map generation network with a boundary-aware loss is proposed. The network learns the underlying distribution of the data, including the relationship between majority and minority classes. The boundary-aware loss increases weighting for minority classes during training, making the network focus on these classes. Experimental results on a public dataset demonstrate our superiority over the state-of-the-art methods, and our effectiveness in addressing the class imbalance issue.

Original languageEnglish
Pages (from-to)10188-10198
Number of pages11
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume35
Issue number10
DOIs
StatePublished - 2025

Keywords

  • Semantic BEV map
  • autonomous driving
  • class imbalance
  • semantic scene understanding

Fingerprint

Dive into the research topics of 'Boundary-Aware Semantic Bird-Eye-View Map Generation Based on Conditional Diffusion Models'. Together they form a unique fingerprint.

Cite this