Skip to main navigation Skip to search Skip to main content

Seq-BEV: Semantic Bird-Eye-View Map Generation in Full View Using Sequential Images for Autonomous Driving

  • Shuang Gao*
  • , Qiang Wang
  • , Yuxiang Sun
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Hong Kong Polytechnic University
  • City University of Hong Kong

Research output: Contribution to journalArticlepeer-review

Abstract

Semantic Bird-Eye-View (BEV) map is a straightforward data representation for environment perception. It can be used for downstream tasks, such as motion planning and trajectory prediction. However, taking as input a front-view image from a single camera, most existing methods can only provide V-shaped semantic BEV maps, which limits the field-of-view for the BEV maps. To provide a solution to this problem, we propose a novel end-to-end network to generate semantic BEV maps in full view by taking as input the equidistant sequential images. Specifically, we design a self-adapted sequence fusion module to fuse the features from different images in a distance sequence. In addition, a road-aware view transformation module is introduced to wrap the front-view feature map into BEV based on an attention mechanism. We also create a dataset with semantic labels in full BEV from the public nuScenes data. The experimental results demonstrate the effectiveness of our design and the superiority over the state-of-the-art methods.

Original languageEnglish
Pages (from-to)11454-11465
Number of pages12
JournalIEEE Transactions on Intelligent Transportation Systems
Volume26
Issue number8
DOIs
StatePublished - 2025

Keywords

  • Semantic BEV maps
  • autonomous driving
  • sequential images
  • view transformation

Fingerprint

Dive into the research topics of 'Seq-BEV: Semantic Bird-Eye-View Map Generation in Full View Using Sequential Images for Autonomous Driving'. Together they form a unique fingerprint.

Cite this