Skip to main navigation Skip to search Skip to main content

Motion Planner with Fixed-Horizon Constrained Reinforcement Learning for Complex Autonomous Driving Scenarios

  • Ke Lin
  • , Yanjie Li*
  • , Shiyu Chen
  • , Duantengchuan Li
  • , Xinyu Wu
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen
  • Wuhan University
  • Shenzhen Institute of Advanced Technology
  • Guangdong-Hong Kong-Macao Joint Laboratory of Human-Machine Intelligence-Synergy Systems

Research output: Contribution to journalArticlepeer-review

Abstract

In autonomous driving, behavioral decision-making and trajectory planning remain huge challenges due to the large amount of uncertainty in environments and complex interaction relationships between the ego vehicle and other traffic participants. In this paper, we propose a novel fixed-horizon constrained reinforcement learning (RL) framework to solve decision-making and planning problems. Firstly, to introduce lane-level global navigation information into the lane state representation and avoid constant lane changes, we propose the constrained A-star algorithm, which can get the optimal path without constant lane changes. The optimality of the algorithm is also theoretically guaranteed. Then, to balance safety, comfort, and goal completion (reaching targets), we construct the planning problem as a constrained RL problem, in which the reward function is designed for goal completion, and two fixed-horizon constraints are developed for safety and comfort, respectively. Subsequently, a motion planning policy network (planner) with vectorized input is constructed. Finally, a dual ascent optimization method is proposed to train the planner network. With the advantage of being able to fully explore in the environment, the agent can learn an efficient decision-making and planning policy. In addition, benefiting from modeling the safety and comfort of the ego vehicle as constraints, the learned policy can guarantee the safety of the ego vehicle and achieve a good balance between goal completion and comfort. Experiments demonstrate that the proposed algorithm can achieve superior performance than existing rule-based, imitation learning-based, and typical RL-based methods.

Original languageEnglish
Pages (from-to)1577-1588
Number of pages12
JournalIEEE Transactions on Intelligent Vehicles
Volume9
Issue number1
DOIs
StatePublished - 1 Jan 2024
Externally publishedYes

Keywords

  • Autonomous driving
  • constrained reinforcement learning
  • fixed-horizon constraint
  • policy learning
  • trajectory planning

Fingerprint

Dive into the research topics of 'Motion Planner with Fixed-Horizon Constrained Reinforcement Learning for Complex Autonomous Driving Scenarios'. Together they form a unique fingerprint.

Cite this