TY - GEN
T1 - Decision Making for Autonomous Driving Via Multimodal Transformer and Deep Reinforcement Learning∗
AU - Fu, Wen
AU - Li, Yanjie
AU - Ye, Zhaohui
AU - Liu, Qi
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - On the basis of environmental information processed by the sensing module, the decision module in automatic driving integrates environmental and vehicle information to make the autonomous vehicle produce safe and reasonable driving behavior. Considering the complexity and variability of the driving environment of autonomous vehicles, researchers have begun to apply deep reinforcement learning (DRL) in the study of autonomous driving control strategies in recent years. In this paper, we apply an algorithm framework combining multimodal transformer and DRL to solve the autonomous driving decision problem in complex scenarios. We use ResNet and transformer to extract the features of LiDAR point cloud and image. We use Deep Deterministic Policy Gradient (DDPG) algorithm to complete the subsequent autonomous driving decision-making task. And we use information bottleneck to improve the sampling efficiency of RL. We use CARLA simulator to evaluate our approach. The results show that our approach allows the agent to learn better driving strategies.
AB - On the basis of environmental information processed by the sensing module, the decision module in automatic driving integrates environmental and vehicle information to make the autonomous vehicle produce safe and reasonable driving behavior. Considering the complexity and variability of the driving environment of autonomous vehicles, researchers have begun to apply deep reinforcement learning (DRL) in the study of autonomous driving control strategies in recent years. In this paper, we apply an algorithm framework combining multimodal transformer and DRL to solve the autonomous driving decision problem in complex scenarios. We use ResNet and transformer to extract the features of LiDAR point cloud and image. We use Deep Deterministic Policy Gradient (DDPG) algorithm to complete the subsequent autonomous driving decision-making task. And we use information bottleneck to improve the sampling efficiency of RL. We use CARLA simulator to evaluate our approach. The results show that our approach allows the agent to learn better driving strategies.
UR - https://www.scopus.com/pages/publications/85138688995
U2 - 10.1109/RCAR54675.2022.9872180
DO - 10.1109/RCAR54675.2022.9872180
M3 - 会议稿件
AN - SCOPUS:85138688995
T3 - 2022 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2022
SP - 481
EP - 486
BT - 2022 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2022
Y2 - 17 July 2022 through 22 July 2022
ER -