Skip to main navigation Skip to search Skip to main content

An Improved Dyna-Q Algorithm for Mobile Robot Path Planning in Unknown Dynamic Environment

  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

This article deals with the problem of mobile robot path planning in an unknown environment that contains both static and dynamic obstacles, utilizing a reinforcement learning approach. We propose an improved Dyna- {Q} algorithm, which incorporates heuristic search strategies, simulated annealing mechanism, and reactive navigation principle into {Q} -learning based on the Dyna architecture. A novel action-selection strategy combining varepsilon -greedy policy with the cooling schedule control is presented, which, together with the heuristic reward function and heuristic actions, can tackle the exploration-exploitation dilemma and enhance the performance of global searching, convergence property, and learning efficiency for path planning. The proposed method is superior to the classical {Q} -learning and Dyna- {Q} algorithms in an unknown static environment, and it is successfully applied to an uncertain environment with multiple dynamic obstacles in simulations. Further, practical experiments are conducted by integrating MATLAB and robot operating system (ROS) on a physical robot platform, and the mobile robot manages to find a collision-free path, thus fulfilling autonomous navigation tasks in the real world.

Original languageEnglish
Pages (from-to)4415-4425
Number of pages11
JournalIEEE Transactions on Systems, Man, and Cybernetics: Systems
Volume52
Issue number7
DOIs
StatePublished - 1 Jul 2022

Keywords

  • Dyna-Q
  • Dynamic environment
  • mobile robot
  • path planning
  • reinforcement learning (RL)

Fingerprint

Dive into the research topics of 'An Improved Dyna-Q Algorithm for Mobile Robot Path Planning in Unknown Dynamic Environment'. Together they form a unique fingerprint.

Cite this