Skip to main navigation Skip to search Skip to main content

Learning to Switch Gait Randomly and Continuously of Bipedal Robots via Stage-Wise Reward Shaping

  • Chiyao Li
  • , Shilong Sun*
  • , Haodong Huang
  • , Yuanpeng Wang
  • , Wenfu Xu
  • *Corresponding author for this work
  • School of Robotics and Advanced Manufacture, Harbin Institute of Technology Shenzhen
  • Guangdong Provincial Key Laboratory of Intelligent Morphing Mechanisms and Adaptive Robotics
  • Key University Laboratory of Mechanism & Machine Theory and Intelligent Unmanned Systems of Guangdong

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Bipedal robots capable of continuously and randomly switching gaits acrossvarious terrains are of great significance. This paper proposes a stage-wise reward shaping method that enables a single policy to perform random switching between jumping and walking locomotion modes. Dedicated reward functions are designed for the jumping and walking locomotion, respectively, and during training, the locomotion mode is randomly switched at fixed intervals. For the complex jumping task, we further divide it into four consecutive sub-stages, each shaped with a different reward function. In the walking mode, omnidirectional velocity command tracking is used to train the robot, enabling omnidirectional locomotion at various speeds. In addition, a long short-term memory (LSTM) network is integrated into the Actor-Critic framework to help the model utilize historical information and make better decisions. Simulation results demonstrate that the proposed method enables the bipedal robot to switch between jumping and walking gaits smoothly and naturally in arbitrary order, allowing it to traverse complex terrains by switching gait modes.

Original languageEnglish
Title of host publication2025 IEEE International Conference on Robotics and Biomimetics, ROBIO 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1964-1970
Number of pages7
ISBN (Electronic)9798331557478
DOIs
StatePublished - 2025
Externally publishedYes
Event2025 IEEE International Conference on Robotics and Biomimetics, IEEE ROBIO 2025 - Chengdu, China
Duration: 3 Dec 20257 Dec 2025

Conference

Conference2025 IEEE International Conference on Robotics and Biomimetics, IEEE ROBIO 2025
Country/TerritoryChina
CityChengdu
Period3/12/257/12/25

Fingerprint

Dive into the research topics of 'Learning to Switch Gait Randomly and Continuously of Bipedal Robots via Stage-Wise Reward Shaping'. Together they form a unique fingerprint.

Cite this