Skip to main navigation Skip to search Skip to main content

Reinforcement Learning control for Free-Floating Space Manipulator: Truncated Quantiles Critics

  • Heng Yang
  • , Xuebo Yang*
  • , Jialu Li
  • , Meiling Hu
  • *Corresponding author for this work
  • Harbin Institute of Technology

Research output: Contribution to journalConference articlepeer-review

Abstract

With the increasing complexity of space missions, the free-floating space manipulator (FFSM) systems have become indispensable for various space applications. This paper proposes a reinforcement learning (RL)-based control framework leveraging the truncated quantile critics (TQC) algorithm to simultaneously achieve: (i) precise end-effector (EE) position and orientation tracking, (ii) effective base spacecraft drift suppression, and (iii) collision-free operation maintenance. Additionally, we introduce a novel time differential reward function for controller training to enhance multi-objective learning efficiency. The proposed framework is validated through numerical simulations and Monte Carlo Experiments using a space station's manipulator system, with comparisons against two baseline RL approaches demonstrating its superior performance.

Original languageEnglish
Pages (from-to)1255-1260
Number of pages6
JournalIFAC-PapersOnLine
Volume59
Issue number20
DOIs
StatePublished - 1 Aug 2025
Event23th IFAC Symposium on Automatic Control in Aerospace, ACA 2025 - Harbin, China
Duration: 2 Aug 20256 Aug 2025

Keywords

  • base drift rejection
  • collision avoidance
  • free-floating space manipulator
  • trajectory optimization

Fingerprint

Dive into the research topics of 'Reinforcement Learning control for Free-Floating Space Manipulator: Truncated Quantiles Critics'. Together they form a unique fingerprint.

Cite this