Skip to main navigation Skip to search Skip to main content

Curriculum reinforcement learning with measurable task representation learning

  • Yongyan Wen
  • , Siyuan Li*
  • , Mingjian Fu
  • , Yiqin Yang
  • , Xun Wang
  • , Peng Liu
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Fuzhou University
  • Chinese Academy of Sciences
  • CASIC

Research output: Contribution to journalArticlepeer-review

Abstract

In curriculum reinforcement learning (CRL), an agent incrementally accumulates knowledge over a sequence of tasks (i.e., a curriculum), and the learning process is aimed at using the accumulated knowledge to finally solve a challenging target task. While early CRL works focus on sequencing candidate tasks, recent research explores automatic curriculum generation. Among the rich CRL literature, the interpolation-based CRL paradigm is a main body, which automatically generates intermediate tasks by interpolating between the initial task distribution and the target task distribution in task space with meaningful distance metrics (i.e., can measure the task similarity). However, in challenging navigation tasks, the non-Euclidean context (task) space invalidates this assumption. To achieve automatic curriculum generation in complex task, we propose a novel automatic curriculum generation approach based on measurable task representation learning. To better measure the similarity, we propose to transform the task space to a latent space. Through a variational autoencoder structure that encodes the reward and the state transitions, we achieve a latent task representation with a task similarity measurement property, and two close task embeddings correspond to two similar tasks in terms of rewards and state transitions. Based on the learned task representation, we further develop an automatic curriculum generation scheme, which can effectively generate new tasks more and more similar to the target task. We evaluate our method in a variety of challenging navigation tasks, and the experiment results indicate that the proposed approach surpasses state-of-the-art CRL approaches based on interpolation and generative adversarial networks.

Original languageEnglish
Article number109019
JournalNeural Networks
Volume202
DOIs
StatePublished - Oct 2026

Keywords

  • Curriculum reinforcement learning
  • Representation learning

Fingerprint

Dive into the research topics of 'Curriculum reinforcement learning with measurable task representation learning'. Together they form a unique fingerprint.

Cite this