Skip to main navigation Skip to search Skip to main content

An Imitative Reinforcement Learning Framework for Pursuit-Lock-Launch Missions

  • Siyuan Li*
  • , Rongchang Zuo
  • , Bofei Liu
  • , Yaoyu He
  • , Peng Liu
  • , Yingnan Zhao
  • *Corresponding author for this work
  • Faculty of Computing, Harbin Institute of Technology
  • Harbin Institute of Technology
  • Tsinghua University
  • Harbin Engineering University

Research output: Contribution to journalArticlepeer-review

Abstract

Unmanned combat aerial vehicle (UCAV) within-visual-range (WVR) engagement, referring to a fight between two or more UCAVs at close quarters, plays a decisive role on the aerial battlefields. With the development of artificial intelligence, WVR engagement progressively advances toward intelligent and autonomous modes. However, autonomous WVR engagement policy learning is hindered by challenges such as weak exploration capabilities, low learning efficiency, and unrealistic simulated environments. To overcome these challenges, we propose a novel imitative reinforcement learning framework, which efficiently leverages expert data while enabling autonomous exploration. The proposed framework not only enhances learning efficiency through expert imitation but also ensures adaptability to dynamic environments via autonomous exploration with reinforcement learning. Therefore, the proposed framework can learn a successful policy of “pursuit-lock-launch” for UCAVs. To support data-driven learning, we establish an environment based on the Harfang3D sandbox. The extensive experimental results indicate that the proposed framework excels in this multistage task and significantly outperforms state-of-the-art reinforcement learning and imitation learning methods. Thanks to the ability of imitating experts and autonomous exploration, our framework can quickly learn the critical knowledge in complex aerial combat tasks, achieving up to a 100% success rate and demonstrating excellent robustness.

Original languageEnglish
Article number6
JournalACM Transactions on Autonomous and Adaptive Systems
Volume21
Issue number1
DOIs
StatePublished - 10 Mar 2026
Externally publishedYes

Keywords

  • Air Combat
  • Imitation Learning
  • Reinforcement Learning
  • WVR Engagement

Fingerprint

Dive into the research topics of 'An Imitative Reinforcement Learning Framework for Pursuit-Lock-Launch Missions'. Together they form a unique fingerprint.

Cite this