Skip to main navigation Skip to search Skip to main content

Perception-Aware-Based UAV Trajectory Planner via Generative Adversarial Self-Imitation Learning From Demonstrations

  • Hanxuan Zhang
  • , Ju Huo*
  • , Yulong Huang*
  • , Jiajun Cheng
  • , Xiaofeng Li
  • *Corresponding author for this work
  • School of Electrical Engineering and Automation, Harbin Institute of Technology
  • National Key Laboratory of Complex System Control and Intelligent Agent Cooperation
  • Harbin Engineering University

Research output: Contribution to journalArticlepeer-review

Abstract

The use of unmanned aerial vehicles (UAVs) for Internet of Things applications, like intelligent monitoring and search, is increasingly becoming a popular research focus globally. While various optimization algorithms exist to plan UAV flight paths, they frequently compromise the quality of the planning path to decrease planning time. In view of the above problems, a perception-aware-based UAV trajectory planner via generative adversarial self-imitation learning from demonstration is proposed. First, a progressively growing discriminator is devised to prevent the policy network from being overpowered in early training stages, avoiding potential training failures. Second, the issue of homogenized strategic patterns among optimized expert trajectories is solved by incorporating successful trajectories from the policy network into the expert buffer, which thereby enhances the network's generalization capabilities. Third, to address the challenges of skewed distribution and considerable performance variation among the strategies learned by the policy network during training, a class-level instance-balancing expert buffer is introduced. Finally, the yaw angle of the UAV in real time during flight is obtained by using the analytical solution of the position trajectory and yaw angle and the position trajectory output from the policy network. Experiments confirm our proposed method achieves comparable flight costs and success rates to those of the reference expert method, while the planning time is reduced. The proposed method is also shown to be well adapted to dynamic environments and obstacle trajectories, which are not involved in training. Additionally, the ablation studies highlight the individual contributions of each component within the proposed method.

Original languageEnglish
Pages (from-to)3248-3260
Number of pages13
JournalIEEE Internet of Things Journal
Volume12
Issue number3
DOIs
StatePublished - 2025
Externally publishedYes

Keywords

  • Class-level instance-balancing
  • generative adversarial self-imitation learning from demonstration (GASILFD)
  • perception-aware-based trajectory planning
  • progressively growing discriminator
  • unmanned aerial vehicle (UAV)

Fingerprint

Dive into the research topics of 'Perception-Aware-Based UAV Trajectory Planner via Generative Adversarial Self-Imitation Learning From Demonstrations'. Together they form a unique fingerprint.

Cite this