Skip to main navigation Skip to search Skip to main content

Deep Reinforcement Learning Based Trajectory Design and Resource Allocation for UAV-Assisted Communications

  • Chiya Zhang
  • , Zhukun Li*
  • , Chunlong He*
  • , Kezhi Wang
  • , Cunhua Pan
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen
  • Shenzhen University
  • Brunel University London
  • Southeast University, Nanjing

Research output: Contribution to journalArticlepeer-review

Abstract

In this letter, we investigate the Unmanned Aerial Vehicles (UAVs)-assisted communications in three dimensional (3-D) environment, where one UAV is deployed to serve multiple user equipments (UEs). The locations and quality of service (QoS) requirement of the UEs are varying and the flying time of the UAV is unknown which depends on the battery of the UAVs. To address the issue, a proximal policy optimization 2 (PPO2)-based deep reinforcement learning (DRL) algorithm is proposed, which can control the UAV in an online manner. Specifically, it can allow the UAV to adjust its speed, direction and altitude so as to minimize the serving time of the UAV while satisfying the QoS requirement of the UEs. Simulation results are provided to demonstrate the effectiveness of the proposed framework.

Original languageEnglish
Pages (from-to)2398-2402
Number of pages5
JournalIEEE Communications Letters
Volume27
Issue number9
DOIs
StatePublished - 1 Sep 2023
Externally publishedYes

Keywords

  • 3-D trajectory design
  • Unmanned aerial vehicles
  • deep reinforcement learning
  • uncertain flight time

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning Based Trajectory Design and Resource Allocation for UAV-Assisted Communications'. Together they form a unique fingerprint.

Cite this