Skip to main navigation Skip to search Skip to main content

Maximum Entropy Deep Reinforcement Learning Based Power Allocation for NOMA Maritime Network

  • Jiayi He
  • , Yakai Zhang
  • , Zhiyong Liu*
  • *Corresponding author for this work
  • Harbin Institute of Technology Weihai
  • Shandong Provincial Key Laboratory of Marine Electronic Information and Intelligent Unmanned Systems
  • Key Laboratory of Cross-Domain Synergy and Comprehensive Support for Unmanned Marine Systems

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In order to address the challenges of high propagation delays and limited service capabilities in maritime satellite communications, unmanned aerial vehicles have been proposed as an airborne backhaul solution to enhance communications between satellites and maritime base stations. The non-orthogonal multiple access (NOMA) framework can solve the user sparsity problem in maritime networks. In this paper, a deep reinforcement learning algorithm is used to solve the nonconvex power allocation problem under NOMA. In order to mitigate the risk of overestimation of Q values and local optimal convergence of Deep Q Network (DQN) algorithm, we propose an algorithm called Soft Agent Critical Ocean Satellite Communication Power Allocation (SAC-OSCPA) based on the idea of maximum entropy and compare it with the traditional DQN algorithm. The main goal of this research is to maximize network throughput in scenarios with randomly distributed users. Simulation results show that the average system throughput is improved by 13.18% with the SAC-OSCPA algorithm, and the average throughput of the worst performing user is significantly improved by 41.59%. These results demonstrate the efficacy of the proposed algorithm in optimizing the communication performance of maritime satellite networks.

Original languageEnglish
Title of host publicationWireless and Satellite Systems - 14th EAI International Conference, WiSATS 2024, Proceedings
EditorsHsiao-Hwa Chen, Weixiao Meng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages215-229
Number of pages15
ISBN (Print)9783031862021
DOIs
StatePublished - 2025
Externally publishedYes
Event14th EAI International Conference on Wireless and Satellite Systems, WiSATS 2024 - Harbin, China
Duration: 23 Aug 202425 Aug 2024

Publication series

NameLecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
Volume606 LNICST
ISSN (Print)1867-8211
ISSN (Electronic)1867-822X

Conference

Conference14th EAI International Conference on Wireless and Satellite Systems, WiSATS 2024
Country/TerritoryChina
CityHarbin
Period23/08/2425/08/24

Keywords

  • deep reinforcement learning
  • maritime network
  • non-orthogonal multiple access (NOMA)
  • power allocation

Fingerprint

Dive into the research topics of 'Maximum Entropy Deep Reinforcement Learning Based Power Allocation for NOMA Maritime Network'. Together they form a unique fingerprint.

Cite this