TY - GEN
T1 - Secure Resource Allocation for NOMA PLC Networks with Multi-agent DRL
AU - Pu, Honghong
AU - Liu, Xiaosheng
AU - Zhang, Ruifang
AU - Sun, Zhifu
AU - Xu, Dainguo
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - This paper investigates the intelligent downlink secure transmission in a non-orthogonal multiple access (NOMA) network over power line channels, where each NOMA pair consists of one distant and nearby terminal user (UE) and the distant UE could wiretap the nearby UE. In order to maximize the security sum rate of all nearby UEs while guaranteeing the targeted data rate requirements of all UEs, a joint secure subchannel assignment and power allocation problem is first established. To deal with it, a deep reinforcement learning (DRL) scheme is then proposed. In specific, this scheme uses a deep Q-network (DQN) to learn the optimal decision policy for each NOMA pair by combining the compressed local observation and an aggregation information from other NOMA pairs as the input. All the input dimensions of the DQNs, the local compression networks and the central aggregation networks are independent of the PLC network size. In another word, this proposed scheme is scalable, which ca be easily applied in the practical system where the network size is dynamically changing. Simulation results verify that the effectiveness of this proposed scheme compared to benchmark scheme.
AB - This paper investigates the intelligent downlink secure transmission in a non-orthogonal multiple access (NOMA) network over power line channels, where each NOMA pair consists of one distant and nearby terminal user (UE) and the distant UE could wiretap the nearby UE. In order to maximize the security sum rate of all nearby UEs while guaranteeing the targeted data rate requirements of all UEs, a joint secure subchannel assignment and power allocation problem is first established. To deal with it, a deep reinforcement learning (DRL) scheme is then proposed. In specific, this scheme uses a deep Q-network (DQN) to learn the optimal decision policy for each NOMA pair by combining the compressed local observation and an aggregation information from other NOMA pairs as the input. All the input dimensions of the DQNs, the local compression networks and the central aggregation networks are independent of the PLC network size. In another word, this proposed scheme is scalable, which ca be easily applied in the practical system where the network size is dynamically changing. Simulation results verify that the effectiveness of this proposed scheme compared to benchmark scheme.
KW - deep reinforcement learning
KW - non-orthogonal multiple access
KW - power line communication
KW - secure resource allocation
UR - https://www.scopus.com/pages/publications/85124381118
U2 - 10.1109/ICCT52962.2021.9657961
DO - 10.1109/ICCT52962.2021.9657961
M3 - 会议稿件
AN - SCOPUS:85124381118
T3 - International Conference on Communication Technology Proceedings, ICCT
SP - 959
EP - 964
BT - 2021 IEEE 21st International Conference on Communication Technology, ICCT 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 21st IEEE International Conference on Communication Technology, ICCT 2021
Y2 - 13 October 2021 through 16 October 2021
ER -