TY - GEN
T1 - Reinforcement-Learning-Based 2D Flow Control for Logistics Systems
AU - Yin, Mingrui
AU - Cai, Chenxin
AU - Liu, Jie
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024
Y1 - 2024
N2 - To address large-scale challenges in current logistics systems, we present an logistics system employing a multi-agent framework grounded on an actuator network named Omniveyor. This design ensures real-time responsiveness and scalability in flow operations. Under the premise of centralized control, we employ Reinforcement Learning (RL) to efficiently control omni-wheel conveyors. Different from traditional path planning, our approach considers the entire platform as an agent, using the package state for observation. Proximal Policy Optimization (PPO) proves to be an effective algorithm for platform-wide control planning. Experimental results demonstrate the system’s capability to accurately deliver packages. Therefore, the RL algorithm allows the omni-wheel platform to autonomously learn and optimize package paths, circumventing the need for traditional controls or path planning methods.
AB - To address large-scale challenges in current logistics systems, we present an logistics system employing a multi-agent framework grounded on an actuator network named Omniveyor. This design ensures real-time responsiveness and scalability in flow operations. Under the premise of centralized control, we employ Reinforcement Learning (RL) to efficiently control omni-wheel conveyors. Different from traditional path planning, our approach considers the entire platform as an agent, using the package state for observation. Proximal Policy Optimization (PPO) proves to be an effective algorithm for platform-wide control planning. Experimental results demonstrate the system’s capability to accurately deliver packages. Therefore, the RL algorithm allows the omni-wheel platform to autonomously learn and optimize package paths, circumventing the need for traditional controls or path planning methods.
KW - Centralized control system
KW - Logistics system
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/85187716894
U2 - 10.1007/978-981-97-1010-2_19
DO - 10.1007/978-981-97-1010-2_19
M3 - 会议稿件
AN - SCOPUS:85187716894
SN - 9789819710096
T3 - Communications in Computer and Information Science
SP - 257
EP - 270
BT - Wireless Sensor Networks - 17th China Conference, CWSN 2023, Proceedings
A2 - Wang, Lei
A2 - Qiu, Tie
A2 - Lin, Chi
A2 - Wang, Xinbing
PB - Springer Science and Business Media Deutschland GmbH
T2 - 17th China Conference on Wireless Sensor Networks, CWSN 2023
Y2 - 13 October 2023 through 15 October 2023
ER -