Skip to main navigation Skip to search Skip to main content

Transformer-based multi-agent reinforcement learning for flexible job shop scheduling with AGVs

  • School of Mechatronics Engineering, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Integrated scheduling in flexible job shops with AGV transportation constraints presents a complex optimization challenge due to the tight coupling between operation sequencing and material handling. Existing reinforcement learning methods, typically relying on recurrent or fully connected architectures, often struggle to capture global spatio-temporal dependencies and suffer from sequential computational bottlenecks. To address these limitations, this study proposes MA-Trans, a hierarchical deep reinforcement learning framework based on Proximal Policy Optimization and Transformer encoders. By designing a cooperative dual-agent architecture, the model jointly optimizes job scheduling and AGV dispatching, employing self-attention mechanisms to efficiently encode long-range interactions among manufacturing resources. Experimental results demonstrate that MA-Trans significantly outperforms state-of-the-art baselines, including ResGAT and LSTM-Ptr, in both makespan minimization and AGV load balancing. Beyond solution quality, the proposed Transformer-based architecture reduces training time by approximately 23.8% compared to recurrent baselines while maintaining low-latency inference of around 2.75 ms. Moreover, the model exhibits robust zero-shot generalization capabilities, adapting to varying problem scales and AGV configurations without retraining, thereby highlighting its potential for real-time smart manufacturing applications.

Original languageEnglish
Article number114899
JournalApplied Soft Computing
Volume193
DOIs
StatePublished - May 2026
Externally publishedYes

Keywords

  • Automated guided vehicle (AGV)
  • Deep reinforcement learning
  • Flexible job shop scheduling problem
  • Multi-agent reinforcement learning
  • Transformer

Fingerprint

Dive into the research topics of 'Transformer-based multi-agent reinforcement learning for flexible job shop scheduling with AGVs'. Together they form a unique fingerprint.

Cite this