Skip to main navigation Skip to search Skip to main content

Two-Stage Optimization of PCBA Placement Route Schedule Based on Deep Reinforcement Learning

  • Baoqing Yin
  • , Xianqiang Yang*
  • , Zhengkai Li
  • , Xinghu Yu
  • , Hao Sun
  • , Jianbin Qiu
  • , Juan J. Rodriguez-Andina*
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Ningbo University of Technology
  • Ningbo Institute of Intelligent Equipment Technology Company Ltd
  • Research Center of Intelligent Control and Systems
  • Yitang Intelligent Technol Co Ltd
  • University of Vigo

Research output: Contribution to journalArticlepeer-review

Abstract

In printed circuit board assembly (PCBA), placement route schedule (PRS) significantly affects assembly efficiency of the beam head placement machine. The PRS is typically solved by decomposing it into placement point assignment problem (PPAP) and beam heads sequencing problem (BHSP). This article first proposes a deep reinforcement learning framework to tackle PPAP, which is a key determinant of overall process quality. Then, to mitigate the impact of placement position and angle on assembly efficiency, a dynamic programming-based beam head sequencing algorithm is introduced to solve BHSP. Since component types and placement point assignment states vary across different pick-and-place cycles, a dynamic combinatorial mask encoding method is proposed to effectively extract feature information between placement points. Inspired by the beam head placement process, a decoder that combines gated recurrent units and an attention mechanism is finally introduced, which fully utilizes historical node information to predict the next node. Experimental results demonstrate that the proposed method reduces PCBA routing distance by an average of 4.62%, outperforming other State-of-the-Art approaches.

Original languageEnglish
Pages (from-to)275-285
Number of pages11
JournalIEEE Transactions on Industrial Informatics
Volume22
Issue number1
DOIs
StatePublished - Jan 2026

Keywords

  • Attention mechanism
  • deep reinforcement learning (DRL)
  • dynamic combinatorial mask
  • gated recurrent units (GRUs)
  • surface mounting route optimization

Fingerprint

Dive into the research topics of 'Two-Stage Optimization of PCBA Placement Route Schedule Based on Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this