Skip to main navigation Skip to search Skip to main content

组合动作空间深度强化学习的人群疏散引导方法

Translated title of the contribution: Crowd evacuation guidance based on combined action-space deep reinforcement learning
  • Yiran Xue
  • , Rui Wu*
  • , Jiafeng Liu
  • *Corresponding author for this work
  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Crowd evacuation guidance systems are of great significance for protecting lives and reducing personal and property losses during disasters in buildings. Existing crowd evacuation guidance systems require the manual design of models and input parameters, incurring significant workloads and potential errors. An end-to-end intelligent evacuation guidance method based on deep reinforcement learning was proposed, and an interactive simulation environment based on the social force model was designed. The agent could automatically learn a scene model and explore the path planning strategy by interacting with simulation environment and through trial and error with only scene images as input, and then directly output dynamic signage information, thus achieving the crowd evacuation guidance efficiently. Aiming to solve the "dimension disaster" phenomenon of deep Q network (DQN) algorithm caused by high dimension action space and complex network structure in crowd evacuation, a combined action-space DQN algorithm was proposed. The algorithm grouped the output layer nodes of the Q network according to action dimensions, significantly reduced the network complexity, and improved the practicality of the system in complex scenes with multiple guidance signs. Experiments in different simulation scenes demonstrate that the proposed method is superior to the static guidance method in evacuation time and on par with the manually designed model method. It shows that the proposed method can effectively guide the crowd, improve the evacuation efficiency, and reduce the workload and artificial errors of manually designed models.

Translated title of the contributionCrowd evacuation guidance based on combined action-space deep reinforcement learning
Original languageChinese (Traditional)
Pages (from-to)29-38
Number of pages10
JournalHarbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology
Volume53
Issue number8
DOIs
StatePublished - 30 Aug 2021

Fingerprint

Dive into the research topics of 'Crowd evacuation guidance based on combined action-space deep reinforcement learning'. Together they form a unique fingerprint.

Cite this