Skip to main navigation Skip to search Skip to main content

Sequential Experimental Design for Complex Industrial Systems: An Efficient Proximal Policy Optimization Approach

  • Harbin Institute of Technology
  • National Key Laboratory of Complex System Control and Intelligent Agent Cooperation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a reinforcement learning (RL) framework to optimize sequential experimental design for complex industrial systems. The framework addresses efficient identification of unknown parameters in complex industrial systems under budget constraints. A deep neural network approximates the action-value function, while Proximal Policy Optimization (PPO) with a clipped surrogate objective optimizes the policy, represented by a separate deep neural network. The policy maps the posterior over unknown parameters to an optimal experimental input. Evaluated on a one-dimensional thermal conduction problem, the method strategically positions a temperature sensor to estimate unknown parameters of an instantaneous heat source. Simulation results demonstrate that the policy effectively guides experiments towards informative measurements, concentrating the posterior around true parameter values. Compared to a genetic algorithm, the RL approach achieves superior performance, highlighting its potential for optimizing sequential experimental design in complex, resource-constrained scenarios.

Original languageEnglish
Title of host publication2025 IEEE 23rd International Conference on Industrial Informatics, INDIN 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331511210
DOIs
StatePublished - 2025
Event23rd International Conference on Industrial Informatics, INDIN 2025 - KunMing, China
Duration: 12 Jul 202515 Jul 2025

Publication series

NameIEEE International Conference on Industrial Informatics (INDIN)
ISSN (Print)1935-4576

Conference

Conference23rd International Conference on Industrial Informatics, INDIN 2025
Country/TerritoryChina
CityKunMing
Period12/07/2515/07/25

Keywords

  • Bayesian theory
  • Design of experiment
  • Proximal Policy Optimization

Fingerprint

Dive into the research topics of 'Sequential Experimental Design for Complex Industrial Systems: An Efficient Proximal Policy Optimization Approach'. Together they form a unique fingerprint.

Cite this