TY - GEN
T1 - Sequential Experimental Design for Complex Industrial Systems
T2 - 23rd International Conference on Industrial Informatics, INDIN 2025
AU - Wang, Yuchen
AU - Yang, Baoqing
AU - Ma, Jie
AU - Shu, Qing
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - This paper proposes a reinforcement learning (RL) framework to optimize sequential experimental design for complex industrial systems. The framework addresses efficient identification of unknown parameters in complex industrial systems under budget constraints. A deep neural network approximates the action-value function, while Proximal Policy Optimization (PPO) with a clipped surrogate objective optimizes the policy, represented by a separate deep neural network. The policy maps the posterior over unknown parameters to an optimal experimental input. Evaluated on a one-dimensional thermal conduction problem, the method strategically positions a temperature sensor to estimate unknown parameters of an instantaneous heat source. Simulation results demonstrate that the policy effectively guides experiments towards informative measurements, concentrating the posterior around true parameter values. Compared to a genetic algorithm, the RL approach achieves superior performance, highlighting its potential for optimizing sequential experimental design in complex, resource-constrained scenarios.
AB - This paper proposes a reinforcement learning (RL) framework to optimize sequential experimental design for complex industrial systems. The framework addresses efficient identification of unknown parameters in complex industrial systems under budget constraints. A deep neural network approximates the action-value function, while Proximal Policy Optimization (PPO) with a clipped surrogate objective optimizes the policy, represented by a separate deep neural network. The policy maps the posterior over unknown parameters to an optimal experimental input. Evaluated on a one-dimensional thermal conduction problem, the method strategically positions a temperature sensor to estimate unknown parameters of an instantaneous heat source. Simulation results demonstrate that the policy effectively guides experiments towards informative measurements, concentrating the posterior around true parameter values. Compared to a genetic algorithm, the RL approach achieves superior performance, highlighting its potential for optimizing sequential experimental design in complex, resource-constrained scenarios.
KW - Bayesian theory
KW - Design of experiment
KW - Proximal Policy Optimization
UR - https://www.scopus.com/pages/publications/105032662052
U2 - 10.1109/INDIN64977.2025.11278937
DO - 10.1109/INDIN64977.2025.11278937
M3 - 会议稿件
AN - SCOPUS:105032662052
T3 - IEEE International Conference on Industrial Informatics (INDIN)
BT - 2025 IEEE 23rd International Conference on Industrial Informatics, INDIN 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 12 July 2025 through 15 July 2025
ER -