TY - GEN
T1 - Research on Anomaly Data Preprocessing Technology for Deep Learning Soft Sensor Models Facing Missing Data and Fault Data
AU - Xue, Jian
AU - Feng, Lei
AU - Hu, Wenlong
AU - Li, Yuanzi
AU - Zheng, Wenbin
AU - Liu, Bing
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Deep learning-based soft sensor technology plays an important role in industrial process monitoring, however anomalous data such as sensor faults and missing values can severely compromise the predictive performance and reliability of the models. Most existing approaches address only a single type of anomaly, making it difficult to cope with the complex scenarios of multiple coexisting anomalies in real industrial environments. To overcome this limitation, this paper proposes a unified two-stage data preprocessing strategy that integrates anomaly detection and isolation with data reconstruction. In the first stage, a parallel Long Short-Term Memory (LSTM)Residual Network (ResNet) architecture is employed for fault detection and isolation to identify and separate abnormal data. In the second stage, an improved masked autoencoder model is applied to reconstruct the data at the detected anomaly positions, thereby leveraging the complementary strengths of fault detection and data reconstruction across different anomaly magnitudes. Experimental results on the Tennessee Eastman Process dataset demonstrate that the proposed method achieves an R2 of 0. 9761, a MAPE of 0. 0636%, and an RMSE of 3.6915 in reconstructing anomalous data caused by both sensor faults and missing values.
AB - Deep learning-based soft sensor technology plays an important role in industrial process monitoring, however anomalous data such as sensor faults and missing values can severely compromise the predictive performance and reliability of the models. Most existing approaches address only a single type of anomaly, making it difficult to cope with the complex scenarios of multiple coexisting anomalies in real industrial environments. To overcome this limitation, this paper proposes a unified two-stage data preprocessing strategy that integrates anomaly detection and isolation with data reconstruction. In the first stage, a parallel Long Short-Term Memory (LSTM)Residual Network (ResNet) architecture is employed for fault detection and isolation to identify and separate abnormal data. In the second stage, an improved masked autoencoder model is applied to reconstruct the data at the detected anomaly positions, thereby leveraging the complementary strengths of fault detection and data reconstruction across different anomaly magnitudes. Experimental results on the Tennessee Eastman Process dataset demonstrate that the proposed method achieves an R2 of 0. 9761, a MAPE of 0. 0636%, and an RMSE of 3.6915 in reconstructing anomalous data caused by both sensor faults and missing values.
KW - Data reconstruction
KW - Fault detection
KW - Masked autoencoder
KW - Soft sensor
UR - https://www.scopus.com/pages/publications/105034853798
U2 - 10.1109/ICSMD67131.2025.11365519
DO - 10.1109/ICSMD67131.2025.11365519
M3 - 会议稿件
AN - SCOPUS:105034853798
T3 - ICSMD 2025 - International Conference on Sensing, Measurement and Data Analytics in the Era of Artificial Intelligence
BT - ICSMD 2025 - International Conference on Sensing, Measurement and Data Analytics in the Era of Artificial Intelligence
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th International Conference on Sensing, Measurement and Data Analytics in the Era of Artificial Intelligence, ICSMD 2025
Y2 - 21 November 2025 through 23 November 2025
ER -