Skip to main navigation Skip to search Skip to main content

Novel training algorithms for long short-term memory neural network

  • Xiaodong Li
  • , Changjun Yu*
  • , Fulin Su
  • , Taifan Quan
  • , Xuguang Yang
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Harbin Institute of Technology Weihai

Research output: Contribution to journalArticlepeer-review

Abstract

More recently, due to the enormous potential of long short-term memory (LSTM) neural network in various fields, some efficient training algorithms have been developed, including the extended Kalman filter (EKF)-based training algorithm and particle filter (PF)-based training algorithm. However, it should be noted that if the system is highly non-linear, the linearisation employed in the EKF may cause instability. Moreover, the PF usually suffers from the particle degeneracy. Therefore, the PF-based training algorithm may only find a poor local optimum. To solve these problems, an unscented Kalman filter (UKF)-based training algorithm is proposed. The UKF employs a deterministic sampling method; hence, there is no linearisation in it and it does not have the degeneracy problem. Moreover, the computational complexity of the UKF is the same order as that of the EKF. To further reduce the computational complexity, the authors propose a minimum norm UKF (MN-UKF) to obtain a good trade-off between performance and complexity. To the best of the authors’ knowledge, this is the first reported solution to this problem. Simulations using both benchmark synthetic signal and real-world signal illustrate the potential of the algorithms developed.

Original languageEnglish
Pages (from-to)304-308
Number of pages5
JournalIET Signal Processing
Volume13
Issue number3
DOIs
StatePublished - 2019

Fingerprint

Dive into the research topics of 'Novel training algorithms for long short-term memory neural network'. Together they form a unique fingerprint.

Cite this