Skip to main navigation Skip to search Skip to main content

Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds

  • Hejing Zhang
  • , Jian Guan*
  • , Qiaoxi Zhu
  • , Feiyang Xiao
  • , Youde Liu
  • *Corresponding author for this work
  • Harbin Engineering University
  • University of Technology Sydney
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalConference articlepeer-review

Abstract

Different machines can exhibit diverse frequency patterns in their emitted sound. This feature has been recently explored in anomaly sound detection and reached state-of-the-art performance. However, existing methods rely on the manual or empirical determination of the frequency filter by observing the effective frequency range in the training data, which may be impractical for general application. This paper proposes an anomalous sound detection method using self-attention-based frequency pattern analysis and spectral-temporal information fusion. Our experiments demonstrate that the self-attention module automatically and adaptively analyses the effective frequencies of a machine sound and enhances that information in the spectral feature representation. With spectral-temporal information fusion, the obtained audio feature eventually improves the anomaly detection performance on the DCASE 2020 Challenge Task 2 dataset.

Original languageEnglish
Pages (from-to)336-340
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2023-August
DOIs
StatePublished - 2023
Externally publishedYes
Event24th Annual conference of the International Speech Communication Association, Interspeech 2023 - Dublin, Ireland
Duration: 20 Aug 202324 Aug 2023

Keywords

  • Anomalous sound detection
  • feature representation
  • frequency pattern analysis
  • self-attention

Fingerprint

Dive into the research topics of 'Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds'. Together they form a unique fingerprint.

Cite this