Skip to main navigation Skip to search Skip to main content

Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking

  • Yang Liu*
  • , Volkan Kilic
  • , Jian Guan
  • , Wenwu Wang
  • *Corresponding author for this work
  • University of Surrey
  • Izmir Katip Celebi University
  • Harbin Engineering University

Research output: Contribution to journalArticlepeer-review

Abstract

Sequential Monte Carlo probability hypothesis density (SMC-PHD) filtering is a popular method used recently for audio-visual (AV) multi-speaker tracking. However, due to the weight degeneracy problem, the posterior distribution can be represented poorly by the estimated probability, when only a few particles are present around the peak of the likelihood density function. To address this issue, we propose a new framework where particle flow (PF) is used to migrate particles smoothly from the prior to the posterior probability density. We consider both zero and non-zero diffusion particle flows (ZPF/NPF), and developed two new algorithms, AV-ZPF-SMC-PHD and AV-NPF-SMC-PHD, where the speaker states from the previous frames are also considered for particle relocation. The proposed algorithms are compared systematically with several baseline tracking methods using the AV16.3, AVDIAR and CLEAR datasets, and are shown to offer improved tracking accuracy and average effective sample size (ESS).

Original languageEnglish
Article number8811627
Pages (from-to)934-948
Number of pages15
JournalIEEE Transactions on Multimedia
Volume22
Issue number4
DOIs
StatePublished - Apr 2020
Externally publishedYes

Keywords

  • Audio-Visual Tracking
  • Optimal Proposal Distribution
  • PHD filter
  • Particle Flow
  • Sequential Monte Carlo

Fingerprint

Dive into the research topics of 'Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking'. Together they form a unique fingerprint.

Cite this