Skip to main navigation Skip to search Skip to main content

Improving Voice Activity Detection via weighting likelihood and dimension reduction

  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

The performance of the traditional Voice Activity Detection (VAD) algorithms declines sharply in lower Signal-to-Noise Ratio (SNR) environments. In this paper, a feature weighting likelihood method is proposed for noise-robust VAD. The contribution of dynamic features to likelihood score can be increased via the method, which improves consequently the noise robustness of VAD. Divergence based dimension reduction method is proposed for saving computation, which reduces these feature dimensions with smaller divergence value at the cost of degrading the performance a little. Experimental results on Aurora II database show that the detection performance in noise environments can remarkably be improved by the proposed method when the model trained in clean data is used to detect speech endpoints. Using weighting likelihood on the dimension-reduced features obtains comparable, even better, performance compared to original full-dimensional feature.

Original languageEnglish
Pages (from-to)330-336
Number of pages7
JournalJournal of Electronics
Volume25
Issue number3
DOIs
StatePublished - May 2008

Keywords

  • Dimension reduction
  • Divergence
  • Noise robustness
  • Voice Activity Detection (VAD)
  • Weighting likelihood

Fingerprint

Dive into the research topics of 'Improving Voice Activity Detection via weighting likelihood and dimension reduction'. Together they form a unique fingerprint.

Cite this