Skip to main navigation Skip to search Skip to main content

Gaussian mixture model compensation method using shift factor for speaker recognition

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

The performance of GMM-based text-independent speaker recognition systems declines rapidly when the training data is reduced. A model compensation method is proposed to address the problem. Since there is a shift between each target GMM-based model and the UBM (Universal Background Model), a low-dimensional affine space is fined, named shift space, and the shift for each model with sufficient training data is transformed to the shift factor in this space. When the training data of the target speaker is insufficient, firstly, the coordinate of the shift factor is learned from the GMM mixtures of insensitive to the amount of training data, and then it is adopted to compensate other GMM mixtures. Using the proposed method, a relative reduction of 7% in EER (equal error rate) is obtained comparing with the baseline system.

Original languageEnglish
Pages (from-to)658-664
Number of pages7
JournalShengxue Xuebao/Acta Acustica
Volume36
Issue number6
StatePublished - Nov 2011
Externally publishedYes

Fingerprint

Dive into the research topics of 'Gaussian mixture model compensation method using shift factor for speaker recognition'. Together they form a unique fingerprint.

Cite this