Abstract
The performance of GMM-based text-independent speaker recognition systems declines rapidly when the training data is reduced. A model compensation method is proposed to address the problem. Since there is a shift between each target GMM-based model and the UBM (Universal Background Model), a low-dimensional affine space is fined, named shift space, and the shift for each model with sufficient training data is transformed to the shift factor in this space. When the training data of the target speaker is insufficient, firstly, the coordinate of the shift factor is learned from the GMM mixtures of insensitive to the amount of training data, and then it is adopted to compensate other GMM mixtures. Using the proposed method, a relative reduction of 7% in EER (equal error rate) is obtained comparing with the baseline system.
| Original language | English |
|---|---|
| Pages (from-to) | 658-664 |
| Number of pages | 7 |
| Journal | Shengxue Xuebao/Acta Acustica |
| Volume | 36 |
| Issue number | 6 |
| State | Published - Nov 2011 |
| Externally published | Yes |
Fingerprint
Dive into the research topics of 'Gaussian mixture model compensation method using shift factor for speaker recognition'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver