Skip to main navigation Skip to search Skip to main content

Confirmation Based Self-Learning algorithm in LVCSR's semi-supervised incremental learning

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalConference articlepeer-review

Abstract

A new data selection algorithm is proposed in this paper for semi-supervised incremental learning of large vocabulary continuous speech recognition (LVCSR) system named Confirmation Based Self-Learning (CBSL). The CBSL algorithm first selects sentence level training corpus via the calculation of confidence measure, and then introduces the confirmation criterion to select word level corpus avoiding further calculation of confidence measure. It is proved that the proposed algorithm can improve the performance of acoustic model training with the highest raise by 4.42% of the recognition correctness rate in comparison with traditional single level and double level confidence measure based data selection algorithms. Besides, considering the characteristic of the distribution of high and low confidence measure data, both kinds of data are used for system training and a 1.41% increase of correctness rate is achieved by adding low confidence measure data.

Original languageEnglish
Pages (from-to)754-759
Number of pages6
JournalProcedia Engineering
Volume29
DOIs
StatePublished - 2012
Externally publishedYes
Event2012 International Workshop on Information and Electronics Engineering, IWIEE 2012 - Harbin, China
Duration: 10 Mar 201211 Mar 2012

Keywords

  • Confirmation Based Self-Learning
  • Data selection
  • High and low confidence measure
  • Semi-supervised incremental learning

Fingerprint

Dive into the research topics of 'Confirmation Based Self-Learning algorithm in LVCSR's semi-supervised incremental learning'. Together they form a unique fingerprint.

Cite this