Abstract
Solutions for two important problems for the deployment of noise-robust large vocabulary automatic speech recognizers using the missing data paradigm are presented. irst problem is the generation of missing data masks. We propose and evaluate a method based on vector quantization and harmonicity that successfully exploits the characteristics of speech while requiring only weak assumptions on the noise. A second problem that is addressed is computational efficiency. We advocate the usage of PROSPECT features and the L-cluster-Mbest method for Gaussian selection. In total, a speed up of a factor of about 6 can be achieved with these methods.
Keywords: Large vocabulary continuous speech recognition, missing feature, vector quantization, harmonicity.
About this chapter
Cite this chapter as:
Yujun Wang, Maarten Van Segbroeck, Hugo Van hamme ;Robust Large Vocabulary Continuous Speech Recognition Based on Missing Feature Techniques, Recent Advances in Robust Speech Recognition Technology (2011) 1: 141. https://doi.org/10.2174/978160805172411101010141
DOI https://doi.org/10.2174/978160805172411101010141 |
Publisher Name Bentham Science Publisher |