

This paper presents the results on whispered speech recognition using gammatone filterbank cepstral coefficients for speaker dependent mode. The isolated words used for this experiment are taken from the Whi-Spe database. Whispered speech recognition is based on dynamic time warping and hidden Markov models methods. The experiments are focused on the following modes: normal speech, whispered speech and their combinations (normal/whispered and whispered/normal). The results demonstrated an important improvement in recognition after application of cepstral mean subtraction, especially in mixed train/test scenarios. © 2017, Pleiades Publishing, Inc.
| Engineering controlled terms: | Filter banksHidden Markov modelsMarkov processesSpeech |
|---|---|
| Engineering uncontrolled terms | Cepstral coefficientsCepstral mean subtractionGammatone filterbankIsolated wordsON dynamicsSpeaker dependentsWhispered speech |
| Engineering main heading: | Speech recognition |
| Funding sponsor | Funding number | Acronym |
|---|---|---|
| Ministarstvo Prosvete, Nauke i Tehnološkog Razvoja | 32032,178027,OI-178027,TR-32032 | MPNTR |
ACKNOWLEDGMENTS This research was financed in part by the Ministry of Education, Science and Technological Development of the Republic of Serbia within the projects OI-178027 and TR-32032.
Marković, B.; Telecommunication Department, School of Electrical Engineering, University of Belgrade, Belgrade, Serbia;
© Copyright 2017 Elsevier B.V., All rights reserved.