Stressed speech recognition using a warped frequency scale

D. Gharavian; S. M. Ahadi

doi:10.1587/elex.5.187

LETTER

Stressed speech recognition using a warped frequency scale

D. Gharavian, S. M. Ahadi

Author information

Keywords: prosody, stress, speech recognition, frequency warping, formants

JOURNAL FREE ACCESS

2008 Volume 5 Issue 6 Pages 187-191

DOI https://doi.org/10.1587/elex.5.187

Details

Abstract

The use of emotion-initiated gestures in human speech communication results in the improvement of speech understanding. However, this is a source of difficulty for automatic speech recognizers. In this paper, using the orderly changes found in the second formant, due to stress, a warping function is introduced that can be applied to the mel frequency scale during the calculation of MFCC parameters. We show that this approach leads to improvements in the stressed speech recognition results. Furthermore, using the second formant frequency as an extra element of the feature vector leads to further improvements in the speech recognizer performance.

Corresponding author

Register with J-STAGE for free!