[PDF][PDF] Autoregressive Modeling based Feature Extraction for Aurora3 DSR Task
P Motlıcek - 2003 - Citeseer
2003•Citeseer
Techniques for analysis of speech, that use autoregressive (allpole) modeling approaches,
are presented here and compared to generally known Mel-frequency cepstrum based
feature extraction. In the paper, first, we focus on several possible applications of modeling
speech power spectra that increase the performance of ASR system mainly in case of large
mismatch between training and testing data. Then, the attention is payed to the different
types of features that can be extracted from allpole model to reduce the overall word error …
are presented here and compared to generally known Mel-frequency cepstrum based
feature extraction. In the paper, first, we focus on several possible applications of modeling
speech power spectra that increase the performance of ASR system mainly in case of large
mismatch between training and testing data. Then, the attention is payed to the different
types of features that can be extracted from allpole model to reduce the overall word error …
Abstract
Techniques for analysis of speech, that use autoregressive (allpole) modeling approaches, are presented here and compared to generally known Mel-frequency cepstrum based feature extraction. In the paper, first, we focus on several possible applications of modeling speech power spectra that increase the performance of ASR system mainly in case of large mismatch between training and testing data. Then, the attention is payed to the different types of features that can be extracted from allpole model to reduce the overall word error rate. The results show that generally used cepstrum based features, which can be easily extracted from all-pole model, are not the most suitable parameters for ASR, where the input speech is corrupted by different types of real noises. Very good recognition performances were achieved eg, with discrete or selective all-pole modeling based approaches, or with decorrelated line spectral frequencies. The feature extraction techniques were tested on SpeechDat-Car databases used for front-end evaluation of advanced distributed speech recognition (DSR) systems.
Citeseer
Showing the best result for this search. See all results