Combined Speech Recognition and Speaker Verification over the Fixed and Mobile Telephone Networks

A. Kounoudes, A. Antonakoudi, V. Kekatos, and P. Peleties (Cyprus)


Speaker verification, Text validation, Hidden Markov Models, Biometrics.


A double-digit text-dependent speaker verification and text validation system is presented for use in telephone services. The system utilizes concatenated phoneme HMMs for both speech recognition and user authentication, and works in a soundprompted mode. Tests with Hidden Markov Models (HMMs) using Perceptual Linear Prediction (PLP) and Mel Frequency Cepstral Coefficients (MFCC) as well as Cepstral Mean Subtraction (CMS) are performed to assess their effect on recognition performance. The paper also studies the effects of various factors such as the length of the training data, the number of embedded re-estimations and Gaussian mixtures in training of the HMMs, the use of world models, bootstrapping, and user-depended thresholds on the performance of speech recognition and speaker verification.

Important Links:

Go Back