Description: Abstract: This thesis mainly studied the basic principle of speech recognition, to analyze the composition of the speech recognition system, including the preprocessing, feature extraction, to set up the module library, identify several most matches. Pretreatment, including speech sampling, pre-emphasis, add window (hamming window), endpoint detection Feature extraction of MFCC MEL frequency cepstrum coefficient.
The voice system USES a dynamic time scale (DTW) algorithm, the research object is the speaker-dependent speech recognition, and realized in MATLAB platform.To carry out the follow-up study, the first to use the recording in a computer system to record the audio files of Arabic Numbers 0-9, and translated into . Wav format file.
To Search:
File list (Check if you may need any files):
程序.docx