MARC View

000			02783nam a22001937a 4500
003			OSt
005			20220107122808.0
008			160226b xxu\|\|\|\|\| \|\|\|\| 00\| 0 eng d
040			_c
100			_aMaya Moneykumar (93514008) _98206
245			_aSyllable based word identification for malayalam speech using machine learning
502			_bMaster of Philosophy in Computer Science _c2014-2015 _dINT _eElizabeth Sherly
520			_aThis thesis aims at discussing the development of an isolated word identification system for Malayalam using Machine Learning techniques. This work examines how Artificial Neural Network (ANN) and Hidden Markov Model (HMM) can benefit a medium size vocabulary, speaker independent isolated word level recognition system. The goal of this work is to design an ANN based word recognition system and evaluate its accuracy in different modes namely words within the vocabulary as well as out of vocabulary. The recognition accuracy is also tested for speaker dependent as well speaker independent modes. The system was then compared with a similar system performance based on HMM. The work aims at syllable based word identification where each and every utterance will be segmented into corresponding syllables which are in turn trained by the system. Currently, most speech recognition systems are based on Hidden Markov Model (HMM) which is a statistical framework that supports both acoustic and temporal modeling. In this work, the system is trained using syllables segmented from the utterances where a new approach is made to do the syllable segmentation efficiently, based on the energy measure, formant frequencies and zero crossing rate. These segmented syllables are then trained using HMM and ANN to compare the recognition accuracy. To compare the two systems, we have kept similar, the train and test data and also the extracted features. The comparison includes the overall system performance and different test accuracy rates for both the models. The system is trained using multiple utterances of 80 different words by 9 different speakers, 6 male and 3 female, where the feature extraction was done using MFCC, the most powerful feature extraction technique. In this work, the speech recognition engine is built using HTK and WEKA. The work also attempts to evaluate the improvement in recognition accuracy of the system based on ANN by training and testing with additional parameters. The system proved successful in identifying the utterances of out of vocabulary words, which indeed is a notable step in the area of speech recognition.
650			_aCOMPUTING METHODOLOGIES _98207
650			_aARTIFICIAL INTELLIGENCE _98208
650			_aNATURAL LANGUAGE PROCESSING _98209
650			_aSPEECH RECOGNITION _98210
942			_2ddc _cPR
999			_c4910 _d4910