Please use this identifier to cite or link to this item: http://localhost:8081/xmlui/handle/123456789/11288
Full metadata record
DC FieldValueLanguage
dc.contributor.authorB, Ravi.-
dc.date.accessioned2014-11-26T07:55:30Z-
dc.date.available2014-11-26T07:55:30Z-
dc.date.issued2006-
dc.identifierM.Techen_US
dc.identifier.urihttp://hdl.handle.net/123456789/11288-
dc.guideAnand, R. S.-
dc.description.abstractSpeech recognition has been a fascinating and interesting topic for researchers for many years. Speech is a natural communication medium with machines especially in the environment where keyboard input is awkward or impossible. Automatic Speech Recognition (ASR) has made great progress for European languages. During the last years great progress in the field has been made, mainly due to many years of research and the availability of high performance systems and algorithms. Speech recognition is a process that converts an acoustic signal to a set of words. In India almost three-fourth of the population lives in rural areas and most of this population is unfamiliar with computers and English. Speaker dependent continuous speech recognition of Hindi language will enable people to interact with computers in their own language and without the use of a keyboard. Speech recognition is used by several different categories of users. People who have difficulty in using their hands to type, professionals, and people with learning disabilities are the main users. A speech recognizer system comprised of two distinct blocks, a Feature Extractor and a Recognizer. The Feature Extractor block uses a Mel-frequency cepstral analysis, which translates the incoming continuous speech in to a feature vectors. Once the features are extracted, they are used for matching with the features of the stored words. The network output is the words to be recognized. For this purpose recognizer block uses neural network. In the course of developing this system, we explored two different ways to use neural networks for audio modeling: prediction and classification. We found that predictive networks yield poor results because of a lack of discrimination, but classification networks gave excellent results. Finally, this thesis reports how we optimized the accuracy of our system with many natural techniques, such as expanding the input window size, normalizing the inputs, increasing the number of hidden units, converting the network's output activations to log likelihoods, optimizing the learning rate schedule by automatic search, backpropagating error from word level outputs, and using gender dependent networks.en_US
dc.language.isoenen_US
dc.subjectELECTRICAL ENGINEERINGen_US
dc.subjectSPEAKER DEPENDENTen_US
dc.subjectCONTINUOUS SPEECH RECOGNITIONen_US
dc.subjectHINDI LANGUAGEen_US
dc.titleSPEAKER DEPENDENT CONTINUO- US SPEECH RECOGNITION OF HINDI LANGUAGEen_US
dc.typeM.Tech Dessertationen_US
dc.accession.numberG12783en_US
Appears in Collections:MASTERS' THESES (Electrical Engg)

Files in This Item:
File Description SizeFormat 
EEDG12783.pdf6.75 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.