Please use this identifier to cite or link to this item: http://localhost:8081/jspui/handle/123456789/20635
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBhimrao, Chunarkar Snehit-
dc.date.accessioned2026-05-04T12:21:50Z-
dc.date.available2026-05-04T12:21:50Z-
dc.date.issued2021-06-
dc.identifier.urihttp://localhost:8081/jspui/handle/123456789/20635-
dc.guideTripathy, Manojen_US
dc.description.abstractWith multiple language spoken in the world by the different group of people, it is possible that we may encounter with such mixed language speech to hear, especially during data share in an audio-video format. Such blended audio is not clearly understandable to the audience. Hence it is needed to be processed with some separation method. Mixed Language separation is the task of separating a particular language speech from a mixed language speech audio. Deep neural network (DNN) based speech enhancement algorithm shows good results in the non-stationary environment and can successfully outperform conventional methods which are based on unsupervised algorithms. All the information in the acoustic signal is too cumbersome to deal with, hence speech signal is illustrated by predetermined number of components of the signal which named as feature of the speech signal. Features like Mel Frequency Cepstrum Coefficient (MFCC), Power Spectrum and Relative Spectral Transformed Perceptual Linear Prediction coefficient (RASTA-PLP) are extracted from the mixed language speech, which then used as the input to the DNN. For training target, the Short-Time Fourier Transform (STFT) Spectral Mask is considered. Short-Time Objective Intelligibility (STOI) and Perceptual Evaluation of Speech Quality (PESQ) scores are used to compare the Intelligibility and Quality of the separated language speech signal processed by DNN. Language separated speech using trained DNN model has been found to achieve improve Intelligibility and Quality.en_US
dc.language.isoenen_US
dc.publisherIIT Roorkeeen_US
dc.subjectDeep Neural Network, Adam optimizer, Language Separation, Speech Features, Short-Time Objective Intelligibility, Perceptual Evaluation of Speech Quality.en_US
dc.titleMixed Language Separation Using DNNen_US
dc.typeDissertationsen_US
Appears in Collections:MASTERS' THESES (Electrical Engg)

Files in This Item:
File Description SizeFormat 
19528006_CHUNARKAR SNEHIT BHIMRAO.pdf2.79 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.