Mixed Language Separation Using DNN

Bhimrao, Chunarkar Snehit

Please use this identifier to cite or link to this item: http://localhost:8081/jspui/handle/123456789/20635

Full metadata record

DC Field	Value	Language
dc.contributor.author	Bhimrao, Chunarkar Snehit	-
dc.date.accessioned	2026-05-04T12:21:50Z	-
dc.date.available	2026-05-04T12:21:50Z	-
dc.date.issued	2021-06	-
dc.identifier.uri	http://localhost:8081/jspui/handle/123456789/20635	-
dc.guide	Tripathy, Manoj	en_US
dc.description.abstract	With multiple language spoken in the world by the different group of people, it is possible that we may encounter with such mixed language speech to hear, especially during data share in an audio-video format. Such blended audio is not clearly understandable to the audience. Hence it is needed to be processed with some separation method. Mixed Language separation is the task of separating a particular language speech from a mixed language speech audio. Deep neural network (DNN) based speech enhancement algorithm shows good results in the non-stationary environment and can successfully outperform conventional methods which are based on unsupervised algorithms. All the information in the acoustic signal is too cumbersome to deal with, hence speech signal is illustrated by predetermined number of components of the signal which named as feature of the speech signal. Features like Mel Frequency Cepstrum Coefficient (MFCC), Power Spectrum and Relative Spectral Transformed Perceptual Linear Prediction coefficient (RASTA-PLP) are extracted from the mixed language speech, which then used as the input to the DNN. For training target, the Short-Time Fourier Transform (STFT) Spectral Mask is considered. Short-Time Objective Intelligibility (STOI) and Perceptual Evaluation of Speech Quality (PESQ) scores are used to compare the Intelligibility and Quality of the separated language speech signal processed by DNN. Language separated speech using trained DNN model has been found to achieve improve Intelligibility and Quality.	en_US
dc.language.iso	en	en_US
dc.publisher	IIT Roorkee	en_US
dc.subject	Deep Neural Network, Adam optimizer, Language Separation, Speech Features, Short-Time Objective Intelligibility, Perceptual Evaluation of Speech Quality.	en_US
dc.title	Mixed Language Separation Using DNN	en_US
dc.type	Dissertations	en_US
Appears in Collections:	MASTERS' THESES (Electrical Engg)

Files in This Item:

File	Description	Size	Format
19528006_CHUNARKAR SNEHIT BHIMRAO.pdf		2.79 MB	Adobe PDF	View/Open

Show simple item record