National Repository of Dissertations in Serbia
    • English
    • Српски
    • Српски (Serbia)
  • English 
    • English
    • Serbian (Cyrilic)
    • Serbian (Latin)
  • Login
View Item 
  •   NaRDuS home
  • Универзитет Сингидунум
  • Студије при универзитету
  • View Item
  •   NaRDuS home
  • Универзитет Сингидунум
  • Студије при универзитету
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Speech Recognition in noisy environment using Deep Learning Neural Network

Thumbnail
2017
Disertacija (1.733Mb)
Izveštaj komisije (4.044Mb)
Author
Nasef, Ashrf Ali Abraheem
Mentor
Marjanović-Jakovljević, Marina
Committee members
Veinović, Mladen
Kovačević, Branko
Metadata
Show full item record
Abstract
Recent researches in the field of automatic speaker recognition have shown that methods based on deep learning neural networks provide better performance than other statistical classifiers. On the other hand, these methods usually require adjustment of a significant number of parameters. The goal of this thesis is to show that selecting appropriate value of parameters can significantly improve speaker recognition performance of methods based on deep learning neural networks. The reported study introduces an approach to automatic speaker recognition based on deep neural networks and the stochastic gradient descent algorithm. It particularly focuses on three parameters of the stochastic gradient descent algorithm: the learning rate, and the hidden and input layer dropout rates. Additional attention was devoted to the research question of speaker recognition under noisy conditions. Thus, two experiments were conducted in the scope of this thesis. The first experiment w...as intended to demonstrate that the optimization of the observed parameters of the stochastic gradient descent algorithm can improve speaker recognition performance under no presence of noise. This experiment was conducted in two phases. In the first phase, the recognition rate is observed when the hidden layer dropout rate and the learning rate are varied, while the input layer dropout rate was constant. In the second phase of this experiment, the recognition rate is observed when the input layers dropout rate and learning rate are varied, while the hidden layer dropout rate was constant. The second experiment was intended to show that the optimization of the observed parameters of the stochastic gradient descent algorithm can improve speaker recognition performance even under noisy conditions. Thus, different noise levels were artificially applied on the original speech signal.

Faculty:
Универзитет Сингидунум, Студије при универзитету
Date:
06-12-2017
[ Google Scholar ]
Handle
https://hdl.handle.net/21.15107/rcub_nardus_9085
URI
https://nardus.mpn.gov.rs/handle/123456789/9085
https://singipedia.singidunum.ac.rs/izdanje/42831-speech-recognition-in-noisy-environment-using-deep-learning-neural-network

DSpace software copyright © 2002-2015  DuraSpace
About NaRDus | Contact us

OpenAIRERCUBRODOSTEMPUS
 

 

Browse

All of DSpaceUniversities & FacultiesAuthorsMentorCommittee membersSubjectsThis CollectionAuthorsMentorCommittee membersSubjects

DSpace software copyright © 2002-2015  DuraSpace
About NaRDus | Contact us

OpenAIRERCUBRODOSTEMPUS