Razvoj matematičkog modela trajanja glasova u automatskoj sintezi govora na srpskom jeziku
The Development of Phone Duration Model in Speech Synthesis in theSerbian Language
Doktorand
Sovilj-Nikić, SandraMentor
Delić, VladoČlanovi komisije
Hadžić, OlgaBajić, Dragana
Jovičić, Slobodan
Marković, Maja
Delić, Vlado
Metapodaci
Prikaz svih podataka o disertacijiSažetak
U okviru ove disertacije razvijeno je više različitih modela trajanja glasova u srpskom jeziku primenom odgovarajućih metoda automatskog učenja. Izvršena je objektivna evaluacija razvijenih modela i njihovo međusobno poređenje na osnovu kvantitativnih pokazatelja kao što su RMSE(engl. root-mean-squared error), MAE (engl. mean absolute error) i CC (engl. correlation coefficient). Takođe je izvršeno poređenje modela za srpski jezik sa performansama modela razvijenih za druge jezike, pri čemu je uočeno da su performanse modela razvijenih u ovoj disertaciji uporedljive ili čak prevazilaze performanse modela koji su razvijeni za druge jezike.
In this dissertation several different phone duration models of the Serbain language using appropriate machine learning algorithms were developed. The objective evaluation of the models obtained and their mutual comparison based on quantitative measures such as RMSE (root-mean-squared error), MAE (mean absolute error) and CC (correlation coefficient) were performed. The comparison of the models developed for the Serbian language with the performances of the models developed for other languages is also carried out. It was observed that the performances of the models developed in this dissertation are comparable or even outperform the performances of the models that have been developed for other languages.