Using Pre-trained Models for Phoneme Representation in Czech Speech Synthesis

dc.contributor.author	Vladař, Lukáš
dc.date.accessioned	2026-03-27T19:05:48Z
dc.date.available	2026-03-27T19:05:48Z
dc.date.issued	2025
dc.date.updated	2026-03-27T19:05:48Z
dc.description.abstract	Text-to-speech (TTS) systems, i.e., systems producing artificial speech, represent an importanttopic in the field of artificial intelligence. Modern approaches based on neural networksreach very good results, almost comparable to real human speech.Nguyen et al. (2023) argue that including a large-scale pre-trained model for phonemerepresentation in a neural TTS system can further improve the final synthetic speech. We usedtheir pre-trained model called XPhoneBERT to investigate whether it can also enhance the qualityof speech synthesis in the Czech language.	en
dc.format	2
dc.identifier.isbn	978-80-261-1302-7
dc.identifier.obd	43948781
dc.identifier.orcid	Vladař, Lukáš 0009-0009-8047-7303
dc.identifier.uri	http://hdl.handle.net/11025/67457
dc.language.iso	en
dc.project.ID	SGS-2025-011
dc.publisher	Západočeská univerzita v Plzni
dc.relation.ispartofseries	Studentská vědecká konference FAV 2025
dc.subject	phoneme representation	en
dc.subject	Czech speech	en
dc.subject	synthesis	en
dc.title	Using Pre-trained Models for Phoneme Representation in Czech Speech Synthesis	en
dc.type	Stať ve sborníku (O)
dc.type	STAŤ VE SBORNÍKU
dc.type.status	Čestné prohlášení
local.files.count	1	*
local.files.size	519416	*
local.has.files	yes	*

Files

Showing 1 - 1 out of 1 results

Showing 1 - 1 out of 1 results