Using Pre-trained Models for Phoneme Representation in Czech Speech Synthesis

dc.contributor.authorVladař, Lukáš
dc.date.accessioned2026-03-27T19:05:48Z
dc.date.available2026-03-27T19:05:48Z
dc.date.issued2025
dc.date.updated2026-03-27T19:05:48Z
dc.description.abstractText-to-speech (TTS) systems, i.e., systems producing artificial speech, represent an importanttopic in the field of artificial intelligence. Modern approaches based on neural networksreach very good results, almost comparable to real human speech.Nguyen et al. (2023) argue that including a large-scale pre-trained model for phonemerepresentation in a neural TTS system can further improve the final synthetic speech. We usedtheir pre-trained model called XPhoneBERT to investigate whether it can also enhance the qualityof speech synthesis in the Czech language.en
dc.format2
dc.identifier.isbn978-80-261-1302-7
dc.identifier.obd43948781
dc.identifier.orcidVladař, Lukáš 0009-0009-8047-7303
dc.identifier.urihttp://hdl.handle.net/11025/67457
dc.language.isoen
dc.project.IDSGS-2025-011
dc.publisherZápadočeská univerzita v Plzni
dc.relation.ispartofseriesStudentská vědecká konference FAV 2025
dc.subjectphoneme representationen
dc.subjectCzech speechen
dc.subjectsynthesisen
dc.titleUsing Pre-trained Models for Phoneme Representation in Czech Speech Synthesisen
dc.typeStať ve sborníku (O)
dc.typeSTAŤ VE SBORNÍKU
dc.type.statusČestné prohlášení
local.files.count1*
local.files.size519416*
local.has.filesyes*

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
2P-2026_Priloha1c_zprava za projekt_43948781.pdf
Size:
507.24 KB
Format:
Adobe Portable Document Format
License bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: