T5G2P: Text-to-Text Transfer Transformer Based Grapheme-to-Phoneme Conversion

dc.contributor.authorŘezáčková, Markéta
dc.contributor.authorTihelka, Daniel
dc.contributor.authorMatoušek, Jindřich
dc.date.accessioned2025-06-20T08:46:28Z
dc.date.available2025-06-20T08:46:28Z
dc.date.issued2024
dc.date.updated2025-06-20T08:46:28Z
dc.description.abstractThe present paper explores the use of several deep neural network architectures to carry out a grapheme-to-phoneme (G2P) conversion, aiming to find a universal and language-independent approach to the task. The models explored are trained on whole sentences in order to automatically capture cross-word context (such as voicedness assimilation) if it exists in the given language. Four different languages, English, Czech, Russian, and German, were chosen due to their different nature and requirements for the G2P task. Ultimately, the Text-to-Text Transfer Transformer (T5) based model achieved very high conversion accuracy on all the tested languages. Also, it exceeded the accuracy reached by a similar system, when trained on a public LibriSpeech database.en
dc.format11
dc.identifier.document-number001283673700010
dc.identifier.doi10.1109/TASLP.2024.3426332
dc.identifier.issn2329-9290
dc.identifier.obd43943437
dc.identifier.orcidŘezáčková, Markéta 0000-0002-6194-7826
dc.identifier.orcidTihelka, Daniel 0000-0002-3149-2330
dc.identifier.orcidMatoušek, Jindřich 0000-0002-7408-7730
dc.identifier.urihttp://hdl.handle.net/11025/61004
dc.language.isoen
dc.project.IDSGS-2022-017
dc.project.IDGA22-27800S
dc.relation.ispartofseriesIEEE/ACM Transactions on Audio, Speech, and Language Processing
dc.rights.accessC
dc.subjectCNNen
dc.subjectCzechen
dc.subjectEnglishen
dc.subjectG2Pen
dc.subjectGermanen
dc.subjectphonetic transcriptionen
dc.subjectRNNen
dc.subjectRussianen
dc.subjectT5en
dc.titleT5G2P: Text-to-Text Transfer Transformer Based Grapheme-to-Phoneme Conversionen
dc.typeČlánek v databázi WoS (Jimp)
dc.typeČLÁNEK
dc.type.statusPublished Version
local.files.count1*
local.files.size992882*
local.has.filesyes*
local.identifier.eid2-s2.0-85198311174

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
T5G2P_Text-to-Text_Transfer_Transformer_Based_Grapheme-to-Phoneme_Conversion.pdf
Size:
969.61 KB
Format:
Adobe Portable Document Format
License bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections