T5G2P: Text-to-Text Transfer Transformer Based Grapheme-to-Phoneme Conversion
| dc.contributor.author | Řezáčková, Markéta | |
| dc.contributor.author | Tihelka, Daniel | |
| dc.contributor.author | Matoušek, Jindřich | |
| dc.date.accessioned | 2025-06-20T08:46:28Z | |
| dc.date.available | 2025-06-20T08:46:28Z | |
| dc.date.issued | 2024 | |
| dc.date.updated | 2025-06-20T08:46:28Z | |
| dc.description.abstract | The present paper explores the use of several deep neural network architectures to carry out a grapheme-to-phoneme (G2P) conversion, aiming to find a universal and language-independent approach to the task. The models explored are trained on whole sentences in order to automatically capture cross-word context (such as voicedness assimilation) if it exists in the given language. Four different languages, English, Czech, Russian, and German, were chosen due to their different nature and requirements for the G2P task. Ultimately, the Text-to-Text Transfer Transformer (T5) based model achieved very high conversion accuracy on all the tested languages. Also, it exceeded the accuracy reached by a similar system, when trained on a public LibriSpeech database. | en |
| dc.format | 11 | |
| dc.identifier.document-number | 001283673700010 | |
| dc.identifier.doi | 10.1109/TASLP.2024.3426332 | |
| dc.identifier.issn | 2329-9290 | |
| dc.identifier.obd | 43943437 | |
| dc.identifier.orcid | Řezáčková, Markéta 0000-0002-6194-7826 | |
| dc.identifier.orcid | Tihelka, Daniel 0000-0002-3149-2330 | |
| dc.identifier.orcid | Matoušek, Jindřich 0000-0002-7408-7730 | |
| dc.identifier.uri | http://hdl.handle.net/11025/61004 | |
| dc.language.iso | en | |
| dc.project.ID | SGS-2022-017 | |
| dc.project.ID | GA22-27800S | |
| dc.relation.ispartofseries | IEEE/ACM Transactions on Audio, Speech, and Language Processing | |
| dc.rights.access | C | |
| dc.subject | CNN | en |
| dc.subject | Czech | en |
| dc.subject | English | en |
| dc.subject | G2P | en |
| dc.subject | German | en |
| dc.subject | phonetic transcription | en |
| dc.subject | RNN | en |
| dc.subject | Russian | en |
| dc.subject | T5 | en |
| dc.title | T5G2P: Text-to-Text Transfer Transformer Based Grapheme-to-Phoneme Conversion | en |
| dc.type | Článek v databázi WoS (Jimp) | |
| dc.type | ČLÁNEK | |
| dc.type.status | Published Version | |
| local.files.count | 1 | * |
| local.files.size | 992882 | * |
| local.has.files | yes | * |
| local.identifier.eid | 2-s2.0-85198311174 |
Files
Original bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- T5G2P_Text-to-Text_Transfer_Transformer_Based_Grapheme-to-Phoneme_Conversion.pdf
- Size:
- 969.61 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: