VITS, Tacotron or FastSpeech? Challenging some of the most popular synthesizers
| dc.contributor.author | Matoušek, Jindřich | |
| dc.contributor.author | Tihelka, Daniel | |
| dc.contributor.author | Tihelková, Alice | |
| dc.date.accessioned | 2025-06-20T08:55:19Z | |
| dc.date.available | 2025-06-20T08:55:19Z | |
| dc.date.issued | 2023 | |
| dc.date.updated | 2025-06-20T08:55:19Z | |
| dc.description.abstract | The paper presents a comparative study of three neural speech synthesizers, namely VITS, Tacotron$2$ and FastSpeech$2$, which belong among the most popular TTS systems nowadays. Due to their varying nature, they have been tested from several points of view, analysing not only the overall quality of the synthesized speech, but also the capability of processing either orthographic or phonetic inputs. The analysis has been carried out on two English and one Czech voices. | en |
| dc.format | 14 | |
| dc.identifier.doi | 10.1007/978-3-031-47665-5_26 | |
| dc.identifier.isbn | 978-3-031-47664-8 | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.obd | 43940621 | |
| dc.identifier.orcid | Matoušek, Jindřich 0000-0002-7408-7730 | |
| dc.identifier.orcid | Tihelka, Daniel 0000-0002-3149-2330 | |
| dc.identifier.orcid | Tihelková, Alice 0000-0002-3367-4525 | |
| dc.identifier.uri | http://hdl.handle.net/11025/61570 | |
| dc.language.iso | en | |
| dc.project.ID | SGS-2022-017 | |
| dc.project.ID | GA22-27800S | |
| dc.project.ID | 90140 | |
| dc.project.ID | 90104 | |
| dc.publisher | Springer | |
| dc.relation.ispartofseries | 7th Asian Conference on Pattern Recognition (ACPR 2023) | |
| dc.subject | text-to-speech synthesis, VITS, FastSpeech2, Tacotron2 | en |
| dc.title | VITS, Tacotron or FastSpeech? Challenging some of the most popular synthesizers | en |
| dc.type | Stať ve sborníku (D) | |
| dc.type | STAŤ VE SBORNÍKU | |
| dc.type.status | Published Version | |
| local.files.count | 1 | * |
| local.files.size | 309766 | * |
| local.has.files | yes | * |
| local.identifier.eid | 2-s2.0-85177460227 |
Files
Original bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- Matousek_Tihelka_Tihelkova_VITS_Tacotron_or_FastSpeech_Challenging_Some_of_the_Most_Popular_Synthesizers_2023.pdf
- Size:
- 302.51 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: