Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis
| dc.contributor.author | Vladař, Lukáš | |
| dc.contributor.author | Matoušek, Jindřich | |
| dc.date.accessioned | 2025-06-20T08:35:57Z | |
| dc.date.available | 2025-06-20T08:35:57Z | |
| dc.date.issued | 2024 | |
| dc.date.updated | 2025-06-20T08:35:57Z | |
| dc.description.abstract | During the development of a speech synthesizer, we often face a lack of training data. This paper describes how the amount of data used to train a speech synthesizer affects the quality of the final synthetic speech. To answer this question, we trained multiple VITS synthesizers using different amounts of training data and we compared them using listening tests and the MCD objective measure. Furthermore, we compared three training strategies: training a speech synthesizer from scratch, fine-tuning a single-speaker model and fine-tuning a multi-speaker model. | en |
| dc.format | 11 | |
| dc.identifier.document-number | 001307848400009 | |
| dc.identifier.doi | 10.1007/978-3-031-70566-3_9 | |
| dc.identifier.isbn | 978-3-031-70565-6 | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.obd | 43944160 | |
| dc.identifier.orcid | Vladař, Lukáš 0009-0009-8047-7303 | |
| dc.identifier.orcid | Matoušek, Jindřich 0000-0002-7408-7730 | |
| dc.identifier.uri | http://hdl.handle.net/11025/60332 | |
| dc.language.iso | en | |
| dc.project.ID | SGS-2022-017 | |
| dc.project.ID | GA22-27800S | |
| dc.publisher | Springer International Publishing | |
| dc.relation.ispartofseries | 27th International Conference on Text, Speech, and Dialogue, TSD 2024 | |
| dc.subject | fine-tuning | en |
| dc.subject | speech synthesis | en |
| dc.subject | training data | en |
| dc.subject | VITS | en |
| dc.title | Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis | en |
| dc.type | Stať ve sborníku (D) | |
| dc.type | STAŤ VE SBORNÍKU | |
| dc.type.status | Published Version | |
| local.files.count | 1 | * |
| local.files.size | 418303 | * |
| local.has.files | yes | * |
| local.identifier.eid | 2-s2.0-85204408635 |
Files
Original bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- 978-3-031-70566-3_9.pdf
- Size:
- 408.5 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: