Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis

dc.contributor.authorVladař, Lukáš
dc.contributor.authorMatoušek, Jindřich
dc.date.accessioned2025-06-20T08:35:57Z
dc.date.available2025-06-20T08:35:57Z
dc.date.issued2024
dc.date.updated2025-06-20T08:35:57Z
dc.description.abstractDuring the development of a speech synthesizer, we often face a lack of training data. This paper describes how the amount of data used to train a speech synthesizer affects the quality of the final synthetic speech. To answer this question, we trained multiple VITS synthesizers using different amounts of training data and we compared them using listening tests and the MCD objective measure. Furthermore, we compared three training strategies: training a speech synthesizer from scratch, fine-tuning a single-speaker model and fine-tuning a multi-speaker model.en
dc.format11
dc.identifier.document-number001307848400009
dc.identifier.doi10.1007/978-3-031-70566-3_9
dc.identifier.isbn978-3-031-70565-6
dc.identifier.issn0302-9743
dc.identifier.obd43944160
dc.identifier.orcidVladař, Lukáš 0009-0009-8047-7303
dc.identifier.orcidMatoušek, Jindřich 0000-0002-7408-7730
dc.identifier.urihttp://hdl.handle.net/11025/60332
dc.language.isoen
dc.project.IDSGS-2022-017
dc.project.IDGA22-27800S
dc.publisherSpringer International Publishing
dc.relation.ispartofseries27th International Conference on Text, Speech, and Dialogue, TSD 2024
dc.subjectfine-tuningen
dc.subjectspeech synthesisen
dc.subjecttraining dataen
dc.subjectVITSen
dc.titleEffects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesisen
dc.typeStať ve sborníku (D)
dc.typeSTAŤ VE SBORNÍKU
dc.type.statusPublished Version
local.files.count1*
local.files.size418303*
local.has.filesyes*
local.identifier.eid2-s2.0-85204408635

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
978-3-031-70566-3_9.pdf
Size:
408.5 KB
Format:
Adobe Portable Document Format
License bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: