Speaker-Dependent BiLSTM-Based Phrasing
| dc.contributor.author | Řezáčková, Markéta | |
| dc.contributor.author | Tihelka, Daniel | |
| dc.date.accessioned | 2021-02-22T11:00:20Z | |
| dc.date.available | 2021-02-22T11:00:20Z | |
| dc.date.issued | 2020 | |
| dc.description.abstract-translated | Phrase boundary detection is an important part of text-to-speech systems since it ensures more natural speech synthesis outputs. However, the problem of phrasing is ambiguous, especially per speaker and per style. This is the reason why this paper focuses on speaker-dependent phrasing for the purposes of speech synthesis, using a neural network model with a speaker code. We also describe results of a listening test focused on incorrectly detected breaks because it turned out that some mistakes could be actually fine, not wrong. | en |
| dc.format | 8 s. | cs |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | ŘEZÁČKOVÁ, M., TIHELKA, D. Speaker-Dependent BiLSTM-Based Phrasing. In: Text, Speech, and Dialogue 23rd International Conference, TSD 2020, Brno, Czech Republic, September 8-11, 2020, Proceedings. Cham: Springer, 2020. s. 340-347. ISBN 978-3-030-58322-4, ISSN 0302-9743. | cs |
| dc.identifier.doi | 10.1007/978-3-030-58323-1_37 | |
| dc.identifier.isbn | 978-3-030-58322-4 | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.obd | 43930360 | |
| dc.identifier.uri | 2-s2.0-85091141874 | |
| dc.identifier.uri | http://hdl.handle.net/11025/42717 | |
| dc.language.iso | en | en |
| dc.project.ID | GA19-19324S/Plně trénovatelná syntéza české řeči z textu s využitím hlubokých neuronových sítí | cs |
| dc.project.ID | SGS-2019-027/Inteligentní metody strojového vnímání a porozumění 4 | cs |
| dc.publisher | Springer | en |
| dc.relation.ispartofseries | Text, Speech, and Dialogue 23rd International Conference, TSD 2020, Brno, Czech Republic, September 8-11, 2020, Proceedings | en |
| dc.rights | Plný text není přístupný. | cs |
| dc.rights | © Springer | en |
| dc.rights.access | closedAccess | en |
| dc.subject.translated | Phrase boundary detection, Text-to-speech system, Neural network, Speaker-dependent phrasin | en |
| dc.title | Speaker-Dependent BiLSTM-Based Phrasing | en |
| dc.type | konferenční příspěvek | cs |
| dc.type | conferenceObject | en |
| dc.type.status | Peer-reviewed | en |
| dc.type.version | publishedVersion | en |