A Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speech

dc.contributor.authorMatoušek, Jindřich
dc.contributor.authorTihelka, Daniel
dc.date.accessioned2022-03-28T10:00:29Z
dc.date.available2022-03-28T10:00:29Z
dc.date.issued2021
dc.description.abstract-translatedIn this paper, we continue to investigate the use of machine learning for the automatic detection of glottal closure instants (GCIs) from raw speech. We compare several deep one-dimensional convolutional neural network architectures on the same data and show that the InceptionV3 model yields the best results on the test set. On publicly available databases, the proposed 1D InceptionV3 outperforms XGBoost, a non-deep machine learning model, as well as other traditional GCI detection algorithms.en
dc.format5 s.cs
dc.format.mimetypeapplication/pdf
dc.identifier.citationMATOUŠEK, J. TIHELKA, D. A Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speech. In 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021). New York: IEEE, 2021. s. 6938-6942. ISBN: 978-1-72817-605-5 , ISSN: 1520-6149cs
dc.identifier.document-number704288407043
dc.identifier.doi10.1109/ICASSP39728.2021.9413675
dc.identifier.isbn978-1-72817-605-5
dc.identifier.issn1520-6149
dc.identifier.obd43933805
dc.identifier.uri2-s2.0-85115048595
dc.identifier.urihttp://hdl.handle.net/11025/47270
dc.language.isoenen
dc.project.IDGA19-19324S/Plně trénovatelná syntéza české řeči z textu s využitím hlubokých neuronových sítícs
dc.project.ID90140/Velká výzkumná infrastruktura_(J) - e-INFRA CZcs
dc.publisherIEEEen
dc.relation.ispartofseries2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)en
dc.rightsPlný text je přístupný v rámci univerzity přihlášeným uživatelům.cs
dc.rights© IEEEen
dc.rights.accessrestrictedAccessen
dc.subject.translatedglottal closure instant (GCI)en
dc.subject.translateddetectionen
dc.subject.translateddeep learningen
dc.subject.translatedconvolutional neural networken
dc.titleA Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speechen
dc.typekonferenční příspěvekcs
dc.typeConferenceObjecten
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
Matousek_A_Comparison_of_Convolutional_Neural_Networks_ICASSP2021.pdf
Size:
1.76 MB
Format:
Adobe Portable Document Format
OPEN License Selector