Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech

dc.contributor.authorLehečka, Jan
dc.contributor.authorŠvec, Jan
dc.contributor.authorPsutka, Josef
dc.contributor.authorIrcing, Pavel
dc.date.accessioned2025-06-20T08:43:51Z
dc.date.available2025-06-20T08:43:51Z
dc.date.issued2023
dc.date.updated2025-06-20T08:43:51Z
dc.description.abstractThis paper is a step forward in our effort to make vast oral history archives more accessible to the public and researchers by breaking down the decoding barriers between the knowledge encoded in the spoken testimonies and users who want to search for the information of their interest. We present new Transformer-based monolingual models suitable for speech recognition of oral history archives in English, German, and Czech. Our experiments show that although the all-purpose speech recognition systems have recently made tremendous progress, the transcription of oral history archives is still a challenging task for them; our tailored models significantly outperformed larger public multilingual models and scored new state-of-the-art results on all tested datasets. Due to the 2-phase fine-tuning process, our models are robust and can be used for oral history archives of various domains. We publicly release our models within a public speech recognition service.en
dc.format5
dc.identifier.doi10.21437/Interspeech.2023-872
dc.identifier.isbnneuvedeno
dc.identifier.issn2308-457X
dc.identifier.obd43940687
dc.identifier.orcidLehečka, Jan 0000-0002-3889-8069
dc.identifier.orcidŠvec, Jan 0000-0001-8362-5927
dc.identifier.orcidPsutka, Josef 0000-0003-4761-1645
dc.identifier.orcidIrcing, Pavel 0000-0001-6967-1687
dc.identifier.urihttp://hdl.handle.net/11025/60794
dc.language.isoen
dc.project.ID90254
dc.project.IDGA22-27800S
dc.publisherInternational Speech Communication Association
dc.relation.ispartofseriesINTERSPEECH 2023
dc.subjectspeech recognitionen
dc.subjectoral history archivesen
dc.titleTransformer-based Speech Recognition Models for Oral History Archives in English, German, and Czechen
dc.typeStať ve sborníku (D)
dc.typeSTAŤ VE SBORNÍKU
dc.type.statusPublished Version
local.files.count1*
local.files.size213427*
local.has.filesyes*
local.identifier.eid2-s2.0-85171577630

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
Lehecka_Svec_PsutkaJV_Ircing_interspeech_2023.pdf
Size:
208.42 KB
Format:
Adobe Portable Document Format
License bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: