Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech

Lehečka, Jan

Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech

dc.contributor.author	Lehečka, Jan
dc.contributor.author	Švec, Jan
dc.contributor.author	Psutka, Josef
dc.contributor.author	Ircing, Pavel
dc.date.accessioned	2025-06-20T08:43:51Z
dc.date.available	2025-06-20T08:43:51Z
dc.date.issued	2023
dc.date.updated	2025-06-20T08:43:51Z
dc.description.abstract	This paper is a step forward in our effort to make vast oral history archives more accessible to the public and researchers by breaking down the decoding barriers between the knowledge encoded in the spoken testimonies and users who want to search for the information of their interest. We present new Transformer-based monolingual models suitable for speech recognition of oral history archives in English, German, and Czech. Our experiments show that although the all-purpose speech recognition systems have recently made tremendous progress, the transcription of oral history archives is still a challenging task for them; our tailored models significantly outperformed larger public multilingual models and scored new state-of-the-art results on all tested datasets. Due to the 2-phase fine-tuning process, our models are robust and can be used for oral history archives of various domains. We publicly release our models within a public speech recognition service.	en
dc.format	5
dc.identifier.doi	10.21437/Interspeech.2023-872
dc.identifier.isbn	neuvedeno
dc.identifier.issn	2308-457X
dc.identifier.obd	43940687
dc.identifier.orcid	Lehečka, Jan 0000-0002-3889-8069
dc.identifier.orcid	Švec, Jan 0000-0001-8362-5927
dc.identifier.orcid	Psutka, Josef 0000-0003-4761-1645
dc.identifier.orcid	Ircing, Pavel 0000-0001-6967-1687
dc.identifier.uri	http://hdl.handle.net/11025/60794
dc.language.iso	en
dc.project.ID	90254
dc.project.ID	GA22-27800S
dc.publisher	International Speech Communication Association
dc.relation.ispartofseries	INTERSPEECH 2023
dc.subject	speech recognition	en
dc.subject	oral history archives	en
dc.title	Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech	en
dc.type	Stať ve sborníku (D)
dc.type	STAŤ VE SBORNÍKU
dc.type.status	Published Version
local.files.count	1	*
local.files.size	213427	*
local.has.files	yes	*
local.identifier.eid	2-s2.0-85171577630

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: Lehecka_Svec_PsutkaJV_Ircing_interspeech_2023.pdf
Size:: 208.42 KB
Format:: Adobe Portable Document Format

Download

License bundle

Showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Conference Papers (KKY)