One Model is Not Enough: Ensembles for Isolated Sign Language Recognition

Hrúz, Marek

One Model is Not Enough: Ensembles for Isolated Sign Language Recognition

dc.contributor.author	Hrúz, Marek
dc.contributor.author	Gruber, Ivan
dc.contributor.author	Kanis, Jakub
dc.contributor.author	Boháček, Matyáš
dc.contributor.author	Hlaváč, Miroslav
dc.contributor.author	Krňoul, Zdeněk
dc.date.accessioned	2023-03-06T11:00:26Z
dc.date.available	2023-03-06T11:00:26Z
dc.date.issued	2022
dc.description.abstract-translated	In this paper, we dive into sign language recognition, focusing on the recognition of isolated signs. The task is defined as a classification problem, where a sequence of frames (i.e., images) is recognized as one of the given sign language glosses. We analyze two appearance-based approaches, I3D and TimeSformer, and one pose-based approach, SPOTER. The appearance-based approaches are trained on a few different data modalities, whereas the performance of SPOTER is evaluated on different types of preprocessing. All the methods are tested on two publicly available datasets: AUTSL and WLASL300. We experiment with ensemble techniques to achieve new state-of-the-art results of 73.84% accuracy on the WLASL300 dataset by using the CMA-ES optimization method to find the best ensemble weight parameters. Furthermore, we present an ensembling technique based on the Transformer model, which we call Neural Ensembler.	en
dc.format	17 s.	cs
dc.format.mimetype	application/pdf
dc.identifier.citation	HRÚZ, M. GRUBER, I. KANIS, J. BOHÁČEK, M. HLAVÁČ, M. KRŇOUL, Z. One Model is Not Enough: Ensembles for Isolated Sign Language Recognition. SENSORS, 2022, roč. 22, č. 13, s. nestránkováno. ISSN: 1424-8220	cs
dc.identifier.document-number	824167200001
dc.identifier.doi	10.3390/s22135043
dc.identifier.issn	1424-8220
dc.identifier.obd	43937108
dc.identifier.uri	2-s2.0-85133217387
dc.identifier.uri	http://hdl.handle.net/11025/51652
dc.language.iso	en	en
dc.project.ID	TN01000024/Národní centrum kompetence - Kybernetika a umělá inteligence	cs
dc.project.ID	90042/Velká výzkumná infrastruktura povinnost (J) - CESNET II	cs
dc.project.ID	EF15_003/0000466/Umělá inteligence a uvažování	cs
dc.publisher	MDPI	en
dc.relation.ispartofseries	SENSORS	en
dc.rights	© authors	en
dc.rights.access	openAccess	en
dc.subject.translated	sign language recognition	en
dc.subject.translated	CNN	en
dc.subject.translated	Transformer	en
dc.subject.translated	ensemble	en
dc.title	One Model is Not Enough: Ensembles for Isolated Sign Language Recognition	en
dc.type	článek	cs
dc.type	article	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: sensors-22-05043-v3-2.pdf
Size:: 1.09 MB
Format:: Adobe Portable Document Format

Download

Collections

OBD
Articles (KKY)
Articles (NTIS)