Findings of the Shared Task on Multilingual Coreference Resolution

Žabokrtský, Zdeněk

Findings of the Shared Task on Multilingual Coreference Resolution

dc.contributor.author	Žabokrtský, Zdeněk
dc.contributor.author	Konopík, Miloslav
dc.contributor.author	Nedoluzhko, Anna
dc.contributor.author	Novák, Michal
dc.contributor.author	Ogrodniczuk, Maciej
dc.contributor.author	Popel, Martin
dc.contributor.author	Pražák, Ondřej
dc.contributor.author	Sido, Jakub
dc.contributor.author	Zeman, Daniel
dc.contributor.author	Zhu, Yilun
dc.date.accessioned	2025-06-20T08:16:20Z
dc.date.available	2025-06-20T08:16:20Z
dc.date.issued	2022
dc.date.updated	2025-06-20T08:16:19Z
dc.description.abstract	This paper presents an overview of the shared task on multilingual coreference resolution associated with the CRAC 2022 workshop. Shared task participants were supposed to develop trainable systems capable of identifying mentions and clustering them according to identity coreference. The public edition of CorefUD~1.0, which contains \ndatasets{} datasets for \nlanguages{} languages, was used as the source of training and evaluation data. The CoNLL score used in previous coreference-oriented shared tasks was used as the main evaluation metric. There were \nsystems{} coreference prediction systems submitted by \nteams{} participating teams; in addition, there was a competitive Transformer-based baseline system provided by the organizers at the beginning of the shared task. The winner system outperformed the baseline by 12 percentage points (in terms of the CoNLL scores averaged across all datasets for individual languages).	en
dc.description.abstract	Tento článek představuje přehled otevřené úlohy týkající se vícejazyčného hledání koreferencí spojené s workshopem CRAC 2022. Účastníci měli vyvinout systémy schopné identifikovat entity a shlukovat je podle identity koreference. Veřejné vydání CorefUD~1.0, které obsahuje 13 datasetů pro 10 jazyků bylo použito jako zdroj trénovacích dat. Jako evaluační metriku jsme použili CoNLL skóre používané v dřívějších úlohách na koreference. Bylo odevzdáno 8 systémů z pěti různých týmů; Dále byl vytvořen základní systém založený na architektuře Transformer, který poskytli organizátoři na začátku úlohy. Vítězný systém překonal základní systém o 12 procentních bodů CoNLL skóre zprůměrovaného přes všechny datové sady.	cz
dc.format	18
dc.identifier.isbn	neuvedeno
dc.identifier.issn	2951-2093
dc.identifier.obd	43936918
dc.identifier.orcid	Konopík, Miloslav 0000-0001-7397-1658
dc.identifier.orcid	Pražák, Ondřej 0000-0001-5445-7792
dc.identifier.orcid	Sido, Jakub 0000-0002-7709-7512
dc.identifier.uri	http://hdl.handle.net/11025/59289
dc.language.iso	en
dc.project.ID	SGS-2022-016
dc.publisher	Association for Computational Linguistics
dc.relation.ispartofseries	CRAC 2022 Shared Task on Multilingual Coreference Resolution
dc.subject	Coreference resolution, shared task, multilingual dataset. semantics	en
dc.subject	Hledání koreferencí, vícejazyčná datová sada, otevřená úloha, zpracování sémantiky textu	cz
dc.title	Findings of the Shared Task on Multilingual Coreference Resolution	en
dc.title	Poznatky z otevřené úlohy výcejazyčného hledání koreferencí	cz
dc.type	Stať ve sborníku (O)
dc.type	STAŤ VE SBORNÍKU
dc.type.status	Published Version
local.files.count	1	*
local.files.size	359062	*
local.has.files	yes	*

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: Pražák a kol. 2022.crac-mcr.1.pdf
Size:: 350.65 KB
Format:: Adobe Portable Document Format

Download

License bundle

Showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Conference Papers (KIV)