Diarization Based on Identification with X-Vectors

Zajíc, Zbyněk

Diarization Based on Identification with X-Vectors

dc.contributor.author	Zajíc, Zbyněk
dc.contributor.author	Psutka, Josef
dc.contributor.author	Müller, Luděk
dc.date.accessioned	2021-02-22T11:00:22Z
dc.date.available	2021-02-22T11:00:22Z
dc.date.issued	2020
dc.description.abstract	V tomto článku popisujeme diarizaci mono telefonních dat z Jazykové poradny Ústavu pro jazyk český. Náš navrhovaný přístup k diarizaci využívá informace o identitě jednoho z účastníků hovoru. V klasickém přístupu k diarizaci nahrazujeme shlukování x-vektorů identifikací řečníka.	cs
dc.description.abstract-translated	In this paper, we describe a diarization of mono channel telephone recordings from The Language Consulting Center providing the Czech language consultancy service. In our proposed approach to a diarization, we use information about the known identity of one speaker (the language counsellor) acquired from the text transcription at the beginning of the conversation. In the state-of-the-art diarization based on the x-vectors clustering, we replace the clustering step by the identification of each segment of the recording against the counsellor’s identity x-vector and the general x-vector model that represents the client. Our proposed diarization without resegmentation step can be used as an online approach. Because of the uniqueness of our data, we compare our results with the Kaldi diarization as the baseline system.	en
dc.format	12 s.	cs
dc.format.mimetype	application/pdf
dc.identifier.citation	ZAJÍC, Z., PSUTKA, J., MÜLLER, L. Diarization Based on Identification with X-Vectors. In: Speech and Computer, 22nd International Conference, SPECOM 2019, St. Petersburg, Russia, October 7-9,2020, Proceedings. Cham: Springer, 2020. s. 667-678. ISBN 978-3-030-60275-8, ISSN 0302-9743.	cs
dc.identifier.doi	10.1007/978-3-030-60276-5_64
dc.identifier.isbn	978-3-030-60275-8
dc.identifier.issn	0302-9743
dc.identifier.obd	43930813
dc.identifier.uri	2-s2.0-85092921730
dc.identifier.uri	http://hdl.handle.net/11025/42726
dc.language.iso	en	en
dc.project.ID	LM2018101/LINDAT/CLARIAH-CZ – Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy	cs
dc.project.ID	90101/Velká výzkumná infrastruktura povinnost (J) - LINDAT/CLARIAH-CZ	cs
dc.project.ID	DG16P02B009/Zpřístupnění dotazů jazykové poradny v lingvisticky strukturované databázi	cs
dc.project.ID	LM2015042/E-infrastruktura CESNET	cs
dc.project.ID	90042/Velká výzkumná infrastruktura povinnost (J) - CESNET II	cs
dc.publisher	Springer	en
dc.relation.ispartofseries	Speech and Computer, 22nd International Conference, SPECOM 2019, St. Petersburg, Russia, October 7-9,2020, Proceedings	en
dc.rights	Plný text není přístupný.	cs
dc.rights	© Springer	en
dc.rights.access	closedAccess	en
dc.subject	diarizace, identifikace, x-vektor, automatické rozpoznávání řeči	cs
dc.subject.translated	Diarization, Identification, X-vector, Automatic speech recognition	en
dc.title	Diarization Based on Identification with X-Vectors	en
dc.title.alternative	Diarizace založená na identifikaci pomocí x-vektorů	cs
dc.type	konferenční příspěvek	cs
dc.type	conferenceObject	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en

Collections

OBD
Preprints (KKY)
Preprints (NTIS)

Diarization Based on Identification with X-Vectors

Files

Collections