Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings

Psutka, Josef V.

Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings

dc.contributor.author	Psutka, Josef V.
dc.contributor.author	Vaněk, Jan
dc.contributor.author	Psutka, Josef
dc.date.accessioned	2016-01-06T13:29:22Z
dc.date.available	2016-01-06T13:29:22Z
dc.date.issued	2011
dc.description.abstract-translated	This paper describes the effort with building speaker-clustered acoustic models as a part of the real-time LVCSR system that is used more than one year by the Czech TV for automatic subtitling of parliament meetings broadcasted on the channel ČT24. Speaker-clustered acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model or even gender-dependent models. Frequent changes of speakers and a direct connection of the LVCSR system to the audio channel require an automatic switching/fusion of models as quickly as possible. An important part of the solution is real time likelihood evaluations of all clustered acoustic models, taking advantage of a fast GPU(Graphic Processing Unit). The proposed method achieved a WER reduction to the baseline gender-independent model over 2.34% relatively with more than 2M Gaussian mixtures evaluated in real-time.	en
dc.format	7 s.	cs
dc.format.mimetype	application/pdf
dc.identifier.citation	PSUTKA, Josef V.; VANĚK, Jan; PSUTKA, Josef. Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings. In: Text, speech and dialogue. Berlin: Springer, 2011, p. 284-290. (Lectures notes in computer science; 6836). ISBN 978-3-642-23537-5.	en
dc.identifier.doi	10.1007/978-3-642-23538-2_36
dc.identifier.isbn	978-3-642-23537-5
dc.identifier.uri	http://www.kky.zcu.cz/cs/publications/JosefVPsutka_2011_Speaker-clustered
dc.identifier.uri	http://hdl.handle.net/11025/17133
dc.language.iso	en	en
dc.publisher	Springer	en
dc.relation.ispartofseries	Lecture notes in computer science; 6836	en
dc.rights	© Josef V. Psutka - Jan Vaněk - Josef Psutka	cs
dc.rights.access	openAccess	en
dc.subject	akustické modelování	cs
dc.subject	GPU	cs
dc.subject.translated	acoustic models	en
dc.subject.translated	GPU	en
dc.title	Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings	en
dc.type	článek	cs
dc.type	article	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: JosefVPsutka_2011_Speaker-clustered.pdf
Size:: 158.08 KB
Format:: Adobe Portable Document Format
Description:: Plný text

Download

License bundle

Showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Articles (KKY)
Articles (KIV)