Training of speaker-clustered discriminative acoustic models for use in real-time recognizers

Vaněk, Jan

Training of speaker-clustered discriminative acoustic models for use in real-time recognizers

dc.contributor.author	Vaněk, Jan
dc.contributor.author	Psutka, Josef V.
dc.contributor.author	Zelinka, Jan
dc.contributor.author	Trmal, Jan
dc.date.accessioned	2015-12-10T13:20:32Z
dc.date.available	2015-12-10T13:20:32Z
dc.date.issued	2010
dc.description.abstract	Je dobře známo, že akustické modely, založené na informaci o pohlaví řečníka, jsou více akusticky homogenní, a proto dosahují lepších výsledků rozpoznávání než jeden univerzální akustický model v případě, že je pohlaví řečníka úspěšně detekováno, nebo předem známo. Řečníci ovšem nemusí být rozděleni jen do dvou skupin. V tomto článku je popsán algoritmus, který je schopen vytvořit větší množství shluků řečníků. Dále se tento článek zabývá problémem vhodného použití těchto modelů v systémech rozpoznávání řeči pracujících v reálném čase, kde informace od detektoru správného shluku řečníků je často zpožděná nebo nesprávná. Dále jsou ještě v článku diskutovány různé přístupy k začlenění diskriminativních metod při trénování těchto akustických modelů.	cs
dc.description.abstract-translated	It is well known that gender-dependent (male/female) acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model in the case where the gender is successfully detected or a priory known. Speakers do not need to be split to two groups only. An algorithm to make higher number of speaker clusters is described in this paper. Further, the paper deals with a problem how to use these gender-based or speaker-clustered acoustic models in a real-time LVCSR where information from an automatic cluster detector is often delayed or incorrect. Moreover, various ways, how to incorporate discriminative training methods into training of the speaker-clustered acoustic models, are discussed in the paper.	en
dc.format	7 s.	cs
dc.format.mimetype	application/pdf
dc.identifier.citation	VANĚK, Jan; PSUTKA, Josef V.; ZELINKA, Jan; TRMAL, Jan. Training of speaker-clustered discriminative acoustic models for use in real-time recognizers. In: Speech processing. Prague: Institute of photonics and electronics AS CR , 2010, p. 152-158. ISBN 978-80-86269-21-4.	en
dc.identifier.isbn	978-80-86269-21-4
dc.identifier.uri	http://hdl.handle.net/11025/16957
dc.language.iso	en	en
dc.publisher	Institute of photonics and electronics AS CR	en
dc.rights	© Jan Vaněk - Josef V. Psutka - Jan Zelinka - Jan Trmal	cs
dc.rights.access	openAccess	en
dc.subject	model shlukování řečníků	cs
dc.subject	akustické modelování	cs
dc.subject	automatické rozpoznávání řeči	cs
dc.subject.translated	speaker-clustered model	en
dc.subject.translated	acoustics modeling	en
dc.subject.translated	automatic speech recognition	en
dc.title	Training of speaker-clustered discriminative acoustic models for use in real-time recognizers	en
dc.title.alternative	Trénování diskriminativních akustických modelů založených na shlucích řečníků pro rozpoznávání řeči pracujícím v reálném čase	cs
dc.type	článek	cs
dc.type	article	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: VanekJan_2010_Trainingof.pdf
Size:: 184.32 KB
Format:: Adobe Portable Document Format
Description:: Plný text

Download

License bundle

Showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Articles (KKY)
Articles (KIV)