Optimalizace herní strategie agenta zpětnovazebním učením

Seják, Michal

Optimalizace herní strategie agenta zpětnovazebním učením

dc.contributor.advisor	Konopík Miloslav, Ing. Ph.D.
dc.contributor.author	Seják, Michal
dc.contributor.referee	Sido Jakub, Ing.
dc.date.accepted	2020-6-16
dc.date.accessioned	2020-11-10T00:39:00Z
dc.date.available	2019-10-7
dc.date.available	2020-11-10T00:39:00Z
dc.date.issued	2020
dc.date.submitted	2020-5-7
dc.description.abstract	Agenti zpětnovazebného učení v současnosti patří mezi nejlepší způsoby, jak řešit obecné úlohy. Konkrétně algoritmus AlphaGo Zero (AZ) se v hraní mnoha deskových her drží v současnosti na nejvyšších příčkách. Nicméně, hodí se pouze na práci s deterministickými adverzálními prostředími a jako takový nenachází ve skutečném světě mnohá uplatnění, jelikož obdržení veškeré informace o běžných procesech je takřka nemožné. V této práci analyzujeme způsob, jakým AZ dosahuje svých výsledků a jak lze tento algoritmus upravit tak, aby řešil obecné stochastické neadverzální problémy, přičemž zavádíme techniku kontroly redundance, pomocí níž lze efektivněji prořezávat stavový strom. Na závěr navrhneme vlastní prostředí a otestujeme, jakých výsledků dosahuje obyčený algoritmus DQN ve srovnání s upraveným AZ bez a s kontrolou redundance, kde ukážeme, že verze AZ využívající kontrolu redundance dosahuje mnohem kvalitnějších výsledků, než ostatní dva algoritmy.	cs
dc.description.abstract-translated	Reinforcement learning agents are one of the best methods of general problem solving. The algorithm AlphaGo Zero (AZ) in particular achieved state-of-the-art results in solving multiple board games. However, it is suited only for solving adversary deterministic environments and finds few real-life applications, as finding complete information about real-life processes is next to impossible. In our work, we analyze how exactly does AZ function and how it can be adjusted for solving non-adversary stochastic environments, while introducing a redundancy checking technique to prune the state tree more effectively. Finally, we design a custom environment and examine how the simple DQN algorithm compares to the adjusted AZ both with and without redundancy checking, showing that the version utilizing the redundancy checking heuristic remarkably outperforms both the DQN and the unamplified AZ.	en
dc.description.result	Obhájeno	cs
dc.format	74 s	cs
dc.format.mimetype	application/pdf
dc.identifier	82945
dc.identifier.uri	http://hdl.handle.net/11025/41802
dc.language.iso	en	en
dc.publisher	Západočeská univerzita v Plzni	cs
dc.rights	Plný text práce je přístupný bez omezení.	cs
dc.rights.access	openAccess	en
dc.subject	zpětnovazebné učení	cs
dc.subject	umělá inteligence	cs
dc.subject	prostředí	cs
dc.subject	agent	cs
dc.subject	strategie	cs
dc.subject.translated	reinforcement learning	en
dc.subject.translated	artificial intelligence	en
dc.subject.translated	environment	en
dc.subject.translated	agent	en
dc.subject.translated	strategy	en
dc.thesis.degree-grantor	Západočeská univerzita v Plzni. Fakulta aplikovaných věd	cs
dc.thesis.degree-level	Bakalářský	cs
dc.thesis.degree-name	Bc.	cs
dc.thesis.degree-program	Inženýrská informatika	cs
dc.title	Optimalizace herní strategie agenta zpětnovazebním učením	cs
dc.title.alternative	Reinforcement Learning for Optimizing Agent Strategies	en
dc.type	bakalářská práce	cs
local.relation.IS	https://portal.zcu.cz/StagPortletsJSR168/CleanUrl?urlid=prohlizeni-prace-detail&praceIdno=82945

Files

Original bundle

Showing 1 - 4 out of 4 results

Name:: bachelors.pdf
Size:: 849.86 KB
Format:: Adobe Portable Document Format
Description:: Plný text práce

Download

Name:: A17B0344P_Posudek.pdf
Size:: 130.18 KB
Format:: Adobe Portable Document Format
Description:: Posudek oponenta práce

Download

Name:: A17B0344P_Hodnoceni.pdf
Size:: 104.32 KB
Format:: Adobe Portable Document Format
Description:: Posudek vedoucího práce

Download

Name:: A17B0344P_Obhajoba.pdf
Size:: 74.62 KB
Format:: Adobe Portable Document Format
Description:: Průběh obhajoby práce

Download

Collections

Bachelor´s works (KIV)