Discrete Time Dynamic Programming Using Tensor Trains

Tichavský, Petr

Discrete Time Dynamic Programming Using Tensor Trains

dc.contributor.author	Tichavský, Petr
dc.contributor.author	Straka, Ondřej
dc.contributor.author	Punčochář, Ivo
dc.date.accessioned	2026-04-02T18:05:34Z
dc.date.available	2026-04-02T18:05:34Z
dc.date.issued	2025
dc.date.updated	2026-04-02T18:05:34Z
dc.description.abstract	Discrete time dynamic programming has many applications in decision-making and econometrics. In it, one is looking for a so-called value function that obeys a functional equation called the Bellman equation. The difficulty is that the number of variables of the value function can be very high, and a brute-force iteration of the Bellman equation is not feasible. Some authors solve this problem with deep neural networks, which have disadvantages. In this paper, we propose to handle the (sampled) value function in terms of a tensor train in a rectangular grid. Two novel techniques for the function interpolation were proposed. The decomposition has to be repeated in each Bellman iteration. Since the number of the tensor samples is still astronomically large, we propose to decompose the tensor using the TT-cross technique which only uses a fraction of the tensor elements. In this way, it is possible to find approximate solutions to the problem in dimensions where the traditional methods fail. Next, we propose a smoothing operation that may improve the convergence and a novel way of computing the approximation error and estimating the time when the iteration should be halted. The method’s performance is demonstrated in the example of the linear quadratic controller, where the ideal solution is known as the ground truth. Next, the proposed technique is applied to the problem of active fault detection, and its performance is compared to that of the neural network technique.	en
dc.format	24
dc.identifier.document-number	001668929200004
dc.identifier.doi	10.1137/24M1672341
dc.identifier.issn	1064-8275
dc.identifier.obd	43947697
dc.identifier.orcid	Tichavský, Petr 0000-0003-0621-4766
dc.identifier.orcid	Straka, Ondřej 0000-0003-3066-5882
dc.identifier.orcid	Punčochář, Ivo 0000-0003-0528-7998
dc.identifier.uri	http://hdl.handle.net/11025/67492
dc.language.iso	en
dc.project.ID	GA22-11101S
dc.relation.ispartofseries	SIAM Journal on Scientific Computing
dc.rights.access	A
dc.subject	control design	en
dc.subject	Bellman equation	en
dc.subject	tensor train	en
dc.title	Discrete Time Dynamic Programming Using Tensor Trains	en
dc.type	Článek v databázi WoS (Jimp)
dc.type	ČLÁNEK
dc.type.status	Published Version
local.files.count	1	*
local.files.size	758676	*
local.has.files	yes	*
local.identifier.eid	2-s2.0-105025201874

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: articleSIAMJSC_TiStPu.pdf
Size:: 740.89 KB
Format:: Adobe Portable Document Format

Download

License bundle

Showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Articles (KKY)