Discrete Time Dynamic Programming Using Tensor Trains

dc.contributor.authorTichavský, Petr
dc.contributor.authorStraka, Ondřej
dc.contributor.authorPunčochář, Ivo
dc.date.accessioned2026-04-02T18:05:34Z
dc.date.available2026-04-02T18:05:34Z
dc.date.issued2025
dc.date.updated2026-04-02T18:05:34Z
dc.description.abstractDiscrete time dynamic programming has many applications in decision-making and econometrics. In it, one is looking for a so-called value function that obeys a functional equation called the Bellman equation. The difficulty is that the number of variables of the value function can be very high, and a brute-force iteration of the Bellman equation is not feasible. Some authors solve this problem with deep neural networks, which have disadvantages. In this paper, we propose to handle the (sampled) value function in terms of a tensor train in a rectangular grid. Two novel techniques for the function interpolation were proposed. The decomposition has to be repeated in each Bellman iteration. Since the number of the tensor samples is still astronomically large, we propose to decompose the tensor using the TT-cross technique which only uses a fraction of the tensor elements. In this way, it is possible to find approximate solutions to the problem in dimensions where the traditional methods fail. Next, we propose a smoothing operation that may improve the convergence and a novel way of computing the approximation error and estimating the time when the iteration should be halted. The method’s performance is demonstrated in the example of the linear quadratic controller, where the ideal solution is known as the ground truth. Next, the proposed technique is applied to the problem of active fault detection, and its performance is compared to that of the neural network technique.en
dc.format24
dc.identifier.document-number001668929200004
dc.identifier.doi10.1137/24M1672341
dc.identifier.issn1064-8275
dc.identifier.obd43947697
dc.identifier.orcidTichavský, Petr 0000-0003-0621-4766
dc.identifier.orcidStraka, Ondřej 0000-0003-3066-5882
dc.identifier.orcidPunčochář, Ivo 0000-0003-0528-7998
dc.identifier.urihttp://hdl.handle.net/11025/67492
dc.language.isoen
dc.project.IDGA22-11101S
dc.relation.ispartofseriesSIAM Journal on Scientific Computing
dc.rights.accessA
dc.subjectcontrol designen
dc.subjectBellman equationen
dc.subjecttensor trainen
dc.titleDiscrete Time Dynamic Programming Using Tensor Trainsen
dc.typeČlánek v databázi WoS (Jimp)
dc.typeČLÁNEK
dc.type.statusPublished Version
local.files.count1*
local.files.size758676*
local.has.filesyes*
local.identifier.eid2-s2.0-105025201874

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
articleSIAMJSC_TiStPu.pdf
Size:
740.89 KB
Format:
Adobe Portable Document Format
License bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections