The Discounted Cost Model
2020
In this chapter, let \(\alpha >0\) be a fixed discount factor. We shall consider the \(\alpha \)-discounted CTMDP problems ( 1.15) and ( 1.16). Without loss of generality we can restrict to (relaxed) \(\pi \)-strategies because (Markov) \(\pi \)-strategies are sufficient for \(\alpha \)-discounted problems; see Remark 1.3.1. (In fact, in this chapter, the class of natural Markov strategies is also sufficient, according to ( 2.92). However, we choose to stay with the more general class of \(\pi \)-strategies. The same is true for some of the subsequent chapters.) Although most \(\pi \)-strategies are not realizable, under appropriate conditions, the solution to the unconstrained problem is given by a realizable deterministic stationary strategy (see Theorem 3.1.2). Moreover, as was explained in Sect. 1.3.5, for every \(\pi \)-strategy there exists an equivalent standard \(\xi \)-strategy which is realizable.
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
0
References
0
Citations
NaN
KQI