Markov Decision Making Flashcards Preview

Decision Making > Markov Decision Making > Flashcards

Flashcards in Markov Decision Making Deck (7)
Loading flashcards...
1

Deterministic Dynamic Programming (DDP)

for any node, the next node is fully determined by action K

2

Stages

vertical levels
e.g. time

3

States

horizontal levels
e.g. stock levels, success

4

States at a stage

nodes

5

Actions

arcs between nodes from consecutive stages

6

Return

immediate gains from an action (associated with an arc)

7

Stochastic Dynamic Programming (SDP)

for any node, the next node is dependant on both the action K and the outcome d, which is a random variable