Dynamic systems can be described by differential and difference equations. Without a loss of generality, consider a dynamic system represented by a difference equation: xk+1=f(xk). The state of the system is represented by x and the function f is the mapping from one state to the next. One way to characterize a dynamic system is with an additive cost: J. An additive cost summarizes the cost of operation for the system from some initial state, x0. To ensure that the sums are finite an exponential weighting factor, alpha, is introduced. This factor has a value between between 0 and 1. Under some circumstances, alpha can be equal to one. One special case is where the system is guaranteed to eventually enter a zero cost state. However, in general, it will need to be less than one. This additive cost is a function of each initial condition:
J(x0)=∑∞k=0(αk⋅c(xk)). | (1) |
The value of the additive cost can be solved using the dynamic programming equations:
V(x)=c(x)+αk⋅V(f(x)). | (2) |
The function V is referred to as the value function.
No comments:
Post a Comment