Processing math: 100%

Sunday, July 31, 2011

Notes on Dynamic Systems and the Value Function

 

Dynamic systems can be described by differential and difference equations. Without a loss of generality, consider a dynamic system represented by a difference equation: xk+1=f(xk). The state of the system is represented by x and the function f is the mapping from one state to the next. One way to characterize a dynamic system is with an additive cost: J. An additive cost summarizes the cost of operation for the system from some initial state, x0. To ensure that the sums are finite an exponential weighting factor, alpha, is introduced. This factor has a value between between 0 and 1.  Under some circumstances, alpha can be equal to one. One special case is where the system is guaranteed to eventually enter a zero cost state. However, in general, it will need to be less than one. This additive cost is a function of each initial condition:

 

J(x0)=k=0(αkc(xk)).

(1)

The value of the additive cost can be solved using the dynamic programming equations:

 

V(x)=c(x)+αkV(f(x)).

(2)

The function V is referred to as the value function.

This work is licensed under a Creative Commons Attribution By license.

No comments:

Post a Comment