Theorem (Bellman’s Optimality Principle)
Consider the finite horizon optimization problem J(X,γ)=Exγ[k=0∑N−1c(Xk,Uk)+cN(XN)]If ∃J0,…,JN−1,f0,…,fN−1 where JN(x)=cN(x)and for 0≤t≤N−1 Jt(x)=u∈Umin(c(x,u)+E[Jt+1(xt+1)∣xt=x,ut=u])=c(x,ft(x))+E[Jt+1(xt+1)∣xt=x,ut=ft(x)]then we have that γ∈ΓAinfJN(x)=J0(x)and γ∗={f0,…,fN−1} is optimal.