Suppose the cost function is non-negative. Consider the successive iteration vn(x)=umin⎩⎨⎧c(x,u)+βX∫vn−1(y)T(dy∣x,u)⎭⎬⎫,∀x,n≥1with v0(x)=0,∀x∈X. Then, vn is a monotonically non-decreasing sequence. If this sequence converges point wise to a function v where v(x)=c(x,f(x))+β∫v(y)T(dy∣x,f(x))is such that with γ={f,f,…} then n→∞limβnExγ[v(xn)]=0which means γ is optimal and v is the value function.