A Partially Observed Markov Decision Process a.k.a. POMDP, is a seven tuple (X,U,Y,K,T,Q,c) where: - X is the state space, a subset of a Polish space. - U is the action space, a subset of a Polish space. - Y is the observation space, a subset of a Polish space. - K={(x,u):uāU(x),xāX} is the set of state-control pairs that are feasible. - T:XĆUā[0,1]ā£X⣠is the state transition kernel i.e.Ā T(Aā£xtā,utā)=P(xt+1āāAā£xtā,utā)where AāB(X). - Q:Xā[0,1]ā£Y⣠is the observation channel i.e.Ā Q(Aā£xtā)=P(ytāāAā£xtā)where AāB(Y). - c:KāR is the cost function