FIND ME ON

GitHub

LinkedIn

Information Signal

🌱

Definition
StochasticControl

Definition

The information signal, yi,iNy^{i},i\in\mathcal{N}, is what any agent in a team problem is said to observe. ## Static For each iNi\in\mathcal{N} we define it in terms of the static information function, ηi\eta^{i} s.t. yi=ηi(ξ)η~i(ω),ωΩy^{i}=\eta^{i}(\xi)\equiv \tilde{\eta}^{i}(\omega),\quad\omega \in\Omegawhere ξΞ\xi\in\mathit\Xi is the random state of nature (which is a RV mapping from Ξ\mathit\Xi to Ω\Omega). Given how we tend to treat Ξ    Ω\mathit\Xi\iff\Omega in the definition of ηi\eta^{i} we do similarly here with yi=η~i(ω)y^{i}=\tilde{\eta}^{i}(\omega).

Dynamic

We know that a decision problem is said to be dynamic if the measurements of at least one of the agents involves past actions hence the information function also takes in past measurements as input: yi=ηi(ξ;u),iN(1)\tag{1}y^{i}=\eta^{i}(\xi;\mathbf{u}),\quad i\in\mathcal{N}where the dependence on u\mathbf{u} (the NN-tuple of past actions by each agent) is assumed to be Strictly Causal, which means that under a given fixed clock the information received by each agent can depend only on actions taken in the past.

Let T:={1,,T}\mathcal{T}:=\{ 1,\dots,T \} and tTt\in\mathcal{T}. Let uti,ytiu_{t}^{i},y_{t}^{i} denote the action variable and information signal of agent Ai\mathbf{A}i at time tTt\in\mathcal{T}. Furthermore let ut:{ut1,,utN}yt={yt1,,ytN}\mathbf{u}_{t}:\{ u_{t}^{1},\dots,u_{t}^{N} \}\quad \mathbf{y}_{t}=\{ y_{t}^{1},\dots,y_{t}^{N} \}and u[t0,t1)u[t0,t11]:={ut0,ut0+1,,ut11}{u[t0,t1)1,,u[t0,t1)N}\mathbf{u}_{[t_{0},t_{1})}\equiv \mathbf{u}_{[t_{0},t_{1}-1]}:=\{ \mathbf{u}_{t_{0}},\mathbf{u}_{t_{0}+1},\dots,\mathbf{u}_{t_{1}-1} \}\equiv \{ u^{1}_{[t_{0},t_{1})},\dots,u^{N}_{[t_{0},t_{1})} \}Then, under the strictly causal assumption, (1)(1) becomes equivalent to yti=nti(ξ,u[1,t)),tT,iNy_{t}^{i}=n_{t}^{i}(\xi,\mathbf{u}_{[1,t)}),\quad t\in\mathcal{T},i\in\mathcal{N}for some information functions nti,tT,iNn_{t}^{i},t\in\mathcal{T},i\in\mathcal{N}. The variable ytiYtiy_{t}^{i}\in\mathbb{Y}^{i}_{t} is the on-line information available to Ai\mathbf{A}i which we can use for the construction of utiu_{t}^{i} through an appropriate policy γti:YtiUti\gamma_{t}^{i}:\mathbb{Y}_{t}^{i}\to \mathbb{U}_{t}^{i}: uti=γti(yti)γti(ηti[ξ;u[1,t)]),tT,iNu_{t}^{i}=\gamma_{t}^{i}(y_{t}^{i})\equiv \gamma_{t}^{i}(\eta_{t}^{i}[\xi;\mathbf{u}_{[1,t)}]),\quad t\in\mathcal{T},i\in\mathcal{N}