Radner Krainak Theorem

Theorem (2.4.5)

Let,

$\{ J;\Gamma^{i},i\in\mathcal{N} \}$ be a static stochastic Team problem where $\mathbb{U}^{i}\equiv \mathbb{R}^{m_{i}},i\in\mathcal{N}$ (i.e. uncountable);
the loss function $L(\xi,\mathbf{u})$ is convex and continuously differentiable in $\mathbf{u}$ a.s.;
$J(\underline{\gamma})$ is bounded from below on $\mathbf{\Gamma}$ ;
$\underline{\gamma}^{*}$ be a policy $N$ -tuple with a finite cost and suppose that for every $\underline{\gamma}\in\mathbf{\Gamma}$ s.t. $J(\underline{\gamma})<\infty$ , $\tag{⭐}\sum_{i\in\mathcal{N}}E\left[\nabla_{u^{i}}L(\xi;\underline{\gamma}^{*}(\mathbf{y}))[\gamma^{i}(y^{i})-\gamma^{i*}(y^{i})]\right]\ge 0$ where $\nabla_{u^{i}}L(\xi;\gamma^{*}(\mathbf{y}))$ stands for the partial derivatives under the policy $\underline{\gamma}^{*}$ . Then, $\underline{\gamma}^{*}$ is a team-optimal policy, and it is unique if $L$ is strictly convex in $\mathbf{u}$ .

Remark

As noted in the textbook, this theorem arises due to us now considering uncountable (but still finite dimensional) measurement spaces. This causes hypothesis one, $(c.1)$ , to no longer imply our policy space $\mathbf{\Gamma}$ is compact.

Assumptions

$(c.5)$

For all $\underline{\gamma}\in\Gamma$ s.t. $J(\underline{\gamma})<\infty$ , the following RVs are integrable: $\nabla_{u^{i}}L(\xi;\underline{\gamma}^{*}(\mathbf{y}))[\gamma^{i}(y^{i})-\gamma^{i*}(y^{i})],\quad i\in\mathcal{N}$

$(c.6)$

$\Gamma^{i}$ is a Hilbert Space for each $i\in\mathcal{N}$ , and $J(\underline{\gamma})<\infty$ for all $\underline{\gamma}\in\Gamma$ . Furthermore, $E_{\xi|y^{i}}\left[ \nabla_{u^{i}}L(\xi;\underline{\gamma}^{*}(\mathbf{y})) \right]\in\Gamma^{i}\quad i\in\mathcal{N}$ >[!thm] Stationary Radner Krainak >Let $\{ J;\Gamma^{i},i\in\mathcal{N} \}$ be a Static stochastic Team problem which satisfies all of the hypotheses of Theorem 2.4.5, with the exception of (⭐). Instead let either $(c.5)$ or $(c.6)$ hold. Then, if $\underline{\gamma}^{*}\in\mathbf{\Gamma}$ is a stationary policy it is also team-optimal. Such a policy is unique if $L(\xi;\mathbf{u})$ is strictly convex in $\mathbf{u}$ , a.s..

Linked from

Radner Krainak Theorem

Radner Krainak Theorem

Assumptions

(c.5)(c.5)(c.5)

(c.6)(c.6)(c.6)

Linked from

$(c.5)$

$(c.6)$