Let (X,d) be a Polish metric space, and let 1≤p<∞. For any two Probability Measures μ,ν on X the Wasserstein Metric of order p between μ and ν is defined by the formula Wp(μ,ν)=π∈Π(μ,ν)infX×X∫d(x,y)pdπ(x,y)1/p=inf{[E[d(X,Y)p]1/p,law(X)=μ,law(Y)=ν}
Definition (Wasserstein-1)
The Wasserstein metric of order 1 for any two Measures μ,ν∈P(X) is defined as W1(μ,ν)=η∈H(μ,ν)infX×X∫η(dx,dy)∣x−y∣where H(μ,ν) denotes the set of probability measures on X×X with the first marginal μ and second marginal ν. Furthermore we equivalently have W1(μ,ν)=∥f∥Lip≤1sup∫fdμ−∫fdνwhere ∥f∥Lip:=x=ysupdX(x,y)f(x)−f(y)and dX is the metric on X.
Definition (Wasserstein convergence)
Let (μn)n∈N⊂P(X),μ∈P(X). We say that μn→μ in W1(μn,μ) (the Wasserstein metric) if W1(μn,μ)→0 as n→∞
Definition (Wasserstein space)
With the same convention defined in the Wasserstein metric the Wasserstein space of order p is defined as Pp(X):=⎩⎨⎧μ∈P(X)X∫d(x0,x)pμ(dx)<+∞⎭⎬⎫x0∈Xwhere x0∈X is arbitrary. This space does not depend on the choice of the point x0. Then Wp defines a finite Metric on Pp(X).
Theorem (Wasserstein distance metrizes Wasserstein space (Villani))
Let (X,d) be a Polish space, and 1≤p<∞; then the Wasserstein metricWpmetrizes the weak convergence inPp(X). i.e. if (μk)k∈N⊂Pp(X) and μ∈Pp(X), then the statements μk converges weakly in Pp(X)toμand Wp(μk,μ)→0are equivalent.
Definition (Weak convergence in Wasserstein space)
Let (X,d) be a Polish space, and 1≤p<∞. Let (μk)k∈N⊂Pp(X) and μ∈Pp(X) where Pp(X) is the . Then μk is said to converge weakly in Pp(X) if any one of the following properties is satisfied for any x0∈X:
μk→μ and ∫d(x0,x)pdμk(x)→∫d(x0,x)pdμ(x)
μk→μ and k→∞limsup∫d(x0,x)pdμk(x)≤∫d(x0,x)pdμ(x)
μk→μ and R→∞limk→∞limsupd(x0,x)≥R∫d(x0,x)pdμk(x)=0
For all continuous functions φ with ∣φ(x)∣≤C(1+d(x0,x)p), C∈R, one has ∫φ(x)dμk(x)→∫φ(x)dμ(x)