Definition (Typical Set)
For a DMS { X i } i = 1 ∞ \{X_i\}_{i=1}^\infty { X i } i = 1 ∞ with pmf p X p_X p X on X \mathcal{X} X and entropy H ( X ) H(X) H ( X ) , given integer n ≥ 1 n\ge 1 n ≥ 1 and ϵ > 0 \epsilon>0 ϵ > 0 , the typical set A ϵ ( n ) A_\epsilon^{(n)} A ϵ ( n ) with respect to the source is A ϵ ( n ) = { a n ∈ X n : ∣ − 1 n log 2 ( p X n ( a n ) ) − H ( X ) ∣ ≤ ϵ } = { a n ∈ X n : 2 − n ( H ( X ) + ϵ ) ≤ p X n ( a n ) ≤ 2 − n ( H ( X ) − ϵ ) } \begin{align*}
A_\epsilon^{(n)}&=\left\{a^{n}\in X^{n}:\left|-\frac{1}{n}\log_2(p_{X^{n}}(a^{n}))-H(X)\right|\le\epsilon\right\}\\\\
&=\left\{a^{n}\in X^{n}:2^{-n(H(X)+\epsilon)}\le p_{X^{n}}(a^{n})\le2^{-n(H(X)-\epsilon)}\right\}
\end{align*} A ϵ ( n ) = { a n ∈ X n : − n 1 log 2 ( p X n ( a n )) − H ( X ) ≤ ϵ } = { a n ∈ X n : 2 − n ( H ( X ) + ϵ ) ≤ p X n ( a n ) ≤ 2 − n ( H ( X ) − ϵ ) } or the typical set is the set of all n-tuples a n = ( a 1 , ⋯ , a n ) a^n=(a_1,\cdots,a_n) a n = ( a 1 , ⋯ , a n ) drawn iid from X \mathcal{X} X according to p X p_X p X whose probability is roughly 2 − n H ( X ) 2^{-nH(X)} 2 − n H ( X ) ; or whose “apparent entropy” is within the “true entropy” H ( X ) H(X) H ( X ) .
Theorem (Asymptotic Equipartition Property)
For a DMS { X i } i = 1 ∞ \{X_i\}_{i=1}^\infty { X i } i = 1 ∞ with pmf p X p_X p X and alphabet X \mathcal{X} X , − 1 n log 2 ( p X n ( X n ) ) → n → ∞ H ( X ) \mbox i n p r o b a b i l i t y -\frac{1}{n}\log_2(p_{X^n}(X^n))\xrightarrow{n\to\infty}H(X)\mbox{ in probability} − n 1 log 2 ( p X n ( X n )) n → ∞ H ( X ) \mbox in p r o babi l i t y for the average information of X i , n ≥ 1 X_i,n\ge1 X i , n ≥ 1 converges in probability to the entropy of X X X . For a stationary ergodic source { X i } i = 1 ∞ \{X_i\}_{i=1}^\infty { X i } i = 1 ∞ we have − 1 n log 2 ( p X n ( X n ) ) → n → ∞ H ( X ) \mbox i n p r o b a b i l i t y -\frac{1}{n}\log_2(p_{X^n}(X^n))\xrightarrow{n\to\infty}H(\mathcal{X})\mbox{ in probability} − n 1 log 2 ( p X n ( X n )) n → ∞ H ( X ) \mbox in p r o babi l i t y
Theorem (Consequence of AEP)
For a DMS { X i } i = 1 ∞ \{X_i\}_{i=1}^\infty { X i } i = 1 ∞ with pmf p X p_X p X on X \mathcal{X} X and entropy H ( X ) H(X) H ( X ) , the typical set A ϵ ( n ) A_\epsilon^{(n)} A ϵ ( n ) satisfies:
lim n → ∞ P ( A ϵ ( n ) ) = 1 \lim_{n\to\infty}P(A_\epsilon^{(n)})=1 lim n → ∞ P ( A ϵ ( n ) ) = 1
∣ A ϵ ( n ) ∣ ≤ 2 n ( H ( X ) + ϵ ) |A_\epsilon^{(n)}|\le2^{n(H(X)+\epsilon)} ∣ A ϵ ( n ) ∣ ≤ 2 n ( H ( X ) + ϵ )
∣ A ϵ ( n ) ∣ ≥ ( 1 − ϵ ) 2 n ( H ( X ) − ϵ ) |A_\epsilon^{(n)}|\ge(1-\epsilon)2^{n(H(X)-\epsilon)} ∣ A ϵ ( n ) ∣ ≥ ( 1 − ϵ ) 2 n ( H ( X ) − ϵ ) for n n n sufficiently large
Definition (Joint typical set)
Let { ( X i , Y i ) } i = 1 ∞ \{(X_{i},Y_{i})\}^\infty_{i=1} {( X i , Y i ) } i = 1 ∞ be a |DMS with common pmf p X Y p_{XY} p X Y on X × Y \mathcal{X}\times\mathcal{Y} X × Y . Given δ > 0 \delta>0 δ > 0 and integer n ≥ 1 n\ge1 n ≥ 1 , the jointly typical set A δ ( n ) A_\delta^{(n)} A δ ( n ) (or jointly δ \delta δ -typical set) with respect to the source is A δ ( n ) = { ( x n , y n ) ∈ X n × Y n : ∣ − 1 n log 2 ( p X n ( x n ) ) − H ( X ) ∣ ≤ δ , ∣ − 1 n log 2 ( p Y n ( y n ) ) − H ( Y ) ∣ ≤ δ , ∣ − 1 n log 2 ( p X n Y n ( x n , y n ) ) − H ( X , Y ) ∣ ≤ δ } \begin{align*}
A_{\delta}^{(n)}=\left\{(x^{n},y^{n})\in \mathcal{X}^{n}\times\mathcal{Y}^{n}:\left|-\frac{1}{n}\log_{2}(p_{X^{n}}(x^{n}))-H(X)\right|\le\delta,\right.\\
\left|-\frac{1}{n}\log_{2}(p_{Y^{n}}(y^{n}))-H(Y)\right|\le\delta,\\
\left.\left|-\frac{1}{n}\log_{2}(p_{X^{n}Y^{n}}(x^{n},y^{n}))-H(X,Y)\right|\le\delta \right\}\\
\end{align*} A δ ( n ) = { ( x n , y n ) ∈ X n × Y n : − n 1 log 2 ( p X n ( x n )) − H ( X ) ≤ δ , − n 1 log 2 ( p Y n ( y n )) − H ( Y ) ≤ δ , − n 1 log 2 ( p X n Y n ( x n , y n )) − H ( X , Y ) ≤ δ }
Theorem (Joint Asymptotic Equipartition Property )
For a DMS { ( X i , Y i ) } i = 1 ∞ \{(X_{i},Y_{i})\}^\infty_{i=1} {( X i , Y i ) } i = 1 ∞ with pmf p X Y p_{XY} p X Y on X × Y \mathcal{X}\times\mathcal{Y} X × Y , then the joint typical set A δ ( n ) A_{\delta}^{(n)} A δ ( n ) defined with respect to source p X Y p_{XY} p X Y satisfies:
P X n Y n ( A δ ( n ) ) = P ( ( X n , Y n ) ∈ A δ ( n ) ) > 1 − δ P_{X^{n}Y^{n}}(A_\delta^{(n)})=P((X^{n},Y^{n})\in A_{\delta}^{(n)})>1-\delta P X n Y n ( A δ ( n ) ) = P (( X n , Y n ) ∈ A δ ( n ) ) > 1 − δ for n n n sufficiently large
∣ A δ ( n ) ∣ ≤ 2 n ( H ( X , Y ) + δ ) |A_\delta^{(n)}|\le2^{n(H(X,Y)+\delta)} ∣ A δ ( n ) ∣ ≤ 2 n ( H ( X , Y ) + δ )
∣ A δ ( n ) ∣ ≥ ( 1 − δ ) 2 n ( H ( X , Y ) − δ ) |A_\delta^{(n)}|\ge(1-\delta)2^{n(H(X,Y)-\delta)} ∣ A δ ( n ) ∣ ≥ ( 1 − δ ) 2 n ( H ( X , Y ) − δ ) for n n n sufficiently large
Definition (Typical set)
Fix ϵ > 0 \epsilon>0 ϵ > 0 and n n n , the typical set A ϵ ( n ) A_{\epsilon}^{(n)} A ϵ ( n ) for a CMS with pdf f X f_{X} f X is given by A ϵ ( n ) = { x n ∈ S X n : ∣ − 1 n log 2 f X n ( X n ) − h ( X ) ∣ ≤ ϵ } A_{\epsilon}^{(n)}=\{x^{n}\in S_{X^{n}}:|- \frac{1}{n}\log_{2}f_{X^{n}}(X^{n})-h(X)|\le\epsilon\} A ϵ ( n ) = { x n ∈ S X n : ∣ − n 1 log 2 f X n ( X n ) − h ( X ) ∣ ≤ ϵ }
Theorem (AEP For Continuous IID RVs)
Let { X i } i = 1 ∞ \{X_{i}\}_{i=1}^{\infty} { X i } i = 1 ∞ be a memoryless real-valued source , with common pdf f X f_{X} f X with support S X ⊂ R S_{X}\subset \mathbb{R} S X ⊂ R . Then − 1 n log 2 f X n ( X 1 , … , X n ) → n → ∞ E [ − log 2 f X ( X ) ] = h ( X ) - \frac{1}{n}\log_{2}f_{X^{n}}(X_{1},\ldots,X_{n})\stackrel{ n\to\infty}\to E\left[-\log_{2}f_{X}(X)\right]=h(X) − n 1 log 2 f X n ( X 1 , … , X n ) → n → ∞ E [ − log 2 f X ( X ) ] = h ( X ) in probability .
Theorem (Consequence of the continuous AEP)
For the memoryless continuous source { X i } i = 1 ∞ \{X_{i}\}^\infty_{i=1} { X i } i = 1 ∞ with pdf f X f_{X} f X defined on S X ⊂ R S_{X}\subset\mathbb{R} S X ⊂ R and differential entropy h ( X ) h(X) h ( X ) , the following hold:
lim n → ∞ P X n ( A ϵ ( n ) ) = 1 , \mbox i . e . P X n ( A ϵ ( n ) ) > 1 − ϵ \mbox f o r n \mbox s u f f i c i e n t l y l a r g e \lim_{n\to\infty}P_{X^{n}}\left(A_{\epsilon}^{(n)}\right)=1,\ \mbox{ i.e. }P_{X^{n}}\left(A_{\epsilon}^{(n)}\right)>1-\epsilon \ \mbox{ for }n\mbox{ sufficiently large} n → ∞ lim P X n ( A ϵ ( n ) ) = 1 , \mbox i . e . P X n ( A ϵ ( n ) ) > 1 − ϵ \mbox f or n \mbox s u f f i c i e n tl y l a r g e
v o l ( A ϵ ( n ) ) ≤ 2 n ( h ( X ) ) + ϵ ∀ n vol(A_{\epsilon}^{(n)})\le 2^{n(h(X))+\epsilon} \ \forall n v o l ( A ϵ ( n ) ) ≤ 2 n ( h ( X )) + ϵ ∀ n
v o l ( A ϵ ( n ) > ( 1 − ϵ ) 2 n ( h ( X ) − ϵ ) ) \mbox f o r n \mbox s u f f i c i e n t l y l a r g e vol(A_{\epsilon}^{(n)}>(1-\epsilon)2^{n(h(X)-\epsilon)}) \ \mbox{for }n\mbox{ sufficiently large} v o l ( A ϵ ( n ) > ( 1 − ϵ ) 2 n ( h ( X ) − ϵ ) ) \mbox f or n \mbox s u f f i c i e n tl y l a r g e