Theorem (Shannon Fixed-Length Lossless Source Coding Theorem for DMS)
Given integer D≥2, consider a DMS {Xi}i=1∞ with alphabet X, pmf pX and source entropy HD(X)=−a∈X∑pX(a)logDpX(a)=(logD2)H(X)then the following results hold:
- Forward Part (Achievability): For any ϵ∈(0,1) and 0<δ<ϵ, there exists a sequence of (K,n) D-ary fixed-length codes Cn for the source such that HD(X)<n→∞limsupnk≤HD(X)+δ and Pe(Cn)<ϵ\mboxfornsufficientlylarge
- “Strong” Converse Part: For any ϵ∈(0,1) and any sequence of (K,n) D-ary fixed-length codes Cn for the source with n→∞limsupnk<HD(X) we have that Pe(Cn)>1−ϵ\mboxfornsufficientlylarge
Conclusion
HD(X)=inf{R:R achievable} where R \text{ achievable}\iff \forall\epsilon>0, \ \exists\mathcal{C_{n}}\text{ s.t. } \begin{array} \ \limsup_{n\to\infty} \frac{k}{n}\le R \text{ and} \\ P_e<\epsilon \text{ for n sufficiently large}\end{array}
Intuition
The theorem shows that for a DMS, its source entropy HD(X) is the “smallest” compression rate for which there exists D-ary block codes for the source with asymptotically vanishing error probability (i.e. limn→∞Pe=0).
Theorem (Shannon’s VLC Lossless Source Coding Theorem for DMS)
Given integer D≥2, consider a DMS {Xi}i=1∞ with alphabet X, pmf pX, and source entropy HD(X)=−a∈X∑pX(a)logDpX(a)=(logD2)H(X)Then the following results hold:
- Forward Part (Achievability) For any ϵ∈(0,1), there exists a sequence of D-ary n-th order prefix (hence UD) VLCs f:Xn→{0,1,⋯,D−1}∗for the source with average code rate satisfying Rˉn<HD(X)+ϵfor n sufficiently large.
- Converse Part Every D-ary n-order UD VLC fn:Xn→{0,1,⋯,D−1}∗for the source has an average code rate Rˉn≥HD(X)
Cor ((Achievability of Entropy for DMS))
For any n≥1, ∃ a D-ary n-th order prefix code f:Xn→{0,1,⋯,D−1}∗for a DMS {Xi} with alphabet X with average code rate: HD(X)≤Rˉn<HD(X)+n1
Theorem (Shannon’s VLC Lossless Source Coding Theorem for Stationary Sources)
Given integer D≥2, consider a stationary source {Xi}i=1∞ with alphabet X, pmf pX, and source entropy rate HD(X)=n→∞limn1HD(Xn)Then the following results hold:
- Forward Part (Achievability) For any ϵ∈(0,1), there exists a sequence of D-ary n-th order prefix (hence UD) VLCs f:Xn→{0,1,⋯,D−1}∗for the source with average code rate satisfying Rˉn<HD(X)+ϵfor n sufficiently large.
- Converse Part Every D-ary n-order UD VLC fn:Xn→{0,1,⋯,D−1}∗for the source has an average code rate Rˉn≥HD(X)
Cor ((Achievability of Source Entropy for Stationary Source))
For any n≥1, ∃ a D-ary n-th order prefix code f:Xn→{0,1,⋯,D−1}∗for a stationary source {Xi} with alphabet X with avg code rate n1HD(Xn)≤Rˉn<n1HD(Xn)+n1as n→∞, Rˉn→H(X).