Lemma
If C is the Shannon-Fano Code for the distribution q(xn) and Xn has pmf p(xn) then we have the following bound for the redundancy n1D(p∥q)≤R(Cn,p)<n1D(p∥q)+n1 the n1 becomes a n2 if arithmetic coding is used instead.
Cor (Zero Penalty iff Universal Code)
A sequence of Shannon-Fano (or arithmetic) codes {Cn} obtained from a sequence of coding distributions {qn} is universal for a source class P if and only if n→∞limn1D(p∥qn)=0, ∀p∈P