Self-Information

Definition (Self-information)

Let EE denote an event with probability p>0p>0 of occurring. We call I(E)I(E) or I(p)I(p) as it is a function of p, the self-information of EE and use it to represent the “amount of information” one gains about event EE when learning that EE has occurred.

Remark

Equivalently, we can also think of it as a measure of the “amount of uncertainty” one had prior to the occurrence of event EE.

Proposition (Properties of I(p)I(p))

  1. Certain events are not surprising: If some event EE will most definitely happen, p(E)=1p(E)=1, then that event occurring should provide us with no surprise (or new information): H(E)=0H(E)=0.
  2. Impossible events are infinitely surprising: If some event EE, has zero chance of occurring, p(E)=0p(E)=0, then we should be infinitely surprised that the event is occurring H(E)=H(E)=\infty.
  3. Non-Increasing: I(p)I(p) should be non-increasing in p (i.e. the less likely event EE is, the more information one gains from it happening).
  4. Continuity: I(p)I(p) should be continuous in p. Intuitively, one would expect that a small change in p corresponds to a small change in the amount of information about EE.
  5. Continuity of Independence: If E1E_1 and E2E_2 are independent with probabilities p1>0p_1>0 and p2>0p_2>0, respectively, then I(E1E2)=I(p1p2)=I(p1)+I(p2)I(E_1\cap E_2)=I(p_1*p_2)=I(p_1)+I(p_2) This property is “reasonable” as E1E_1 and E2E_2 are independent.

Remark

  • I(1)=0I(1)=0
    • Setting p1=p2p_1=p_2 yields I(11)=I(1)+I(1)I(1*1)=I(1)+I(1) which implies that I(1)=0I(1)=0
  • I(p)0I(p)\ge 0 p\forall p
    • By the first remark and the fact that I(p)I(p) is non-increasing we see that this holds true.

Theorem (Representation of I(p)I(p))

The only function I(p)I(p), 0p10\le p\le1, satisfying properties 1-5 above is given by I(p)=clogb(p)I(p)=-c\log_b(p) where c>0c>0 and b>1b>1 are constants (b is for base unit).

Remark

We usually set c=1c=1 and b={2,e}b=\{2,e\} in this course.

Proposition (b unit table)

bunits of I(p)2bitsenats3ternary unitsqq-ary digits \begin{array} {|r|r|}\hline b & \text{units of }I(p) \\ \hline 2 & \text{bits} \\ \hline e & \text{nats} \\ \hline 3 & \text{ternary units} \\ \hline q & \text{q-ary digits} \\ \hline \end{array}

Linked from