FIND ME ON

GitHub

LinkedIn

Self-Information

🌱

Definition
InfoTheory

Let EE denote an event with probability p>0p>0 of occurring. We call I(E)I(E) or I(p)I(p) as it is a function of p, the self-information of EE and use it to represent the “amount of information” one gains about event EE when learning that EE has occurred.

Equivalently, we can also think of it as a measure of the “amount of uncertainty” one had prior to the occurrence of event EE.

1. Certain events are not surprising: If some event EE will most definitely happen, p(E)=1p(E)=1, then that event occurring should provide us with no surprise (or new information): H(E)=0H(E)=0. 2. Impossible events are infinitely surprising: If some event EE, has zero chance of occurring, p(E)=0p(E)=0, then we should be infinitely surprised that the event is occurring H(E)=H(E)=\infty. 3. Non-Increasing: I(p)I(p) should be non-increasing in p (i.e. the less likely event EE is, the more information one gains from it happening). 4. Continuity: I(p)I(p) should be continuous in p. Intuitively, one would expect that a small change in p corresponds to a small change in the amount of information about EE. 5. Continuity of Independence: If E1E_1 and E2E_2 are independent with probabilities p1>0p_1>0 and p2>0p_2>0, respectively, then I(E1E2)=I(p1p2)=I(p1)+I(p2)I(E_1\cap E_2)=I(p_1*p_2)=I(p_1)+I(p_2) This property is “reasonable” as E1E_1 and E2E_2 are independent.

- I(1)=0I(1)=0 - Setting p1=p2p_1=p_2 yields I(11)=I(1)+I(1)I(1*1)=I(1)+I(1) which implies that I(1)=0I(1)=0 - I(p)0I(p)\ge 0 p\forall p - By the first remark and the fact that I(p)I(p) is non-increasing we see that this holds true.

The only function I(p)I(p), 0p10\le p\le1, satisfying properties 1-5 above is given by I(p)=clogb(p)I(p)=-c\log_b(p) where c>0c>0 and b>1b>1 are constants (b is for base unit).

We usually set c=1c=1 and b={2,e}b=\{2,e\} in this course.

bunits of I(p)2bitsenats3ternary unitsqq-ary digits \begin{array} {|r|r|}\hline b & \text{units of }I(p) \\ \hline 2 & \text{bits} \\ \hline e & \text{nats} \\ \hline 3 & \text{ternary units} \\ \hline q & \text{q-ary digits} \\ \hline \end{array}

Linked from