🌱
Let denote an event with probability of occurring. We call or as it is a function of p, the self-information of and use it to represent the “amount of information” one gains about event when learning that has occurred.
Equivalently, we can also think of it as a measure of the “amount of uncertainty” one had prior to the occurrence of event .
1. Certain events are not surprising: If some event will most definitely happen, , then that event occurring should provide us with no surprise (or new information): . 2. Impossible events are infinitely surprising: If some event , has zero chance of occurring, , then we should be infinitely surprised that the event is occurring . 3. Non-Increasing: should be non-increasing in p (i.e. the less likely event is, the more information one gains from it happening). 4. Continuity: should be continuous in p. Intuitively, one would expect that a small change in p corresponds to a small change in the amount of information about . 5. Continuity of Independence: If and are independent with probabilities and , respectively, then This property is “reasonable” as and are independent.
- - Setting yields which implies that - - By the first remark and the fact that is non-increasing we see that this holds true.
The only function , , satisfying properties 1-5 above is given by where and are constants (b is for base unit).
We usually set and in this course.