Indicator Random Variables and the Fundamental Bridge
This section is devoted to indicator random variables, which we have encountered previously but will treat in much greater detail here.
In particular, we will show that indicator r.v.s are an extremely useful tool for calculating expected values. Recall from the previous chapter that the indicator r.v. (or ) for an event is defined to be 1 if occurs and 0 otherwise. So is a Bernoulli random variable, where success is defined as '' occurs'' and failure is defined as '' does not occur''. Some useful properties of indicator r.v.s are summarized below.
Theorem: Indicator r.v. Properties
Let and be events. Then the following properties hold.
- for any positive integer .
- .
- .
Proof:
-
Property 1 holds since and for any positive integer .
-
Property 2 holds since is if does not occur and 0 if occurs.
-
Property 3 holds since is if both and are 1, and 0 otherwise.
-
Property 4 holds since
Indicator r.v.s provide a link between probability and expectation; we call this fact the fundamental bridge.
Theorem: Fundamental Bridge between Probability and Expectation
There is a one-to-one correspondence between events and indicator r.v.s, and the probability of an event is the expected value of its indicator r.v. :
Proof:
The fundamental bridge connects events to their indicator r.v.s, and allows us to express any probability as an expectation.
Conversely, the fundamental bridge is also extremely useful in many expected value problems. We can often express a complicated discrete r.v. whose distribution we don't know as a sum of indicator r.v.s, which are extremely simple. The fundamental bridge lets us find the expectation of the indicators; then, using linearity, we obtain the expectation of our original r.v.
Recognizing problems that are amenable to this strategy and then defining the indicator r.v.s takes practice, so it is important to study a lot of examples and solve a lot of problems. In applying the strategy to a random variable that counts the number of [noun]s, we should have an indicator for each potential [noun]. This [noun] could be a person, place, or thing; we will see examples of all three types.
Example Putnam Problem
Solution:
Law of the Unconscious Statistician (LOTUS)
As we saw in the St. Petersburg paradox, does not equal in general if is not linear. So how do we correctly calculate ?
Since is an r.v., one way is to first find the distribution of and then use the definition of expectation. Perhaps surprisingly, it turns out that it is possible to find directly using the distribution of , without first having to find the distribution of . This is done using the law of the unconscious statistician (LOTUS).
Theorem: LOTUS
If is a discrete r.v. and is a function from to , then
where the sum is taken over all possible values of .
This means that we can get the expected value of knowing only , the PMF of ; we don't need to know the PMF of .
The name comes from the fact that in going from to it is tempting just to change to in the definition, which can be done very easily and mechanically, perhaps in a state of unconsciousness. On second thought, it may sound too good to be true that finding the distribution of $g(X) is not needed for this calculation, but LOTUS says it is true. We will omit a general proof of LOTUS, but let's see why it is true in some special cases.
Let have support with probabilities , so the PMF is . Then has support with probabilities so
As claimed by LOTUS, to edit the expression for into an expression for , we can just change the in front of the to an ; the is unchanged, and we can still use the PMF of .