Variance
One important application of LOTUS is for finding the* variance* of a random variable. Like expected value, variance is a single-number summary of the distribution of a random variable. While the expected value tells us the center of mass of a distribution, the variance tells us how spread out the distribution is.
Definition: Variance and Standard Deviation
The variance of an r.v. is
The square root of the variance is called the standard deviation (SD):
Recall that when we write , we mean the expectation of the random variable , not (which is by linearity).
The variance of measures how far is from its mean on average, but instead of simply taking the average difference between and its mean , we take the average squared difference. To see why, note that the average deviation from the mean, , always equals 0 by linearity; positive and negative deviations cancel each other out. By squaring the deviations, we ensure that both positive and negative deviations contribute to the overall variability. However, because variance is an average squared distance, it has the wrong units: if is in dollars, is in squared dollars. To get back to our original units, we take the square root; this gives us the standard deviation.
One might wonder why variance isn't defined as , which would achieve the goal of counting both positive and negative deviations while maintaining the same units as . This measure of variability isn't nearly as popular as , for a variety of reasons. The absolute value function isn't differentiable at 0, so it doesn't have as nice properties as the squaring function. Squared distances are also connected to geometry via the distance formula and the Pythagorean theorem, which turn out to have corresponding statistical interpretations.
An equivalent expression for variance is . This formula is often easier to work with when doing actual calculations. Since this is the variance formula we will use over and over again, we state it as its own theorem.
Theorem
For any r.v. ,
Proof: Let . Expand and use linearity:
Variance has the following properties. The first two are easily verified from the definition, the third will be addressed in a later chapter, and the last one is proven just after stating it.
To prove the last property, note that is the expectation of the nonnegative r.v. , so . If for some constant , then and , so .
Conversely, suppose that . Then , which shows that has probability , which in turn shows that equals its mean with probability .