[FrontPage] [TitleIndex] [WordIndex

Note: You are looking at a static copy of the former PineWiki site, used for class notes by James Aspnes from 2003 to 2012. Many mathematical formulas are broken, and there are likely to be other bugs as well. These will most likely not be fixed. You may be able to find more up-to-date versions of some of these notes at http://www.cs.yale.edu/homes/aspnes/#classes.

The expected value or expectation of some quantity X (written E[X] or sometimes just EX) is just the average of all possible values of X weighted by their probabilities. Formally, if X takes on values in some set S, then

From the definition it is not hard to show several useful properties about expectations:

These properties together are known as linearity of expectation.

The additive property applies for any random values X and Y, even if they are not independent. For example, consider an algorithm that splits an input array of n elements into two arrays, where one contains anywhere from 0 to n elements (with all n+1 possibilities equally likely) and the other contains the rest; it then runs a Theta(n2) subroutine on both subarrays. Let T1 be the running time of the subroutine on the first array, and let T2 be the running time on the second array. Let T be the total running time. Then E[T] = E[T1+T2] = E[T1]+E[T2] = 2E[T1] (by symmetry). The value of E[T1] can be calculated from the definition (it's Theta(n2)). In this analysis, it is possible to consider each of T1 and T2 separately, even though their actual values are closely linked.

For more complicated calculations it can sometimes be useful to split the possibilities into several cases. The conditional expectation of a variable X given an event A (written E[X|A]) is the average value of X over all cases where A occurs. Formally, it is defined for a variable X that takes on values in S by

where Pr[X=x|A] is the probability that X equals x conditioned on A, which is in turn defined formally by

The usefulness of conditional probabilities is that if we have a set of mutually exclusive events A1, A2, An whose probabilities sum to 1---representing some set of n cases that together exhaust all possibilities---we can compute the expectation of X by taking a weighted average over the conditional expectations in each case:

The formal definition of E[X] that we gave up above is actually a special case of this formula.

1. Examples


CategoryAlgorithmNotes


2014-06-17 11:58