Main

Probability Densities

The last module dealt with the uniform distribution, where any one outcome is as likely as another. This module deals with experiments whose outcomes have different probabilities. For example, consider an unfair coin which has a probability of landing heads and a probability of landing tails. Another example is time spent on hold with customer service, where it is more likely that the call is answered in the first hour than in the second hour.

Random variable and probability density function (PDF)

A random variable is a function whose output should be thought of as the outcome of an experiment. Associated with a random variable is a probability density function (PDF) , which is defined by . That is, the probability that the random variable falls in a certain range of values is given by integrating the PDF over that range of values.

Phrased another way, we can think of probability as the quantity we want to compute over a certain range of values, and the probability element is given by

Example

Consider the spinner from the last module. The outcome of a spin is some angle (relative to the positive -axis) between 0 and . If is the random variable which gives the output of a spin, then

since the spinner was assumed to be fair. This holds for all . Then the associated PDF is

Note

Sometimes a PDF is only defined on a certain domain . can be thought of as the set of all possible outcomes of the experiment . In this case, it is assumed that for not in that domain. So another way of defining the PDF for the spinner is for .

Properties of a probability density function

The following are defining properties of a PDF. In other words, a function is a PDF on the domain if and only if it satisfies these properties.

  1. for all .
  2. .

The first property is necessary since probabilities must be non-negative. The second property reflects the fact that the random variable associated with must have some outcome in the domain (since is the set of all possible outcomes), and so integrating over all of these outcomes should give 1.

Note

If is defined on some specific domain , then the integral over that specific domain should equal 1. This is because outside of that domain, as mentioned in the above note.

Example

Find the value of the constant so that for all is a PDF.

As long as , the first property for a PDF will be met, since for all . To satisfy the second property, compute

Since this integral is supposed to be 1, we find that .

Several specific density functions

Uniform density

Hinted at above and in the previous module, the uniform density function (or uniform distribution) on is given by (and if is not in ):

More generally, the uniform distribution on the domain (whatever the dimension) is given by

In dimension 0, where outcomes are discrete (as in the rolling of a die or the flipping of a coin), remember that volume is just counting. So in this case the probability of a particular outcome is

where is the number of outcomes in the domain (e.g. for the roll of a die; for a coin flip).

Exponential density

Another density function used to model many common experiments is the exponential density function. This is actually a whole family of density functions given by for and some constant. The reason a parameter is used is that the exponential density is often used to model experiments with a time outcome.

Example

Show that the exponential density (for ) satisfies the properties of a density function.

The exponential function is never negative, so one need only check the integral. One finds

as desired. So the exponential density is in fact a density.

Example

Consider a call made to customer service at Acme company. The number of minutes spent on hold before the call is answered is often modeled with an exponential density function

Find, in terms of , the probability that the waiting time for a call is less than 30 minutes.

To find the probability that , use the relationship between probability and the PDF, which is

Example

Again consider customer service call waiting time at Acme company, and again assume an exponential density function

Suppose half of all customers are answered within 5 minutes. Find and then find the probability that a call takes more than 10 minutes to be answered.

Since half of all customers are answered within 5 minutes, we have that

On the other hand, we know that this can be expressed as the integral of the density function, so we have

So we have that

Taking the log of both sides, dividing by and simplifying, we have

For the second part, we want to know the probability of waiting more than 10 minutes. This is (leaving as a constant for now)

Now plugging in the value of , we have

Gaussian density

The last probability density function is the 'Gaussian, or normal, density function. This is an important density function and is expanded on in the next module. Like the exponential, the Gaussian density function usually has parameters (see the next module), but in its simplest form, the Gaussian is given by

The Gaussian has all real as its domain, but because it tails off so quickly in both directions, the probability of getting values far from the center (in this case ) is very small.


EXERCISES

  • Which of the following are probability density functions?