Random Variables and Discrete Probability Distribution

Random Variable

A random variable is a variable whose value is determined by chance, as the examples below suggest:

Random variables can be either discrete or continuous.

Discrete variables have a finite or countable number of values associated with them. For example, any variable whose description starts out with "The number of..." is a discrete variable.

Continuous variables, on the other hand can assume any real value in some interval. For example, a certain species of fish might measure anywhere between 10.6 inches to 26.3 inches. That said, don't worry if you never actually see a fish of length 15.32975612386561 inches. We always have to round these types of variables to some more reasonable number of decimal places given the limits of our measuring devices.

Discrete Probability Distributions

A probability distribution for a discrete random variable consists of the values a random variable can assume and the corresponding probabilities that those values occur.

The related function that outputs the probabilities for the respective values the random variable can assume is called the probability density function, or pdf. (This is also called the probability function, the frequency function, or the probability mass function, depending on what statistics book you read.)

For example, suppose we toss a coin three times.

The sample space consists of 8 equally likely possibilities:


Suppose the random variable X counts the number of heads seen in three tosses.

We can't see a negative number of heads in three tosses, nor can we see more than 3, so the values that X can assume are: 0, 1, 2, or 3

(By the way, the probability distribution for this random variable is called discrete because there are a countable number of values that our random variable can assume. In this case there are 4 of them.)

Looking at the sample space, it is easy to see that:

Adopting the notation, P(x), for the probability density function (i.e., P(x) gives the probability of seeing exactly x heads in this example), we can summarize these results with the following table:


The above table constitutes the probability distribution for X.

It should be remembered that all of the probabilities should be legitimate (between zero and one) and the sum of the probabilities should equal one (as 100% of the time, there is an outcome).


$$0 \le P(X) \le 1 \quad \textrm{ and } \quad \sum P(X) = 1$$

As seen above, a discrete probability distribution might take the form of a table.

However, one can can also specify the outcomes and probabilities with a formula -- as in the case of the binomial distribution, which gives the probability of observing X successes in n Bernoulli trials:

$$P(X) = {}_nC_x p^x q^{n-x} \quad \textrm{where } X = 0,1,2,\ldots, n$$

The fact that all of the probabilities so produced are between zero and one, with a sum of exactly one is less obvious here -- but still present. Indeed, if we consider the case when $n=8$ and $p=1/2$, we produce precisely the same probabilities as seen in the above table!