Home
>
The Poisson Distribution – Explanation & Examples

JUMP TO TOPIC

The Poisson Distribution – Explanation & Examples

The definition of the Poisson distribution is:

“The Poisson distribution is a discrete probability distribution that describes the probability of the number of events occurring in a fixed interval.”

In this topic, we will discuss the Poisson distribution from the following aspects:

What is a Poisson distribution?
When to use Poisson distribution?
Poisson distribution formula.
How to do the Poisson distribution?
Practice questions.
Answer key.

What is a Poisson distribution?

The Poisson distribution is a discrete probability distribution that describes the probability of the number of events (discrete random variable) from a random process in a fixed interval.

Discrete random variables take a countable number of integer values and cannot take decimal values. Discrete random variables are usually counts.

The fixed interval can be:

Time as the number of calls received per hour in a call center or the number of goals per football match.
Distance as the number of mutations on a strand of DNA per unit length.
Area as the number of bacteria found per unit area of an agar plate.
Volume as the number of bacteria found per milliliter of a liquid.

The Poisson distribution is named after the French mathematician Siméon Denis Poisson.

When to use Poisson distribution?

You can apply the Poisson distribution to random processes with a large number of possible events, each of which is rare.

However, the average rate (the average number of events per interval) can be any number and does not always have to be small.

For the Poisson distribution to describe a random process, it must be:

The number of events occurring in an interval can take values 0, 1, 2, ….etc. No decimal numbers are allowed because it is a discrete distribution or a count distribution.
The occurrence of one event does not affect the probability that a second event will occur. That is, events occur independently.
The average rate (the average number of events per interval) is constant and does not change based on time.
Two events cannot occur at the same time. It means that at each sub-interval, either an event occurs or not.

– Example 1

Data from a certain call center shows a historical average of 10 calls received per hour. What is the probability of receiving 0, 10, 20, or 30 per hour in this center?

We can use the Poisson distribution to describe this process because:

The number of calls per hour can take values 0, 1, 2, ….etc. No decimal numbers can occur.
The occurrence of one event does not affect the probability that a second event will occur. There is no reason to expect a caller to affect the chances of another person calling, and so the events occur independently.
We may assume the average rate (the number of calls per hour) to be constant.
Two calls cannot occur at the same time. It means that at each sub-interval, like second or minute, either a call occurs or not.

This process is not a perfect fit for the Poisson distribution. For example, the average rate of calls per hour may decrease in the night hours.

Practically speaking, the process (the number of calls per hour) is close to the Poisson distribution and can be used to describe the process’s behavior.

Using the Poisson distribution can help us to calculate the probability of 0,10,20 or 30 calls per hour:

The probability of zero calls per hour = 0%.

The probability of 10 calls per hour = 0.125 or 12.5%.

The probability of 20 calls per hour = 0.002 or 0.2%.

The probability of 30 calls per hour = 0%.

We see that 10 calls have the highest probability, and as we move away from 10, the probability fades away.

We can connect the points to draw a curve:

The average rate of 10 calls per hour has the highest probability (curve peak). As we move away from 10, the probability fades away.

The average rate (the average number of events per interval) can take a decimal value. In that case, the number of events with the highest probability will be the nearest integer to the average rate, as we will see in the following example.

– Example 2

Data from the maternity ward in a certain hospital shows 2372 babies born in this hospital in the last year. The average per day = 2372/365 = 6.5.

What is the probability that 10 babies will be born in this hospital tomorrow?

How many days of the next year that 10 babies per day will be born in this hospital?

The number of babies born per day in this hospital can be described using the Poisson distribution because:

The number of babies born per day can take values 0, 1, 2, ….etc. No decimal numbers can occur.
The occurrence of one event does not affect the probability that a second event will occur. We do not expect that a newborn baby will affect another baby’s chances to be born in that hospital unless the hospital is full, so the events occur independently.
The average rate (the number of babies born per day) may be assumed to be constant.
Two babies cannot be born at the same time. It means that either a baby is born or not at each sub-interval, like second or minute.

The number of babies born per day is close to the Poisson distribution. We can use the Poisson distribution to describe the process’s behavior.

The Poisson distribution can help us to calculate the probability of 10 babies born per day:

The probability of 10 babies born per day = 0.056 or 5.6 %.

We see that 6 babies have the highest probability.

When the number of babies is larger than 16, the probability is very small and can be considered zero.

We can connect the points to draw a curve:

The 6 babies per day have the highest probability (curve peak), and as we move away from 6, the probability fades away.

1. To know the number of days in the next year, this hospital will expect a different number of births.

We construct a table with each outcome (number of babies) and its probability.
babies probability

babies	probability
0	0.002
1	0.010
2	0.032
3	0.069
4	0.112
5	0.145
6	0.157
7	0.146
8	0.119
9	0.086
10	0.056
11	0.033
12	0.018
13	0.009
14	0.004
15	0.002
16	0.001
17	0.000
18	0.000
19	0.000
20	0.000

2. Add another column for the expected days. Fill that column by multiplying each probability value by the number of days in a year (365).

babies	probability	days
0	0.002	0.730
1	0.010	3.650
2	0.032	11.680
3	0.069	25.185
4	0.112	40.880
5	0.145	52.925
6	0.157	57.305
7	0.146	53.290
8	0.119	43.435
9	0.086	31.390
10	0.056	20.440
11	0.033	12.045
12	0.018	6.570
13	0.009	3.285
14	0.004	1.460
15	0.002	0.730
16	0.001	0.365
17	0.000	0.000
18	0.000	0.000
19	0.000	0.000
20	0.000	0.000

We expect that about 20 days out of the total 365 days of the next year, this hospital will deliver 10 births per day.

– Example 3

The average number of goals in a World Cup soccer match is approximately 2.5.

The number of goals per football match can be described using the Poisson distribution because:

The number of goals per football match can take values 0, 1, 2, ….etc. No decimal numbers can occur.
The occurrence of one event (goal) does not affect the probability that a second event will occur, and so the events occur independently.
The average rate (the number of goals per match) may be assumed to be constant.
Two goals cannot occur at the same time. It means that at each sub-interval of the match, like second or minute, either a goal occurs or not.

The number of goals per match is close to the Poisson distribution. We can use the Poisson distribution to describe the process’s behavior.

The Poisson distribution can help us to calculate the probability of each number of goals in a football match:

We see that 2 goals per match have the highest probability = 0.257 or 25.7%.
Examples of 2 goals per match are a score of 2-0 or 1-1.

When the number of goals is larger than 9, the probability is very small and can be considered zero.

We can connect the points to draw a curve:

The 2 goals per match have the highest probability (curve peak), and as we move away from 2, the probability fades away.

64 matches are played in World Cup soccer. We can use the Poisson distribution to calculate the number of matches that will likely contain the different number of goals:

1. We construct a table with each outcome (number of goals) and its probability.
goals probability

goals	probability
0	0.082
1	0.205
2	0.257
3	0.214
4	0.134
5	0.067
6	0.028
7	0.010
8	0.003
9	0.001
10	0.000

2. Add another column for the expected matches.

Fill that column by multiplying each probability value by the number of matches in World Cup soccer (64).

goals	probability	matches
0	0.082	5.248
1	0.205	13.120
2	0.257	16.448
3	0.214	13.696
4	0.134	8.576
5	0.067	4.288
6	0.028	1.792
7	0.010	0.640
8	0.003	0.192
9	0.001	0.064
10	0.000	0.000

We are expecting:

About 6 matches will contain no goals.

About 13 matches will contain 1 goal.

About 16 matches will contain 2 goals.

About 13 matches will contain 3 goals, and so on.

3. We can add another column for the observed number of goals in the World Cup soccer of 2018 in Russia to see how closely the Poisson distribution predicts the number of goals:

goals	probability	matches	matches 2018
0	0.082	5.248	1
1	0.205	13.120	15
2	0.257	16.448	17
3	0.214	13.696	19
4	0.134	8.576	5
5	0.067	4.288	2
6	0.028	1.792	2
7	0.010	0.640	3
8	0.003	0.192	0
9	0.001	0.064	0
10	0.000	0.000	0

We see that the expected number of matches found by Poisson distribution is near the observed number of matches having these goals.

The Poisson distribution is good at describing this process behavior. Similarly, you can use it to predict the number of goals per match in the next World Cup of 2022.

Poisson distribution formula

If the random variable X follows the Poisson distribution with λ average number of events per fixed interval, the probability of getting exactly k events in this fixed interval is given by:

f(k,λ)=”P(k events in the interval)”=(λ^k.e^(-λ))/k!

where:

f(k,λ) is the probability of k events per fixed interval.

λ is the average number of events per fixed interval.

e is a mathematical constant approximately equal to 2.71828.

k! is the factorial of k and equals to k X (k-1) X (k-2) X….X1.

How to do the Poisson distribution?

To calculate the Poisson distribution for the number of events in a fixed interval, we only need the average number of events in a fixed interval.