Definitions

Define the following terms/concepts

Simple probability

\(\bf (1)\) A fair coin is thrown 3 times. What is the probability that at least one heads is observed?

\[P(\text{At least one heads}) = 1-P(\text{No Heads})\] The sample space of this event is \[\mathbb{S} = \begin{bmatrix} \text{No heads} = (H,H,H) \\ \text{1 Heads} = (H,T,T),(T,H,T),(T,T,H) \\ \text{2 Heads} = (H,H,T),(H,T,H),(T,H,H) \\ \text{All Heads} = (H,H,H) \end{bmatrix} \] The sample space consists of 8 possible outcomes and there is only one outcome with “No Heads”. \[ P(\text{At least one heads}) = 1-P(\text{No Heads}) = 1-\frac{1}{8} = \frac{7}{8} \]

\(\bf (2)\) Suppose that in a class of 30 students, 8 students are in band, 15 students play a sport, and 5 students are both in band and play a sport. Let \(A\) be the event that a student is in band and let \(B\) be the event that a student plays a sport. Create a Venn diagram that models this situation. What is the probability that a student is in the band or plays a sport?

Since these events are non-disjoint, we must use the formula for a union of two non-disjoint events: \[P(A\cup B) = P(A) + P(B) - P(A\cap B)\] \[= \frac{8}{30}+\frac{15}{30} - \frac{5}{30} =\frac{18}{30}\]

\(\bf (3)\) A standard deck of 52 cards contains four suits (Hearts, Diamonds, Clubs, and Spades). Each suit contains 13 cards (9 numbered cards and 3 face cards, and an Ace).

\[P(\text{Card is numbered}) = \frac{\text{Number of ways to get numbered card}}{\text{Total number of cards}} = \frac{9 \cdot 4}{52} \]

Let \(A\) be the event that a card drawn at random is an Ace and \(B\) be the event that a card drawn at random is a Spades. We are interested in the \(P(A\cap B)\) and we are told to assume independence. Therefore \[P(A\cap B) = P(A)\cdot P(B) = \frac{4}{52} \cdot \frac{13}{52} = \frac{1}{52}\]

\(\bf (4)\) An urn contains 5 green balls, 7 red balls, and 4 yellow balls. Two balls are selected at random with replacement. What is the probability that first ball is yellow and the second ball is red?

Since the balls are selected with replacement the two events are independent. Therefore, the intersection of the two events: \(A\) first ball yellow and \(B\) second ball is red is given by \[P(A\cap B) = \frac{4}{16} \cdot \frac{7}{16} = \frac{7}{64}\]

\(\bf (5)\) The probability that it rains \(R\) on a given day in Moscow, Idaho is \(P(R) = 0.1\). The probability that it is Windy \(W\) is \(P(W) = 0.2\), and the probability that it is cloudy \(C\) is \(P(C) = 0.7\). Assume that these three types of weather events are independent. What is the probability that a given day is both Rainy, Cloudy, and Windy?

We are told the events are independent and we wish to find the intersection of \(R\), \(W\) and \(C\): \[P(R\cap W\cap C) = (0.1)(0.2)(0.7) = 0.014 \]

Sampling Distributions, Means, and Variances

\(\bf (6)\) A random variable \(X\) has the following probability distribution

X P(X)
1 0.25
2 0.25
3 0.25
4 0.25
The sample space for a mean of two observations of \(X\) is given below.
Observation 1 Observation 2 \(\bar{X}\)
1 1 1.0
2 1 1.5
3 1 2.0
4 1 2.5
1 2 1.5
2 2 2.0
3 2 2.5
4 2 3.0
1 3 2.0
2 3 2.5
3 3 3.0
4 3 3.5
1 4 2.5
2 4 3.0
3 4 3.5
4 4 4.0
The sampling distribution of \(\bar{X}\) is given by:
\(\bar{X}\) \(P(\bar{X})\)
1.0 \(\color{red}{1(0.25^2)=0.0625}\)
1.5 \(\color{red}{2(0.25^2)=0.125}\)
2.0 \(\color{red}{3(0.25^2)=0.1875}\)
2.5 \(\color{red}{4(0.25^2)=0.25}\)
3.0 \(\color{red}{3(0.25^2)=0.1875}\)
3.5 \(\color{red}{2(0.25^2)=0.125}\)
4.0 \(\color{red}{1(0.25^2)=0.0625}\)

\[\mu =\sum_x xP(x) \] \[= 1(0.0625)+2(0.125)+\cdots +3.5(0.125)+4(0.0625)\] \[= 2.5 \]

\[\sigma =\sqrt{\sum_x (x-\mu)^2P(x)} \] \[= \sqrt{(1-2.5)^2(0.0625)+(2-2.5)^2(0.125)+\cdots +(3.5-2.5)^2(0.125)+(4-2.5)^2(0.0625)} \] \[= \sqrt{0.625} \] \[= 0.791\]

\(\bf (7)\) A random variable \(Y\) has two possible outcomes. The outcome \(Y = 1\) occurs with probability \(p = 0.23\). Derive the sampling distribution for the proportion of outcomes with \(Y = 1\) in a sample of \(n = 3\) observations of \(Y\).

The population distribution of \(Y\) is given by:
\(Y\) \(P(Y)\)
0 0.77
1 0.23

The sample space for the sample proportion of 3 observations of \(Y\) is given below.

Observation 1 Observation 2 Observation 3 \(\hat{p}\)
0 0 0 0.00
1 0 0 0.33
0 1 0 0.33
1 1 0 0.67
0 0 1 0.33
1 0 1 0.67
0 1 1 0.67
1 1 1 1.00

The sampling distribution of \(\hat{p}\) is given by:

\(\hat{p}\) \(P(\hat{p})\)
0.00 \(\color{red}{1(0.77^3)=0.457}\)
0.33 \(\color{red}{3(0.23 \times 0.77^2)=0.409}\)
0.67 \(\color{red}{3(0.23^2\times 0.77)=0.122}\)
1.00 \(\color{red}{1(0.23^3)=0.012}\)

\(\bf (8)\) Consider a survey of tree volume. Suppose that in the population of trees \(10\%\) have a volume of 10 cubic feet, \(20\%\) have a volume of 20 cubic feet, \(30\%\) have a volume of 30 cubic feet, and \(40\%\) have a volume of 40 cubic feet. Let \(X\) denote the volume of a randomly selected tree, assuming that every tree has an equal chance of being selected. The probability distribution of \(X\) is given in the table below.

X P(X)
10 0.1
20 0.2
30 0.3
40 0.4

\[\mu = \sum_x xP(x)\] \[ = 10(0.1)+20(0.2)+30(0.3)+40(0.4)\] \[ = 30\]

\[\sigma = \sqrt{\sum_x (x-\mu)^2P(x)}\] \[ = \sqrt{(10-30)^2(0.1)+(20-30)^2(0.2)+(30-30)^2(0.3)+(40-30)^2(0.4)}\] \[ = \sqrt{100}\] \[ = 10\]

The sample space for the sample mean of 2 observations of tree volume is given below.
Observation 1 Observation 2 \(\bar{X}\)
10 10 10
20 10 15
30 10 20
40 10 25
10 20 15
20 20 20
30 20 25
40 20 30
10 30 20
20 30 25
30 30 30
40 30 35
10 40 25
20 40 30
30 40 35
40 40 40

The sampling distribution of \(\bar{X}\) is given by:

\(\bar{X}\) \(P(\bar{X})\)
10 \(\color{red}{(0.1)^2=0.01}\)
15 \(\color{red}{2(0.1 \times 0.2)=0.04}\)
20 \(\color{red}{2(0.1\times 0.3)+0.2^2=0.1}\)
25 \(\color{red}{2(0.4\times 0.1)+2(0.2\times 0.3)=0.2}\)
30 \(\color{red}{2(0.2\times 0.4)+0.3^2=0.25}\)
35 \(\color{red}{2(0.3\times 0.4)=0.24}\)
40 \(\color{red}{(0.4)^2=0.16}\)

Deriving Probabilities From Discrete Distributions

\(\bf (9)\) Michael Dickson, who plays for the Seattle Seahawks, is a punter in the NFL. The following sampling distribution gives the mean punting distance (in yards) for a sample of \(n=4\) punts. Use this distribution to answer the following questions:

\(\bar{X} =\) Mean Punt Distance \(P(\bar{X})\)
35.33 0.40
45.67 0.50
55.00 0.08
60.33 0.01
65.67 0.01

\[ P(\bar{X} \geq 55) = 1- P(\bar{X} < 55)\] \[ = 1-(0.4+0.5) = 0.1 \]

\[ P(\bar{X} \geq 35.33) = 1\]

\[P(35.33\leq \bar{X} \leq 45.67) = P(X = 35.33)+P(X = 45.67) = 0.9\]

Let \(A\) be the event \(\bar{X} = 35.33\) and \(B\) be the event \(\bar{X} = 55\), then \[P(A\cup B) = P(X = 35.33)+P(X = 55) = 0.3+0.08 = 0.38 \]

\(\bf (10)\) A random variable \(X\) follows a binomial distribution with with probability of success \(p = 0.43\) and number of trials \(n = 8\). What is the probability of observing fewer than 3 successes, that is what is \(P(X < 3)\)?

Let \(X \sim Binomial(p = 0.43, n = 8)\) then

\[ \color{red}{P(X < 3) = P(X = 0)+P(X = 1)+P(X = 2)}\]

\[ \color{red}{= \sum_{k = 0}^2 \frac{8!}{k!(8 - k)!} \times 0.43^{k} \times (1-0.43)^{8-k} \approx 0.256}\]

\(\bf (11)\) Tom Brady is a famous quarterback who recently retired from playing in the National Football League. During his career, the probability that Brady would complete a pass in a given play was \(64.3\%\). If a typical NFL team runs about \(35\) offensive pass plays, what is the probability that Brady would complete exactly \(28\) pass plays in a game?

Let \(X \sim Binomial(p = 0.643, n = 35)\) then

\[\color{red}{P(X = 28) = \frac{35!}{28!(35 - 28)!} \times 0.643^{28}\times (1-0.643)^{35-28} \approx 0.021}\]

\(\bf (12)\) A random variable \(W\) follows a Poisson distribution with rate parameter \(\lambda = 1.8\) events per hour. What is the probability of observing exactly four events in a given hour, that is what is \(P(W = 4)\)?

Let \(W\sim Poisson(\lambda = 1.8)\) \[P(W = 4) = \frac{1.8^4\cdot e^{-1.8}}{4!} \approx 0.072 \]


\(\bf (13)\) A factory experiences on average 3 equipment malfunctions per day, following a Poisson distribution with a rate parameter of 3. Calculate the probability that on a randomly selected day, the factory will observe 3 or more equipment malfunctions.

let \(X\) represent the number of equipment failures in a given day for the factor, then \[X\sim Poisson(\lambda = 3)\] \[P(X\geq 3) = 1 - P(X< 3)\] \[P(X < 3) = P(X = 0)+P(X = 1)+P(X = 2)\] \[= 1-\sum_{k=0}^2 \frac{3^k\cdot e^{-3}}{k!} \approx 0.577\]

Deriving Probabilities From Continuous Distributions

\({\bf (14)}\) Use the plot below of the probability distribution of a continuous random variable \(X\) to answer the following questions

Use the plot to find the following probabilities:

In this plot the area comprising a single grid square is \[ \text{Area of a rectangle} = \text{Base}\times\text{Height} = 0.167\times 0.3 \approx 0.05\] there are six grid squares between \(0\) and \(0.6\), therefore \[P(X<0.6)= 6\times 0.05 = 0.3\]

\(P(X>0.3)= 1-P(X\leq 0.3) = 1-(2*0.05) = 0.9\)

\(P(1.5<X<1.8)=0.05\)

\({\bf (15)}\) Use the plot below of the probability distribution of a continuous random variable \(Y\) to answer the following questions

Use the plot to find the following probabilities (recall that the area of a triangle is \(\frac{1}{2} \times \text{base} \times \text{height}\))

In this plot the area comprising a single grid square is \[ \text{Area of a rectangle} = \text{Base}\times\text{Height} = 0.25\times 0.16 \approx 0.04\] There are \(14\) grid squares in the area between \(-0.25\) and \(1.0\) \(P(Y>-0.25)=14\times 0.04 = 0.56\)

There are \(16\) grid squares in the area between \(-0.5\) and \(0.25\) therefore \(P(-0.5<Y<0.25)=16\times 0.04 = 0.64\)

\({\bf (16)}\) Use the plot below of the probability distribution of a continuous random variable \(Z\) to answer the following questions

Use the plot to find the following probabilities (recall that the area of a triangle is \(\frac{1}{2} \times \text{base} \times \text{height}\))

\(P(Z<2)=0.5(2)(0.3) = 0.3\)

In this plot a single grid square has an area of \(0.05\) thus the area between \(Z = 2\) and \(Z = 3\) is given by \(P(2<Z<3)=2(0.05)+(0.5)(1)(0.2) = 0.2\)

\(P(Z>4) = P(Z<2) =0.3\)

\(Q1 \approx 2\)

\({\bf 17.)}\) Assume the continuous random variable \(X\) follows a normal distribution with mean \(\mu = 25\) and variance \(\sigma^2 = 16\).

Using a \(Z\)-table, find the following (approximate) probabilities for \(X\). You can also use the app in the course website or StatDistributions.com to check your result

\[ z = \frac{15-25}{4} = -2.5\] \[ P(Z<-2.5) = 0.006\]

\[ z = \frac{28-25}{4} = 0.75\] \[ P(Z>0.75) = 1-P(Z\leq 0.75) = 1-0.773 = 0.227\]

\[ z_{\text{lower}} = \frac{12-25}{4} = -3.25\] \[ z_{\text{upper}} = \frac{20-25}{4} = -1.25\] \[ P(-3.25 < Z < -1.25) = P(Z<-1.25) - P(Z<-3.25) = 0.106 - 0.001 = 0.105\]

\[ z = \frac{30-25}{4} = 1.25\] \[ P(Z>1.25) = 1-P(Z\leq 1.25) = 1-0.894 = 0.106 \]

\[ z_{\text{lower}} = \frac{25-25}{4} = 0\] \[ z_{\text{upper}} = \frac{33-25}{4} = 2\] \[ P(0<Z<2) = P(Z<2) - P(Z<0) = 0.977 - 0.5 = 0.477\]

the \(95th\) percentile of a standard normal distribution is \(z=1.645\) plugging this into the \(z\)-score equation and solving for \(X\) gives \[ (z\times \sigma) + \mu = (1.645\times 4) + 25 = 31.58 \]

the \(40th\) percentile of a standard normal distribution is \(z=-0.253\) \[ (z\times \sigma) + \mu = (-0.253\times 4)+25 = 23.9 \]

\({\bf 18.)}\) Suppose that a random variable \(X\) follows a normal distribution with mean \(\mu = -5\) and standard deviation \(\sigma = 1.8\). Using the plot of a the \(Z\)-distribution below, shade in the area corresponding to the probability \(P(-2< X <0.5)\)

start by finding the \(z\)-scores\[ P(-2<X<0.5) = P(X<0.5) - P(X<-2) \] \[ z_{\text{lower}} = \frac{0.5 - (-5)}{1.8} = 3.06\] \[ z_{\text{upper}} = \frac{-2 - (-5)}{1.8} = 1.67\] \[P(-2<X<0.5) = P(1.67< Z < 3.06) \]