\(\bf (1)\) Rocket League is a popular online video game and E-sport that emerged in 2015. The game enjoys a healthy following of around 93 million players per month. The game features players from around the world who compete in sports like soccer (football), basketball and hockey while controlling RC-like vehicles. The most popular game-mode is doubles soccer in which teams of two players each try to outscore each other by hitting a soccer ball into the opposing goal. In the course of each match, the player controlled vehicles often collide in what is commonly referred to as a “bump”. The following histogram shows the distribution of the average number of “bumps” per match for a given player of Rocket League
Assuming a mean of \(\bar{x} = 355\) and standard deviation of \(s = 69\), in approximately what proportion of matches did the player have more than 423 bumps?
Approximately what proportion of matches did the player have more than 217 bumps but fewer than 492 bumps?
How many bumps would the player need to have in a match to be in the top \(2.5\%\) of matches?
After playing a new match, the player logs 620 bumps. Calculate the \(z\)-score for this match and explain the meaning of the z-score in relation to the average number of bumps. How many standard deviations is the player’s performance from the mean number of bumps?
Figure 1 Pinneped whisker morphology - Ginter et al. 2012 | A juvenile Harp Seal |
---|---|
\(\bf (2)\) Pinnipeds (seals and sea lions) possess the largest vibrissae (whiskers) among mammals and their vibrissal hair shafts demonstrate a diversity of shapes (See Figure 1). In a study conducted by (Ginter et al. 2012), researchers measured 9 characteristics of Pinniped whiskers in individuals from 9 species of seals and sea lion. Their goal was to better characterize whisker morphology and evolution. The following data are the recorded whisker lengths of 20 Harp Seals (given in Table 1). Note that the data have been sorted according to increasing ``Total Length (cm)” for your convenience - use the table to answer the following questions:
Observation | Whisker Total Length (cm) | Number of Beads |
---|---|---|
1 | 4.18 | 2 |
2 | 4.23 | 2 |
3 | 4.31 | 2 |
4 | 4.31 | 2 |
5 | 4.33 | 2 |
6 | 4.34 | 2 |
7 | 4.35 | 2 |
8 | 4.38 | 2 |
9 | 4.40 | 2 |
10 | 4.41 | 2 |
11 | 4.43 | 3 |
12 | 4.44 | 2 |
13 | 4.46 | 2 |
14 | 4.47 | 3 |
15 | 4.48 | 3 |
16 | 4.51 | 2 |
17 | 4.59 | 3 |
18 | 4.60 | 2 |
19 | 4.62 | 2 |
20 | 5.08 | 3 |
## [1] "comma separated values = 4.18,4.23,4.31,4.31,4.33,4.34,4.35,4.38,4.4,4.41,4.43,4.44,4.46,4.47,4.48,4.51,4.59,4.6,4.62,5.08"
Make a boxplot of the variable ``Total Length” and make note of any outliers
Observation 20 has a total whisker length of 5.08 (cm), how many standard deviations is this observation from the mean whisker length? (hint compute the \(z\)-score for this observation)
\(\bf (3)\) Use the following pair of boxplots constructed from the Pinneped whisker data in \(\textbf{Table 1}\). The boxplots show the distribution of total whisker length (cm) for whiskers with 2 beads vs whiskers with three beads. Answer the following questions:
## [1] "-----------------2 bead whisker lengths-----------------"
## [1] "comma separated values = 4.3,4.6,4.5,4.6,4.2,4.3,4.3,4.2,4.4,4.3,4.5,4.3,4.4,4.4,4.4"
## [1] "-----------------3 bead whisker lengths-----------------"
## [1] "comma separated values = 4.6,5.1,4.5,4.4,4.5"
Using the boxplot above, do the whisker lengths for 2 bead and 3 bead whiskers follow a normal distribution? why or why not?
Suppose researchers observe two new Harp Seals (Denoted Seal 1 and Seal 2) and record their total whiskers lengths and the number of beads on the whiskers. Seal 1 has a total whisker length of \(4.32 (cm)\) and the whisker has 3 beads. Seal 2 has a total whisker length of \(4.34 (cm)\) but the whisker has only 2 beads. Suppose we want to know how they compare relative to the distribution of whiskers with the same number of beads. Confirm that the whisker length of seal 1 has a \(z\)-score of \(-1.07\) while Seal 2 has a \(z\)-score of \(-0.43\). Refer to the boxplot above
If the researchers were to observe the whisker length of a new seal and the whisker had 3 beads, how long or short would the whisker of this observation have to be for it to be considered an outlier? (assume the data is approximately normal)
\(\bf (4)\) Use the following plot of the cumulative distribution of a quantitative variable \(x\) to answer questions (a)-(c)
## [1] "Comma separated values = -4.7,-2.5,-2.3,-1.3,-1.3,-1.2,0,0,1,1.8,2.1,2.3,2.6,2.8,2.8,2.9,3.5,3.6,3.7,3.7,4.3,4.8,4.9,5.2,5.3,5.5,6,6.5,6.6,7.1,7.1,7.2,7.4,7.7,7.8,8.1,8.1,8.1,8.5,8.7,9,9,10,10.2,10.2,11,11,11.1,11.1,11.5"
Assume that the mean and standard deviation of \(X\) are \(\bar{x} = 5\) and \(s = 4\). Use these values to find the \(2.5th\) and \(97.5th\) percentiles of \(X\). Compare your answer with the cumulative distribution plot above, why might the two answers differ? (hint: think about shape and how that relates to the empirical rule)
Assume the above cumulative distribution represents an approximately normal distribution (symmetric, bell-shaped) with a standard deviation of \(s = 4\) and a mean of \(\bar{x} = 5\), Under the empirical rule, what percentage of the observations will have a value between \(-3\) and \(13\)?
Using the mean and standard deviation from part a-b, compute and interpret the z-scores for the observations \(x_1 = 11.5\) and \(x_2 = -4.7\). Would these two observations be considered outliers under the \(\pm 2s\) rule?
\(\bf (5)\) Define the following terms
Explanatory variable
Response variable
experimental study
observational study
survey
\(\bf (6)\) Name the sampling method used in each of the following situations:
A man is standing outside of a grocery store handing out questionnaires to shoppers asking them to evaluate their shopping experience and the quality of service. He does not ask shoppers who have their hands full carrying groceries, but instead asks all shoppers who have only a few items or have both hands free to fill out the questionnaire.
A gardener has garden box in which he has planted 10 rows of corn. Each row consists of eight corn plants. The gardener wishes to know the average height of the corn plants in the garden box so he randomly selects rows 1, 5, and 9 and measures the height of all corn plants in those rows.
A company wishes to know how employees feel about compensation and benefits. Over the next few weeks, the company asks 30 employees from each of its branch locations to fill out a survey regarding compensation.
A local polling company wishes to know what proportion of citizens in Moscow, Idaho are voting for the republican candidate in mayoral race. The polling company records this information for every 5th voter who leaves the voting booth.
A lumber company wishes to estimate the amount of usable lumber on a plot of land. The company randomly selects 100 trees and measures their diameter, height, and volume, in order to estimate the number of board feet of each tree.