For the following data score represents the test score,
prep_time represents the preparation time (in hours) and
attend represents if a person attended the lectures.
| score | prep_time | attend |
|---|---|---|
| 1 | 0 | none |
| 5 | 5 | some |
| 10 | 10 | all |
| 9 | 14 | all |
| 4 | 3 | some |
| 7 | 5 | all |
| 11 | 14 | all |
| 8 | 8 | all |
| 3 | 6 | some |
| 2 | 5 | none |
Draw a histogram for score using cutoffs
0,3,6,9,12.
What is the score range?
What are the means and standard deviations of score
and prep_time?
You got 13 score on the test. Find z-score.
If the histogram was symmetric and bell-shaped, would it be a good z-score?
Draw score vs prep_time scatteplot. Do
you think there is a relationship?
What is the correlation between score and
prep_time? Does this confirm your scatterplot
findings.
Compute the distribution table (relative frequencies) for the
attend variable.
Draw a stacked diagram for attend.
Find the average value of score and
prep_time for each category of attend. (Try to
interpret these values :)