a. Make a scatterplot of the data. Which variable is the explanatory variable?
b. Is the association between the variables positive or negative?
c. Find the least squares regression line for predicting cancer deaths from the index of exposure. For each of the exposure indexes, compute the predicted value of cancer mortality and the associated residual.
d. What percentage of the variation in cancer deaths is explained by using the index of exposure?
e. Interpret the value of the slope in the least squares line. i.e., explain what this slope says about the change in cancer death rates for different exposure indexes.
f. Plot the residuals versus the index of exposure. What does the plot indicate about the adequacy of the linear fit?
g. Make another scatterplot of the data and include the least squares line on the plot.
h. Suppose you lived in a county with radioactive contamination index of exposure equal to 5. Use the least squares line to predict the cancer mortality in your home county.
i. Compute the correlation coefficient r between index of exposure and cancer mortality.
j. Create two new variables x* = 10x and y*=y/10. This can be done easily by using Calc > Calculator to create:c4=10*c1k. Make a scatterplot of the transformed indexes and mortality rates. Does this plot have the same appearance as the plot you constructed in part a?
c5=c2/10
l. Is the correlation coefficient for the transformed values the same as the correlation coefficient for the original values?
m. Does the slope of the least squares line of y* on x* have the same slope as the regression line of y on x?
MTB > correlation 'enroll' 'gate'
Correlation of Enroll and Gate = 0.701
MTB > describe 'enroll' 'gate'
N | MEAN | MEDIAN | TRMEAN | STDEV | SEMEAN | |
Enroll | 17 | 2009 | 2007 | 2024 | 657 | 159 |
Gate | 17 | 247235 | 254116 | 247827 | 104807 | 25419 |
MIN | MAX | Q1 | Q3 | |
Enroll | 810 | 2980 | 1622 | 2642 |
Gate | 54738 | 430858 | 151011 | 347350 |
a. Find the equation of the least squares line for predicting the gate count from enrollment.
b. What percentage of the variation in the gate counts is explained by enrollments?
c. Predict the number of persons that will use the library at a small liberal arts college with an enrollment of 1445.
d. One of the reporting colleges has an enrollment of 2200 and a gate count of 130000. Find the value of the residual for this college.