1) p. 641, problem 5
Hypothesized
proportions
1 0.033333
2 0.066667
3 0.100000
4 0.133333
5 0.166667
6 0.166667
7 0.133333
8 0.100000
9 0.066667
10 0.033333
H0: pi
= (5.5-|i - 5.5|)/30
Ha: at
least one pi differs from this value.
Chi-Square Goodness-of-Fit Test for
Observed Counts in Variable: obs counts
Using category names in
location
Test Contribution
Category Observed
Proportion Expected to Chi-Sq
1 4 0.033333
6.6667 1.06667
2 15 0.066667
13.3333 0.20833
3 23 0.100000
20.0000 0.45000
4 25 0.133333
26.6667 0.10417
5 38 0.166667
33.3333 0.65333
6 31 0.166667
33.3333 0.16333
7 32 0.133333
26.6667 1.06667
8 14 0.100000
20.0000 1.80000
9 10 0.066667
13.3333 0.83333
10 8 0.033333
6.6667 0.26667
N DF Chi-Sq P-Value
200 9 6.6125 0.677
Since the p-value
> .10, we fail to reject H0.
There is no evidence that the data is not consistent with the previously determined proportions.
2) p. 662, problem 32
First, looking at the data through segmented bar graphs:


We see that in the sample conservatives tend to use marijuana less frequently (5% of conservatives in frequently category) and “other” more frequently (28% of others in frequently category). To see if these differences are statistically significant, we can run a chi-square test.
H0: no relationship between marijuana usage and political views in the population.
Ha: there is a relationship.
Expected counts are printed
below observed counts
Chi-Square contributions are
printed below expected counts
never rarely
frequently Total
1
479 173 119
771
494.38 151.46 125.17
0.478
3.064 0.304
2
214 47 15
276
176.98
54.22 44.81
7.746
0.961 19.828
3
172 45 85
302
193.65
59.33 49.03
2.420
3.459 26.394
Total 865
265 219 1349
Chi-Sq = 64.654, DF = 4,
P-Value = 0.000
From Minitab, all expected cell counts exceed 5 (smallest = 44.81).
The chi-square value is 64.654 with df=4 and p-value =.000. The p-value is < .01, so we reject the null hypothesis.
There is strong evidence of a relationship between marijuana usage and political views for this population of high school and college students. Since this was an observational study, no causal conclusions can be drawn.
The largest contributions to the chi-square sum come from “other, frequently” and “conservative, frequently” supporting what we observed in the segmented bar graphs.
3) p. 666, problem 47
(a) Since we want to
compare proportions in three groups, we will use a chi-square test.
If we assume these
were three independent samples (soccer players, nonsoccer
athletes, controls), the best way to state the null hypothesis is

Descriptively, it
appears that soccer players do get more concussions (about 50%) compared to 29%
and 15% in the other groups.
H0: p1
= p2 = p3 (the population proportions with
at least one concussion are the same)
Ha: at
least one pi differs
Chi-Square Test: soccer, nonsoccer, control
Expected counts are printed
below observed counts
Chi-Square contributions are
printed below expected counts
soccer nonsoccer control
Total
1
45 28
8 81
30.71 32.40
17.89
6.647 0.598
5.465
2
46 68 45
159
60.29 63.60
35.11
3.386 0.304
2.784
Total 91
96 53 240
Chi-Sq = 19.184, DF = 2,
P-Value = 0.000
Since the p-value is
small (.000 < .001), we reject the null hypothesis and conclude that at
least one population has a different proportion with concussions.
(b) This shows a
moderate negative relationship indicating (as long as the relationship is
linear) that an increase in soccer exposure is associated with a lower score on
the immediate memory recall test.
Other it didn’t tell
us to, we can check whether this sample proportion is statistically
significant.
H0: r = 0 (no association in the population)
Ha: r ≠ 0 (there is an association)
t = r
= -.220(91-2)/(1-.2202)
= -2.13
Comparing this to
the t table with 89 degrees of freedom, we can round down to df = 60 and see that our two-sided
p-value would be between 2(.01) =.02 and 2(.025) = .05. This indicates that the p-value is small
enough to reject the null hypothesis. We
have moderate evidence (below .05 but above .01) of an association between
memory performance and how much soccer is played for this population of soccer
players.
(c) Let m1 = population mean test score for soccer players and m2 for the nonsoccer players.
Using a two-sample t-test
in Minitab:
Two-Sample T-Test and CI
Sample N Mean StDev SE Mean
1 26 37.50
9.13 1.8
2 56
39.6 10.2 1.4
Difference = mu (1) - mu (2)
Estimate for
difference: -2.13000
95% CI for difference: (-6.63998, 2.37998)
T-Test of difference = 0 (vs not =): T-Value = -0.95 P-Value = 0.348 DF = 54
With a large p-value
(.348>.1), we do not have evidence that the mean oral word-association test
score differs between soccer players and nonsoccer
athletes.
(d) Now we want to
run an ANOVA to compare the three population means
H0: m1 = m2 = m3 (the three populations have the
same amount of concussions on average)
Ha: at least one
population has a different amount of concussions on average
We can calculate MSE
by pooling the variances:
= .5244
We can calculate MSTr by comparing the sample means to the overall mean
overall
mean = [92(.3) + 97(.49) + 53(.19)]/237 = .359
SSTr =
91(.3-.359)2 + 96(.49-.359)2 + 52(.19-.359)2 =
3.47
So MSTr = 3.47/(3-1) = 1.73
F = 1.73/.5244 =
3.30
The degrees of
freedom are 2 and 237. Using Table A.9,
with df = 2, 200, the
p-value is between .01 and .05. So we have moderate evidence that that the
average number of concussions differs in at least one of these populations.
4) p. 708, problem 5

5) p. 708, problem 7
Need mean of
’s and mean of s’s: 12.95+
3(.526)/(.940sqrt(5)) gives LCL=12.20, UCL=13.70 Every one of the 22
values is well within
these limits, so the process appears to be in control with respect to location.
6) p. 708, problem 12 No points fall outside 2-sigma limits and only two points fall outside the 1-sigma limits. There are no runs of eight on the same side of the center line.
7) p. 717, problem 19 (a) pbar=578/(100*25) = .231
(b) .231 + 3sqrt(.231(.769)/100) gives LCL=.105 and UCL=.357
39/100 exceeds UCL generating an out-of-control signal.
8) p. 717, problem 23
=4.08, 4.08 + 3 sqrt(4.08) gives LCL=-2.0 or 0 and UCL=10.14. No values exceed 10.1 so the process is in
control.