Stat 322 – HW 5 Solutions

 

1) problem 1 (p. 453)

(a) F = MSA/SSA = (30.6/4)/(59.2/4 × 3) =  1.55

Compare this to F(4, 12).

            From Table: 1.55 < 2.48 so p-value > .100  (F > 3.26 so p-value > .05)

            From Minitab: 1-.7499 = .2501

F distribution with 4 DF in numerator and 12 DF in denominator

   x  P( X <= x )

1.55     0.749909

With such a large p-value, we fail to reject the null hypothesis and conclude that the true average tire lifetimes do not differ across the makes of cars.

 

(b) F = MSB/SSE = (44.1/3)/(59.2/12) = 2.98

Compare this to F(3,12, .05) = 3.49

Since our F-statistic is less than 3.49, this test is not significant at the.05 level.  We conclude that there is no difference in true average tire lifetime due to different brands of tires.

 

2) problem 18 (p. 464), feel free to use Minitab

Note: You can get Minitab to store the fitted values and residuals automatically!

Using Stat > ANOVA > Two-way:

Two-way ANOVA: Yield versus Speed, Formulat

 

Source       DF       SS       MS       F      P

Speed         2   230.81   115.41   19.27  0.000

Formulat      1  2253.44  2253.44  376.27  0.000

Interaction   2    18.58     9.29    1.55  0.252

Error        12    71.87     5.99

Total        17  2574.70

 

S = 2.447   R-Sq = 97.21%   R-Sq(adj) = 96.05%

 

(a) F = (18.58/(3-1)×(2-1))/(71.87/12) = 9.29/5.99 = 1.55

            We could compare to F(2,12,.05) = 3.89

            Since our F-statistic < 3.89, this is not significant at the 5% level.

            Minitab gives us a large p-value.

Fail to reject H0: no interaction between speed and formulation.

 

(b) It appears that yield depends on both formulation (F = 376.27, p-value < .001) and speed (F = 19.27, p-value < .001).

 

(c) The main effects are just the group means (the row and column totals in bold).  The fitted values are just the cell means (the average of the 3 values inside the cell).

 

 

 

speed

 

 

 

 

60

70

80

 

formulation

1

189.47

180.6

191.03

187.03

 

2

166.20

161.03

166.73

164.66

 

 

177.83

170.82

178.88

175.85

 

To find the estimated interaction effects, take the cell average, subtract each main effect for that cell and add the overall mean:

                        g11-hat = 189.47 – 187.03-177.83 + 175.85 = .46

 

The first residual is 189.7 – 189.47 = .233 etc.

 

3) A 1989 study (Capron & Duyme found in Ramsey and Schafer) investigated the effect of heredity and environment on intelligence.  From adoption registers in France, researchers elected samples of adopted children whose biological parameters and adoptive parameters came from either the very highest or the very lowest socioeconomic status (SES) categories (based on years of education and occupation).  They attempted to obtain samples of size 10 from each combination: (1) high adoptive SES and high biological SES, (2) high adoptive SES and low biological SES, (3) low adoptive SES and high biological SES, and (4) low SES for both parents.  They 38 selected children were given intelligence quotient (IQ) tests.

SES of Adoptive

SES of biological

IQ scores of adopted children

High

High

136

99

121

33

125

131

103

115

116

117

High

Low

94

103

99

125

111

93

101

94

125

91

Low

High

98

99

91

124

100

116

113

119

 

 

Low

Low

92

91

98

83

99

68

76

115

86

116

(a) Identify the observational units, explanatory variable(s), and response variable in this study.  Identify each variable as quantitative or qualitative.  Is this an observational study or an experiment? 

observational units = adopted children

explanatory variable 1 = SES of adoptive parents (categorized)

explanatory variable 2 = SES of biological parents (categorized)

response variable = IQ score (Quantitative)

This is an observational study since the explanatory variables were observed, not manipulated (imposed) by the researchers.

 

(b) Enter these data into a blank Minitab worksheet, being careful to treat each column as a different variable.  How many rows should you end up with?

There should be 38 rows, one for each child.

 

(c) Run a one-way ANOVA to see if the mean IQ scores differ significantly depending on the SES of the adoptive parents.  Report the F statistic, p-value, and your conclusion.

Analysis of Variance for IQ

 

Source        DF       SS     MS     F      P

SES adoptive   1    531.3  531.3  1.42  0.241

Error         36  13454.6  373.7

Total         37  13985.9

 

F = 1.42, p-value = .241.  From this large p-value (> .05), we would fail to reject the null hypothesis that the average IQ of the children is the same for the High SES and Low SES groups of the adoptive parents.

 

(d) Run a two-sample t-test to decide whether there is a difference in  the mean IQ scores for the high and low SES adoptive parents.  How does your p-value compare to that in (d)?  What technical condition is required by the procedure in (c) but not in (d).

Two-sample T for IQ

 

SES

adoptive   N   Mean  StDev  SE Mean

high      20  106.6   22.1      4.9

low       18   99.1   15.6      3.7

 

 

Difference = mu (high) - mu (low)

Estimate for difference:  7.48889

95% CI for difference:  (-5.04656, 20.02434)

T-Test of difference = 0 (vs not =): T-Value = 1.21  P-Value = 0.233  DF = 34

 

The p-value is similar.  They would be identical if we had used a pooled t-test but that (and ANOVA) require equal population variances.

 

(e) What happens if you try to run Stat > ANOVA > Two-Way, using the IQ scores as the response, the SES of the adoptive parents as one factor and the SES of the biological parents as the other?

Since we are missing two of the low, high children, we do not have a balanced design so this command will not work.

 

(f) Instead, choose Stat > ANOVA > General Linear Model, entering the IQ scores as the response, and using the two SES variables in the Model box.  Click OK.

 

Analysis of Variance for IQ, using Adjusted SS for Tests

 

Source          DF   Seq SS   Adj SS  Adj MS     F      P

SES adoptive     1    531.3    452.0   452.0  1.27  0.267

SES biological   1    998.5    998.5   998.5  2.81  0.103

Error           35  12456.0  12456.0   355.9

Total           37  13985.9

 

- What is the p-value for the SES adoptive factor?  What hypotheses are being tested?

p-value = .267, we fail to reject that the mean IQ is the same for the high SES adoptive and low SES adoptive populations (H0: mAdopt low = madopt high vs. Ha: madopt lowmadopt ­high or equivalently H0: a1 = a2 = 0 vs. Ha: at least one a ≠ 0)

 

- What is the p-value for the SES biological factor? What hypotheses are being tested?

p-value = .103, we fail to reject that the mean IQ is the same for the high SES adoptive and low SES biological populations. (H0: mbio low = mbio high vs. Ha: mbio lowmbio ­high or equivalently H0: a1 = a2 = 0 vs. Ha: at least one a ≠ 0)

 

(g)

- What does the interactions plot reveal about the overall effect of the adoptive parents’ SES?

The children’s IQs tend to be higher when the adoptive parents’ SES is higher.

 

- What does the interactions plot reveal about the overall effect of the biological parents’ SES?

The children’s IQs tend to be higher when the biological parents’ SES is higher.

 

- Does the difference in mean scores for those with high and low SES biological parents depend on whether the adoptive parents were high or low SES?

Yes, the difference (in mean IQ for high and low SES biological parents) is larger for the children of low SES adoptive parents than for children of high SES adoptive parents.

 

Analysis of Variance for IQ, using Adjusted SS for Tests

 

Source                       DF   Seq SS   Adj SS  Adj MS     F      P

SES adoptive                  1    531.3    416.2   416.2  1.15  0.290

SES biological                1    998.5   1047.6  1047.6  2.90  0.097

SES adoptive*SES biological   1    194.8    194.8   194.8  0.54  0.467

Error                        34  12261.2  12261.2   360.6

Total                        37  13985.9

 

(h) Return the Session window and report the p-value for the Interaction effect.  Does the size of this value make sense based on the graph?

The p-value is rather large, indicating that the interaction effects are not statistically significant.  This makes sense since the lines are not that far from parallel.

 

(i) If there is no interaction (your answer to the last question in (f) is no), then how much is the mean IQ score affected by the SES of adoptive parents, and how much is it affected by the SES of the biological parents? Is one of these effects larger than the other? Hint: Use multiple comparison procedures to help answer this question.

 

Tukey 95.0% Simultaneous Confidence Intervals

Response Variable IQ

All Pairwise Comparisons among Levels of SES adoptive

SES adoptive = high  subtracted from:

 

SES

adoptive   Lower  Center  Upper  -------+---------+---------+---------

low       -19.23  -6.650  5.930  (-----------------*----------------)

                                 -------+---------+---------+---------

                                    -14.0      -7.0       0.0

Tukey Simultaneous Tests

Response Variable IQ

All Pairwise Comparisons among Levels of SES adoptive

SES adoptive = high  subtracted from:

 

SES       Difference       SE of           Adjusted

adoptive    of Means  Difference  T-Value   P-Value

low           -6.650       6.190   -1.074    0.2903

 

 

Tukey 95.0% Simultaneous Confidence Intervals

Response Variable IQ

All Pairwise Comparisons among Levels of SES biological

SES biological = high  subtracted from:

 

SES biological   Lower  Center  Upper   ---+---------+---------+---------+---

low             -23.13  -10.55  2.030   (-----------------*-----------------)

                                        ---+---------+---------+---------+---

                                       -21.0     -14.0      -7.0       0.0

Tukey Simultaneous Tests

Response Variable IQ

All Pairwise Comparisons among Levels of SES biological

SES biological = high  subtracted from:

 

                Difference       SE of           Adjusted

SES biological    of Means  Difference  T-Value   P-Value

low                 -10.55       6.190   -1.704    0.0974

 

The mean IQ score is up to 19.23 points higher in the high SES adoptive group compared to the low SES adoptive group though zero is included in the confidence interval.

The mean IQ score is up to 23.13 points higher in the high SES biological group compared to the low SES biological group, though again zero is included in the interval.  However, comparing the two, this effect appears to be larger.