Stat 217 -
HW 2 Solutions
1) (2 pts) Take the “Learning Styles Survey”
2) For each research question below
(i)
Identify the observational units and variable of interest
(ii)
Identify the population of interest
(iii)
Describe in words the parameter of interest
(iv)
Specify a sampling frame (p. 34) you
could use (e.g., list of all registered students at Cal Poly) to select a
random sample from the population and briefly outline your sampling process.
(a) Estimate
the average number of words on a page of a history textbook.
(i)
observational units = pages, variable = number of words
(ii)
population = all pages in history textbook
(iii)
parameter = average number of words per page in the book
(iv)
sampling frame = list of page numbers with words
(b) Do a
majority of Cal Poly professors classify themselves as liberals?
(i)
observational units = people, variable = political self-classification
(ii)
population = Cal Poly professors
(iii)
parameter = proportion of all Cal Poly professors that are liberal
(iv)
sampling frame = staff roster with all faculty members lists
(c) Estimate
the average price paid for textbooks by Cal Poly students this quarter.
(i)
observational units = Cal Poly study, variable = amount paid for books this
quarter
(ii)
population = all Cal Poly students
(iii)
average price paid by all CP students this quarter
(iv)
registration list this quarter
(d) Do college freshmen really tend to gain an
average of 15 pounds during their first term at college?
(i)
observational units = college freshmen, variable = amount of weight gained
(ii)
population = all college freshmen
(iii)
parameter = average amount of weight gained during their first term by all
college freshmen
(iv)
sampling frame = a list of all college freshmen from registration records
3) Activity 5-14 (p. 86) – 10 pts
a. Parts
a and b only have one variable so you cannot distinguish between explanatory
and response variables in these cases (you could consider the one variable a
response variable in each case).
For part c, the explanatory variable is whether their version of the exam asks them to indicate
race; the response variable is score on SAT-like exam.
For part d, the explanatory variables are gender and
race of “customer”; the response variable is price negotiated for the car.
b. The
observational studies are part a (height of American CEOs) and part b (marriage
counselors). The experiments are part c (SAT-like exam given to African
American students) and part d (best prices at car dealerships).
c. Because
parts a and b are observational studies, you cannot draw any cause-and effect
conclusions from either of them. In part a, because the
economist took a random sample of American CEOs, you are probably safe in
generalizing the results to the population of American CEOs.
In part b, you are not told that the psychologist
interviewed a random sample of marriage counselors, so you might hesitate to
generalize these results to any larger population of counselors.
In part c, you should be able to draw a cause-and-effect
conclusion if an effect is found, because a randomized, controlled experiment
was performed. You should be cautious in generalizing your results to African
American college students at similar colleges, however, because you were not
told how the 200 students were selected for the study.
In part d, if a significant difference is found in
average price among the four types of customers, you should be able to
attribute the difference to race, gender, or both because you used a comparative,
randomized experiment. You should be cautious about generalizing these results
because you are not told how the 10 dealerships were selected, and they were
all apparently in the same city.
4) (Topics 4 and 5, 7 pts)
(a) The
observational units are pedestrians downtown
Response
variable is whether or not the person agreed to hold her hand (categorical and
binary)
(b) Answers will vary but one example is whether she approached from the left or the
right. The researcher could flip a
coin to decide before approaching the pedestrian.
(c) Random
sampling would be involved if she got a list of all pedestrians, assigned them
numbers, randomly selected a set of numbers, and then only approached those
individuals. So random sampling would deal with how those in the study were
selected from the population.
Random
assignment would be involved if she randomly assigned them to different
treatment conditions as in (b).
(d) Answers
will vary. One possibility is people
crossing the street in the daytime in downtown SLO? Probably not willing to say
all Californians or even all SLO residents but can probably say something
beyond just the people interviewed or even people there that evening.
5) (Topic 6, 6 pts)
(a)
conjecture
(b)
|
|
Positive test |
Negative test |
Total |
|
Carries AIDS virus |
(2) 4885 |
(2) 115 |
(1) 5000 |
|
Does not carry AIDS |
(3)73630 |
(3) 921370 |
(1) 995,000 |
|
Total |
(4) 78515 |
(4) 921485 |
1,000,000 |
Now calculate from the table the
proportion of people with positive tests that actually carry the aids virus.
4885/78515 = .062
(c) compare to conjecture
(d) .062 / (115/921485) = 496.8
You are almost 500 times more likely
to have AIDS if you have a positive test than if you test negative. Of course
500 times a very small number is still pretty small.