CH7
CH7
CH7
Learning Objectives
1. Understand the importance of sampling and how results from samples can be used to provide
estimates of population characteristics such as the population mean, the population standard
deviation and / or the population proportion.
2. Know what simple random sampling is and how simple random samples are selected.
4. Understand the central limit theorem and the important role it plays in sampling.
5. Specifically know the characteristics of the sampling distribution of the sample mean ( x ) and the
sampling distribution of the sample proportion ( p ).
6. Learn about a variety of sampling methods including stratified random sampling, cluster sampling,
systematic sampling, convenience sampling and judgment sampling.
7-1
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
Solutions:
c. E and C because 8 and 0 do not apply; 5 identifies E; 7 does not apply; 5 is skipped since E is
already in the sample; 3 identifies C; 2 is not needed since the sample of size 2 is complete.
2. Using the last 3-digits of each 5-digit grouping provides the random numbers:
Numbers greater than 350 do not apply and the 147 can only be used once. Thus, the
simple random sample of four includes 22, 147, 229, and 289.
4. a. 5, 0, 5, 8
N! 10! 3, 628,800
b. 120
n !( N n)! 3!(10 3)! (6)(5040)
5. 283, 610, 39, 254, 568, 353, 602, 421, 638, 164
7. 108, 290, 201, 292, 322, 9, 244, 249, 226, 125, (continuing at the top of column 9) 147, and 113.
9. 102, 115, 122, 290, 447, 351, 157, 498, 55, 165, 528, 25
10. a. Finite population. A frame could be constructed obtaining a list of licensed drivers from the New
York State drivers license bureau.
b. Infinite population. Sampling from a process. The process is the production line producing boxes of
cereal.
c. Infinite population. Sampling from a process. The process is one of generating arrivals to the Golden
Gate Bridge.
d. Finite population. A frame could be constructed by obtaining a listing of students enrolled in the
course from the professor.
e. Infinite population. Sampling from a process. The process is one of generating orders for the mail-
order firm.
7-2
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
54
11. a. x xi / n 9
6
( xi x ) 2
b. s
n 1
48
s= 31
.
6 1
b. p = 55/150 = .3667
465
13. a. x xi / n 93
5
b.
xi ( xi x ) ( xi x ) 2
94 +1 1
100 +7 49
85 -8 64
94 +1 1
92 -1 1
Totals 465 0 116
( xi x ) 2 116
s 5.39
n 1 4
14. a. Eighteen of the 40 funds in the sample are load funds. Our point estimate is
18
p .45
40
b. Six of the 40 funds in the sample are high risk funds. Our point estimate is
6
p .15
40
c. The below average fund ratings are low and very low. Twelve of the funds have a rating of low and
6 have a rating of very low. Our point estimate is
18
p .45
40
$45,500
15. a. x xi / n $4,550
10
7-3
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
( xi x )2 9,068,620
b. s $1003.80
n 1 10 1
16. a. The sampled population is U. S. adults that are 50 years of age or older.
b. We would use the sample proportion for the estimate of the population proportion.
350
p .8216
426
c. The sample proportion for this issue is .74 and the sample size is 426.
d. We would use the sample proportion for the estimate of the population proportion.
354
p .8310
426
e. The inferences in parts (b) and (d) are being made about the population of U.S. adults who are age
50 or older. So, the population of U.S. adults who are age 50 or older is the target population. The
target population is the same as the sampled population. If the sampled population was restricted to
members of AARP who were 50 years of age or older, the sampled population would not be the
same as the target population. The inferences made in parts (b) and (d) would only be valid if the
population of AARP members age 50 or older was representative of the U.S. population of adults
age 50 and over.
b. 299/999 = .30
c. 291/999 = .29
d. The sampled population is all subscribers to the American Association of Individual Investors
Journal. This is also the target population for the inferences made in parts (a), (b), and (c). There is
no statistical basis for making inferences to a target population of all investors. That is not the group
from which the sample is drawn.
18. a. E ( x ) 200
b. x / n 50 / 100 5
d. It shows the probability distribution of all possible sample means that can be observed with random
samples of size 100. This distribution can be used to compute the probability that x is within a
specified from
E ( x ) = = 200
x / n 50 / 100 5
7-4
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
x 5
At x = 205, z 1 P( z 1) = .8413
x 5
x 5
At x = 195, z 1 P ( z 1) = .1587
x 5
x 10
At x = 210, z 2 P( z 2) = .9772
x 5
x 10
At x = 190, z 2 P ( z 2) = .0228
x 5
20. x / n
x 25/ 50 3.54
The standard error of the mean decreases as the sample size increases.
21. a. x / n 10 / 50 141
.
b. n / N = 50 / 50,000 = .001
Use x / n 10 / 50 141
.
c. n / N = 50 / 5000 = .01
Use x / n 10 / 50 141
.
7-5
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
d. n / N = 50 / 500 = .10
N n 500 50 10
Use x 134
.
N 1 n 500 1 50
Note: Only case (d) where n /N = .10 requires the use of the finite population correction factor.
x
51,800
E( x )
b. For n = 120, E ( x ) remains $51,800 and the sampling distribution of x can still be approximated
by a normal distribution. However, x is reduced to 4000 / 120 = 365.15.
c. As the sample size is increased, the standard error of the mean, x , is reduced. This appears logical
from the point of view that larger samples should tend to provide sample means that are closer to the
population mean. Thus, the variability in the sample mean, measured in terms of x , should
decrease as the sample size is increased.
4000
23. a. With a sample of size 60 x 516.40
60
52,300 51,800
At x = 52,300, z .97
516.40
51,300 51,800
At x = 51,300, z .97
516.40
7-6
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
4000
b. x 365.15
120
52,300 51,800
At x = 52,300, z 1.37
365.15
51,300 51,800
At x = 51,300, z 1.37
365.15
x / n 4 / 50 .57
18.5 17.5
At x = 18.5, z 1.75 P(z 1.75) = .9599
.57
18.0 17.5
At x = 18.0, z .88 P(z .88) = .8106
.57
25. x / n 100 / 90 10.54 This value for the standard error can be used for parts (a) and (b)
below.
512 502
a. z .95 P(z .95) = .8289
10.54
492 502
z .95 P(z < -.95) = .1711
10.54
7-7
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
525 515
b. z .95 P(z .95) = .8289
10.54
505 515
z .95 P(z < -.95) = .1711
10.54
The probability of being within 10 of the mean on the Mathematics portion of the test is exactly the
same as the probability of being within 10 on the Critical Reading portion of the SAT. This is
because the standard error is the same in both cases. The fact that the means differ does not affect the
probability calculation.
c. x / n 100 / 100 10.0 The standard error is smaller here because the sample size is larger.
504 494
z 1.00 P(z 1.00) = .8413
10.0
484 494
z 1.00 P(z < -1.00) = .1587
10.0
The probability is larger here than it is in parts (a) and (b) because the larger sample size has made
the standard error smaller.
x 939
26. a. z
/ n
The z value for x - 939 = -25 is just the negative of the z value for x - 939 = 25. So we just show
the computation of z for x - 939 = 25.
25
n = 30 z .56 P(-.56 z .56) = .7123 - .2877 = .4246
245 / 30
25
n = 50 z .72 P(-.72 z .72) = .7642 - .2358 = .5284
245 / 50
25
n = 100 z 1.02 P(-1.02 z 1.02) = .8461 - .1539 = .6922
245 / 100
25
n = 400 z 2.04 P(-2.04 z 2.04) = .9793 - .0207 = .9586
245 / 400
b. A larger sample increases the probability that the sample mean will be within a specified
distance of the population mean. In the automobile insurance example, the probability of
being within 25 of ranges from .4246 for a sample of size 30 to .9586 for a sample of
size 400.
7-8
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
x 22.18 21.68
At x = 22.18, z 1.54 P(z 1.54) = .9382
/ n .3253
At x = 21.18, z = -1.54
b. x / n 2.05 / 50 .2899
x 19.30 18.80
At x = 19.30, z 1.72 P(z 1.72) = .9573
/ n .2899
c. In part (b) we have a higher probability of obtaining a sample mean within $.50 of the population
mean because the standard error for female graduates (.2899) is smaller than the standard error for
male graduates (.3253).
18.50 18.80
At x = 18.50, z 1.60
.1871
x / n 14 / 30 2.56
98 95 92 95
z 1.17 z 1.17
2.56 2.56
The probability the sample means will be within 3 strokes of the population mean of 95 is .7580.
c. x / n 14 / 45 2.09
7-9
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
The probability the sample means will be within 3 strokes of the population mean of 106 is .8502.
d. The probability of being within 3 strokes for female golfers is higher because the sample size is
larger.
29. = 183 = 50
x 8
z .88
/ n 50 / 30
x 8
z 1.13
/ n 50 / 50
x 8
z 1.60
/ n 50 / 100
d. None of the sample sizes in parts (a), (b), and (c) are large enough. The sample size will need to
be greater than n = 100, which was used in part (c).
30. a. n / N = 40 / 4000 = .01 < .05; therefore, the finite population correction factor is not necessary.
N n 4000 40 8.2
x 129
.
N 1 n 4000 1 40
x / n 130
.
Including the finite population correction factor provides only a slightly different value for x than
when the correction factor is not used.
7 - 10
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
x 2
c. z 154
. P(z 1.54) = .9382
130
. 130
.
31. a. E( p ) = p = .40
p(1 p) .40(.60)
b. p .0490
n 100
32. a. E( p ) = .40
p(1 p) .40(.60)
p .0346
n 200
p p .03
z .87 P(z .87) = .8078
p .0346
p p .05
b. z 1.44 P(z 1.44) = .9251
p .0346
p(1 p)
33. p
n
(.55)(.45)
p .0497
100
(.55)(.45)
p .0352
200
(.55)(.45)
p .0222
500
7 - 11
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
(.55)(.45)
p .0157
1000
(.30)(.70)
34. a. p .0458
100
p p .04
z .87 P(z .87) = .8078
p .0458
(.30)(.70)
b. p .0324
200
p p .04
z 1.23 P(z 1.23) = .8907
p .0324
(.30)(.70)
c. p .0205
500
p p .04
z 1.95 P(z 1.95) = .9744
p .0205
(.30)(.70)
d. p .0145
1000
p p .04
z 2.76 P(z 2.76) = .9971
p .0145
7 - 12
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
e. With a larger sample, there is a higher probability p will be within .04 of the population
proportion p.
35. a.
p(1 p) .30(.70)
p .0458
n 100
p
.30
The normal distribution is appropriate because np = 100(.30) = 30 and n(1 - p) = 100(.70) = 70 are
both greater than 5.
b. P (.20 p .40) = ?
.40 .30
z 2.18 P(z 2.18) = .9854
.0458
c. P (.25 p .35) = ?
.35 .30
z 1.09 P(z 1.09) = .8621
.0458
7 - 13
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
e. No, the probabilities are exactly the same. This is because p , the standard error, and the width of
the interval are the same in both cases. Notice the formula for computing the standard error. It
involves p(1 p) . So whenever p = 1 - p the standard error will be the same. In part (b), p = .45
and 1 p = .55. In part (d), p = .55 and 1 p = .45.
The probability is larger than in part (b). This is because the larger sample size has reduced the
standard error from .0352 to .0249.
E ( p ) .12
p p .03
b. z 2.14 P(z 1.94) = .9838
p .0140
7 - 14
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
p p .015
c. z 1.07 P(z 1.07) = .8577
p .0140
E( p ) = .42
p(1 p) (.42)(.58)
p .0285
n 300
p p .03
b. z 1.05 P(z 1.05) = .8531
p .0285
p p .05
c. z 1.75 P(z 1.75) = .9599
p .0285
d. The probabilities would increase. This is because the increase in the sample size makes the standard
error, p , smaller.
p p .04
b. z 1.96 P(z 1.96) = .9750
p .0204
7 - 15
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
p p .04
d. z 1.31 P(z 1.31) = .9049
.75(1 .75) .0306
200
e. The probability of the sample proportion being within .04 of the population mean was reduced from
.9500 to .8098. So there is a gain in precision by increasing the sample size from 200 to 450. If the
extra cost of using the larger sample size is not too great, we should probably do so.
40. a. E ( p ) = .76
.79 .76
b. z 1.40 P(z 1.40) = .9192
.0214
.79 .76
z 1.92 P(z 1.92) = .9726
.0156
41. a. E( p ) = .17
.19 .17
b. z 1.51 P(z 1.51) = .9345
.0133
7 - 16
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
.19 .17
z 2.13 P(z 2.13) = .9834
.0094
42. The random numbers corresponding to the first seven universities selected are
The third, fourth and fifth columns of Table 7.1 were needed to find 7 random numbers of 133 or
less without duplicate numbers.
Authors note: The universities identified are: Clarkson U. (122), U. of Arizona (99), UCLA (25),
U. of Maryland (55), U. of New Hampshire (115), Florida State U. (102), Clemson U. (61).
43. a. With n = 100, we can approximate the sampling distribution with a normal distribution having
E( x ) = 8086
2500
x 250
n 100
x 200
b. z .80 P(z .80) = .7881
/ n 2500 / 100
The probability that the sample mean will be within $200 of the population mean is .5762.
9000 8086
c. At 9000, z 3.66
2500 / 100
Yes, the research firm should be questioned. A sample mean this large is extremely unlikely (almost
0 probability) if a simple random sample is taken from a population with a mean of $8086.
E ( x ) = 406
80
x 10
n 64
7 - 17
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
x 15
b. z 1.50 P(z 1.50) = .9332
/ n 80 / 64
x 380 406
c. At x = 380, z 2.60
/ n 80 / 64
Yes, this is an unusually low performing group of 64 stores. The probability of a sample mean
annual sales per square foot of $380 or less is only .0047.
45. With n = 60 the central limit theorem allows us to conclude the sampling distribution is
approximately normal.
a. This means 14 x 16
16 15
At x = 16, z 1.94 P(z 1.94) = .9738
4 / 60
15.75 15
At x = 15.75, z 1.45 P(z 1.45) = .9265
4 / 60
a. x 7400 / 60 955
x 0
b. z 0
x 955
Note: This could have been answered easily without any calculations ; 27,175 is the expected value
of the sampling distribution of x .
x 1000
c. z 1.05 P(z 1.05) = .8531
x 955
7 - 18
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
x 1000
z 1.35 P(z 1.35) = .9115
x 740
N n
47. a. x
N 1 n
N = 2000
2000 50 144
x 2011
.
2000 1 50
N = 5000
5000 50 144
x 20.26
5000 1 50
N = 10,000
10,000 50 144
x 20.31
10,000 1 50
Note: With n / N .05 for all three cases, common statistical practice would be to ignore
144
the finite population correction factor and use x 20.36 for each case.
50
b. N = 2000
25
z 1.24 P(z 1.24) = .8925
20.11
N = 5000
25
z 1.23 P(z 1.23) = .8907
20.26
7 - 19
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
N = 10,000
25
z 1.23 P(z 1.23) = .8907
20.31
All probabilities are approximately .78 indicating that a sample of size 50 will work well for all 3
firms.
500
48. a. x 20
n n
b. For 25,
25
z 1.25 P(z 1.25) = .8944
20
x
n 30
0.05 0.05
x
1.9 2.1
1.9 + 2.1 = 2
=
2
The area below x = 2.1 must be 1 - .05 = .95. An area of .95 in the standard normal table shows
z = 1.645.
Thus,
2.1 2.0
z 1.645
/ 30
7 - 20
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling and Sampling Distributions
Solve for
(.1) 30
.33
1.645
50. p = .28
p(1 p) (.40)(.60)
51. p .0245
n 400
P ( p .375) = ?
.375 .40
z 1.02 P(z < -1.02) = .1539
.0245
7 - 21
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
b. We want P( p .45)
p p .45 .40
z 1.99
p .0251
p(1 p) (.15)(.85)
p .0292
n 150
b. P (.12 p .18) = ?
.18 .15
z 1.03 P(z 1.03) = .8485
.0292
p(1 p) .25(.75)
54. a. p .0625
n n
Solve for n
.25(.75)
n 48
(.0625) 2
c. P ( p .30) = ?
.30 .25
z .80 P(z .80) = .7881
.0625
7 - 22
2013 Cengage Learning. All Rights Reserved.
May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.