Z Tests and P Values
Z Tests and P Values
Z Tests and P Values
Tests of the true value of an unknown population mean can be either one-tailed (left-
tailed or right-tailed) or two-tailed.
1. two-tailed (non-directional)
H0: µ = µ0 ← a possibility you want to test (null hypothesis)
Ha: µ ≠ µ0 ← what the sample evidence suggests (alternative hypothesis)
Reject H0 if ( x - µ0) is a large positive number or a large negative number
2. left-tailed
H0: µ ≥ µ0} Reject H0 if ( x - µ0) is a large negative number.
Ha: µ < µ0}
3. right-tailed
H0: µ ≤ µ0} Reject H0 if ( x - µ0) is a large positive number.
Ha: µ > µ0}
σ / n = 4.6/ 50
Remember 2: SPSS’ p-values are presented as derived from two-tailed tests. If your
alternative hypothesis being tested reflects a one-tailed test, you must divide the given
SPSS p-value by 2 to reflect the one-tailed nature of your alternative hypothesis. From
example 1 on the previous page, the p-value of .1236, reflecting a two-tailed test, would
be readjusted by .1236 / 2 = one-tailed p-value of .0618.
1. H0: µ = µ o
2. Ha: µ ≠ µ o ; µ < µ o ; µ > µ o
3. Test statistic: Assume that H0 is true and see if you have enough data/evidence to
reject it.
3B. How far sample mean x is from µ
x − µo
z=
σ
n
1. Example:
The level of calcium in the blood of healthy, young adults varies with a mean of 9.5 mg
per deciliter and a SD of 0.4. A clinic in rural Illinois measures the blood calcium level
of 180 healthy pregnant women and finds x = 9.57mg. Is this an indication that the
mean calcium level in this population differs from 9.5mg?
H0: µ = 9.5
Ha: µ ≠ 9.5
x − µ o 9.57 − 9.5 .07
Z= = = 2.33
σ .4 180 .03
n
With alpha = .05 and because this is a two-tailed test (i.e., the = and the ≠ ), the critical
region would consist of a Z score beyond ± 1.96 (note: this is found in the proportion of
the tail, where .0250 is closest to Z = 1.96, so .0250 x 2 = .05).
Thus, with our Z value of 2.33 (look at table), the p-value is determined by .0099
(proportion of the tail section) x 2 (because of the two-tailed nature of this test) = .0198
Conclusion:
At the .05 level, we would reject H0 and say there is enough evidence to show the mean is
different from 9.5. Thus, we have shown that the average level of calcium in the blood of
pregnant Illinois women is different from 9.5 (or all other healthy young adults).
II. Example:
Mike gave the SAT math test to a simple random sample of 500 seniors from Illinois.
These students had a mean score of 461 ( x ). Is this good evidence that the mean for all
Illinois seniors is > 450. σ = 100
H0: µ ≤ 450
Ha: µ > 450
x−µ 461 − 450 11
z = = = = 2.46
σ n 100 500 4.472
With alpha = .05 and because this is a one-tailed test (i.e., the ≤ and the >), the critical
region would consist of a Z score beyond ± 1.645 (i.e., because a Z value of 1.64 and
1.65 are of equal distance in this case, we take the average).
We reject H0 at the .05 level and say that there is a less than 1% chance of getting a
x of 450 or more extreme.
Conclusion:
A Z value of 2.46 indicates that our sample mean is in the critical region and this is a very
unlikely outcome if H0 is true and, thus, the decision to reject H0. The mean test score
on the mathematics portion of the SAT for Illinois seniors is greater than 450.