Hypothesis Testing

Chapter 8
Hypothesis Testing
with Two Samples
8.1
Testing the
Difference Between
Means
(Large
Independent
Samples)
Two Sample Hypothesis

Testing
In a two-sample hypothesis test, two parameters
from two populations are compared.
For a two-sample hypothesis test,
1. the null hypothesis H0 is a statistical hypothesis that
usually states there is no difference between the
parameters of two populations. The null hypothesis
always contains the symbol , =, or .
2. the alternative hypothesis Ha is a statistical
hypothesis that is true when H0 is false. The alternative
hypothesis always contains the symbol >, , or <.
Larson & Farber, Elementary Statistics: Picturing the World, 3e
Two Sample Hypothesis

Testing
To write a null and alternative hypothesis for a twosample hypothesis test, translate the claim made
about the population parameters from a verbal
statement to a mathematical statement.
H0: 1 =
2
H a: 1 2
H0: 1 2
Ha: 1 > 2
Ha: 1 < 2
H0: 1
Regardless of which hypotheses used, 1 = 2 is

always assumed to be true.
Two Sample z-Test

Three conditions are necessary to perform a z-test
for the difference between two population means 1
and 2.
1. The samples must be randomly selected.
2. The samples must be independent. Two
samples are independent if the sample
selected from one population is not related to
the sample selected from the second
population.
3. Each sample size must be at least 30, or, if not,
each population must have a normal
distribution with a known standard deviation.
Two Sample z-Test

If these requirements are met, the sampling
x1 x2 for
distribution
(the difference of the sample
means) is a normal distribution with mean and
standard error of
x x x x 1 2
1
and
x x
1
2
2
x2 x2 1 2 .
n1 n2
1
Sampling distribution
for x1 x2
x x
1
1 2
2
x x
1
x1 x2
Two Sample z-Test

Two-Sample z-Test for the Difference Between
Means
A two-sample z-test can be used to test the difference
between two population means 1 and 2 when a
large sample (at least 30) is randomly selected from
x1 x2 and the samples are independent.
each population
The test xstatistic
is
and the standardized
test
2
2
2
1
2
z is1
where x x 1 2 .
statistic
x x
n1 n2
1
When the samples are large, you can use s1 and s2 in

place of 1 and 2. If the samples are not large, you
can still use a two-sample z-test, provided the
Two Sample z-Test for the

Means
Using a Two-Sample z-Test for the Difference
Between Means (Large Independent Samples)
In Words
In Symbols
1. State the claim mathematically.
Identify the null and
alternative hypotheses.
State H0 and Ha.
2. Specify the level of

significance.
Identify .
3. Sketch the sampling

distribution.
Use Table 4 in
Appendix B.
4. Determine the critical value(s).

5. Determine the rejection
regions(s).
Continued.
8

Means
Between Means (Large Independent Samples)
In Words
In Symbols
x1 x2 1
6. Find the standardized test

statistic.
7. Make a decision to reject or fail

to reject the null hypothesis.
If z is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.
8. Interpret the decision in the

context of the original claim.
x x
1

Means
Example:
A high school math teacher claims that students in
her class will score higher on the math portion of the
ACT than students in a colleagues math class. The
mean ACT math score for 49 students in her class is
22.1 and the standard deviation is 4.8. The mean
ACT math score for 44 of the colleagues students is
19.8 and the standard deviation is 5.4. At = 0.10,
can the teachers claim be supported?
H0: 1 2
=
0.10
Ha: 1 > 2 (Claim)

-3 -2
-1
z0 = 1.28
Continued.
10

Means
Example continued:
H0: 1 2
Ha: 1 > 2 (Claim)
z0 = 1.28
-3 -2
-1
The standardized error is

x x
1
2
2
12 22
4.8
5.4
1.0644.
n1 n2
49
44
Reject H0.
The standardized test statistic is

x1 x
2 1
z
x x
1
22.1 19.8 0
2.161
1.0644
There is enough evidence at the 10% level to support th

teachers claim that her students score better on the
ACT.
11
8.2
Testing the
Difference Between
Means
(Small
Independent
Samples)
Two Sample t-Test

If samples of size less than 30 are taken from
normally-distributed populations, a t-test may be
used to test the difference between the population
means 1 and 2.
Three conditions are necessary to use a t-test for
small independent samples.
2. The samples must be independent. Two
samples are independent if the sample
selected from one population is not related to
the sample selected from the second
population.
3. Each population must have a normal
13
Two Sample t-Test

Two-Sample t-Test for the Difference Between
Means
A two-sample t-test is used to test the difference
between two population means 1 and 2 when a sample is
randomly selected from each population. Performing this
test requires each population to be normally distributed,
and the samples
x1 x
1 2 be independent. The
2 should
t
.
standardized test
x xstatistic is
1
.
If the population variances
are equal, then information
from the two
samples is combined to calculate a pooled
n1 1 s12 n2 1 s22
of the standard deviation

estimate
n n 2
1
Continued.
14
Two Sample t-Test

Two-Sample t-Test (Continued)
The standard error for the sampling distribution xof
1 x2
is
1 1
x x
Variances equal
n n
1
and d.f.= n1 + n2 2.
If the population variances are not equal, then the standard
error is
s12 s22
x x
Variances not equal

n1 n2
1
and d.f = smaller of n1 1 or n2 1.

15
Normal or t-Distribution?
Are both sample sizes
at least 30?
Yes
Use the z-test.
No
Are both populations

normally distributed?
No
Yes
Are both population

standard deviations
known?
No
You cannot use

the z-test or the ttest.
Are the population
variances equal?
No
Yes
Use the z-test.
Use the t-test

with
1 1
Yes x x n n
1
2
and d.f =
1
n1 + n2 2.
Use the t-test with

x x
1
s12 s22
n1 n2
and d.f = smaller of n1 1

or n2 1.
16
Two Sample t-Test for the

Means
Using a Two-Sample t-Test for the Difference
Between Means (Small Independent Samples)
In Words
In Symbols
State H0 and Ha.

significance.
Identify .
3. Identify the degrees of freedom

and sketch the sampling
distribution.
d.f. = n1+ n2 2 or
d.f. = smaller of n1
1 or n2 1.
Use Table 5 in
Appendix B.
Continued.
17

Means
In Words
In Symbols
regions(s).
statistic.
x1 x2 1
x x
1
If t is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.

context ofLarson
the& original
claim.
Farber, Elementary
Statistics: Picturing the World, 3e
18

Means
Example:
A random sample of 17 police officers in Brownsville
has a mean annual income of $35,800 and a standard
deviation of $7,800. In Greensville, a random sample
of 18 police officers has a mean annual income of
$35,100 and a standard deviation of $7,375. Test the
claim at = 0.01 that the mean annual incomes in
the two cities are not the same. Assume the
population variances are equal.
H0: 1 = 2
Ha: 1 2 (Claim)
= 0.005
= 0.005
t
-3 -2 -1
0
1
2
3
d.f. = n1 + n2 2
t0 = 2.576
t0 = 2.576
= 17 + 18 2 =
Continued.
33
19

Means
Example continued:
H0: 1 = 2
Ha: 1 2 (Claim)
-3 -2
-1
t0 = 2.576
t0 = 2.576
The standardized error is

x x
1
1 1

n1 n2
n1 1 s12 n2 1 s22
n1 n2 2
1 1
n1 n2
17 1 78002 18 1 73752
17 18 2
1 1
17 18
7584.0355(0.3382)
2564.92
Continued.
20

Means
Example continued:
H0: 1 = 2
Ha: 1 2 (Claim)
-3 -2
-1
t0 = 2.576
t0 = 2.576

t
x1 x
2 1
x x
1
35800 35100 0
2564.92
0.273
Fail to reject H0.

There is not enough evidence at the 1% level to
support the claim that the mean annual incomes
differ.
21
8.3
Testing the
Difference Between
Means (Dependent
Samples)
Independent and Dependent

Samples
Two samples are independent if the sample selected
from one population is not related to the sample
selected from the second population. Two samples
are dependent if each member of one sample
corresponds to a member of the other sample.
Dependent samples are also called paired samples
or matched samples.
Independent
Samples
Dependent Samples
23
Independent and Dependent

Samples
Example:
Classify each pair of samples as independent or
dependent.
Sample 1: The weight of 24 students in a first-grade clas
Sample 2: The height of the same 24 students
These samples are dependent because the weight
and height can be paired with respect to each
student.
Sample 1: The average price of 15 new trucks
Sample 2: The average price of 20 used sedans
These samples are independent because it is not
possible to pair the new trucks with the used
sedans. The data represents prices for different
vehicles.Larson & Farber, Elementary Statistics: Picturing the World, 3e
24
t-Test for the Difference Between

Means
To perform a two-sample hypothesis test with
dependent samples, the difference between each
data pair is first found:
d = x 1 x2
Difference between entries for a data pair
d
The test statistic is the mean
of these differences.
Mean of the differences between
d
d
.
paired data entries in the dependent
n
samples.
Three conditions are required to conduct the test.
25

Means
2. The samples must be dependent (paired).
3. Both populations must be normally distributed.
If these conditions are met, then the sampling
d
distribution
for
is approximated by a t-distribution
with n 1 degrees of freedom, where n is the number
of data pairs.
t0
t0
26

Means
The following symbols are used for the t-test for
d.
Symb
ol
Description
n
d
d
The number of pairs of data
The mean of the differences between the paired data

entries in the dependent samples
d
d
n
The standard deviation of the differences between
the paired data entries in the dependent samples
sd
The difference between entries for a data pair, d = x1

The
x2 hypothesized mean of the differences of paired
data in the population
n(d 2) d
sd
n(n 1)
27

Means
t-Test for the Difference Between Means
A t-test can be used to test the difference of two
population means when a sample is randomly selected
from each population. The requirements for performing
the test are that each population must be normal and each
member of the first sample must be paired with a member
of the second sample.
d
The test
d statistic is
n
d d
and thet standardized
test statistic is
.
sd
The degrees of freedom are

d.f. = n 1.
28

Means
Using the t-Test for the Difference Between
Means (Dependent Samples)
In Words
In Symbols
State H0 and Ha.

significance.
Identify .
3. Identify the degrees of freedom

and sketch the sampling
distribution.
d.f. = n 1
Use Table 5 in
Appendix B.
Continued.
29

Means
In Words
In Symbols
region(s).
d
sd .
6. Calculate and
Use a table.
d
n
n(d 2) (d )2
sd
n(n 1)
t

statistic.
d d
sd n
30

Means
In Words
In Symbols
8. Make a decision to reject or
fail to reject the null
hypothesis.
If t is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.

31

Means
Example:
A reading center claims that students will perform
better on a standardized reading test after going
through the reading course offered by their center.
The table shows the reading scores of 6 students
before and after the course. At = 0.05, is there
enough evidence to conclude that the students
scores after the course are better than the scores
before Student
the course? 1 2 3 4 5 6
Score (before)
85 96 70 76 81 78
Score (after)
88 85 89 86 92 89
H0: d 0
Ha: d > 0 (Claim)
Continued.
32

Means
Example continued:
H0: d 0
d.f. = 6 1 = 5
=
0.05
Ha: d > 0 (Claim)

d = (score before) (score after)
Student
1
2
Score
85 96
(before)
Score (after) 88 85
d
3 11
3
70
4
76
-3 -2
5
81
-1
6
78
t0 = 2.015
89 86 92 89 d 43
2
1 1 1 1 d 833
9
0
1
1
2 d
43 9 12 36 10 12 12
d
7.167
d
6
n
1
1
0
1
1
n( d 2) ( d )2 6(833) 1849 104.967 10.245
sd
6(5)
n(n 1)
Continued.
33

Means
Example continued:
H0: d 0
Ha: d > 0 (Claim)
-3 -2
-1

d d 7.167 0
t
1.714.
sd n 10.245 6
t0 = 2.015
Fail to reject H0.

support the claim that the students scores after the
course are better than the scores before the course.
34
8.4
Testing the
Difference Between
Proportions
Two Sample z-Test for

Proportions
A z-test is used to test the difference between two
population proportions, p1 and p2.
Three conditions are required to conduct the test.
2. The samples must be independent.
3. The samples must be large enough to use a
normal sampling distribution. That is,
n1p1 5, n1q1 5,
n2p2 5, and n2q2 5.
36

Proportions
If these conditions are met, then the sampling
p1 p2 for
distribution
is a normal distribution with
mean
p p p1 p2
1
and standard error

p p
1
pq
1
1
, where q 1 p.
n1 n21
A weighted estimate of p1 and p2 can be found by usin

x1 x2
p
, where x1 n1p1 and x2 n2p2.
n1 n2
37

Proportions
Two Sample z-Test for the Difference Between
Proportions
A two sample z-test is used to test the difference between
two population proportions p1 and p2 when a sample is
randomly selected from each population.
p1 statistic
p1
The test
is
(p p2) (p1 p2) test statistic is
and the
standardized
z 1
pq
1 1

n
1 n2
wherep x1 x2 and q 1 p.
n1 n2
Note:
n1p, n1q , n2p, and n2q
must be at least 5.
38

Proportions
Between Proportions
In Words
In Symbols
1. State the claim. Identify the
null
and alternative
hypotheses.
significance.
State H0 and Ha.

Identify .
Use Table 4 in
Appendix B.

region(s).
x1 x2
p
n1 n2
5. Find the weighted

estimate of
Continued.
39

Proportions
Between Proportions
In Words
statistic.

In Symbols
z
(p1 p2) (p1 p2)

pq
1 1

n
1 n2
If z is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.
40

Proportions
Example:
A recent survey stated that male college students
smoke less than female college students. In a survey
of 1245 male students, 361 said they smoke at least
one pack of cigarettes a day. In a survey of 1065
female students, 341 said they smoke at least one
pack a day. At = 0.01, can you support the claim
that the proportion of male college students who
smoke at least one pack of cigarettes a day is lower
then the proportion of female college students who
smoke at least one pack a day?
=
H0: p1 p2
0.01
Ha: p1 < p2 (Claim)
-3 -2 -1
z0 = 2.33
Continued.
41

Proportions
Example continued:
H0: p1 p2
Ha: p1 < p2 (Claim)
-3 -2 -1
z0 = 2.33
x1 x 2 n1p1 n2p2
361 341
702
0.304
n1 n2
n1 n2
1245 1065 2310
q 1 p 1 0.304 0.696
Because 1245(0.304), 1245(0.696), 1065(0.304), and
1065(0.696) are all at least 5, we can use a two-sample ztest.
Continued.
42

Proportions
Example continued:
H0: p1 p2
Ha: p1 < p2 (Claim)
z
(p1 p2) (p1 p2)

pq
1 1

n
1 n2
-3 -2 -1
z0 = 2.33
(0.29 0.32) 0
1.56
1
1
(0.304)(0.696)
1245 1065
Fail to reject H0.

support the claim that the proportion of male college
students who smoke is lower then the proportion of
female college students who smoke.
43

Hypothesis Testing

Uploaded by

Copyright:

Available Formats

Hypothesis Testing

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hypothesis Testing

Uploaded by

Copyright:

Available Formats

Chapter 8

Two Sample Hypothesis

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample Hypothesis

Regardless of which hypotheses used, 1 = 2 is

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample z-Test

Two Sample z-Test

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample z-Test

When the samples are large, you can use s1 and s2 in

Two Sample z-Test for the

State H0 and Ha.

2. Specify the level of

3. Sketch the sampling

4. Determine the critical value(s).

Two Sample z-Test for the

6. Find the standardized test

7. Make a decision to reject or fail

8. Interpret the decision in the

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample z-Test for the

Ha: 1 > 2 (Claim)

Two Sample z-Test for the

The standardized error is

The standardized test statistic is

There is enough evidence at the 10% level to support th

Two Sample t-Test

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample t-Test

of the standard deviation

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample t-Test

Variances not equal

and d.f = smaller of n1 1 or n2 1.

Use the z-test.

Are both populations

Are both population

You cannot use

Use the z-test.

Use the t-test

Use the t-test with

and d.f = smaller of n1 1

Two Sample t-Test for the

State H0 and Ha.

2. Specify the level of

3. Identify the degrees of freedom

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample t-Test for the

8. Interpret the decision in the

Two Sample t-Test for the

Two Sample t-Test for the

The standardized error is

Two Sample t-Test for the

The standardized test statistic is

Fail to reject H0.

Independent and Dependent

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Independent and Dependent