Hypothesis Testing

Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 43

Chapter 8

Hypothesis Testing
with Two Samples

8.1

Testing the
Difference Between
Means
(Large
Independent
Samples)

Two Sample Hypothesis


Testing
In a two-sample hypothesis test, two parameters
from two populations are compared.
For a two-sample hypothesis test,
1. the null hypothesis H0 is a statistical hypothesis that
usually states there is no difference between the
parameters of two populations. The null hypothesis
always contains the symbol , =, or .
2. the alternative hypothesis Ha is a statistical
hypothesis that is true when H0 is false. The alternative
hypothesis always contains the symbol >, , or <.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample Hypothesis


Testing
To write a null and alternative hypothesis for a twosample hypothesis test, translate the claim made
about the population parameters from a verbal
statement to a mathematical statement.
H0: 1 =
2
H a: 1 2

H0: 1 2

Ha: 1 > 2

Ha: 1 < 2

H0: 1

Regardless of which hypotheses used, 1 = 2 is


always assumed to be true.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample z-Test


Three conditions are necessary to perform a z-test
for the difference between two population means 1
and 2.
1. The samples must be randomly selected.
2. The samples must be independent. Two
samples are independent if the sample
selected from one population is not related to
the sample selected from the second
population.
3. Each sample size must be at least 30, or, if not,
each population must have a normal
distribution with a known standard deviation.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample z-Test


If these requirements are met, the sampling
x1 x2 for
distribution
(the difference of the sample
means) is a normal distribution with mean and
standard error of
x x x x 1 2
1

and
x x
1

2
2

x2 x2 1 2 .
n1 n2
1

Sampling distribution
for x1 x2

x x
1

1 2
2

Larson & Farber, Elementary Statistics: Picturing the World, 3e

x x
1

x1 x2

Two Sample z-Test


Two-Sample z-Test for the Difference Between
Means
A two-sample z-test can be used to test the difference
between two population means 1 and 2 when a
large sample (at least 30) is randomly selected from
x1 x2 and the samples are independent.
each population
The test xstatistic
is
and the standardized
test
2
2

2
1
2
z is1
where x x 1 2 .
statistic
x x
n1 n2
1

When the samples are large, you can use s1 and s2 in


place of 1 and 2. If the samples are not large, you
can still use a two-sample z-test, provided the
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample z-Test for the


Means
Using a Two-Sample z-Test for the Difference
Between Means (Large Independent Samples)
In Words
In Symbols
1. State the claim mathematically.
Identify the null and
alternative hypotheses.

State H0 and Ha.

2. Specify the level of


significance.

Identify .

3. Sketch the sampling


distribution.

Use Table 4 in
Appendix B.

4. Determine the critical value(s).


5. Determine the rejection
regions(s).
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
8

Two Sample z-Test for the


Means
Using a Two-Sample z-Test for the Difference
Between Means (Large Independent Samples)
In Words
In Symbols

x1 x2 1

6. Find the standardized test


statistic.

7. Make a decision to reject or fail


to reject the null hypothesis.

If z is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.

8. Interpret the decision in the


context of the original claim.

x x
1

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Two Sample z-Test for the


Means

Example:
A high school math teacher claims that students in
her class will score higher on the math portion of the
ACT than students in a colleagues math class. The
mean ACT math score for 49 students in her class is
22.1 and the standard deviation is 4.8. The mean
ACT math score for 44 of the colleagues students is
19.8 and the standard deviation is 5.4. At = 0.10,
can the teachers claim be supported?
H0: 1 2
=
0.10

Ha: 1 > 2 (Claim)


-3 -2

-1

z0 = 1.28
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
10

Two Sample z-Test for the


Means

Example continued:

H0: 1 2
Ha: 1 > 2 (Claim)

z0 = 1.28
-3 -2

-1

The standardized error is


x x
1

2
2
12 22
4.8
5.4

1.0644.

n1 n2
49
44

Reject H0.

The standardized test statistic is


x1 x

2 1
z
x x
1

22.1 19.8 0

2.161

1.0644

There is enough evidence at the 10% level to support th


teachers claim that her students score better on the
ACT.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

11

8.2

Testing the
Difference Between
Means
(Small
Independent
Samples)

Two Sample t-Test


If samples of size less than 30 are taken from
normally-distributed populations, a t-test may be
used to test the difference between the population
means 1 and 2.
Three conditions are necessary to use a t-test for
small independent samples.
1. The samples must be randomly selected.
2. The samples must be independent. Two
samples are independent if the sample
selected from one population is not related to
the sample selected from the second
population.
3. Each population must have a normal

Larson & Farber, Elementary Statistics: Picturing the World, 3e

13

Two Sample t-Test


Two-Sample t-Test for the Difference Between
Means
A two-sample t-test is used to test the difference
between two population means 1 and 2 when a sample is
randomly selected from each population. Performing this
test requires each population to be normally distributed,
and the samples
x1 x
1 2 be independent. The

2 should
t
.
standardized test
x xstatistic is
1

.
If the population variances
are equal, then information
from the two
samples is combined to calculate a pooled
n1 1 s12 n2 1 s22

of the standard deviation


estimate
n n 2
1

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
14

Two Sample t-Test


Two-Sample t-Test (Continued)
The standard error for the sampling distribution xof
1 x2
is
1 1
x x

Variances equal
n n
1

and d.f.= n1 + n2 2.
If the population variances are not equal, then the standard
error is
s12 s22
x x

Variances not equal


n1 n2
1

and d.f = smaller of n1 1 or n2 1.


Larson & Farber, Elementary Statistics: Picturing the World, 3e

15

Normal or t-Distribution?
Are both sample sizes
at least 30?

Yes

Use the z-test.

No

Are both populations


normally distributed?

No

Yes

Are both population


standard deviations
known?

No

You cannot use


the z-test or the ttest.
Are the population
variances equal?
No

Yes

Use the z-test.

Use the t-test


with
1 1
Yes x x n n
1
2
and d.f =
1

n1 + n2 2.

Use the t-test with


x x
1

s12 s22

n1 n2

and d.f = smaller of n1 1


or n2 1.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

16

Two Sample t-Test for the


Means
Using a Two-Sample t-Test for the Difference
Between Means (Small Independent Samples)
In Words
In Symbols
1. State the claim mathematically.
Identify the null and
alternative hypotheses.

State H0 and Ha.

2. Specify the level of


significance.

Identify .

3. Identify the degrees of freedom


and sketch the sampling
distribution.
4. Determine the critical value(s).

d.f. = n1+ n2 2 or
d.f. = smaller of n1
1 or n2 1.
Use Table 5 in
Appendix B.
Continued.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

17

Two Sample t-Test for the


Means
Using a Two-Sample t-Test for the Difference
Between Means (Small Independent Samples)
In Words
In Symbols
5. Determine the rejection
regions(s).
6. Find the standardized test
statistic.
7. Make a decision to reject or fail
to reject the null hypothesis.

x1 x2 1
x x
1

If t is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.

8. Interpret the decision in the


context ofLarson
the& original
claim.
Farber, Elementary
Statistics: Picturing the World, 3e

18

Two Sample t-Test for the


Means

Example:
A random sample of 17 police officers in Brownsville
has a mean annual income of $35,800 and a standard
deviation of $7,800. In Greensville, a random sample
of 18 police officers has a mean annual income of
$35,100 and a standard deviation of $7,375. Test the
claim at = 0.01 that the mean annual incomes in
the two cities are not the same. Assume the
population variances are equal.
H0: 1 = 2
Ha: 1 2 (Claim)

= 0.005

= 0.005
t

-3 -2 -1
0
1
2
3
d.f. = n1 + n2 2
t0 = 2.576
t0 = 2.576
= 17 + 18 2 =
Continued.
33
Larson & Farber, Elementary Statistics: Picturing the World, 3e
19

Two Sample t-Test for the


Means

Example continued:
H0: 1 = 2
Ha: 1 2 (Claim)

-3 -2

-1

t0 = 2.576

t0 = 2.576

The standardized error is


x x
1

1 1

n1 n2

n1 1 s12 n2 1 s22
n1 n2 2

1 1

n1 n2

17 1 78002 18 1 73752
17 18 2

1 1

17 18

7584.0355(0.3382)
2564.92
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
20

Two Sample t-Test for the


Means

Example continued:
H0: 1 = 2
Ha: 1 2 (Claim)

-3 -2

-1

t0 = 2.576

t0 = 2.576

The standardized test statistic is


t

x1 x

2 1

x x
1

35800 35100 0
2564.92

0.273

Fail to reject H0.


There is not enough evidence at the 1% level to
support the claim that the mean annual incomes
differ.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

21

8.3

Testing the
Difference Between
Means (Dependent
Samples)

Independent and Dependent


Samples
Two samples are independent if the sample selected
from one population is not related to the sample
selected from the second population. Two samples
are dependent if each member of one sample
corresponds to a member of the other sample.
Dependent samples are also called paired samples
or matched samples.

Independent
Samples

Dependent Samples

Larson & Farber, Elementary Statistics: Picturing the World, 3e

23

Independent and Dependent


Samples

Example:
Classify each pair of samples as independent or
dependent.
Sample 1: The weight of 24 students in a first-grade clas
Sample 2: The height of the same 24 students
These samples are dependent because the weight
and height can be paired with respect to each
student.
Sample 1: The average price of 15 new trucks
Sample 2: The average price of 20 used sedans
These samples are independent because it is not
possible to pair the new trucks with the used
sedans. The data represents prices for different
vehicles.Larson & Farber, Elementary Statistics: Picturing the World, 3e

24

t-Test for the Difference Between


Means
To perform a two-sample hypothesis test with
dependent samples, the difference between each
data pair is first found:
d = x 1 x2

Difference between entries for a data pair

d
The test statistic is the mean
of these differences.
Mean of the differences between
d
d
.
paired data entries in the dependent
n
samples.

Three conditions are required to conduct the test.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

25

t-Test for the Difference Between


Means
1. The samples must be randomly selected.
2. The samples must be dependent (paired).
3. Both populations must be normally distributed.
If these conditions are met, then the sampling
d
distribution
for
is approximated by a t-distribution
with n 1 degrees of freedom, where n is the number
of data pairs.

t0

t0

Larson & Farber, Elementary Statistics: Picturing the World, 3e

26

t-Test for the Difference Between


Means
The following symbols are used for the t-test for
d.
Symb
ol

Description

n
d
d

The number of pairs of data

The mean of the differences between the paired data


entries in the dependent samples
d
d
n
The standard deviation of the differences between
the paired data entries in the dependent samples

sd

The difference between entries for a data pair, d = x1


The
x2 hypothesized mean of the differences of paired
data in the population

n(d 2) d
sd
n(n 1)

Larson & Farber, Elementary Statistics: Picturing the World, 3e

27

t-Test for the Difference Between


Means
t-Test for the Difference Between Means
A t-test can be used to test the difference of two
population means when a sample is randomly selected
from each population. The requirements for performing
the test are that each population must be normal and each
member of the first sample must be paired with a member
of the second sample.
d
The test
d statistic is
n
d d
and thet standardized
test statistic is

.
sd

The degrees of freedom are


d.f. = n 1.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

28

t-Test for the Difference Between


Means
Using the t-Test for the Difference Between
Means (Dependent Samples)
In Words
In Symbols
1. State the claim mathematically.
Identify the null and
alternative hypotheses.

State H0 and Ha.

2. Specify the level of


significance.

Identify .

3. Identify the degrees of freedom


and sketch the sampling
distribution.
4. Determine the critical value(s).

d.f. = n 1

Use Table 5 in
Appendix B.
Continued.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

29

t-Test for the Difference Between


Means
Using a Two-Sample t-Test for the Difference
Between Means (Small Independent Samples)
In Words
In Symbols
5. Determine the rejection
region(s).
d
sd .
6. Calculate and
Use a table.

d
n

n(d 2) (d )2
sd
n(n 1)
t

7. Find the standardized test


statistic.

d d
sd n

Larson & Farber, Elementary Statistics: Picturing the World, 3e

30

t-Test for the Difference Between


Means
Using a Two-Sample t-Test for the Difference
Between Means (Small Independent Samples)
In Words
In Symbols
8. Make a decision to reject or
fail to reject the null
hypothesis.

If t is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.

9. Interpret the decision in the


context of the original claim.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

31

t-Test for the Difference Between


Means
Example:
A reading center claims that students will perform
better on a standardized reading test after going
through the reading course offered by their center.
The table shows the reading scores of 6 students
before and after the course. At = 0.05, is there
enough evidence to conclude that the students
scores after the course are better than the scores
before Student
the course? 1 2 3 4 5 6
Score (before)

85 96 70 76 81 78

Score (after)

88 85 89 86 92 89

H0: d 0
Ha: d > 0 (Claim)
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
32

t-Test for the Difference Between


Means
Example continued:
H0: d 0

d.f. = 6 1 = 5

=
0.05

Ha: d > 0 (Claim)


d = (score before) (score after)
Student
1
2
Score
85 96
(before)
Score (after) 88 85
d
3 11

3
70

4
76

-3 -2

5
81

-1

6
78

t0 = 2.015

89 86 92 89 d 43
2
1 1 1 1 d 833
9
0
1
1
2 d
43 9 12 36 10 12 12
d

7.167
d
6
n
1
1
0
1
1
n( d 2) ( d )2 6(833) 1849 104.967 10.245
sd
6(5)
n(n 1)
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
33

t-Test for the Difference Between


Means
Example continued:
H0: d 0
Ha: d > 0 (Claim)
-3 -2

-1

The standardized test statistic is


d d 7.167 0
t
1.714.

sd n 10.245 6

t0 = 2.015

Fail to reject H0.


There is not enough evidence at the 5% level to
support the claim that the students scores after the
course are better than the scores before the course.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

34

8.4

Testing the
Difference Between
Proportions

Two Sample z-Test for


Proportions
A z-test is used to test the difference between two
population proportions, p1 and p2.
Three conditions are required to conduct the test.
1. The samples must be randomly selected.
2. The samples must be independent.
3. The samples must be large enough to use a
normal sampling distribution. That is,
n1p1 5, n1q1 5,
n2p2 5, and n2q2 5.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

36

Two Sample z-Test for


Proportions
If these conditions are met, then the sampling
p1 p2 for
distribution
is a normal distribution with
mean
p p p1 p2
1

and standard error


p p
1

pq

1
1
, where q 1 p.
n1 n21

A weighted estimate of p1 and p2 can be found by usin


x1 x2
p
, where x1 n1p1 and x2 n2p2.
n1 n2
Larson & Farber, Elementary Statistics: Picturing the World, 3e

37

Two Sample z-Test for


Proportions
Two Sample z-Test for the Difference Between
Proportions
A two sample z-test is used to test the difference between
two population proportions p1 and p2 when a sample is
randomly selected from each population.
p1 statistic
p1
The test
is
(p p2) (p1 p2) test statistic is
and the
standardized
z 1
pq

1 1

n
1 n2

wherep x1 x2 and q 1 p.
n1 n2

Note:
n1p, n1q , n2p, and n2q
must be at least 5.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

38

Two Sample z-Test for


Proportions
Using a Two-Sample z-Test for the Difference
Between Proportions
In Words
In Symbols
1. State the claim. Identify the
null
and alternative
hypotheses.
2. Specify the level of
significance.

State H0 and Ha.


Identify .
Use Table 4 in
Appendix B.

3. Determine the critical value(s).

4. Determine the rejection


region(s).

x1 x2
p
n1 n2

5. Find the weighted


estimate of
Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
39

Two Sample z-Test for


Proportions
Using a Two-Sample z-Test for the Difference
Between Proportions
In Words
6. Find the standardized test
statistic.

7. Make a decision to reject or fail


to reject the null hypothesis.
8. Interpret the decision in the
context of the original claim.

In Symbols
z

(p1 p2) (p1 p2)


pq

1 1

n
1 n2

If z is in the
rejection region,
reject H0.
Otherwise, fail to
reject H0.

Larson & Farber, Elementary Statistics: Picturing the World, 3e

40

Two Sample z-Test for


Proportions

Example:
A recent survey stated that male college students
smoke less than female college students. In a survey
of 1245 male students, 361 said they smoke at least
one pack of cigarettes a day. In a survey of 1065
female students, 341 said they smoke at least one
pack a day. At = 0.01, can you support the claim
that the proportion of male college students who
smoke at least one pack of cigarettes a day is lower
then the proportion of female college students who
smoke at least one pack a day?
=
H0: p1 p2
0.01

Ha: p1 < p2 (Claim)

-3 -2 -1

z0 = 2.33

Larson & Farber, Elementary Statistics: Picturing the World, 3e

Continued.
41

Two Sample z-Test for


Proportions

Example continued:
H0: p1 p2
Ha: p1 < p2 (Claim)

-3 -2 -1

z0 = 2.33

x1 x 2 n1p1 n2p2
361 341
702

0.304
n1 n2
n1 n2
1245 1065 2310

q 1 p 1 0.304 0.696
Because 1245(0.304), 1245(0.696), 1065(0.304), and
1065(0.696) are all at least 5, we can use a two-sample ztest.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

42

Two Sample z-Test for


Proportions

Example continued:
H0: p1 p2
Ha: p1 < p2 (Claim)
z

(p1 p2) (p1 p2)


pq

1 1

n
1 n2

-3 -2 -1

z0 = 2.33

(0.29 0.32) 0
1.56
1
1
(0.304)(0.696)

1245 1065

Fail to reject H0.


There is not enough evidence at the 1% level to
support the claim that the proportion of male college
students who smoke is lower then the proportion of
female college students who smoke.
Larson & Farber, Elementary Statistics: Picturing the World, 3e

43

You might also like