Hypothesis
Hypothesis
Hypothesis
Hypothesis Tests
Hypothesis Testing
• Hypothesis testing can be used to determine whether a statement about the
value of a population parameter should or should not be rejected.
• The null hypothesis, denoted by H0 , is a tentative assumption about a
population parameter.
• The alternative hypothesis, denoted by Ha, is the opposite of what is stated
in the null hypothesis.
• The hypothesis testing procedure uses data from a sample to test the two
competing statements indicated by H0 and Ha.
Objective of Hypothesis Testing
The objective of hypothesis testing is to either reject or
retain a null hypothesis.
• Children who drink the health drink Complan (a health drink owned by the
company Heinz in India) are likely to grow taller
• If you drink Horlicks, you can grow taller, stronger, and sharper (3 in 1).
• Beautiful people are likely to have girl child (Miller and Kanazawa, 2007).
Description of Hypothesis
Hypotheses are claims that are usually stated in simple
words as listed below:
• Average annual salary of machine learning experts is
different for males and females.
• On average people with Ph.D. in analytics earn more
than people with Ph.D. in engineering.
• The average box-office collection of comedy genre
movies is more than that of action movies.
• Average life of vegetarians is more than meat eaters.
• Proportion of married people defaulting on loan
repayment is less than proportion of singles defaulting
on loan repayment.
Null and Alternative Hypothesis
analytics.
engineering.
H0: m 100,000
HA: m > 100,000
Criteria Decision
Z-statistic = X
/ n
• The critical value in this case will depend on the significance
value and whether it is a one-tailed or two-tailed test
Condition for rejection of null hypothesis H0
16 16 30 37 25 22 19 35 27 32
34 28 24 35 24 21 32 29 24 35
28 29 18 31 28 33 32 24 25 22
21 27 41 23 23 16 24 38 26 28
Solution
Null and alternative hypotheses in this case are given by
H0: 30
HA: < 30
From the data in Table 6.6, the estimated sample mean is
27.05 days.
The standard deviation of the sampling distribution is 12.5.
This is left tail problem.
� 12.5
� = 30, � = 27.05, � = 12.5, � = 40, �� ̅ = =
� 40
12.5
P-value= norm.dist(27.05,12.5, 40
,1)= 0.067772
Solution Continued…
• = 0.05
• The p-value is greater than the value of . So, we fail to reject the null
hypothesis.
• That is, there is no strong evidence against null hypothesis so we
retain the null hypothesis, which is 30
Example
According to the company IQ Research, the average
Intelligence Quotient (IQ) of Indians is 82 derived based
on a research carried out by Professor Richard Lynn, a
British Professor of Psychology, using the data collected
from 2002 to 2006 (Source: IQ Research). The population
standard deviation of IQ is estimated as 11.03. Based on
a sample of 100 people from India, the sample IQ was
estimated as 84.
(a) Conduct an appropriate hypothesis test at = 0.05 to
validate the claim of IQ Research (that average IQ of
Indians is 82).
(b) Ministry of education believes that the IQ is more
than 82. If the actual IQ (population mean) of Indians is
86, calculate the Type II error and the power of
hypothesis test.
Solution
a)Hypothesis test: It is given that = 82, = 11.03, n = 100, and
X =84. � = � = 11.03
�̅ � 100
The null and alternative hypotheses in this case are:
H0: = 82
HA: 82
Since the direction of alternative hypothesis is both ways, we have a two-
tailed t-test.
X is 84 which is greater than = 82. So, we have to find right side area
for sample information. According to confidence level=95% the right side
area α/2=5%/2=0.025
P-value=0.034898, p-value > α/2. So, fail to reject null hypothesis.
Statistic, critical values, and the rejection region
Hypothesis Testing for Population Proportion:
Z-Test for Proportion
According to the central limit theorem of proportions,
the sampling distribution of proportions p̂ for a large
sample follows an approximate normal distribution with
mean p (the population proportion) and standard
deviation p ( 1 n p ).
601 627 330 364 562 353 583 254 528 470
408 601 593 729 402 530 708 599 439 762
292 636 444 286 636 667 252 335 457 632
Solution
It is given that the production cost of Bollywood movies
follows a normal distribution; however, the standard
deviation of the population is not known and we need to
estimate the standard deviation value from the sample.
Thus, we have to use the t-test for testing the hypothesis.
From the sample data in Table we get the following values:
n = 40, X =429.55, and S = 195.0337
X 429.55 500
t - statistic 2.2845
S / n 195.0337 / 40
X 1 9 .5 1 6 .8
t 2 .8 9 2 7
S / n 6 .6 / 50
Solution Continued…
• The critical t-value for two-tailed t-test when = 0.01 and
degrees of freedom = 49 is 2.67
• Since the calculated t-statistic value is greater than the t-critical
value, we reject the null hypothesis. The corresponding p-value is
0.0057. The values of t-statistic, t-critical value, rejection and
retention regions are shown in Figure
One-Tailed Tests About a Population Mean: σ
Known
Example: Metro EMS
The EMS director wants to perform a hypothesis test, with a .05 level of
significance, to determine whether the service goal of 12 minutes or less is
being achieved.
56
One-Tailed Tests About a Population Mean: σ
Known
1. Develop the hypotheses.
p-value = = 0.0068
p-value = = 0.0062
The locations where H0 is rejected are deemed the best locations for radar
traps. At Location F, a sample of 64 vehicles shows a mean speed of 66.2 mph
with a standard deviation of 4.2 mph. Use α = 0.05 to test the hypothesis.
One-Tailed Test About a Population Mean: σ
Unknown
1. Develop the hypotheses.
For a Christmas and New Year’s week, the National Safety Council estimated
that 500 people would be killed and 25,000 injured on the nation’s roads. The
NSC claimed that 50% of the accidents would be caused by drunk driving.
A sample of 120 accidents showed that 67 were caused by drunk driving. Use
these data to test the NSC’s claim with α = 0.05.
Two-Tailed Test About a Population Proportion
Two-Tailed Test About a Population Proportion
p –Value Approach
4. Compute the p –value.