Statistics2 PastExamQuestions
Statistics2 PastExamQuestions
Statistics2 PastExamQuestions
Department of Economics
Chair of Econometrics
Prof. Dr. Christoph Wunder
Problem 1
In a random sample of size n = 621, researchers find that 232 respondents never exercise or do sport.
1.1 Test whether the population proportion of individuals who never exercise or do sport is greater than one-third.
State the hypotheses, rejection rule, test statistic and conclusion (α = 0.05).
1 1
• Hypotheses: H0 : p 6 3 vs. H1 : p > 3
• Rejection rule: reject H0 if Z > zα = z0.05 = 1.645
• Empirical value of test statistic:
232
p̂ = = 0.374
621
• Conclusion: As the empirical value of the test statistic is larger than the critical value,
we reject the null hypothesis that individuals who never exercise or do sport is less than
one-third.
Problem 2
Let x1 , x2 , ..., xn denote a random sample of size n from a uniform population with lower bound a = 0 and upper
bound b. Show that the estimator θ̂ = 2X̄ is an unbiased estimator of b. Use E(Xi ) = b−a
2 as the expected value of
the uniform random variable.
E θ̂ = E (2X̄)
1
= E 2 · ∑ Xi
n
2
= E (Xi )
n∑
2 b
= ∑
n 2
2·n·b
=
2·n
=b
Problem 3
A teacher believes that girls do better on her exams than boys do. A sample of 10 girls (n1 = 10) and 12 boys
(n2 = 12) yields x̄1 = 7, x̄2 = 5.5, s21 = 1, s22 = 1.7. Assume normality of both populations and equal variances.
Carry out an appropriate test for the teacher’s hypothesis. State the hypotheses, degrees of freedom, rejection rule,
empirical value of the test statistic and conclusion (α = 0.05).
• Definition of random variables: X1i is the result in the exam for girl i, i = 1, ..., n1 , X2 j is the
result in the exam for boy j, j = 1, ..., n2 .
• Hypotheses: H0 : µ1 6 µ2 vs. H1 : µ1 > µ2
• Degrees of freedom: n1 + n2 − 2 = 12 + 10 − 2 = 20
• Rejection rule: reject H0 if t > tn1 +n2 −2;α = t20;0.05 = 1.725
• Empirical value of test statistic:
x̄1 − x̄2
t = r
(n1 −1)s21 +(n2 −1)s22
1
n1 + n12 n1 +n2 −2
7 − 5.5
=q (10−1)·1+(12−1)·1.72
1 1
10 + 12 10+12−2
1.5
=q
1 1
9+18.7
10 + 12 20
1.5
=√
0.373
= 2.45
• Conclusion: As the empirical value of the test statistic is greater than the critical value, we
reject the null hypothesis at the 5% level. We conclude that girls do better on her exams than
boys do.
2
Problem 4
Consider an empirical investigation to assess the effectiveness of a training program designed to increase monthly
wages (measured in Euros). If the program is effective, the participants will exhibit higher monthly wages after the
program as compared to before the program. The table below shows data for a random sample of 10 participants.
Do you believe that the program exerts a significant influence (α = 0.10)? Carry out a sign test. State hypotheses,
rejection rule, test statistic, and conclude.
• Hypotheses: H0 : p ≥ 0.5 vs. H1 : p < 0.5, where p = P(X > Y ) is the probability that X (wage before
program) is greater than Y (wage after program).
• Rejection rule: We reject H0 if P(V ≤ v|H0 , n) ≤ α.
• Test statistic:
Problem 5
A sample of size n = 13 from a normal population yields an average of 3 and a standard deviation of 2.
5.1 Calculate the 99% confidence interval for the population mean.
3
• Quantile: tn−1, α2 = t12,0.005 = 3.055
• Confidence interval:
2
ll: 3 − 3.055 · √ = 1.305
13
2
ul: 3 + 3.055 · √ = 4.695
13
5.2 Calculate the 90% confidence interval for the population variance. (5 points)
• Confidence interval:
n−1 2 12
ll: 2
s = · 4 = 2.282
χn−1, α 21.026
2
n−1 12
ul: s2 = · 4 = 9.184
χ2n−1,1− α 5.226
2
Problem 6
The table below shows summary statistics on hourly wages (in euros) before and after participating in an on-the-
job training program for n = 10 workers.
• Decision: As the test statistic is higher than the critical value, we reject the null hypothesis
of an average difference of zero.
4
Problem 7
The table below shows summary statistics of life satisfaction for men and women:
Assume that life satisfaction is normally distributed in the populations. Carry out a test for equality of the two
population variances. State the hypotheses, degrees of freedom, rejection rule, empirical value of the test statistic,
and conclude (α = 0.05).
• Hypotheses: H0 : σ21 = σ22 vs. σ21 6= σ22 , where σ21 and σ22 are the population variances for men and
women, respectively.
• Degrees of freedom: n1 − 1 = 11 − 1 = 10, n2 − 1 = 10 − 1 = 9
1 1
• Rejection rule: we reject H0 if F > Fn1 −1;n2 −1, α2 = F10,9,0.025 = 3.96 or if F < Fn −1;n −1, α = F9,10,0.025 = 0.26
2 1 2
s21 1.002
F= = = 0.6504
s22 1.242
• Decision: As the test statistic is in the acceptance area, we are unable to reject the null
hypothesis that the population variances are equal.
Problem 8
A business magazine claims that the average market value of all pharma companies in the US is less than 250
million euros. A random sample of 30 firms reveals a sample mean of 235 million euros and a standard deviation
of 85 million euros. Carry out an appropriate test for the magazine’s claim. State the hypotheses, degrees of
freedom, rejection rule, empirical value of the test statistic, and conclude (α = 0.10).
• Decision: As the test statistic is not lower than the negative critical value, we can not reject
the null hypothesis that the average market value of all pharma companies in the US is more than
or equal 250 million euros.
Problem 9
Consider a TV debate between two political parties, left (l) and right (r). The order of speakers is as follows:
5
speaker 1 2 3 4 5 6 7 8 9 10
political party l l l r r r r r l l
Do you believe that the order in which speakers of the political parties make their appearance is random? Carry
out a runs test. Give the hypotheses, rejection rule, test statistic, and conclusions (α = 0.05).
• Since 0.040 > 0.025 and 0.992 > 0.025, one cannot reject the H0 that the samples are generated
randomly.
Problem 10
Statisticians want to estimate the proportion of immigrants in Germany with an accuracy of ± 0.01 and 99%
confidence. The proportion is believed to be at most 0.25. Find the minimum sample size required.
p̂ (1 − p̂) z20.005
n=
B2
= 12384.187
Problem 11
The table below records the test scores of a random sample of 16 students from school A, for semester 1 and
semester 2. In semester 1 the students were in large classrooms with a teacher student ratio of 1:50. In semester
2 the students were spilt into smaller classrooms with a teacher student ratio of 1:20. A researcher wants to test
whether the students’ test scores increase when they are assigned to smaller classroom sizes. Assume that test
scores of all the students in school A are normally distributed.
Student 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Semester 1 67 49 42 37 52 72 84 56 43 91 67 28 70 29 42 51
Semester 2 69 52 50 38 72 74 82 62 49 89 64 33 77 28 50 55
11.1 Carry out an appropriate test at the significance level of 5% to test the researcher’s claim. State the hypotheses,
the test statistic (and its distribution), the rejection rule, the empirical value of the test statistic, and your test
decision.
6
Step 1: State the hypotheses
• H0 :µD = µ2 − µ1 ≤ 0
• H1 :µD = µ2 − µ1 > 0, where µ1 and µ2 denote the population test scores in semester 1 and 2,
respectively.
• α = 1 − 0.95 = 0.05
• tα=0.05,d f =15 = 1.753
• We reject H0 in favour of H1 at the 5% level if t > t = 1.753
Step 5: Decision
• We reject the null hypothesis in favour of the alternative hypothesis at the 5% significance
level.
11.2 The researcher has now obtained the test scores of a random sample of 20 students from school B for semester
2. School B maintains a teacher student ratio of 1:50, which has not undergone any changes in the recent
years. The researcher wants to test his claim against the new data he has obtained from school B. Assume
that test scores for all the students in school B are normally distributed and have a different variance to
school A. Carry out an appropriate test at the significance level of 5% to test the researchers claim. State the
hypotheses, the test statistic, degrees of freedom, the rejection rule, the empirical value of the test statistic,
and your test decision.The test scores of the random sample of 20 students are as follows:
51, 52, 54, 61, 67, 48, 47, 71, 87, 48, 55, 52, 69, 67, 58, 71, 58, 32, 81, 91
Set up:
1 n 1 16 994
x̄A = ∑ xi = ∑ xi = 16 = 59
n i=1 16 i=1
1 1 4686
s2A = (∑ xi − x̄A )2 = · 4686 = = 312.4
n−1 15 15
• School B
1 n 1 20 1220
x̄B = ∑ xi = ∑ xi = 20 = 61
n i=1 20 i=1
7
1 1 4052
s2B = ( xi − x̄B )2 = · 4052 = = 213.2632
n−1 ∑ 19 19
• H0 :µA − µB ≤ (µA − µB )0
• H1 :µA − µB > (µA − µB )0
• α = 1 − 0.95 = 0.05
• Degrees of freedom:
2 2
SA SB2
213.2632 2
+ 312.4
nA nB
16 + 20
911.325
df = 2 = = = 29.02 ∼ 29
2 2
(312.4/16)2 (213.2632/20)2 25.415 + 5.984
(SA /nA ) (SB2 /nB ) +
nA −1 + nB −1 15 19
Step 5: Decision
• We cannot reject the null hypothesis in favour of the alternative hypothesis at the 5%
significance level.
11.3 Which test would you prefer to use and why? Explanations should be clear and contextualized.