Mock Midterm
Mock Midterm
Mock Midterm
Remark: this is not a complete mid-term exam, but different exam type questions
from previous mid-terms to practice.
QUESTION 1. Using the Household expenditure survey for 2017, you get the following
information for 96 Spanish households:
Savings Annual savings (euros)
Income Annual income (euros)
Source: INE, 2017.
6000
5000
4000
3000
Savings
2000
1000
-1000
-2000
-3000
0 5000 10000 15000 20000 25000 30000
Income
Figure 1
a) According to Figure 1, by how much do average annual savings increase when annual
income increases by 1000 euros? Explain why some actual values for savings fall
below zero.
b) According to Figure 1, in a SLRM of savings on income, which of the Gauss Markov
assumptions is very likely to be violated? Explain
c) Given the OLS regression, compute the residual for a household with annual income=
5000 euros and annual savings = 863 euros.
1
DATA ANALYSIS FOR ECONOMICS
Variable Description
cigs Average number of cigarettes smoked per day
cigprice State cigarette price, cents per pack
educ Years of schooling
age Age in years
income Annual income, (thousand dollars)
Three different models have been estimated using cigs as dependent variable.
Model 1 Model 2 Model 3
const 10.67 9.565 14.74
(6.173) (6.210) (6.543)
cigprice -0.032 -0.032 -0.032
(0.102) (0.102) (0.102)
income 0.081 0.118
(0.053) (0.056)
educ -0.376
(0.169)
age -0.041
(0.028)
n 807 807 807
Adj. R2 0.005 0.006 0.010
SSR 151734 151294 150157
Note: Standard errors in parentheses
a) Which is the unit of analysis in this dataset and the sample size?
b) Postulate the population model of the SLRM
c) Write the OLS line of the SLRM.
d) Does the price of cigarettes explain a large fraction of the variability in cigarettes
smoked across individuals? Explain.
e) Interpret the estimated coefficients from Model 2.
f) Compare the estimated slope parameters associated to cigprice between the three
models. Could you give an explanation for this happening?
g) Robert is 25 years-old and has completed 14 years of schooling. His annual income
is around 26 thousand dollars and lives in Iowa where the price of the pack of
2
DATA ANALYSIS FOR ECONOMICS
cigarettes is 58 cents. Predict Robert’s number of cigarettes smoked per day (integer
number) using Model 3.
h) It has been argued that, controlling for other factors, individuals tend to give up
smoking as they get older. Is this result consistent with the regression in Model 3?
Explain
i) Which would be your preferred Model? Explain
QUESTION 3
A group of researchers from the World Bank Development Department conducted a study
in 2019 on the main determinants of human development across countries. For that
purpose, they collected information of an Index provided by the United Nations called
Human Development Index (HDI) ranging the level of development from 0 (the worst)
to 1 (the best). They ended up with a sample of 169 countries around the world. All the
information they gathered for the different countries is described in Table 1.
3
DATA ANALYSIS FOR ECONOMICS
Choose the correct answer for the OLS estimator of the intercept, 𝛽𝛽̂0
a. 0.319
b. 1.091
c. 0.045
d. 8.466
If the Total Sum of Squares of educ equals 160.62 and the unbiased estimator of the
error variance equals 2.89, the standard error of the slope is equal to
a. 0.134
b. 0.018
c. 55.43
d. 7.44
The fitted value of hdi for a country with mean values for both educ and gdppc is
a. 0.368
b. 0.702
c. 0.954
d. 0.377
Qatar is the richest country in the sample. The mean years of schooling in Qatar is 9.67 and it
presents a high level of human development index (its hdi equals 0.85). The OLS residual for
Qatar is:
a. -0.08
b. 0.08
4
DATA ANALYSIS FOR ECONOMICS
c. 0.15
d. -0.15
log�
(ℎ𝑑𝑑𝑑𝑑)𝑖𝑖 = 0.108 + 0.032𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑖𝑖 + 0.117 log(𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔 )𝑖𝑖
𝑛𝑛 = 169 𝑆𝑆𝑆𝑆𝑆𝑆 = 0.46 𝑆𝑆𝑆𝑆𝑆𝑆 = 5
a. 0.908
b. 0.906
c. 0.46
d. 0