Solutions Chapter 3
Solutions Chapter 3
Solutions Chapter 3
Exercise Solutions
31
EXERCISE 3.1
(a)
(b)
t1 =
b1 1 83.416 0
=
= 1.92
se(b1 )
43.410
Since the t = 1.92 value does not exceed the 5% critical value tc = t(0.975,38) = 2.024 , we do
not reject H 0 . The data do not reject the zero-intercept hypothesis.
(c)
The p-value 0.0622 represents the sum of the areas under the t distribution to the left of t =
1.92 and to the right of t = 1.92. Since the t distribution is symmetric, each of the tail
areas that make up the p-value are p / 2 = 0.0622 2 = 0.0311. The level of significance, ,
is given by the sum of the areas under the PDF for | t |>| tc |, so the area under the curve for
t > tc is / 2 = .025 and likewise for t < tc . Therefore not rejecting the null hypothesis
because / 2 < p / 2, or < p, is the same as not rejecting the null hypothesis because
tc < t < tc . From Figure xr3.1(a) we can see that having a p-value > 0.05 is equivalent to
having tc < t < tc .
Testing H 0 : 1 = 0 against H1 : 1 > 0, uses the same t-value as in part (b), t = 1.92.
Because it is a one-tailed test, the critical value is chosen such that there is a probability of
0.05 in the right tail. That is, tc = t(0.95,38) = 1.686 . Since t = 1.92 > tc = 1.69, H 0 is
rejected, the alternative is accepted, and we conclude that the intercept is positive. In this
case p-value = P(t > 1.92) = 0.0311. We see from Figure xr3.1(b) that having the p-value
< 0.05 is equivalent to having t > 1.69.
0.4
Rejection Region
0.3
0.2
0.1
1.92
0.0
-3
-2
-1
tc
Figure xr3.1(b) Rejection region and observed t value for Exercise 3.1(d)
(e)
The term "level of significance" is used to describe the probability of rejecting a true null
hypothesis when carrying out a hypothesis test. The term "level of confidence" refers to
the probability of an interval estimator yielding an interval that includes the true
parameter. When carrying out a two-tailed test of the form H 0 : k = c versus H1 : k c,
non-rejection of H 0 implies c lies within the confidence interval, and vice versa,
providing the level of significance is equal to one minus the level of confidence.
(f)
False. The test in (d) uses the level of significance 5%, which is the probability of a Type
I error. That is, in repeated samples we have a 5% chance of rejecting the null hypothesis
when it is true. The 5% significance is a probability statement about a procedure not a
probability statement about 1 . It is careless and dangerous to equate 5% level of
significance with 95% confidence, which relates to interval estimation procedures, not
hypothesis tests.
EXERCISE 3.2
(a)
The coefficient of EXPER indicates that, on average, a technical artist's quality rating goes
up by 0.076 for every additional year of experience.
3.9
3.8
3.7
RATING
3.6
3.5
3.4
3.3
3.2
3.1
0
EXPER
(b)
Using the value tc = t(0.975, 22) = 2.074 , the 95% confidence interval for 2 is given by
(d)
To test H 0 : 2 = 0 against H1 : 2 > 0, we use the t-value from part (c), namely t = 1.727 ,
but the right-tail critical value tc = t(0.95, 22) = 1.717 . Since 1.727 > 1.717 , we reject H 0 and
conclude that 2 is positive. Experience has a positive effect on quality rating.
The p-value of 0.0982 is given as the sum of the areas under the t-distribution to the left of
1.727 and to the right of 1.727. We do not reject H 0 because, for = 0.05, p-value >
0.05. We can reject, or fail to reject, the null hypothesis just based on an inspection of the
p-value. Having the p-value > is equivalent to having t < tc = 2.074 .
36
EXERCISE 3.3
(a)
(b)
(c)
Hypotheses:
Calculated t-value:
Critical t-value:
H 0 : 2 = 0 against H1 : 2 0
t = 0.310 0.082 = 3.78
tc = t(0.995, 22) = 2.819
Decision:
Hypotheses:
Calculated t-value:
Critical t-value:
H 0 : 2 = 0 against H1 : 2 > 0
t = 0.310 0.082 = 3.78
tc = t(0.99, 22) = 2.508
Decision:
Hypotheses:
Calculated t-value:
Critical t-value:
H 0 : 2 = 0 against H1 : 2 < 0
t = 0.310 0.082 = 3.78
tc = t(0.05, 22) = 1.717
Decision:
(d)
(e)
Hypotheses:
Calculated t-value:
Critical t-value:
Decision:
37
EXERCISE 3.4
(a)
MIM
16
12
8
4
0
0
10
20
30
40
50
60
70
80
90
100
PMHS
(b)
(c)
(d)
The estimated slope b2 = 0.18 indicates that a 1% increase in males 18 and older, who are
high school graduates, increases average income of those males by $180. The positive sign
is as expected; more education should lead to higher salaries.
(e)
Using tc = t(0.995,49) = 2.68 , a 99% confidence interval for the slope is given by
38
t=
0.180 0.2
= 0.639
0.0313
The critical values for a two-tailed test with a 5% significance level and 49 degrees of
freedom are tc = 2.01. Since t = 0.634 lies in the interval (2.01, 2.01), we do not
reject H 0 . The null hypothesis suggests that a 1% increase in males 18 or older, who are
high school graduates, leads to an increase in average income for those males of $200.
Non-rejection of H 0 means that this claim is compatible with the sample of data.
EXERCISE 3.5
(a)
( 7.3835 )( 0.1121)
(se)
(b)
The relationship in part (a) indicates that, as income increases, the amount of life
insurance increases, as is expected. If taken literally, the value of b1 = 6.8550 implies that
if a family has no income, then they would purchase $6855 worth of insurance. However,
given the lack of data in the region where INCOME = 0 , this value is not reliable.
(i) If income increases by $1000, then an estimate of the resulting change in the amount
of life insurance is $3880.20.
(ii) The standard error of b2 is 0.1121. To test a hypothesis about 2 the test statistic is
b2 2
~t
se ( b2 ) ( N 2 )
An interval estimator for 2 is b2 tc se ( b2 ) , b2 + tc se ( b2 ) , where tc is the critical
value for t with ( N 2) degrees of freedom at the level of significance.
(c)
To test the claim, the relevant hypotheses are H0: 2 = 5 versus H1: 2 5. The alternative
2 5 has been chosen because, before we sample, we have no reason to suspect 2 > 5 or
2 < 5. The test statistic is that given in part (b) (ii) with 2 set equal to 5. The rejection
region (18 degrees of freedom) is | t | > 2.101. The value of the test statistic is
t=
b2 5 3.8802 5
=
= 9.99
se ( b2 )
0.1121
As t = 9.99 < 2.101 , we reject the null hypothesis and conclude that the estimated
relationship does not support the claim.
40
To test the hypothesis that the slope of the relationship is one, we proceed as we did in
part (c), using 1 instead of 5. Thus, our hypotheses are H0: 2 = 1 versus H1: 2 1. The
rejection region is | t | > 2.101. The value of the test statistic is
t=
3.8802 1
= 25.7
0.1121
Since t = 25.7 > tc = 2.101, we reject the null hypothesis. We conclude that the amount of
life insurance does not increase at the same rate as income increases.
(e)
Life insurance companies are interested in household characteristics that influence the
amount of life insurance cover that is purchased by different households. One likely
important determinant of life insurance cover is household income. To see if income is
important, and to quantify its effect on insurance, we set up the model
INSURANCEi = 1 + 2 INCOMEi + ei
where INSURANCEi is life insurance cover by the i-th household, INCOMEi is
household income, 1 and 2 are unknown parameters that describe the relationship, and ei
is a random uncorrelated error that is assumed to have zero mean and constant variance
2 .
To estimate our hypothesized relationship, we take a random sample of 20 households,
collect observations on INSURANCE and INCOME and apply the least-squares estimation
procedure. The estimated equation, with standard errors in parentheses, is
( se )
( 7.3835)( 0.1121)
The point estimate for the response of life-insurance coverage to an income increase of
$1000 (the slope) is $3880 and a 95% interval estimate for this quantity is ($3645, $4116).
This interval is a relatively narrow one, suggesting we have reliable information about the
response. The intercept estimate is not significantly different from zero, but this fact by
itself is not a matter for concern; as mentioned in part (b), we do not give this value a
direct economic interpretation.
The estimated equation could be used to assess likely requests for life insurance and what
changes may occur as a result of income changes.
41
EXERCISE 3.6
(a)
A 95% interval estimator for 2 is b2 t(0.975,14) se(b2). Using our sample of data the
corresponding interval estimate is
0.3857 2.145 0.03601 = (0.4629, 0.3085)
If we used the interval estimator in repeated samples, then 95% of interval estimates like
the above one would contain 2. Thus, 2 is likely to lie in the range given by the above
interval.
(b)
We set up the hypotheses H0: 2 = 0 versus H1: 2 < 0. The alternative 2 < 0 is chosen
because we would expect the unit costs of production to decline as cumulative production
increases if there is learning. The test statistic, given H0 is true, is
t=
b2
~ t(14)
se(b2 )
The rejection region is t < 1.761. The value of the test statistic is
t=
0.3857
= 10.71
0.03601
Since t = 10.71 < 1.761, we reject H0 and conclude that learning does exist. We
conclude in this way because 10.71 is an unlikely value to have come from the t
distribution which is valid when there is no learning.
42
EXERCISE 3.7
(a)
t=
bj 1
se ( b j )
~ t(118)
The rejection region is t < 1.980 and t > 1.980 , where t(0.975,118) = 1.980 .
The results for each company are given in the following table:
Stock
t-value
Decision rule
Disney
t=
0.9593 1
= 0.287
0.1420
GE
t=
0.9830 1
= 0.162
0.1047
GM
t=
1.0744 1
= 0.478
0.1558
IBM
t=
1.2683 1
= 1.726
0.1554
Microsoft
t=
1.4299 1
= 2.284
0.1882
Mobil-Exxon
t=
0.4030 1
= 7.256
0.08228
For Disney, GE, GM and IBM, we failed to reject the null hypothesis, indicating that the
sample data are consistent with the conjecture that the Disney, GE, GM and IBM stocks
have the same volatility as the market portfolio. For Microsoft and Mobil-Exxon, we
rejected the null hypothesis, and concluded that these two stocks do not have the same
volatility as the market portfolio.
43
We set up the hypotheses H 0 : j 1 versus H1 : j < 1 . The relevant test statistic, given
H0 is true, is
t=
bj 1
se ( b j )
~ t(118)
The rejection region is t < 1.658 where tc = t(0.05,118) = 1.658 . The value of the test
statistic is
t=
0.4030 1
= 7.256
0.08228
Since t = 7.256 < tc = 1.658, we reject H0 and conclude that Mobil-Exxons beta is less
than 1. A beta equal to 1 suggests a stock's variation is the same as the market variation. A
beta less than 1 implies the stock is less volatile than the market; it is a defensive stock.
(c)
We set up the hypotheses H 0 : j 1 versus H1 : j > 1 . The relevant test statistic, given
H0 is true, is
t=
bj 1
se ( b j )
~ t(118)
The rejection region is t > 1.658 where tc = t(0.95,118) = 1.658 . The value of the test statistic
is
t=
1.4299 1
= 2.284
0.1882
Since t = 2.284 > tc = 1.658, we reject H0 and conclude that Microsofts beta is greater
than 1. A beta equal to 1 suggests a stock's variation is the same as the market variation. A
beta greater than 1 implies the stock is more volatile than the market; it is an aggressive
stock.
(d)
A 95% interval estimator for Microsofts beta is b j t(0.975,118) se(b j ) . Using our sample
of data the corresponding interval estimate is
1.4299 1.980 0.1882 = (1.057, 1.803)
Thus we estimate, with 95% confidence, that Microsofts beta falls in the interval 1.057 to
1.803. It is possible that Microsofts beta falls outside this interval, but we would be
surprised if it did, because the procedure we used to create the interval works 95% of the
time. The problem with the interval estimate is that it is wide. We feel sure that Microsoft
is more volatile than the market, but how much more is not known precisely.
44
The two hypotheses are H0: j = 0 versus H1: j 0. The test statistic, given H0 is true, is
t=
aj
se ( a j )
~ t(118)
The rejection region is t < 1.980 and t > 1.980 , where t(0.975,118) = 1.980 .
The results for each company are given in the following table:
Stock
t-value
Decision rule
Disney
t=
0.0010
= 0.152
0.0067
GE
t=
0.0059
= 1.199
0.0049
GM
t=
0.0023
= 0.317
0.0073
IBM
t=
0.0068
= 0.940
0.0073
Microsoft
t=
0.0102
= 1.156
0.0088
Mobil-Exxon
t=
0.0073
= 1.904
0.0039
We do not reject the null hypothesis for any of the stocks. This indicates that the sample
data is consistent with the conjecture from economic theory that the intercept term equals
0.
45
EXERCISE 3.8
(a)
We set up the hypotheses H0: 2 = 0 versus H1: 2 < 0. The alternative 2 < 0 is chosen
because an inverse relationship is one where the dependent variable increases as the
independent variable decreases, and visa versa. Thus, a negative 2 suggests an inverse
relationship between variables. The test statistic, given H0 is true, is
t=
b2
~ t(182)
se(b2 )
The rejection region is t < t(0.05,182) = 1.653 . The value of the test statistic is
t=
194.233
= 19.031
10.2061
Since t = 19.03 < 1.653, we reject the null hypothesis that 2 = 0 and accept the
alternative that 2 < 0 . We conclude that there is a statistically significant inverse
relationship between the number of house starts and the 30-year fixed interest rate.
(b)
We set up the hypotheses H 0 : 2 = 150 versus H1 : 2 150 . The test statistic, given
H0 is true, is
t=
b2 2
~ t(182)
se(b2 )
The rejection region is t < 1.973 and t > 1.973 , with t(0.975,182) = 1.973 . The value of the
test statistic is
t=
194.233 + 150
= 4.334
10.2061
Since t = 4.334 < 1.973, we reject the null hypothesis 2 = 150 and accept the
alternative that 2 150 . The data indicate that, if the 30-year fixed interest rate
increases by 1%, house starts will not fall by 150,000.
(c)
A 95% interval estimate of the slope from the regression estimated in part (a) is:
46
EXERCISE 3.9
(a)
We set up the hypotheses H0: 2 = 0 versus H1: 2 > 0 . The alternative 2 > 0 is chosen
because we assume that growth, if it does influence the vote, will do so in a positive way.
The test statistic, given H0 is true, is
t=
b2
~ t(29)
se(b2 )
The rejection region is t > 1.699 = t(0.95,29) . The value of the test statistic is
t=
0.6599
= 4.0460
0.1631
Since t = 4.0460 > 1.699, we reject the null hypothesis that 2 = 0 and accept the
alternative that 2 > 0 . We conclude that economic growth has a positive effect on the
percentage vote.
(b)
A 95% interval estimate for 2 from the regression in part (a) is:
b2 t(0.975,29) se(b2 ) = 0.6599 2.045 0.1631 = (0.3264, 0.9934)
This interval estimate suggests that, with 95% confidence, the true value of 2 is between
0.3264 and 0.9934. Since 2 represents the change in percentage vote due to economic
growth, we expect that a 1% increase in the growth rate will increase the percentage vote
by an amount between 0.3264 to 0.9934 percent.
(c)
We set up the hypotheses H0: 2 = 0 versus H1: 2 < 0 . The alternative 2 < 0 is chosen
because we assume that inflation, if it does influence the vote, will do so in a negative
way. The test statistic, given H0 is true, is
t=
b2
~ t(29)
se(b2 )
The rejection region is t < 1.699 = t(0.05,29) . The value of the test statistic is
t=
0.4450
= 0.856
0.5197
Since 0.856 > 2.045 , we do not reject the null hypothesis. There is not enough evidence
to suggest inflation has a negative effect on the vote.
(d)
A 95% interval estimate for 2 from the regression in part (c) is:
b2 t(0.975,29) se(b2 ) = 0.4450 2.045 0.5197 = ( 1.508, 0.618)
This interval estimate suggests that, with 95% confidence, the true value of 2 is between
1.508 and 0.618. It suggests that a 1% increase in the inflation rate could increase or
decrease or have no effect on the percentage vote.
EXERCISE 3.10
(a)
The coefficient 2 represents the increase in price from an extra square foot of living area.
We can refer to it as the marginal price per square foot.
(i) A 95% interval estimate of 2 for all houses is:
The results for testing the hypotheses H0: 2 = 80 versus H1: 2 80 are given in the
following table. In each case the test statistic is t = (b2 80) se(b2 ) which has a t( N 2)
distribution if H 0 is true. The rejection region is t < tc and t > tc where tc = t(0.975, N 2) .
Sample
t-value
N 2
tc
Decision rule
All
houses
t=
92.7473 80
= 5.29
2.4105
1078
1.962
Town
houses
t=
55.5853 80
= 3.44
7.0999
68
1.995
French
Style
t=
184.1667 80
= 10.25
10.1626
95
1.985
All cases lead to the rejection of the null hypothesis. We conclude that an additional
square foot does not add $80 to the average sale price of all houses, the sale price of town
houses, nor the sale price of French style houses.
EXERCISE 3.11
(a)
We conclude that the data is consistent with the conjecture that an additional square foot
of living space is associated with an increase in the sale price of the house by $80.
(b)
We conclude that, for houses that are vacant at time of sale, an additional square foot of
living space is not associated with an increase in the sale price of the house by $80.
(c)
We conclude that, for houses that are occupied at time of sale, an additional square foot of
living space is not associated with an increase in the sale price of the house by $80.
(d)
We conclude that, for houses that are occupied at time of sale, an additional square foot of
living space increases the sale price of the house by more than $80.
(e)
We conclude that, for houses that are vacant at time of sale, an additional square foot of
living space increases the sale price by less than $80.
(i) A 95% interval estimate for 2 from the full sample is given by
EXERCISE 3.12
(a)
Estimated equation:
( 0.3787 ) ( 0.0173)
( 22.88) ( 4.77 )
The estimated equation tells us that with every year of experience the associated increase
in hourly wage is $0.0824. Furthermore, it tells us that the average wage for those without
experience is $8.6658. The relatively large t-values suggest that the least squares
estimates are statistically significant at a 5% level of significance.
70
60
WAGE
50
40
30
20
10
0
0
10
20
30
40
50
60
EXPER
(b)
t=
b2
~ t(998)
se(b2 )
(se)
(t )
( 0.4797 ) ( 0.0218)
(17.67 ) ( 0.958)
With every extra year of experience the associated increase in average hourly wage for
females is $0.0209. This estimate is not significantly different from zero, however. The
average wage for females without experience is $8.4747.
50
WAGE
40
30
20
10
0
0
10
15
20
25
30
35
40
45
EXPER
(se)
(t )
( 0.5549 ) ( 0.0254 )
(15.89 ) ( 5.698)
With every extra year of experience, the associated increase in average hourly wage for
males is $0.1448. The average wage for males without experience is $8.8200.
70
60
50
WAGE
(c)
40
30
20
10
0
0
10
20
30
40
50
EXPER
( se )
(t )
( 0.9973) ( 0.0461)
( 6.022 ) ( 2.594 )
With every extra year of experience, the associated increase in average hourly wage for
blacks is $0.1197. The average wage for blacks without experience is $6.0054.
28
24
WAGE
20
16
12
8
4
0
0
10
20
30
40
50
EXPER
n = 9.0315 + 0.1451EXPER
WAGE
(se)
(t )
( 0.5808) ( 0.0266 )
(15.55) ( 5.452 )
With every extra year of experience the associated increase in average hourly wage for
white males is $0.1451. The average wage for white males without experience is $9.0315.
70
60
50
WAGE
(c)
40
30
20
10
0
0
10
20
30
40
50
60
EXPER
Figure xr3.12(e) Fitted regression line and observations for white males
Comparing the estimated wage equations for the four categories, we find that experience
counts the most, or leads to the largest increase in wages, for white males. The effect is
only slightly less for males in general. It is less for blacks and very small for females. For
those with no experience the wage ranking is white males, males, females, blacks.
(d)
Residual plots
50
40
RESID
30
20
10
0
-10
0
10
20
30
40
50
60
EXPER
40
RESID
30
20
10
-10
0
12 16 20 24 28 32 36 40 44 48
EXPER
50
40
RESID
30
20
10
0
-10
-20
0
10
20
30
40
50
60
EXPER
RESID
10
5
0
-5
-10
0
10
20
30
40
50
60
EXPER
50
40
RESID
30
20
10
0
-10
-20
0
10
20
30
40
50
60
EXPER
The main observation that can be made from all the residual plots is that the pattern of
positive residuals is quite different from the pattern of negative residuals. There are very
few negative residuals with an absolute magnitude larger than 10, whereas the positive
residuals are often larger than 10, with a few very large ones, and one over 40. These
characteristics suggest a distribution of the errors that is not normally distributed, but
skewed to the right.
EXERCISE 3.13
(a)
Estimated equation:
(se)
(t )
( 0.1738) ( 0.0078)
( 49.40 ) (10.76 )
With every extra year of experience the associated increase in hourly wage is $0.0842.
The average wage for those without experience is $8.5837. The relatively large t-values
imply the least squares estimates are statistically significant at a 5% level of significance.
80
70
60
WAGE
50
40
30
20
10
0
0
10
20
30
40
50
EXPER
Figure xr3.13(a) Fitted regression line and observations using all data
(b)
t=
b2
~ t(4731)
se(b2 )
n = 8.0375 + 0.0501EXPER
WAGE
(se)
(t )
( 0.2285 ) ( 0.0103)
( 35.18) ( 4.856 )
With every extra year of experience the associated increase in average hourly wage for
females is $0.0501. The average wage for females without experience is $8.0375.
80
70
60
WAGE
50
40
30
20
10
0
0
10
20
30
40
50
EXPER
n = 9.1170 + 0.1153EXPER
WAGE
(se)
(t )
( 0.2510 ) ( 0.0113)
( 36.32 ) (10.216 )
With every extra year of experience the associated increase in average hourly wage for
males is $0.1153. The average wage for males without experience is $9.1170.
80
70
60
50
WAGE
(c)
40
30
20
10
0
0
10
20
30
40
50
EXPER
(se)
(t )
( 0.5002 ) ( 0.0233)
(14.76 ) ( 2.860 )
With every extra unit of experience the associated increase in average hourly wage for
blacks is $0.0667. The average wage for blacks without experience is $7.3825.
40
WAGE
30
20
10
0
0
10
20
30
40
50
EXPER
(se)
(t )
( 0.2644 ) ( 0.0118 )
( 35.02 ) ( 9.847 )
With every extra year of experience the associated increase in average hourly wage for
white males is $0.1164. The average wage for white males without experience is $9.2606.
80
70
60
50
WAGE
(c)
40
30
20
10
0
0
10
20
30
40
50
60
EXPER
Figure xr3.13(e) Fitted regression line and observations for white males
Comparing the estimated wage equations for the four categories, we find that experience
counts the most, or leads to the largest increase in wages, for white males. The effect is
only slightly less for males in general. For blacks experience is worth slightly more than
half of what it is for white males. For females experience is worth slightly less than half of
what it is for white males. For those with no experience the wage ranking is white males,
males, females, blacks.
(d)
Residual plots
70
60
50
RESID
40
30
20
10
0
-10
-20
0
10
20
30
40
50
60
EXPER
70
60
50
RESID
40
30
20
10
0
-10
0
10
20
30
40
50
60
EXPER
RESID
40
30
20
10
0
-10
-20
0
10
20
30
40
50
60
EXPER
RESID
30
20
10
-10
0
10
20
30
40
50
60
EXPER
RESID
40
30
20
10
0
-10
-20
0
10
20
30
40
50
60
EXPER
Figure xr3.13(j)
In all residual plots the pattern of positive residuals is quite different from the pattern of
negative residuals. There are very few negative residuals with an absolute magnitude
larger than 10, whereas the positive residuals are often larger than 10, with a few very
large ones, and one over 40. These characteristics suggest a distribution of the errors that
is not normally distributed, but skewed to the right.