Tute6Answers ECON339
Tute6Answers ECON339
Tute6Answers ECON339
Tutorial assignment:
Ans: The Ramsey’s RESET test is a test to determine the correct functional form.
Ramsey’s RESET test is a test of whether the functional form of the regression is
appropriate. In other words, we test whether the relationship between the dependent
variable and the independent variables really should be linear or whether a non-linear
form would be more appropriate. The test works by adding powers of the fitted values
from the regression into a second regression. If the appropriate model was a linear
one, then the powers of the fitted values would not be significant in this second
regression.
What could be done if it were found that the RESET test failed?
Ans: The test is performed under the null hypothesis of a linear model. The rejection
of the null implies that a nonlinear model is supported by the data. However, the test
does not provide the functional form of the nonlinear model.
If we fail Ramsey’s RESET test, then the easiest “solution” is probably to transform
all of the variables into logarithms. This has the effect of turning a multiplicative
model into an additive one.
If this still fails, then we really have to admit that the relationship between the
dependent variable and the independent variables was probably not linear after all so
that we have to either estimate a non-linear model for the data (which is beyond the
scope of this course) or we have to go back to the drawing board and run a different
regression containing different variables.
Question 1
The data is available in the file “hedonic1.xls” Consider the following multiple
regression model for new houses only (age=0).
Quick>Estimate equation>
Log(selling_price) c sfla beds baths stories vacant
1
Dependent Variable: LOG(SELLING_PRICE)
Method: Least Squares
Date: 07/09/05 Time: 10:42
Sample: 1 6660 IF AGE=0
Included observations: 151
b) Do the coefficients take the expected signs? Check for any evidence of
multicollinearity.
The coefficients for beds and stories do not take the expected signs we expect the
more bedrooms and more stories the greater the selling price however this is not
demonstrated by the data for new houses. Also vacant houses on average are more
expensive than non-vacant houses but this is not significant at the 5% level.
One way to check for multicollinearity is to check the correlation coefficients of all
the explanatory variables.
In the workfile window> hold control and click all the explanatory variables> right
click and select to open them as a group.
2
Any correlations over 0.8 are considered to be evidence of multicollinearity.
However with multicollinearity there is no cut off there is just more of a
possible effect the higher the correlations between the variables.
For example:
Quick>Estimate Equation>
Sfla c beds baths stories vacant
c) What are the possible effects of multicollinearity? What are the possible solutions
to the problem?
Possible consequences include – incorrect signs and sizes of the coefficients and
possible large std errors, so that the variables appear individually not significant
when in fact they are and together they may be significant when doing an F-test.
3
d) Create a dummy variable for the entire dataset which has a value of 1 for a new
house and 0 for any other house.
Alternatively, in the blank window above the workfile type in dum1= age=0.
You will be able to find a new variable dum1 created in the workfile.
To check whether the dummy is properly created, you can graph the dum1 variable and find
the many spikes that have the value 1 at age=0.
e) Do a Chow test for the complete dataset to see if the equation changes depending
on whether the house is a new house or not.
Quick>Estimate equation>
Log(selling_price) c sfla beds baths stories vacant new new*sfla new*beds
new*baths new*stories new*vacant
4
Durbin-Watson stat 0.818210 Prob(F-statistic) 0.000000
Wald Test:
Equation: EQ01
Ho: The coefficients are the same regardless of whether it is a new house or not
H1: The coefficients change depending on whether it is a new house or not
Assume a 5% level of significance
p-value=.0004<0.05 Reject the null
At the 5% level we can conclude that there are different effects for new and old
houses