Bookbinders Case 2
Bookbinders Case 2
Bookbinders Case 2
Q3. Interpret the Confusion Matrix, for how many cases (i.e., respondents)
does the choice logit model predict their choice accurately? For each cell
in the matrix, give an exemplary respondent from the Estimation sheet.
Do you consider this as a successful estimation for the actual dataset?
Why?
As shown in above, the confusion matrix shows 80% accuracy. We find this by
adding the two bold numbers in each column (160+ 1120) and then divide by the
total number of respondents (1600).
Q4. Interpret the Coefficient Estimates (i.e., which variables affect
customer choice significantly?; Based on the significant variables, which of
them affect customer choice positively and negatively?) Please follow the
methodology we did in class and do not worry about the green and red
colors. In answering this question, do not just write down the list of
variables but try to explain these relationships in meaningful sentences as
if you are reporting it to your manager!
Q5. Run a linear regression model on the data. In the regression output,
find the Coefficients column and interpret the coefficients with their tstatistics (similar to the choice logit model output). Are the relationships
consistent between two models? Also, report the R-squared value in the
output and interpret the goodness-of-fit of the regression.
Shown above is the linear regression model. The r-squared value(highlighted) is .56
or 56% which does not show a good fit.
Q6. Based on the insights you gained, summarize the advantages and
limitations of the customer choice and regression models.
Figure 6:
4