Credit GrantingA Comparative Analysis of
Credit GrantingA Comparative Analysis of
Credit GrantingA Comparative Analysis of
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
Wiley and American Finance Association are collaborating with JSTOR to digitize, preserve and extend access
to The Journal of Finance.
http://www.jstor.org
ABSTRACT
Financialclassificationissues, and particularlythe financial distress problem,continue
to be subject to vigorousinvestigation.The corporatecredit grantingprocess has not
receivedas much attention in the literature.This paperexaminesthe relativeeffective-
ness of parametric, nonparametricand judgemental classification procedureson a
sample of corporate credit data. The judgemental model is based on the Analytic
Hierarchy Process. Evidence indicates that (nonparametric)recursive partitioning
methods provide greater informationthan simultaneouspartitioningprocedures.The
judgementalmodel is found to performas well as statistical models. A complementary
relationship is proposed between the statistical and the judgemental models as an
effectiveparadigmfor grantingcredit.
In the context of a single period, expression (1) is simply the E(NPV) from
grantingcredit in the first period. Assuming for convenience that variable costs
(v) can be expressed as a proportionof sales (s), formulation (1) can be restated
as follows to determinethe maximumcredit limit:
where, CLmaxis the maximumcredit line that should be grantedto the customer.
The above descriptionimplies that the credit grantingprocess consists of two
stages: (i) estimation of default probabilities; and (ii) integration of default
probabilitiesin (1) or (2) to estimate credit limits. Two basic approacheshave
evolved to facilitate the assessment of default risk: statistical and judgemental
systems. All credit analysis, whether based on judgementalor statistical systems,
operate on similar principles. Further, in estimating default probabilities, the
suggestedmultistage investigation process essentially determinesthe feasible set
of measurements,X, that can be used to estimate the probabilities,Pli and P2i.
A. StatisticalModels
The two parametric methodologies we employ, MDA and logit, have been
extensively investigated in prior classification studies. Interested readers are
referredto Altman et al. (1981) for a detailed discussion. It will suffice to mention
here that the default probabilitiesfor each observationusing MDA and logit will
be the posteriorprobabilitiesof groupmembershipgeneratedfrom the respective
MDA and logit models. These probabilitiescan then be substituted in (1) or (2)
to determinecredit limits.
Both MDA and logit necessitate restrictiveassumptionsof distributionalform,
beingparametricin nature.In particular,the problemsin using MDA for financial
classificationare well documented(see, e.g., Altman et al. [1]). Several nonpara-
metric classification alternatives have been suggested in the literature. Among
these are the rank transformeddiscriminant analysis (Conover and Iman [6]),
the recursive partitioning algorithm (RPA) (Breiman et al. [2]), the Quinlan
algorithm (Quinlan [17]), and goal programming(GP) (Freed and Glover [12]).
We focus on the GP and the RPA in this study.
MDA, logit and GP can be characterizedas simultaneous partitioning proce-
dures since they consider all the potential discriminatoryvariables simultane-
ously in the classification model. However, models like the RPA can be charac-
1. MathematicalProgramming(MP)
Recently, the interest in MP-based alternatives to parametric classification
proceduresseems to have been revived by Freed and Glover [12]. Formally,in a
MP context, the discriminant problem can be stated as one of finding a linear
transformationX and boundary value b to categorize group k, given points Ai
and groupsGk. Thus, for a 2-groupproblem,the objective is to find X such that:
AiXs b AiE G1
AiX > b Ai E G2
where b is any positive constant. Since, however,the Ai's may be distributedin
a manner that may not permit complete discrimination, we need to relax the
constraints to ensure feasible solutions. The slack (surplus) variable for the
purpose,a, can take two forms: (i) it can be constrained at the grouplevel, ak; or
(ii) it can be constrained at the observationlevel, ai. Constrainingthe a's at the
grouplevel yields LPI:
Min E2=1 ak
such that
AiX c b + ak i =(1,... Q), i E Gl, k = (1, 2).
AiX > b - ak i= (Q + 1, ... ., R), i E G2, k = (1, 2).
where the X's are unrestricted in sign. Constrainingthe a's at the observation
level, on the other hand, yields LPII:
Min E2=1 ak
such that
AiX <b +ai i =(l, .
Q), i E Gl, k =(1, 2).
2. RecursivePartitioning
RPA yields a binary classification tree, similar to a spanning tree, that enables
the assignment of objects into one or k known groups.Binary classification trees
are constructed by repeated splits of X, where X is the measurement space
containing all relevant measurement vectors. The construction of the binary
classification tree revolves aroundthe following three elements:
1. the selection of splitting rules;
2. the decision when to declare a node as terminal;and
3. the assignment of each terminal node to a group.
The performanceof the model critically depends on the first two elements. It
turns out that the third element is trivially resolved, given solutions to the first
two elements. The fundamental notion behind each split of a subset is one
obtainingdescendant subsets that are 'purer'than the data in the present subset.
The best splitting rule for the given sample is defined as the one which
maximizes the decrease in the sum of the impurities of the two resulting
subsamples comparedwith the impurity of the parent sample. In order to find
the best splitting rule, the algorithmfirst searches for the best splitting point for
each explanatory variable and then the best of these splits is selected. The
splitting process terminates when further splitting does not lead to any decrease
in the impurityof the currenttree. The currenttree then is referredto as Tfin.
The next step in the process is to search for a tree that is less complex than
Tfir but has a smaller cross-validated classification error rate. This step is
motivated by the observationthat Tfin usually overfit the data and are complex.
The final optimal tree is selected based on a criterion of minimizing cross-
validated resubstitution risk. The cross-validation risk of tree T is computed
using a V-foldcross-validationprocess, where V is the numberof folds. All cases
in the sample are randomlydivided into V groups of approximatelyequal sizes.
Observationsin V - 1 groupsare used to construct the tree correspondingto the
penalty chosen from the range of values of penalty parametersfor which tree T
was optimal, based on all of the observations.The observationsin the groupleft
out are classified by the newly constructed tree. The procedure is repeated V
times, each time with a different group being left out. The resubstitution risks
from all cross-validationtests for a particularcandidate T are averagedto obtain
cross-validatedrisk for T.
6In some exceptional cases, LPI and LPII can be shown to be degenerate (Markowskiand
Markowski[15]). However,simple normalizationscan be undertakento remedythe problem (see,
Freedand Glover[11]). In the data under study, we did not have to resortto any normalizations.
Note that if we post-multiply 0 by the column vector iv, we obtain the vector
nw. To recoverthe weights, w, we can solve the system (O - nI) = 0, which must
have a non-trivial solution, since 0 has unit rank and all the eigenvalues of 0,
(L = 1, 2, ..., n), are zero except one (Lma.). Further, the trace (0) = sum of
the eigenvalues = sum of the diagonal elements of 0 = n.
The matrix 0 satisfies the 'cardinal'consistency property 0ij?>k = Oik. Thus,
given any row of A, we can determine the rest of the entries from this relation.
However,suppose we have estimates of the ratios in the matrix. In this case, the
cardinal consistency relation above need not hold, nor need an 'ordinal'transi-
tivity relation of the form:Oi> Oj, Oj> ?k imply Oi > Ok hold (wherethe Oiare
the rows of 0). It can be shown that in a positive reciprocal matrix, small
perturbations in the coefficients imply small perturbations in the eigenvalues
and hence the eigenvector is insensitive to small changes in judgement [18, p.
192].
'This model is more fully describedin Srinivasanand Kim [23]. Further,note that even though
we have includedthe descriptionof the AHP-basedmodel underclassificationmethods,it is strictly
not a classificationmodel. The model is, however,directlyusable to determinecredit limits.
8AHP has been applied to many varied problems including planning for a national waterway
(Saaty and Vargas [20]) and marketingdecisions (Wind and Saaty [27]). Srinivasanand Kim [22]
illustratethe applicabilityof AHP to manyfinancialdecisions.For an extensive listing of applications
of AHP, see Zahedi [28]. For theoreticaldetails, refer Saaty [18]. For an axiomatictreatmentof the
process,refer Saaty [19].
B. Methodology
The real test of the classification models in the context of this study is their
ability to replicatethe expert'sjudgements.Maraiset al. [14] identify three types
of overfittingbias that can result if expected misclassificationlosses are estimated
from the same sample used for model estimation: (i) overfitting in the choice of
explanatory variables; (ii) sensitivity to a particular loss function; and (iii)
statistical overfitting bias in the computation of the expected loss rate.
The first type of bias is clearly not relevant for this study as we do not use any
data reduction procedure. Further, loss functions are typically very hard to
estimate as also admitted by Marais et al. [14]. A variety of methods have been
proposedto deal with the statistical overfitting bias. In most prior classification
IV. Results
A. Classification
The cross validated results of a linear MDA model using proportionalprior
probabilitiesare presentedin Exhibit 1. On the average,the linear MDA correctly
classifies 88.89% of all the customers in the bootstrap sample. The average
classification accuracydrops to 85.05%in the case of the holdout sample. There
is some variation in the classification accuracywithin the two groups:90.9%of
the non-HR customers were classified correctly and only 80% of the HR cus-
tomers were correctly classified. Cross validated classification accuracyis lower
but not appreciably.The overall resubstitution loss rate is about 3.84% but is
more severe in the HR cases.
The homogeneityof dispersionmatriceswas tested using a likelihoodratio test
[16] which is approximatelychi-squaredistributed.The results indicatedthat all
the 25 bootstrapsamples did not meet the homogeneityassumption.A quadratic
Exhibit1
Summary of Classification Results (% Correctly Classified)
i. Bootstrap Sample
1. All figures are averages over the 25 replications. Figures in parentheses represent the estimated standard error
of percent correctly classified. Details of replication results are available from the authors.
2. Results for the RPA 1 are from using 10-fold cross-validation. RPA models were based on the symmetric Gini
criterion and no linear combinations were used.
discriminant rule was, therefore, applied and the results are summarized in
Exhibit 1. The classification accuracyhas improvedin the case of the bootstrap
sample to 90.74% compared to 88.89% using the linear rule. However, the
resubstitutionloss rates are slightly higher.This may be a result of the overfitting
that accompaniesa quadraticrule. Replication results do not reveal any signifi-
cant variation over the bootstraptrials.'2
Relative influence of variables was ascertainedusing a forwardstepwise selec-
12 Note than even though the data do not satisfy some assumptions of the MDA models, the
B. CreditLimits13
We conducteda judgementalinput elicitation process for stage IV customers.
All judgementswere providedby the senior credit manager('expert') who is also
responsible for classifying stage IV customers as HR customers. After many
13
We present the AHP model first and subsequentlyprovide a comparativeassessment of the
methodologiesin determiningcredit limits.
HR NHR HR NHR
6 22 13 30
3 2 6 0
The numbersbeloweach node indicatenode membership.The top numberis the frequencyof the groupto which
the node has been assigned. The tree is based on a 10-foldcross-validation.
PAY:
HR = High Risk Good = 2
NHR = Non-High Risk Fair = 1
Poor = 0
were relevant. Not all of the criteria are quantifiableand, therefore,ceterisparibus,we can expect
some divergencebetweenthe results using AHP's subjectiveprobabilitiesand objectiveprobabilities
fromMDA, logit, RPA and GP.
Factor 1 2 3 4 5
1 1 0.33 9 3 .125
2 3 1 9 3 .125
5 8 8 9 8 1
Factor Definition:
1 : Customer Background
2 : Payment Record
3 : Geographical Location
4 : Business Potential
5 : Financial Soundness
each of the factors for stage IV customers is also shown in Exhibit 3. Each
customer was then evaluated on the five dimensions of interest (corresponding
to the five factors) and a set of subjectiveprobabilitieswas developed.The firm
provided us estimates of benefits and losses that are actually used (albeit
subjectively) currently in setting limits. The contribution margin on sales was
estimated at 10% before-tax and the variable costs were estimated at 90% of
sales. Most of the sales by the firm involve a credit period of 15 days only and as
a matter of convenience, we ignoredthe present value effect without loss of any
generality.
Subjectiveprobabilitiesfor each customerwere then integratedalong with the
estimates for benefits and costs in formulation (1) or (2) to determine credit
limits. Similarly, objective probabilities using MDA, logit, RPA, and GP were
also integrated. The results are presented in Exhibit 4. The results reveal that
the AHP limits were the closest to actual limits, which is not totally surprising.
Consistency,for the purposes of comparison,was defined to refer to cases where
the model suggested limits and actual limits were within 10% of each other.
Further, since the operational decision is to grant the credit requested or less,
positive NPV situations are considered consistent if the actual limit granted is
equal to the full amount of credit requested."5Some of the cases (=6) in which
15
If a customer requests for a credit of $100,000, then the operative decision for the firm is to
determineif the customeris worthy of being grantedfull credit. In such cases, it is possible that the
tradeofffunction specified in formulationmay imply a limit in excess of $100,000.However,this is
suggestedlimits are less than actual are the exceptions where extraneous factors
have had an influence on the decision. Elimination of these cases will further
improvethe overall consistency of the results from all the models.
The above results provide some insight on the ability of statistical and judge-
mental models to replicate expert decisions. As stated earlier, we have not
addressed the broader issue of which factors are relevant for corporate credit
granting. In our opinion, this issue motivates a complementary relationship
betweenjudgementaland statistical models. Statistical models like the RPA can
perhaps be used with significant success to isolate from a universe of potential
factors those few variablesthat have a significant bearing on the default risk of
the customer. This information can then be used in a judgemental model, like
the AHP, where objective measurements are considered along with subjective
measurements that are difficult to quantify, to provide an optimal setting for
corporatecredit granting decisions.
V. Summary
The corporatecredit grantingprocess has not received adequateattention in the
literature, perhaps because of the lack of publicly available data. We provide
comparativeanalysis of the ability of statistical classification models to replicate
the decisions of a corporate credit granting expert. It is shown that sequential
partitioning proceduresprovide slightly superior classification results. The su-
periority of sequential partitioning proceduresis likely to be a function of the
complexity of the multivariate data being analyzed. A complementary role is
suggested for statistical classification models in the corporate credit granting
process. Such models can serve the useful purposeof reducingand/or identifying
not to be treatedas an inconsistent case. Formulation(2) only specifies the maximumcredit limit to
grant.
REFERENCES
1. E. I. Altman, R. B. Avery, R. A. Eisenbeis and J. F. Sinkey, Jr., Applicationof Classification
Techniquesin Business,Bankingand Finance,CT: JAI Press, Inc., 1981.
2. H. Bierman and W. Hauseman. "The Credit GrantingDecision," ManagementScience (April
1980), 519-532.
3. L. Breiman,J. H. Freidman,R. A. Olshen and C. J. Stone, Classificationand RegressionTrees,
Wadsworth:California,1984.
4. S. Chatterjeeand S. Chatterjee. "Estimation of Misclassification Probabilitiesby Bootstrap
Methods,"Communicationsin Statistics, (1983), 645-656.
5. S. Chatterjeeand V. Srinivasan,"GraphicalAnalysisand FinancialClassification:A Case Study,"
WorkingPaper, NortheasternUniversity, Revised,February1987.
6. W. J. Conoverand R. J. Iman. "The Risk TransformationAs A Method of Discriminationwith
some Examples,"Communicationsin Statistics Theoryand Methods,9, (1980), 465-487.
7. T. G. Diettrichand R. S. Michalski,"Learningand Generalizationof CharacteristicDescriptions:
EvaluationCriteriaand ComparativeReview of Selected Methods,"Sixth InternationalJoint
Conferenceon ArtificialInteUigence,IJCAI,Tokyo, Japan, (1979), 223-231.
8. B. Efron, "Estimatingthe ErrorRate of a PredictionRule: Improvementon Cross-Validation,"
Journalof the AmericanStatisticalAssociation,(June 1983), 316-331.
9. R. A. Eisenbeis. "Selection and Disclosure of Reasons for Adverse Action in Credit Granting
Systems,"FederalReserveBulletin, September,(1980), 727-736.
10. G. Emery,"PositiveTheoriesof Trade Credit,"In Y. H. Kim (Ed.),Advancesin WorkingCapital
Management,Vol. 1, (1987), forthcoming.
11. N. Freedand F. Glover,"ResolvingCertainDifficulties and Improvingthe ClassificationPower
of LP DiscriminantAnalysis Formulations,"DecisionSciences,(Fall (1986), 589-595.
12. N. Freed and F. Glover, "Simple But Powerful Goal ProgrammingModels for Discriminant
Problems,"EuropeanJournalof OperationsResearch,7, (1981), 44-60.
13. H. Frydman, E. I. Altman and D. Kao, "IntroducingRecursive Partitioning for Financial
Classification:The Case of FinancialDistress,"Journalof Finance,XL, (1984), 269-291.
14. M. L. Marais,J. M. Patell and M. A. Wolfson,"TheExperimentalDesign of ClassificationTests:
The Caseof CommercialBank Loan Classification,"JournalofAccountingResearch,22, (1985),
Supplement.
15. E. P. Markowskiand C. A. Markowski,"SomeDifficultiesand Improvementsin ApplyingLinear
ProgrammingFormulationsto the DiscriminantProblem,"DecisionSciences, 16, (1985), 237-
247.
16. D. Mehta. "The Formulationof Credit Policy Models,"ManagementScience, (October 1968),
B30-B50.
17. J. R. Quinlan,"LearningEfficient ClassificationProceduresand Their Applicationto Chess End
Games,"In R. S. Michalski,J. F. Carbonelland T. M. Mitchell (Eds.), MachineLearning,Palo
Alto, CA:Tioga Publishing,1983.
18. T. L. Saaty, The AnalyticalHierarchyProcess,New York:McGraw-Hill,1980.
19. T. L. Saaty, "The Axiomatic Foundation of the Analytic Hierarchy Process," Management
Science,(July 1986),841-855.
20. T. L. Saaty and L. G. Vargas, The Logicof Priorities:Applicationsin Business, Energy,Health
and Transportation, Boston: Kluwer-Nijhoff, 1982.
21. T. L. Saaty and R. S. Mariano, "RationingEnergy to Industries:Priorities and Input-Output
Dependence,"EnergySystemsand Policy, (Winter 1979).
22. V. Srinivasanand Y. H. Kim, "FinancialApplicationsof the Analytic HierarchyProcess,"TIMS
InternationalConference,Australia,(July 1986).
23. V. Srinivasanand Y. H. Kim, "Modelingthe Credit GrantingProcess in a Fortune500 Corpo-
DISCUSSION