Академический Документы
Профессиональный Документы
Культура Документы
Technical University of Crete, School of Production Engineering and Management, Financial Engineering Laboratory, University Campus, 73100 Chania, Greece
Audencia Group, Nantes School of Management, France
c
ESCP Europe Business School, Research Centre for Energy Management, UK
b
a r t i c l e
i n f o
Article history:
Available online 24 January 2014
JEL classication:
C44
G24
G13
Keywords:
Credit ratings
Rating agencies
BlackScholesMerton model
Multi-criteria decision making
a b s t r a c t
Ratings issued by credit rating agencies (CRAs) play an important role in the global nancial environment.
Among other issues, past studies have explored the potential for predicting these ratings using a variety
of explanatory factors and modeling approaches. This paper describes a multi-criteria classication
approach that combines accounting data with a structural default prediction model in order to obtain
improved predictions and test the incremental information that a structural model provides in this context. Empirical results are presented for a panel data set of European listed rms during the period 2002
2012. The analysis indicates that a distance-to-default measure obtained from a structural model adds
signicant information compared to popular nancial ratios. Nevertheless, its power is considerably
weakened when market capitalization is also considered. The robustness of the results is examined over
time and under different rating category specications.
2014 Elsevier B.V. All rights reserved.
1. Introduction
Credit ratings are important ingredients of the credit risk management process, and they are widely used for estimating default
probabilities, supporting credit-granting decisions, pricing loans,
and managing loan portfolios. Credit ratings are either obtained
through models developed internally by nancial institutions
(Treacy and Carey, 2000) or provided externally by credit rating
agencies (CRAs). The latter, despite the criticisms on their scope
and accuracy (e.g., Frost, 2007; Pagano and Volpin, 2010; Tichy
et al., 2011), are widely used by investors, nancial institutions,
and regulators, and they have been extensively studied in academic
research (for a recent overview, see Jeon and Lovo, 2013). In this
context, models that explain and replicate the ratings issued
by CRAs can be useful in various ways, as they can facilitate an
understanding of the factors that drive CRAs evaluations, provide
investors and regulators with early-warning signals and information for important rating changes, and support the credit risk
assessment process for rms not rated by the CRAs.
Previous studies have focused on analyzing and predicting credit ratings using mostly rm-specic data (usually in the form of
Corresponding author. Tel.: +30 28210 37318; fax: +30 28210 69410.
E-mail addresses: mdoumpos@dpem.tuc.gr (M. Doumpos), dniklis@isc.tuc.gr
(D. Niklis), kostas@dpem.tuc.gr (C. Zopounidis), kandriosopoulos@escpeurope.eu
(K. Andriosopoulos).
0378-4266/$ - see front matter 2014 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.jbankn.2014.01.010
600
2. Methodology
2.1. Market model
The works of Black and Scholes (1973) and Merton (1974) led to
the development of the research on structural models for credit
risk modeling. In this framework (henceforth referred to as BSM),
a rm is assumed to have a simple debt structure, consisting of a
single liability with face value L maturing at time T. The rm defaults on its debt at maturity if its assets market value is lower
than L. In this context, the rms market value of equity (E) is modeled as a call option on the underlying assets (A), whose value is
given by the BlackScholes option pricing formula:
p
E AN d1 Lerf T N d1 r T
d1
lnA=L r f 0:5r2 T
p
r T
Furthermore, under Mertons assumption that equity is a function of assets and time, the following equation is derived from Its
lemma (Hull, 2011):
A
E
rE rN d1
DD
lnA=L l 0:5r2 T
p
r T
Vxi
K
X
wk v k xik
k1
where xi xi1 ; . . . ; xiK is the data vector for rm i on K independent attributes (evaluation criteria), wk is the (non-negative)
trade-off constant for criterion k (the trade-offs are normalized
to sum up to 1), and v k is the corresponding marginal value
function scaled in [0, 1]. The credit scores obtained from the additive model (4) range in [0, 1] and are inversely related to the risk
level of the rms (i.e., high credit scores indicate low risk and
vise-versa). The marginal value functions decompose the overall
credit score of a rm into partial scores at the criteria level; they
are non-decreasing for prot-related attributes and non-increasing for cost-related indicators. For example, the higher the protability of a rm (according to a protability indicator such as the
return on assets ROA), the higher (i.e., closer to 1) its performance on the corresponding marginal value function and the
higher its overall credit score (i.e., lower risk level). The marginal
value functions have a functional-free form (piece-wise linear)
that is inferred directly through the model-tting process. This enables the additive model to capture the nonlinear (monotone)
relationships between the independent attributes and the ratings
of the rms. Nevertheless, in contrast to popular nonlinear data
mining algorithms (e.g., neural networks), the additive form of
the model makes it easy to comprehend as it adopts the structure
of a simple credit scorecard. On the other hand, the criteria tradeoffs act as proxies for the relative importance of the independent
attributes (on a 01 scale). Given the additive and compensatory
structure of the evaluation model, a high risk level implied by
the low performance of an attribute with a high trade-off constant
is not easily compensated by high performance by other attributes
with low trade-offs.1
On the basis of its overall credit score as dened by (4), a rm i
is classied into risk grade R if and only if
where 1 > t 1 > t 2 > t N1 > 0 are the thresholds that distinguish a
set of N ordered rating classes R1 ; R2 ; . . . ; RN (class R1 is the lowrisk grade and RN is the high-risk one). Given a training sample consisting of m observations from each rating class R , the additive
model can be developed through the solution of the following
optimization problem:
min
N
N
X
1 XX
yih yih
m
1
i2R h1
8
9
w1 w2 wK 1
10
11
12
The objective of this optimization formulation is to t an additive scoring model to the data, so that the total weighted classication error (downgrade and upgrade errors) is minimized. The
classication errors in the objective function are weighted by the
number of training observations from each rating class, thereby
avoiding the construction of a scoring model that is biased toward
larger classes.2 Constraint (7) denes the downgrade errors
y
for a rm i from rating class R
i ; yi;1 ; . . . ; yi;N1
1
For instance, if a protability indicator has a high trade-off constant and a
leverage indicator has a low trade-off, then poor protability cannot be easily
compensated for by lower debt levels in order to achieve an overall high credit risk
score.
2
The objective function can be easily modied to introduce different weights for
the downgrade and upgrade errors, thus enabling analysts to take into consideration
the different costs of such errors. However, due to the lack of such information, this
possibility is not explored in this study.
601
(a)
(b)
Fig. 1. Classication errors. (a) Two rms from the low-risk class, R1 , are
downgraded to categories R2 and R3 . (b) Two rms from the high-risk class, R3 ,
are upgraded.
and y
i;1 yi;N1 0, where d is a small positive constant used
to model the strict inequalities in classication rule (5). If there is a
two-notch downgrade compared to the actual rating of the rm,
then
Vxi < t 1 < t
and
the
error
variables
are
y
i t Vxi d; yi;1 t 1 Vxi d, and the others equal to
602
Table 1
Sample composition (number of observations) by year, country, and business sector.
Year
No. of
obs.
Country
No. of
obs.
2002
2003
38
102
Germany
France
308
303
2004
115
UK
298
2005
126
Switzerland
135
2006
2007
2008
2009
2010
2011
2012
135
138
140
139
149
151
92
Netherlands
Italy
Spain
Belgium
130
88
33
30
Total
1325
Sector
No. of
obs.
Manufacturing
Information and
communication
Wholesale and
retail trade
Transportation
and storage
Construction
1325
853
220
130
Table 3
Averages of independent variables by rating group.
ROA
EBIT/IE
EQ/TA
EQ/LTD
CAP
DD
R1
R2
R3
R4
R5
Invest.
Specul.
13.69
18.33
46.86
2.29
18.01
8.96
8.02
8.42
34.74
1.27
16.75
5.94
6.58
5.54
30.48
1.06
15.74
4.17
4.26
3.12
27.82
0.87
14.65
2.89
2.79
0.54
21.33
0.78
13.12
1.31
7.64
7.56
33.26
1.23
16.27
5.17
2.47
2.46
26.17
0.85
14.26
2.49
90
32
1325
considered for this purpose. In the rst, the sample rms are
grouped into ve major risk classes, as follows: (1) class R1 , consisting of the lowest-risk cases, with ratings in the range from
AA- up to AAA; (2) class R2 , consisting of cases with ratings A,
A, and A+; (3) classes R3 , consisting of cases from low investment-level grades (BBB, BBB, BBB+); (4) class R4 , consisting of
speculative ratings (BB, BB, BB+); and (5) the high-risk category
R5 , which includes highly speculative ratings (i.e., D up to B+).
Alternatively, we also consider a two-group setting distinguishing
between speculative (D to BB+) and investment grades (BBB to
AAA). The percentage of observations in each rating group under
these schemes is shown in Table 2.
For every observation in the sample for year t, the S&P longterm rating is recorded at the end of June, while annual nancial
data are taken from the end of year t 1. The nancial data involve
four nancial ratios: ROA (prot before taxes/total assets), interest
coverage (earnings before interest and taxes/interest expenses,
EBIT/IE), solvency (equity/total assets, EQ/TA), and the long-term
debt leverage ratio (equity/long term debt, EQ/LTD). ROA is the primary indicator used to measure corporate protability. Interest
coverage assesses rms ability to cover their debt obligations
through their operating prots. The solvency ratio analyzes the
capital adequacy of the rms, whereas the long-term debt leverage
ratio takes into consideration the long-term debt burden of rms
relative to their equity. In addition to these nancial ratios, we also
take into account the sizes of rms, as measured by the logarithm
of their market capitalization (CAP), and a country risk indicator
Table 2
Percentage of sample observations in each risk category.
R1
R2
R3
R4
R5
Investment
Speculative
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
5.3
5.9
7.8
7.1
7.4
7.2
5.7
4.3
4.0
4.0
5.4
31.6
30.4
31.3
28.6
24.4
23.9
24.3
25.9
24.8
23.2
27.2
39.5
43.1
39.1
39.7
43.7
45.7
42.1
41.0
41.6
43.7
45.7
18.4
12.7
13.0
16.7
20.0
19.6
22.9
20.1
22.1
23.2
15.2
5.3
7.8
8.7
7.9
4.4
3.6
5.0
8.6
7.4
6.0
6.5
76.3
79.4
78.3
75.4
75.6
76.8
72.1
71.2
70.5
70.9
78.3
23.7
20.6
21.7
24.6
24.4
23.2
27.9
28.8
29.5
29.1
21.7
Overall
5.8
26.3
42.4
19.0
6.5
74.5
25.5
R1 includes ratings from AA to AAA; R2 includes A, A, and A+; R3 consists of
BBB, BBB, BBB+; R4 includes BB, BB, BB+, R5 includes ratings from D to B+.
Speculative grades range from D to BB+, and investment grades range from BBB to
AAA.
(CRI) dened as a binary variable that takes the value of 1 for countries rated by S&P as AAA (in a given year) and 0 otherwise.
The DD measure from the BSM model is also employed, estimated from the daily stock prices of the rms in year t 1. The
risk-free rate for countries in the Eurozone is taken from the
three-month Euribor rate, whereas the three-month treasury-bill
and LIBOR rates are employed for the UK and Switzerland, respectively. The face value of debt is dened using current liabilities plus
half the long-term debt, in accordance with the KMVs implementation of the BSM model (Dwyer et al., 2004) and the arguments
and empirical results presented by Vassalou and Xing (2004).3
Finally, following simplications similar to those of Bharath and
Shumway (2008) and Li and Miu (2010), the annual equity returns
and the corresponding volatilities over year t 1 are used as proxies
of the expected return on assets (l) and its volatility (r), respectively. However, to avoid using a negative or near-zero return on
assets, we adopt the transformation of Hillegeist et al. (2004) and
set l maxfr f ; r E g, where rf is the risk-free rate and r E denotes
the annual equity return.
Table 3 presents the averages (over all years) of the nancial
variables and the DD indicator for each rating group in the sample.
All variables have a clear monotone (increasing) relationship with
the ratings. In particular, highly rated rms (e.g., rating group R1 )
are more protable, have higher interest coverage, are better capitalized and leveraged in terms of long-term debt, and have higher
market capitalization. The DD is also considerable larger for highly
rated rms than for low-rated ones. The differences between the
rating groups are statistically signicant at the 1% level for all variables under the KruskalWallis non-parametric test. The country
indicator was also found signicant at the 5% level according to
the v2 test (p 0:045).
The relationship between the selected variables and the rating
of the rms is also veried with Kendalls s rank correlation coefcient, as illustrated in Table 4. The results are reported both for
the ve rating categories dened above and for the full rating scale
of S&P. For comparison purposes, we also include the correlations
for the logarithm of total assets (TA) as an alternative size indicator
to market capitalization. Results for two alternative procedures for
estimating the DD measure are also reported, including the iterative
procedure of Vassalou and Xing (2004) and the model of Bharath
and Shumway (2008). These are denoted as DD-VX and DD-BS,
respectively. All correlations are found signicant at the 1% level.
The negative signs indicate that (as expected) all variables are
negatively related to credit risk (i.e., the higher the considered variables the lower the risk). Market capitalization has the strongest
correlation with the ratings. Total assets are also strongly associated with the ratings, but the correlation is considerably weaker
compared to capitalization, thus conrming that capitalization
3
Vassalou and Xing (2004) note that considering long-term debt in the BSM model
(even though a time horizon of one year is assumed) is important because rms make
interest payments in the short term for their long-term liabilities. Additionally, longterm debt enables rms to roll over their short-term liabilities. The authors also
conducted tests using different portions of long-term debt, and concluded that there
is no signicant variation in the results of the BSM model compared to the base
setting where half of the long-term liabilities are used.
ROA
EBIT/IE
EQ/TA
EQ/LTD
CAP
TA
DD
DD-VX
DD-BS
Five risk-grades
0.335
0.391
0.199
0.202
0.565
0.404
0.382
0.361
0.370
0.327
0.372
0.181
0.186
0.570
0.396
0.375
0.352
0.362
incorporates additional information when compared to accounting-based measures of a rms size, such as its total assets. Among
the three estimates of DD, the one obtained with the procedure
adopted in this study has the highest correlation with the ratings.
Finally, it is also worth noting the limited differences between the
ve-grade scheme and the full rating scale of S&P, which conrms
that there is no signicant information lost due to the adopted simplication of the rating scale.
4. Results
4.1. Empirical setting
Given that, during the time period spanned by the data, the
European economic environment has experienced signicant
changes (e.g., the outbreak of the global crisis and the European
sovereign debt crisis), we test the dynamics and robustness of
the results over time by developing and validating a series of models through a walk-forward approach. In particular, the data for the
period 20022005 are used rst for model tting, whereas the subsequent period, 20062012, serves as the holdout sample. In a second run, the training data are extended up to 2006, and the holdout
data span the period 20072012. The same process is repeated up
to the case where the training data cover the period 20022010.
Thus, six training and test runs are performed. Henceforth, these
walk-forward runs will be referred to as F05 (model tting on data
up to 2005) up to F10 (model tting on data up to 2010).
In each run of the walk-forward approach, four models are
developed. The rst model (henceforth referred to as M1) is based
solely on the four nancial ratios and the country indicator. The
second model (M2) additionally considers the DD. A third model
(M3) extends M1 by adding capitalization, whereas the most comprehensive model (M4) combines all independent attributes. The
consideration of these four models provides a decomposition of
the results, covering both the information content of the market
model compared to accounting ratios, as well as its explanatory
power as opposed to the size of the rms, which was found to be
the strongest predictor.
In the section below, we analyze the results obtained using the
ve-point rating classication described in the previous section. As
a robustness test, the two-class investment-speculative scheme
will be discussed in a separate section.
4.2. Overall results
Table 5 summarizes the trade-offs of all variables in the models,
averaged over all six tests of the walk-forward approach. As a measure of the variability of the results over the six tests, the coefcients of variation are also reported (in parentheses).
Under the two models that do not consider the capitalization of
rms (i.e., models M1 and M2), ROA and interest coverage (EBIT/IE)
are the strongest variables, followed by solvency (EQ/TA), whereas
the country risk rating indicator appears to be practically irrelevant
603
604
Table 5
Trade-offs (in %) of the independent variables (averages over all years with coefcients of variation in parentheses).
ROA
EBIT/IE
EQ/TA
EQ/LTD
CI
DD
CAP
M1
40.17
(0.04)
31.37
(0.13)
18.34
(0.17)
8.26
(0.28)
1.86
(0.09)
M2
27.13
(0.09)
27.20
(0.16)
13.71
(0.18)
6.89
(0.24)
1.67
(0.07)
23.39
(0.11)
M3
19.98
(0.16)
15.49
(0.11)
9.08
(0.06)
8.38
(0.22)
1.03
(0.08)
46.03
(0.05)
M4
15.12
(0.20)
13.43
(0.13)
7.13
(0.12)
7.80
(0.22)
0.93
(0.13)
10.70
(0.09)
44.89
(0.05)
50%
45%
ROA
40%
35%
30%
25%
20%
15%
10%
EBIT/IE
5%
F05
F06
M1
F07
F08
M2
25%
F09
M3
F10
F05
M4
M1
F06
13%
EQ/TA
20%
11%
15%
9%
10%
7%
F07
F08
M2
F09
M3
F10
M4
EQ/LTD
5%
5%
F05
F06
M1
F07
M2
29%
F08
F09
M3
F10
F05
M4
M1
F06
M2
50%
DD
23%
48%
17%
45%
11%
43%
F07
F08
F09
M3
F10
M4
CAP
40%
5%
F05
F06
F07
M2
F08
F09
F10
M4
F05
F06
F07
M3
F08
F09
F10
M4
solvency) may have been the actual reason for the increase in the
relative importance of the solvency ratio that is observed in models
M1 and M2. However, after controlling for the market capitalization of the rms, the effect of solvency becomes weaker and appears to be steady over time.
Table 6 presents the results of the comparison of the classication accuracies of all models in the holdout samples.4 The rows in
4
The classication accuracy of a rating model is dened as the ratio between the
number of correct rating assignments by the model (i.e., cases where the ratings
estimated by the model coincide with the actual ratings) to the total number of
observations in the sample.
the two panels correspond to the six walk-forward tests, while the
columns correspond to the years in the holdout samples. For each
year in the holdout samples, Panel A presents the differences between the classication accuracy of model M2 as opposed to model
M1, whereas Panel B presents the same information for the comparison of M4 to M3. The two last columns of the table present the overall classication accuracies of the models for the full holdout panel
data.
Both models M1 and M2 clearly perform poorly, as their classication accuracies are below 50%. The accuracies for model M1,
which only considers the four nancial ratios and the country indicator, range from 37% to 40%. The introduction of DD into model
605
F05
F06
F07
F08
F09
F10
Overall accur.
2006
2007
2008
2.22
2.90
6.52
0.71
2.14
2.14
2009
2.88
2.16
0.72
0.00
2010
2011
2012
M1
M2
4.03
3.36
0.67
3.36
4.70
4.64
5.30
4.64
3.97
5.30
5.96
11.96
14.13
9.78
14.13
10.87
8.70
38.77
37.70
37.11
39.17
40.05
39.92
41.74
42.03
39.34
43.69
46.43
46.91
M3
M4
54.03
57.97
56.18
56.87
58.42
60.91
55.40
58.71
56.93
58.00
60.20
62.14
Panel B: M4 M3 differences
F05
F06
F07
F08
F09
F10
0.74
5.07
1.45
0.71
0.71
0.00
1.44
0.72
2.16
0.72
1.34
1.34
0.67
2.68
2.68
2.65
3.97
4.64
2.65
3.31
0.66
98%
Table 7
Average classication matrix for model M4.
R1
M1
M2
M3
M4
95%
92%
Actual rating
R1
90.23
R2
17.53
R3
1.49
R4
0.00
R5
0.00
2.17
2.17
0.00
1.09
2.17
2.17
R2
R3
R4
R5
6.32
48.09
12.60
1.60
0.00
3.45
31.35
57.82
27.67
2.17
0.00
3.03
26.94
60.43
35.65
0.00
0.00
1.16
10.29
62.17
Percentage of sample observations with actual rating Ri (rows) that are classied by
the model in class Rj (columns).
89%
86%
83%
80%
F05
F06
F07
F08
F09
F10
606
Table 8
Average trade-offs (in %) of the independent variables (coefcients of variation in parentheses) under the two-class setting.
ROA
EBIT/IE
EQ/TA
EQ/LTD
CI
DD
CAP
M1
36.38
(0.15)
31.85
(0.16)
20.51
(0.36)
9.47
(0.23)
1.78
(0.09)
M2
23.03
(0.23)
25.26
(0.28)
14.91
(0.36)
6.79
(0.22)
1.76
(0.21)
28.24
(0.25)
M3
14.78
(0.27)
18.24
(0.13)
11.94
(0.44)
7.32
(0.10)
1.48
(0.44)
46.24
(0.03)
M4
11.26
(0.22)
15.50
(0.18)
10.34
(0.46)
6.41
(0.12)
1.31
(0.44)
10.94
(0.19)
44.24
(0.03)
85%
M1
M2
M3
91%
M4
M1
M2
M3
M4
88%
80%
AUROC
Accuracy
85%
75%
70%
82%
79%
76%
65%
73%
60%
70%
F05
F06
F07
F08
F09
F10
F05
F06
F07
F08
F09
F10
Fig. 4. Classication accuracy and AUROC for the two-class models (averaged over all years).
References
Ak, Z., Arad, O., Galil, K., 2012. Using Merton Model: An Empirical Assessment of
Alternatives. Tech. Rep., Ben Gurion University, Israel.
Agarwal, V., Tafer, R., 2008. Comparing the performance of market-based and
accounting-based bankruptcy prediction models. J. Bank. Finance 32 (8), 1541
1551.
Alali, F., Anandarajan, A., Jiang, W., 2012. The effect of corporate governance on
rms credit ratings: further evidence using governance score in the United
States. Account. Finance 52 (2), 291312.
Baker, M., Wurgler, J., 2002. Market timing and capital structure. J. Finance 57 (1),
132.
Bharath, S.T., Shumway, T., 2008. Forecasting default with the Merton distance to
default model. Rev. Financ. Stud. 21 (3), 13391369.
Black, F., Scholes, M., 1973. The pricing of options and corporate liabilities. J. Polit.
Econ. 81 (3), 637659.
Charitou, A., Dionysiou, D., Lambertides, N., Trigeorgis, L., 2013. Alternative
bankruptcy prediction models using option-pricing theory. J. Bank. Finance 37
(7), 23292341.
Chen, L., Zhao, X., 2006. On the relation between the market-to-book ratio, growth
opportunity, and leverage ratio. Finance Res. Lett. 3 (4), 253266.
Cheng, M., Neamtiu, M., 2009. An empirical analysis of changes in credit rating
properties: timeliness, accuracy and volatility. J. Account. Econ. 47 (12), 108130.
Das, S.R., Hanouna, P., Sarin, A., 2009. Accounting-based versus market-based crosssectional models of CDS spreads. J. Bank. Finance 33 (4), 719730.
Doumpos, M., Pasiouras, F., 2005. Developing and testing models for replicating
credit ratings: a multicriteria approach. Comput. Econ. 25 (4), 327341.
Doumpos, M., Zopounidis, C., 2002. Multicriteria Decision Aid Classication
Methods. Springer, New York.
607