Академический Документы
Профессиональный Документы
Культура Документы
How Well Does the Vasicek-Basel AIRB Model Fit the Data?
Evidence from a Long Time Series of Corporate Credit
Ratings Data
by
Paul H. Kupiec
-
Preliminary
September 2009
EXTENDED ABSTRACT
The Basel II AIRB framework uses Vasiceks asymptotic single factor model to set minimum
regulatory capital requirements for bank credit risk. I develop an estimation approach that
produces consistent estimates of the parameters in the Vasicek model as well as consistent
estimates of the common macro factor realizations that drive credit defaults. The model is
estimated using Moodys data on the default rates of rated corporate credits over the period 1920-
2008. Model fit is assessed using robust statistics and small samples estimation issues are
examined. The treatment of observations with recorded default rates of zero is identified as an
important estimation issue that requires further study.
The Vasicek-Basel II AIRB default model is not capable of reproducing the observed variation in
Moodys corporate default rate data. The analysis shows that the true correlation among default
patterns for different credit grades is inconsistent with the Vasicek single common factor model
specification. Over the period 1920-2008, corporate rated credits either do not default or default
at much higher frequencies than are predicted by the model. In contrast to the models
assumptions, the macro factor that drives corporate defaults exhibits strong positive
autocorrelation. Observed credit cycles last many years on average. As a consequence, long time
series are required to produce reliable model parameter estimates. It is impossible, for example,
to produce reliable estimates of unconditional probability of default from sample as short as 5- or
10-years. In contrast to the Basel AIRB framework, highly-rated corporate credits exhibit much
higher default rate correlations compared to lower-rated credits.
I use the econometric model to develop procedures to correct for business cycle effects when
estimating unconditional default rates when a credit class has only a limited sample of default
data. These procedures are useful not only for estimation, but also for back testing or attempting
to validate a banks Basel AIRB parameter assignments.
-
Federal Deposit Insurance Corporation. The views expressed are those of the author and do not
reflect the views of the FDIC. I am grateful to Ed Kane and Matt Pritsker for comments on an
early draft of thisstudy. Email: pkupiec@fdic.gov
2
HOW WELL DOES THE VASICEK-BASEL AIRB MODEL FIT THE DATA?
EVIDENCE FROM A LONG TIME SERIES OF CORPORATE CREDIT RATINGS DATA
I. INTRODUCTION
The Basel II Advanced Internal Ratings-Based (AIRB) framework is used to set
minimum regulatory capital requirements of the largest, most sophisticated
internationally-active banks. For example, the Financial Stability Institute (2006) reports
that 95 countries plan to implement Basel II by 2015, and more than 60 percent of the
Basel II adopters plan on including the AIRB option for credit risk capital requirements.
1
The AIRB regulatory framework uses an asymptotic version of Vasiceks (1987)
portfolio credit loss model to approximate the annual default rate distributions on
portfolios of credits that are differentiated by a bank-assigned credit rating. The AIRB
framework uses the Vasicek default rate distribution and estimates of loss given default
(LGD) and exposure at default (EAD) to approximate the credit loss distribution for each
credit grade portfolio.
2
Regulatory capital requirements are then set equal to the 99.9
percent upper-tail critical value of a credit grades potential portfolio loss distribution.
Much has been written about Basel II, but few if any studies have formally
analyzed how well the Basel II Advanced Rating Based (AIRB) model fits credit default
data produced by portfolios of credits categorized under a consistent credit rating system.
This paper develops a new approach for estimating the parameters of the Basel AIRB
1
The 60 percent figure includes both Basel Committee members and nonmember
countries that plan on adopting the AIRB approach as an option for credit risk regulatory
capital calculations.
2
The LGD and EAD estimates are not part of the Vasicek model. They are estimated
independently of the parameters of the Vasicek model.
3
model and new techniques for assessing the model fit relative to historical default rate
data on corporate credits rated by Moodys Investors Services.
The approach I propose uses panel regression methods to estimate the Vasicek-
Basel AIRB model parameters using time series data on a cross section of failure rates
from a consistent credit rating system. The methodology produces consistent estimates
of the unconditional default rates associated with each credit grade by correcting for
business cycle effects that are modeled using a single common factor. A consistent
estimate of the AIRB correlation parameter is derived directly from the default rate data.
The methodology also produces estimates of the common macro factor that is assumed to
drive default correlations.
3
This common factor can be used to control for
macroeconomic effects when circumstances require an estimate of the unconditional
default rates for additional credit grades for which only brief time series histories are
available.
The ability to identify the common market factor and to control for its impact on
observed default rate realizations is particularly useful for back testing and model
validation analysis when true default rates (and thus the common market factor
realizations) are autocorrelated. The methodology allows the researcher to estimate or
test the unconditional default rate associated with a rating grade while correcting for the
sample dependent common factor realization. This approach offers a significant
3
The methodology can also be used to estimate multiple correlation parameters if the
data includes a sufficiently large number of credit grades.
4
improvement over tests that ignore time dependence in default rate realizations and treat
sample default rate estimates as unconditional mean default rate values.
4
The proposed estimation methodology is implemented using default rate data
from Moodys Investors Services on rated corporate bond issues from 1920-2008. The
data are used to derive consistent estimates for the unconditional default rates associated
with the Aa, A, Baa, Ba, B and CaaC rating categories as well as for the Vasicek default
correlation parameter.
The estimation exercise highlights the importance of zero default rate
observations in historical data. Under the asymptotic Vasicek-Basel AIRB model
assumptions, zero default rates should almost never occur, and yet one-third of the
Moodys sample reports a zero annual default rate. A reported default rate of zero can
arise because a portfolio is not truly asymptotic and so the reported default rate is
downward biased as a consequence of measurement error. Estimates can be adjusted to
account for a reasonable upper bound on the magnitude of the measurement error
associated with zero default rate observations, but the resulting model parameter
estimates are very sensitive to the treatment accorded zero default rate observations.
An important issue related to model parameter estimation is the behavior of
unconditional default parameter estimates from small samples. Because of the assumed
importance of the macro factor, default realizations are driven by market conditions
which must be controlled for when estimating the unconditional default rates associated
with a rating grade. While the Vasicek model specification assumes that the macro factor
4
Unconditional default rate tests include those proposed by Cantor and Falkenstein
(undated copy), Pluto and Tasche (undated copy), Cantor Hamilton and Tennant (2007)
and Schuermann and Hanson (2004).
5
realizations are independent across time but in reality they have strong positive
autocorrelation. This positive autocorrelation makes it impossible to estimate
unconditional default rates from small samples unless the common factor effects are
properly controlled for in the estimation process using estimates of the common factor
realizations derived from external data.
The econometric model specification provides an intuitive process for correcting
for the aforementioned small sample bias in unconditional default rates. If consistent
estimates of the Vasicek common factor realizations and correlation parameter can be
recovered from a long time series panel data set of default rates, these estimates can be
used to construct consistent estimates of the unconditional default rates for auxiliary
rating grades when the new credit class has only a limited sample realized default rates. I
derive the algorithm to make these adjustments and demonstrate the technique for default
data on Moodys alpha-numeric rating scale over the period 2001-2008.
The next section reviews the Vasicek-Basel AIRB portfolio default rate model.
Section III discusses the new technique for estimating the Vasicek-Basel model
parameters. Section IV discusses important estimation issues that arise because observed
default rates exhibit strong positive correlation. Section V discusses the Moodys
corporate default rate data. Section VI discusses specific estimation issues that arise
because of the prevalence of zero default rates in the data. Section VII discusses the
model parameter estimates and parameter test statistics. Section VIII analyzes the small
sample estimation bias problem. Section IX discusses the small sample correction
algorithm that controls for the common factor realizations and produces consistent
estimates. A final section summarizes the results and concludes.
6
II. THE VASICEK PORTFOLIO CREDIT LOSS DISTRIBUTION MODEL
The Gaussian single factor model of portfolio credit losses (a.k.a. the Vasicek
model), developed by Vasicek (1987), Finger (1999), Schnbucher (2000), Gordy (2003)
and others, provides an approximation for the distribution of the default rate on a well-
diversified credit portfolio. The Vasicek model is a default-mode model meaning that all
credits are assumed to either perform or default within the models risk measurement
horizon. The asymptotic version of Vasicek model focuses on a large diversified
portfolio in which idiosyncratic risk is fully diversified and the only source of portfolio
loss uncertainty is the default rate that is driven by the common latent Gaussian factor.
5
The model measures the aggregate value of the losses generated by defaulting credits and
the income earned on non-defaulting credits is not recognized.
6
The Vasicek model assumes that uncertainty on credit i is driven by a latent
unobserved factor,
i
V
~
, with the following properties:
. , , 0 )
~ ~
( )
~ ~
(
), ( ~
) ( ~
~
~
1
~
~
j i e e E e e E
e e
e e
e e V
j M j i
i id
M M
i V M V i
= =
+ =
|
|
(1)
where
) ( |
represents the standard normal density function.
i
V
~
is distributed standard
normal, ( ) , 0
~
=
i
V E and ( ) ( ) ( ) . 1
~ ~ ~ 2
2 2
= =
i i i
V E V E V o
i
V
~
is often interpreted as a proxy for
5
The model assumes the unconditional probability of default, exposure at default, and
loss rates in default (LGD) are known non-stochastic quantities for all obligors.
6
Kupiec (2006, 2007) develops a more general version of the Vasicek model in which
interest income on defaulting credits is recognized and offsets losses on defaulting
credits.
7
the market value of the firm that issued credit . i The common factor,
M
e
~
, induces
correlation between individual credit latent factor realizations,
( )
( ) ( )
.
~ ~
~
,
~
j i
j i
V
V V
V V Cov
o o
=
Credit i is assumed to default when its latent factor takes on a value less than a
credit-specific threshold,
i i
D V <
~
. The unconditional probability that credit i defaults is
( ),
i
D PD u = where ( ) u represents the cumulative standard normal density function.
Time is not an independent factor in this model, but is implicitly recognized through the
calibration of input values for PD.
The loss rate on a portfolio of credits that have identical correlations, , and
default thresholds, , D D
i
= is determined as follows. Define a default indicator function
for each credit,
<
=
otherwise
D V if
I
i
i
0
~
1 ~
(2)
i
I
~
has a binomial distribution with an expected value of ( ). D u Define X
~
to be the
proportion of credits in the portfolio that default,
n
I
X
i
n
i
~
~
1
=
= .
In an asymptotic portfolio, the number of individual credits is assumed to increase
without bound, . n In the limit, idiosyncratic risks are completely diversified within
the portfolio and portfolio default rate uncertainty is driven by the common factor alone.
The unconditional distribution function of X
~
, the asymptotic portfolios default rate, is
given by,
| |
( ) ( )
| | 1 , 0 ,
1 ~
Pr
1 1
e
|
|
.
|
\
|
u u
u = s
x
PD x
x X
(3)
Conditional on a specific draw of the common factor,
M M
e e =
~
, the conditional
value of the default indicator for a single credit is,
8
| |
<
= =
otherwise
e
e D
if
e e I
i
M
M M i
0
~
1
1
~
|
~
(4)
When the correlation and default thresholds are identical for all credits in the portfolio,
the conditional indictor functions assigned for each credit are independent and identically
distributed. As the number of credits in the portfolio increase without bound, conditional
on a value of
M
e , the portfolio default rate distribution converges (almost surely) to a
non-random value that depends on
M
e ,
| | | |
( )
|
|
.
|
\
|
u
u =
|
|
.
|
\
|
u =
=
(
(
(
(
=
= =
1 1
~
|
~
~
|
~
lim
~
|
~
lim
1
.
1
M M
M M i
s a
M M
n
i
i
n
M M
n
e PD e D
e e I E
n
e e I
e e X
. (5)
Equation (5) implies, conditional on a specific realization of the common factor, an
asymptotic portfolios default rate is a non-random value fully determined by two
parameters, the credits unconditional default rate, PD, and the Vasicek correlation
parameter, .
Intuition can be gained by examining a simulated time series of default rates on a
set of asymptotic portfolios generated by equation (5). Consider a simulated times series
of default rates generated by four portfolios, each representing a different credit grade
within a rating system. These portfolios are distinguished by their credits unconditional
probabilities of default which are assumed to be: ( , 00 . 1 3 , 70 . 2 , 25 . 0 1 = = = P P P
and 00 . 2 4 = P ). All of portfolios are assumed to have an identical correlation parameter,
20 . 0 = .
9
Figure 1: Simulated Time Series of Default Rates on Four
Asymptotic Portfolios
0.00%
1.00%
2.00%
3.00%
4.00%
5.00%
6.00%
7.00%
8.00%
1 3 5 7 9
1
1
1
3
1
5
1
7
1
9
2
1
2
3
2
5
2
7
2
9
3
1
3
3
3
5
3
7
3
9
4
1
4
3
4
5
4
7
4
9
time
P
o
r
t
f
o
l
i
o
D
e
f
a
u
l
t
R
a
t
e
P1
P2
P3
P4
Figure 1 plots simulated times series of default rates on these four hypothetical
asymptotic portfolios. The plots show that the default rates on these portfolios are very
highly correlated because the idiosyncratic risk of default is completely diversified and
the default rate is driven by only a single common factor. Table 1 reports the sample
default rate correlations. The Vasicek model correlation parameter is 20 percent for all
credits and yet the portfolio default rate realizations are nearly perfectly correlated.
7
Figure 1 and Table 1 show that, under the asymptotic Vasicek model, realized portfolio
default rates will be nearly perfectly correlated regardless of the magnitude of the
correlation parameter .
8
7
The default rates would be exactly perfectly correlated except equation (5) applies
different non-linear transformations to the common Gaussian term .
M
e
8
Somewhat paradoxically, Table A1 in the Appendix shows that the correlations among
portfolio default rates actually decline as the Vasicek correlation parameter, , increases.
10
P1 P2 P3 P4
P1 1.000 0.996 0.992 0.977
P2 1.000 0.999 0.992
P3 1.000 0.996
P4 1.000
Table 1: Simulated Asymptotic Portfolio Sample Default Rate
Correlations
III. ESTIMATION OF THE ASYMPTOTIC VASICEK MODEL PARAMETERS
The parameters of the asymptotic Vasicek portfolio model can be consistently
estimated using panel data regression techniques. I adopt the common practice of
identifying a credit rating or grade with its unconditional probability of default. Equation
(5) implies that the default rate realization on an asymptotic portfolio of credits from an
identical grade is a nonlinear function of the unconditional default rate and correlation
parameters of its constituent credits.
Let
jt
P represent the realized default rate on portfolio
j
P in year t. Equation (5)
implies,
( )
Mt
j
jt
e
PD
P
u
= u
1 1
) (
1
1
(6)
Where
j
PD is the unconditional probability of default for a credit in rating category j ;
Mt
e is the realized value of the unobserved common Gaussian factor; and , , , 1 M j =
represent M individual credit rating categories.
Equation (6) is consistent with the theoretical predictions of the asymptotic
Vasick default rate model, but in reality, observed default rates my deviate from their
theoretically predicted value by some mean-zero error term,
it
c
~
. To complete the
11
empirical model, I assume that error terms are independent and identically distributed
across time, and uncorrelated cross-sectionally,
( )
( )
( )
2 2
~
0 )
~ ~
(
, , , 3 , 2 , 1 , 0
~ ~
0 )
~ ~
(
, 0
i it
jt it
k jt it
jt it
it
E
i j E
j and k for E
i j E
t i E
o c
c c
c c
c c
c
=
= =
= =
= =
=
+
(7)
The final condition allows each credit grade to be characterized by a different error
variance. Recognizing the possibility of the mean-zero model error, the empirical
specification of the Vasicek model is,
( )
it Mt
A
jt
e
PD
P c
~
1 1
)
~
(
1
1
+
u
= u
(8)
Consistent Estimation of the Transformed Model Parameters
Define ) (
1
jt jt
P y
u = ;
jt
y is the observed portfolio default rate transformed
through the inverse normal distribution function. Unlike the observed default rate,
jt
y is
not bounded between 0 and 1, but is instead a continuous variable in the range . Let
( )
u
=
1
1
j
j
PD
a ;
j
a is a time-invariant constant determined by the characteristic credit
class s j' unconditional probability of default and the Vasicek correlation parameter.
Define
Mt t
e b
=
1
;
t
b is a scalar multiple of the common Gaussian factor
realization. Notice that the scalar multiple for
Mt
e is not dependent of the asymptotic
portfolios credit rating but only depends of the credits correlation parameter. Using
these definitions, equation (8) can be written in more simplified notation as,
12
jt t j jt
b a y c
~ ~
+ + = (9)
Under the model assumptions, the parameter
j
a can be consistently estimated as
the sample average from a time series of default rate realizations on the credits in rating
category j . The sample average estimate is,
j
T
t
jt
T
t
t
j
T
t
jt
a
T T
b
a
T
y
~ ~
1 1 1
= + + =
= = =
c
(10)
as , 0
.
1
=
s a
T
t
t
T
b
and 0
~
.
1
=
s a
T
t
jt
T
c
under the model assumptions.
A consistent estimate for
t
b can be derived from the average of the residuals from
a cross section of credit grades after each of the observed default rates are corrected for a
time series estimate of its constant term,
( )
t
M
j
jt
t
M
j
j jt
b
M
b
M
a y
1 1
= + =
= =
c
. (11)
The model assumptions require 0
. .
1
=
s a
M
j
jt
M
c
as the number of independent credit
grades becomes large.
Ignoring the precision of the constant parameter estimates ( ) ' s a
j
, the precision
of the scaled macro factor estimates will depend on the number of independent credit
grades included in a cross section as well as the magnitude of the model error variance
associated each credit grade. Smaller model error variances ( M i
i
, , 3 , 2 , 1 ,
2
= o ) and a
larger number of cross sections will improve the precision on the scaled macro factor
13
estimates. If the model fits the data from each credit category poorly (large residual
variances) and there are few independent credit grades, the estimates cannot be expected
to provide a very accurate representation of the actual macro factor realizations. I will
provide a more detailed analysis of the small sample properties of the common factor
estimates from the model in a subsequent section.
The empirical model can be written in a panel regression format. To simplify the
notation and maintain clarity, I write the model in terms of three credit grades but the
notation generalizes to include additional credit grades. Let ( )
it it it it
D D D X 3 2 1 = be
a vector of selection covariates that indicate membership in a credit grade (rating
category); for example, if
it
y is the transformed default rate associate with credit grade 1,
t it
y y
1
= and ( ) 0 0 1
1
=
t
X . Similarly, if
it
y is the transformed default rate associated
with credit grade 2,
t it
y y
2
= and ( ) 0 1 0
2
=
t
X . Define ) 2 1 (
2 it i it it
T t t t t = to
be a selection matrix that identifies the year associated with observation
it
y . For example,
when
1 i it
y y = , a default rate observation from year 1, ) 0 0 0 0 1 (
1
=
i
t ; when
the observation is from year 3, ) 0 0 1 0 0 (
3
=
i
t . Using this notation, an
empirical model for a generic portfolio default rates observation is,
( ) ( )
it
T
it T
T
it it
b b b b X a a a y c t
~
3 2 1 3 2 1
+ + = (12)
where
it
c
~
is the residual term.
Normally, in an analysis of covariance setting, the parameters in equation (12)
would be estimated using least squares (OLS) or weighted least squares after dropping a
14
dummy variable for one year.
9
In the present case, we can make use of an additional
model restriction to identify a complete set of individual year effects. The Vasicek model
assumes that the common Gaussian factor is a standard normal variable. This assumption
imposes a restriction that the average time effect is 0. This additional assumption allows
for the consistent estimation of all of the model parameters by estimating equation (12)
under the restriction, . 0
1
=
=
T
t
t
b The efficiency of model parameter estimates can be
improved by correcting heteroskedasticity using generalized least squares, as the residual
variances differ among rating classes.
Estimation of the Correlation and Common Factor Realizations
Restricted OLS provides consistent estimates of the models parameters, including
the rating-grade dependent intercepts and consistent estimates of the individual time
effects, . , , 3 , 2 , 1
T T b
t
= Because the Gaussian common factor, ,
~
Mt
e in the Vasicek
model has a standard deviation of 1 by assumption and the
t
b
=
T
t
t
b
T
1
2
1
is a consistent estimator for,
1
. It follows that the consistent estimators
for the
Mt
e series are given by ,
=
=
T
t
t
t
Mt
b
T
b
e
1
) , , 3 , 2 , 1 ( T T = and
=
=
+
=
T
t
t
T
t
t
b
T
b
T
1
2
1
2
1
1
1
is a consistent estimator for . Finally, ( ) 1 u
i
a is a consistent estimator of the
unconditional probability of default for rating grade , 3 , 2 , 1 , = i i and so all the
9
Weighted least squares could correct for heteroskedasticity if the variance of the error
term was different for each rating grade.
15
underlying parameters of the Vasicek model are identified using this panel regression
approach to model estimation.
IV. AUTOCORRELATION ISSUES
It is intuitively reasonable to think that, should the model be estimated using a
long time series of portfolio default rate data, the economic effect of the common factor
on the default rate should average out over the sample since the Vasicek model assumes
that the common factors average effect is 0. The longer the time series, the better the
sample average of the unobserved common factor realizations should approximate 0
because, under model assumptions,
Mt
e
~
are independent identically distributed random
variables with mean 0. In reality, observed default rates are positively correlated across
time which may be indicative of positive autocorrelation in the common factor
realizations.
The Vasicek model structure does not include the possibility of business cycles in
the data as there is no autocorrelation or time-series structure in the common Gaussian
factor specification. If credit ratings are updated annually to project a constant
conditional default rate for each credit grade, and these implicit performance forecasts are
efficient, there is no reason to expect autocorrelation in the deviations from the credit
grades unconditional default rate. These arguments notwithstanding, observed default
rate data on rated corporate bonds exhibit positive autocorrelation; corporate default rates
exhibit clear evidence of credit cycles. Table 2 reports first-order autocorrelation
estimates for the annual default rates reported on selected corporate credits rated by
Moodys Investors Services and for the transformed default rates using the inverse
16
cumulative normal distribution. These estimates show evidence of strong positive
autocorrelation for the realized default rates and transformed default rates for all credits
except those rated Aa by Moodys.
intercept p-value p-value R
2
Aa default rate 0.055 0.005 -0.079 0.470 0.006
A default rate 0.049 0.077 0.412 <.001 0.172
Baa default rate 0.140 0.010 0.452 <.001 0.208
Ba default rate 0.506 0.007 0.511 <.001 0.261
B default rate 1.697 0.002 0.520 <.001 0.270
CaaC default rate 9.477 <.001 0.303 0.005 0.091
(Aa default rate) -3.860 <.001 -0.078 0.491 0.006
(A default rate) -1.170 <.001 0.666 <.001 0.444
(Baa default rate) -1.715 <.001 0.478 <.001 0.231
(Ba default rate) -1.031 <.001 0.628 <.001 0.398
(B default rate) -1.176 <.001 0.494 <.001 0.245
(CaaC default rate) -1.218 <.001 0.314 0.003 0.100
Estimates are based on Moody's Corporate Default Rate Data, 1920-2008. Default rates
are the number of defaults in the year following a Moody's rating designation divided by
the number of rated credits in a credit grade. Default rates are measured as perc
Table 2: Credit Cycles in the Realized Default Rates on Rated Corporate Credits
lagged
dependent
variable dependent variable
There is little published literature on the properties of credit cycles, e.g., their
average length, symmetry, amplitude of boom and bust phases, or other features. The
AR 1 default rate model estimates reported in Table 2 suggest that shocks to default rates
have lingering measurable effects for about three years, but the simple specification may
be too simplistic to fully capture the full dynamics of default rate behavior.
While the Vasicek model does not recognize the possibility of autocorrelation in
realized default rates, if the model is estimated over a long time series, even if default
rates are autocorrelated, the effect of credit cycles will average out to 0, and the
identifying restriction 0
1
=
=
T
t
t
b will be appropriate. Consider a case in which a sample
includes multiple complete cycles and a partial cycle. Estimating equation (12) while
17
imposing the restriction 0
1
=
=
T
t
t
b will result in biased estimates of the time effects as the
actual sample will not be balanced between positive and negative deviations from the
unconditional mean so the restriction is inappropriate. The effect of partial cycles in a
sample will diminish as the sample time series lengthens and includes more and more
complete cycles. In the limit, even if there are credit cycles in the default data, the
restriction 0
1
=
=
T
t
t
b will hold exactly as . T
The importance of using a long time series to estimate equation (12) can be
illustrated in the context of a simple autoregressive model. Let
t
R
~
represent a time series
generated by a stationary autoregressive process,
. 1 ), , 0 ( ~
~
,
~
~
1 1 1 0
< + + =
o o | o o
t t t t
u u R R (13)
The unconditional sample mean of the process is,
1
0
1 o
o
= =
+ + =
1
1
1
1 0
1
~
o o .
As the sample size T becomes large, , 0
~
.
1
=
s a
T
t
t
T
u
and
T
R
T
R
T
t
t
s a
T
t
t
=
=
1
.
1
1
. Thus, as
as , T
1
0
. .
1
1 o
o
=
s a
t
T
t
T
R
.
In equation (12), transformed default rates are equal to an unconditional mean
value plus a mean zero independent identically distributed innovation that includes the
effect of a macroeconomic factor. What happens if true transformed default rates follow
18
a stationary autoregressive process similar to equation (13) but we estimate equation (12)
without accounting for autocorrelation? Let
t
R represent the transformed autocorrelated
default rate series and define
t
v to be the realized deviations from the sample mean,
T
R
R v
T
t
t
t t
=
=
1
. In any sample, OLS estimation will ensure
=
=
T
t
t
v
1
0 by construction.
The prior discussion established that, in small samples,
T
R
T
t
t
=1
~
is biased estimator of
1
0
1 o
o
1
= =
=
j
b
T
b
e
T
t
t
j
j M
Figure 4 plots the mean and 90 percent probability bounds
22
of the sampling
distribution for the common factor realizations based on a bootstrap of 5000 paired
replications. Recall that, under the Vasicek-AIRB model, positive values of the common
22
The 90 percent probability bound is comprised of the 5- and the 95-percentile levels of
the of the common factors estimated sampling distribution.
38
factor are associated with low portfolio default rates whereas negative common factor
draws are associated with large default rates.
Estimates based on Moody's annual default rate data on rated corporate credits, 1920-2008. Zero default rate
observations are truncated to 10 basis points. The sampling distribution is calculated from 5000 paired bootstrap
replications. The pink line is the 95th percentile of the sampling distribution. The blue line is the mean of the
sampling distribution. The yellow line is the 5th percentile of the sampling distribution.
Figure 4: Sampling Distribution for the Vasicek Model
Common Factor Estimates, 1920-2008
-3
-2.5
-2
-1.5
-1
-0.5
0
0.5
1
1.5
2
1
9
2
0
1
9
2
3
1
9
2
6
1
9
2
9
1
9
3
2
1
9
3
5
1
9
3
8
1
9
4
1
1
9
4
4
1
9
4
7
1
9
5
0
1
9
5
3
1
9
5
6
1
9
5
9
1
9
6
2
1
9
6
5
1
9
6
8
1
9
7
1
1
9
7
4
1
9
7
7
1
9
8
0
1
9
8
3
1
9
8
6
1
9
8
9
1
9
9
2
1
9
9
5
1
9
9
8
2
0
0
1
2
0
0
4
2
0
0
7
The macro factor estimated plotted in Figure 4 show a clear pattern of credit
cycles in the default rate data. Realized defaults rates were above average for most of the
1920s and 1930s. Realized default rates were below average for all but one year over the
period 1941 to 1969. This long credit cycle was followed by two shorter credit cycles in
addition to the current downturn. Default rates were above average in both the late 1980s
and the late 1990s and rose again beginning in 2007.
The sampling distribution estimates clearly show temporal dependence among the
common factor realizations as realized values of the factors are statistically positive or
statistically negative for extended periods of time. The Wald and Wolfowitz (1940) runs
test statistic for independence is 9.84 when calculated using the mean values of the
39
common factor sampling distributions. The common factor estimates violate a formal
nonparametric runs test for independence at any commonly used level of significance
and thus there is strong statistical evidence that the temporal independence assumption of
the Vasicek-AIRB model is miss-specified.
Aa, A, Baa Ba, B, CaaC
rated credits rated credits
mean 0.205399794 0.324310576
median 0.20525 0.3234
mode 0.22236 0.29813
std deviation 0.023963989 0.032735678
1 percent 0.15115 0.24829
5 percent 0.1669 0.2695
95 percent 0.24511 0.37813
99 percent 0.26049 0.40091
Distribution Percentiles
Table 11: Sampling Distributions for Vasicek
Correlation Parameter when High- and Low-
Quality are Modeled Separetely
Sampling distribution are based on a sample size of 75, with a
default rate lower bound of 1 basis point, and 5000 paired
sample bootstrap replications.
The bootstrap technique can also be used to test other aspects of the model
specification. The estimates reported thus far have imposed the restriction of a common
Vasicek correlation parameter across credit rating classes and this may not be
appropriate. Table11 reports summary statistics for the sampling distributions for the
Vasicek correlation parameter when the correlation parameter is allowed to differ
between investment grade (Aa, A, Baa) and sub-investment grade (Ba, B, CaaC)
groupings of the rating class portfolios. Figure 5 plots the full sampling distributions for
these alternative correlation parameter estimates based on 5000 replications of sample
40
size 75.
23
The statistics reported in Table 11 and the distributions plotted in Figure 5
impose a lower bound of 1 basis point on any default rate observations that are reported
to be 0 in the data.
The statistics reported in Table 6 and plots in Figure 5 show that the data is
consistent with at least two different correlation parameters, one for highly-rated credits,
and a different correlation parameter for lower-quality credits. The reported statistics
show that the correlation parameter for lower-quality credits is clearly greater than the
correlation parameter estimate that provides the best fit for investment grade credits.
Recall that the Basle II AIRB model imposes a regulatory correlation function in which
the correlation parameter decreases as a credits unconditional probability of default
increases. The sampling distribution plotted in the lower panel of Figure 5 is the
distribution for the difference in the correlation parameter estimates for low quality
credits and high quality credits. The sampling distribution for the difference in these
correlation parameters clearly shows that the regulatory correlation structure is
inconsistent with the data. The evidence from this Moodys data suggests that the
Vasicek model default correlation parameter increases in magnitude as credit quality
declines.
23
The sample of 75 observations represents data from 1920-2006, excluding 12 years in
which no default observations were recorded in any rating category. This correlation
section has not yet been updated from an earlier draft to include 2008 data, GLS
estimates on the full sample including the 0 default years and higher truncation
thresholds for 0 default rate observations
41
Figure 5: Sampling Distributions for Correlation Parameter Estimates
when Aa, A, Baa Credits may have a Different Correlation than Ba, B,
CaaC Credits
Sampling distribution estimates are based on an estimation sample size of 75 years of data and
5000 paired sample bootstrap replications. The lower bound on portfolio default rates are
assumed to be 1 basis point and years with no defaults on any rated credit
Correlation Estimate for Aa, A, Baa Rated Credits
0
1
2
3
4
5
0
.
1
0
.
1
3
0
.
1
5
0
.
1
6
0
.
1
8
0
.
1
9
0
.
2
0
.
2
2
0
.
2
3
0
.
2
5
0
.
2
6
0
.
2
8
0
.
3
2
0
.
4
4
p
r
o
b
a
b
i
l
i
t
y
(
%
)
Correlation Estimate for Ba, B, CaaC Rated Credits
0
1
2
3
4
5
0
.
1
0
.
2
1
0
.
2
2
0
.
2
4
0
.
2
6
0
.
2
8
0
.
3
0
.
3
2
0
.
3
4
0
.
3
6
0
.
3
8
0
.
3
9
0
.
4
1
p
r
o
b
a
b
i
l
i
t
y
(
%
)
Ba, B, CaaC Correlation Estimate Less Correlation
Estimate for Aa, A, Baa Rated Credits
0
1
2
3
4
5
6
0
.
0
1
0
.
0
2
0
.
0
4
0
.
0
5
0
.
0
7
0
.
0
9
0
.
1
0
.
1
2
0
.
1
3
0
.
1
5
0
.
1
6
0
.
1
8
0
.
1
9
0
.
2
1
p
r
o
b
a
b
i
l
i
t
y
(
%
)
Figures 6 and 7 provide some additional visual evidence regarding model fit.
Figure 6 plots the actual and predicted portfolio default rates for Moodys investment
42
grade credits. The investment grade data includes a large number of zero default rate
observations. When the investment grade credit portfolios experience higher default rates
(in the 1920s and 1930s) the model produces elevated default rate predictions, but many
of these predictions fall far short of the actual recorded default rates. In a period
beginning in the late 1960s, the model predicts an elevated level of investment grade
defaults that only materializes for Baa-rated credits, and here the models default rate
predictions are small relative to the default rates recoded on Baa-rated credits.
Figure 7 provides the actual and predicted portfolio default rates for Moodys
sub-investment grade credits. As the plots show, the Vasicek-Basle AIRB model fit
deteriorates for sub-investment grade credits. For the lowest rated credits, B and CaaC,
errors are large and concentrated in the period since the 1970s. The large error rates from
sub-investment grade credits likely owes at least in part to an inappropriate restriction on
this groups correlation parameter. The model estimates in Figures 6 and 7 impose a
uniform correlation parameter for all credit rating classes, whereas the evidence reported
in Table 6 and Figure 5 show that sub-investment grade credits have a larger correlation
parameter.
The model prediction errors depend in part on the truncation value selected for
zero default rate observations. In general, within a reasonable range of truncation values,
as the truncation value for zero default rate observations is increased, the unconditional
default rate estimates increase and the model correlation parameter estimate decreases.
While I have not done an exhaustive analysis of alternative truncation values, and the
best truncation value may be grade specific and depend on the number of obligors in each
credit grade, a uniform 20 basis point truncation value seems to be reasonable
43
compromise relative to an objective of minimizing root mean-squared prediction errors
across the credit grades.
Rating 20 bps 50bps
Aa 24.4 34.6
A 25.1 35.4
Baa 30.4 36.2
Ba 112.6 112.5
B 328.3 317.9
CaaC 1593.1 1563.7
Table 12: Model Prediction Error Rates
for Alternative Zero Default Rate
Tuncation Choices
RMSE for truncation value
RMSE is the root mean-square Vasicek model
prediction error measured in basis points using
Moody's Corporate default rate data, 1920-2008.
VIII. VASICEK-BASEL II AIRB PARAMETER ESTIMATION FROM SMALL SAMPLES
Few if any financial institutions have data on the default rate performance of their
internal rating systems over 89 years. Indeed because of data limitations on banks own
internal ratings system performance, the Basel II AIRB approach requires as little as five
years of data as the minimum acceptable sample length that a bank may use for
estimation of some AIRB model parameters. An important issue of regulatory concern
has been the quality of the parameter estimates that may be generated from such small
samples. It has long been appreciated that short time series on ratings system
performance are unlikely to include data from a full credit cycle.
44
Figure 6: Predicted and Actual Default Rates for Moody's Aa-, A- and Baa-Rated
Corporate Credits, 1920-2008, 20 basis points truncation
Moody's Aa Credits
0
0.005
0.01
0.015
0.02
0.025
1
9
2
0
1
9
2
4
1
9
2
8
1
9
3
2
1
9
3
6
1
9
4
0
1
9
4
4
1
9
4
8
1
9
5
2
1
9
5
6
1
9
6
0
1
9
6
4
1
9
6
8
1
9
7
2
1
9
7
6
1
9
8
0
1
9
8
4
1
9
8
8
1
9
9
2
1
9
9
6
2
0
0
0
2
0
0
4
2
0
0
8
actual
predicted
Moody's A Credits
0
0.005
0.01
0.015
0.02
0.025
0.03
1
9
2
0
1
9
2
4
1
9
2
8
1
9
3
2
1
9
3
6
1
9
4
0
1
9
4
4
1
9
4
8
1
9
5
2
1
9
5
6
1
9
6
0
1
9
6
4
1
9
6
8
1
9
7
2
1
9
7
6
1
9
8
0
1
9
8
4
1
9
8
8
1
9
9
2
1
9
9
6
2
0
0
0
2
0
0
4
2
0
0
8
Moody's Baa Credits
0
0.005
0.01
0.015
0.02
0.025
0.03
0.035
0.04
1
9
2
0
1
9
2
4
1
9
2
8
1
9
3
2
1
9
3
6
1
9
4
0
1
9
4
4
1
9
4
8
1
9
5
2
1
9
5
6
1
9
6
0
1
9
6
4
1
9
6
8
1
9
7
2
1
9
7
6
1
9
8
0
1
9
8
4
1
9
8
8
1
9
9
2
1
9
9
6
2
0
0
0
2
0
0
4
2
0
0
8
45
Figure 7: Predicted and Actual Default Rates for Moody's Ba-, B- and CaaC-
Rated Corporate Credits, 1920-2008, 20 basis points truncation
Moody's Ba Credits
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
1
9
2
0
1
9
2
4
1
9
2
8
1
9
3
2
1
9
3
6
1
9
4
0
1
9
4
4
1
9
4
8
1
9
5
2
1
9
5
6
1
9
6
0
1
9
6
4
1
9
6
8
1
9
7
2
1
9
7
6
1
9
8
0
1
9
8
4
1
9
8
8
1
9
9
2
1
9
9
6
2
0
0
0
2
0
0
4
2
0
0
8
actual
predicted
Moody's B Credits
0
0.05
0.1
0.15
0.2
0.25
0.3
1
9
2
0
1
9
2
4
1
9
2
8
1
9
3
2
1
9
3
6
1
9
4
0
1
9
4
4
1
9
4
8
1
9
5
2
1
9
5
6
1
9
6
0
1
9
6
4
1
9
6
8
1
9
7
2
1
9
7
6
1
9
8
0
1
9
8
4
1
9
8
8
1
9
9
2
1
9
9
6
2
0
0
0
2
0
0
4
2
0
0
8
Moody's CaaC Credits
0
0.2
0.4
0.6
0.8
1
1.2
1
9
2
0
1
9
2
4
1
9
2
8
1
9
3
2
1
9
3
6
1
9
4
0
1
9
4
4
1
9
4
8
1
9
5
2
1
9
5
6
1
9
6
0
1
9
6
4
1
9
6
8
1
9
7
2
1
9
7
6
1
9
8
0
1
9
8
4
1
9
8
8
1
9
9
2
1
9
9
6
2
0
0
0
2
0
0
4
2
0
0
8
46
Small sample estimates of unconditional default rate inputs into AIRB regulatory
capital calculations may be downward biased in the samples do not include sufficient
data on a bust phase of the credit cycle. To correct for this potential shortcoming, The
Basel AIRB approach requires that data used to estimate unconditional default rates
either include data from a bust phase of the credit cycle or alternatively include some
other technique to adjust unconditional default rates so that they reflect downturn
conditions.
In this section, I present concrete evidence on the importance of the small sample
bias that may occur when the Vasicek-Basel II AIRB model parameters are estimated
from a short time series on model performance. I estimate the sampling distributions for
the parameter estimates of the Vasicek-Basel II AIRB model when the parameters are
estimated from sample sizes of 5- and 10 years of data. I compare these small sample
distributions to the sampling distribution for parameter estimates from the full sample of
89 years of data. I estimate the small sample distributions using the jackknife procedure
in which I resample 5000 replications using paired observations in order to preserve
heteroskedasticity characteristics in the data.
47
Figure 8: Sampling Distributions for the Unconditional Probability of Default on
Moodys Aa-Rated Credits for Alternative Sample Sizes Drawn from 1920-2008
Data, Zero Default Rates Truncated to 20 bps
23.5 24.5 25.5 26.5 27.5 28.5 29.5 30.5 31.5 32.5 33.5 34.5 35.5
0
2
4
6
8
10
12
P
e
r
c
e
n
t
Aaprob
20 60 100 140 180 220 260 300 340 380 420 460 500 540 580 620 660 700 740 780 820 860 900
0
10
20
30
40
50
60
70
80
P
e
r
c
e
n
t
Aaprob
48
22.5 52.5 82.5 112.5 142.5 172.5 202.5 232.5 262.5 292.5 322.5 352.5 382.5
0
10
20
30
40
50
60
70
P
e
r
c
e
n
t
Aaprob
The sampling distributions for Vasicek model unconditional default rate
parameter estimates for Moodys Aa-Rated credits derived from alternative sample sizes
are plotted in Figure 8. The plots in Figure 8 show a clear pattern in which small sample
parameter estimates are downward biased relative to the parameter estimates derived
from long time series. Small sample parameter estimate sampling distributions exhibit a
strong left skew with high probabilities that a sample estimates will indicate only a
minimal unconditional probability of default when true underlying default rates are far
higher. The magnitude of the bias is greater for smaller sample sizes. Unconditional
default rate estimates generated from samples even as long as 10 years are likely to
produce strongly downward biased estimates of the unconditional default rates.
49
Figure 9: Sampling Distributions for the Unconditional Probability of Default on
Moodys Baa-Rated Credits for Alternative Sample Sizes Drawn from 1920-2008
Data, Zero Default Rates Truncated to 20 bps
27.6 30 32.4 34.8 37.2 39.6 42 44.4 46.8 49.2 51.6 54
0
2
4
6
8
10
12
14
P
e
r
c
e
n
t
Baaprob
20 60 100 140 180 220 260 300 340 380 420 460 500 540 580 620 660 700 740 780 820 860 900 940 980 1020
0
10
20
30
40
50
60
P
e
r
c
e
n
t
Baaprob
50
15 30 45 60 75 90 105 120 135 150 165 180 195 210 225 240 255 270 285 300 315 330 345
0
5
10
15
20
25
30
35
40
45
P
e
r
c
e
n
t
Baaprob
Figure 9 plots the sampling distribution for the unconditional default rate on
Moodys Baa-rated credits for alternative sample sizes. Figure 9 shows a pattern that is
very similar to the pattern evident in Figure 8. Figures 8 and 9 clearly demonstrate that is
a very high probability that Vasicek model estimates based on small sample sizes will
significantly understate the unconditional probability of default associated with a
Moodys corporate credit grade. Figures 10 and 11 (in the Appendix) show that this
pattern also holds for Moodys A- and B-rated corporate credits.
The downward bias in small sample estimates is a consequence of at least two
important features of the Moodys corporate default rate data: (1) the prevalence of zero
reported default rates, and (2) the strong positive autocorrelation in the common macro
factor that drives default. The jackknife random sampling technique for drawing small
samples will potentially draw a large share of zero default rates given their weights in the
data. Since the macro factor is strongly autocorrelated, it will require a very long time
series before the macro factor is likely to have a sample average of zero. Because model
51
identification is achieved by imposing the zero mean condition, macro factor estimates
are likely to be biased in small samples. This bias will induce a bias in the unconditional
default rate estimates as well.
This biases that are demonstrated in the small sample results in Figures 8-11
likely understates the estimation issues associated with small sample estimates that are
likely to be derived in practice as true small samples will have positively temporally
correlated observations and so most of the macro factor draws in a small sample are
likely to be either positive or negative. In such a case, the model restriction that the
macro factor average draw be zero over the sample will induce a larger bias in the macro
factor estimates compared to those produced under the jackknife random sampling
techniques that underpin Figures 8-11. This additional issue does not necessarily arise in
the small sample parameter distributions plotted in Figures 8-11 because each
observation in a jackknife sample is chosen at random from the entire times series, so the
underlying autocorrelation structure that is evident in the raw data is not necessarily
preserved.
IX. A METHOD FOR CORRECTING THE BIAS IN SMALL SAMPLE ESTIMATES
This econometric implementation of the Vasicek model can be modified to derive
consistent estimates of the unconditional default rates associated with additional grades
or auxiliary rating systems that may not have a long time series of default rate realization.
The consistency of the estimator is conditional on the assumption that the default rate and
correlation factor associated with the credit rating class are identical to those that
characterize the credit grading system for which there exists long time series of dfault
rates--for example, like the Moodys 1920-2008 default data sample.
52
Recall that the Vasicek model implies,
( )
it Mt
A
jt
e
PD
P c
~
1 1
)
~
(
1
1
+
u
= u
(14)
In the econometric implementation of specification (14), the year fixed-effect estimates
are estimates of the macro factor realizations,
Mt t
e b
=
1
. If we use estimates of
the macro factors realizations that are constructed from a long time series, these estimates
are consistent estimates of the underlying macro factor realizations (assuming the
Vasicek model is true). We can use these macro factor estimates to estimate the
unconditional default rates associated with a different rating system that may not have a
long sample of default rate realizations.
Let
A
PD represent the unconditional probability of default on an auxiliary rating
category for which we have a data on default rates with a sample size of S where S is a
relative modest sample size. From expression (14), it is evident that,
( )
it
A
t jt
PD
b P c
~
1
)
~
(
1
1
+
u
= u
, (15)
Consequently, as size of the small sample on the alternative rating grade increases,
( )
( )
u
u
=
)
~
(
1
.. .
1 A
s a
S
q t
t jt
PD
b P . (16)
From expression (16), it is possible to construct a consistent estimate of the
unconditional default rate of the new rating class that controls for the macro factor
53
conditions that drive defaults, ( )
|
|
.
|
\
|
u u
=
S
q t
t jt
b P
)
~
( 1
1
is an estimator for the
unconditional default rate for the rating class,
( )
A s a
S
q t
t jt
PD b P
|
|
.
|
\
|
u u
=
. .
1
)
~
( 1 , (17)
where
A
PD is the unconditional probability of default on the auxiliary rating class.
Year
2001 -0.19447
2002 -0.27158
2003 -0.04959
2004 0.09941
2005 0.15243
2006 0.15585
2007 0.24216
2008 0.17973
macro factor
estimate
Table 13: Macro Factor
Estimates from Moody's
Corporate Default Data 1920-
2008
Estimates are based the Vasicek
model specification (Table 6),
with zero default rates
truncated to 20 basis points.
I apply this macro factor adjustment algorithm to estimate the unconditional
default rates for Moodys alpha-numeric rating scale. While Moodys published default
rate statistics on these grades from the early 1980s, I use data from 2001-2008 to
demonstrate the adjustment. I exclude rating grades which exhibit no defaults over this
sample period, and I truncate zero default rates to 20 basis points. The correlation and
macro factor adjustments used in (17) are taken from the Vasicek model estimates
derived from the Moodys 1920-2008 data on letter rating grades when zero default rates
are truncated to 20 basis points (Table 6). The macro factors are reported in Table 13.
54
Over this period, there are 4 years in which macro effects work to increase default rates
(2001-2003 and 2008) and 4 years in which the common factor reduced default rates
(2004-2007).
Moody's
alpha-
numeric
rating
Aa3 18.51 32.84
A1 12.36 31.01
A2 5.43 27.81
A3 5.56 27.89
Baa1 22.56 34.46
Baa2 25.05 37.21
Baa3 30.53 37.44
Ba1 33.51 39.31
Ba2 40.25 47.01
Ba3 117.98 102.50
B1 100.84 80.22
B2 252.71 157.20
B3 508.04 364.75
Caa1 921.98 758.48
Caa2* 1608.40 1460.38
Caa3 2556.39 2495.40
Sample
unconditional
probability of
default estimate
Model estimate of
unconditional
probability of default
Table 13: Alternative Estimates of the Unconditional
Default Rates Associated with Moody's Alpha-
Numeric Rating Grades, 2001-2008
Model estimates use estimates of the macro economic
factor realizations over the 2001-2008 period and the
model correlation paramter estimate derived from the
Moody's letter rating grade model estimated over the
sample period 1920-2008.
Table 14 reports the sample unconditional mean default rate by rating grade as
well as estimates of the rating grade unconditional default rates that adjust for the effect
of the macro factor using expression (17). The macro factor adjust has a nonlinear effect
on the unconditional probability of default estimates. Because of the non-linearity of the
cumulative normal distribution function, the macro factor adjustments have a much larger
55
impact when the reported default rates are small. Overall, the results suggest that the
simple sample mean default rate underestimates the unconditional default rates on higher
quality credits (Aa3-Ba2) and over estimates the unconditional default rate on the lower
quality credits (Ba3-Caa3).
X. CONCLUSIONS
This paper has developed a new approach model that uses standard panel
regression techniques to estimate the parameters of the asymptotic Vasicek portfolio
default rate model that is used as the basis for the Basel AIRB regulatory capital
framework. The approach produces consistent estimates of all the model parameters
using time series data on a cross section of the failures rates from a consistent credit
rating system. The approach is novel in that it produces consistent estimates of the
Vasicek/AIRB correlation parameter directly from the default rate data without any need
to use stock return or other data and methods of inference. The approach can be used to
estimate multiple correlation parameters for a rating system. Estimates suggest that, in
contrast to Basel II AIRB assumptions, lower grade (sub-investment grade) credits have a
substantially higher Vasicek correlation parameter compared to investment grade credits.
Because the new approach to parameter estimation is based on standard
econometric techniques, the approach introduces a new battery of test statistics and
model diagnostic tools into Basel AIRB calibration discussion. For example, bootstrap
methods are used to calculate sampling distributions and exact small sample test statistics
for Basel AIRB model parameters. The results suggest that the unconditional default
probability inputs (and the correlation parameter) into the Basel AIRB framework can be
estimated, but the accuracy of the estimated values hinges on a number of important
56
considerations. The method selected to handle 0 default rate observations in the
estimation process is a particularly important issue that may not have received adequate
attention in the literature. Sample size is also a very important issue albeit a well-known
issue but an issue that could benefit from additional study. Model parameters estimated
derived from small samples are biased and the analysis in this paper helps to quantify the
magnitudes of the potential bias.
This new econometric approach also provides consistent estimates of the
common macro factor that is assumed to drive credit defaults in the Vasicek-AIRB
model. Statistical analysis of the macro factor times series derived from the Moodys
data provides strong evidence of a credit cycle in corporate default rates. This macro
factor time series properties are inconsistent with Vasicek-Basel II AIRB model which
assumes that common factor realizations are independent. The macro factor estimates that
are generated can be used to correct small sample biases that arise when unconditional
default rates are estimated using only brief time series histories that may not include full
or balanced credit cycles. When this adjustment is applies to estimate the unconditional
default rates associated with selected grades from Moodys alpha-numeric rating scale
using data from 2001-2008, the results suggest that simple default arte sample averages
by credit grade understate the unconditional probabilities of default associated with high
quality grades and over-estimate these probabilities for the more risky credit classes.
57
XI. APPENDIX
P1 P2 P3 P4
P1 1.000 1.000 0.999 0.998
P2 1.000 1.000 0.999
P3 1.000 1.000
P4 1.000
P1 P2 P3 P4
P1 1.000 0.994 0.988 0.972
P2 1.000 0.999 0.992
P3 1.000 0.996
P4 1.000
P1 P2 P3 P4
P1 1.000 0.991 0.984 0.961
P2 1.000 0.999 0.989
P3 1.000 0.995
P4 1.000
P1 P2 P3 P4
P1 1.000 0.990 0.981 0.952
P2 1.000 0.999 0.986
P3 1.000 0.993
P4 1.000
Table A1: Simulated Asymptotic Portfolio Sample Default Rate Correlations Under
Different Asumptions for the Vasicek Correlation Parameter
(1000 observations)
01 . 0 =
10 . 0 =
20 . 0 =
40 . 0 =
58
Figure 10: Sampling Distributions for the Unconditional Probability of Default on
Moodys A-Rated Credits for Alternative Sample Sizes Drawn from 1920-2008
Data, Zero Default Rates Truncated to 20 bps
23.1 24.3 25.5 26.7 27.9 29.1 30.3 31.5 32.7 33.9 35.1 36.3 37.5
0
2
4
6
8
10
12
14
P
e
r
c
e
n
t
Aprob
20 60 100 140 180 220 260 300 340 380 420 460 500 540 580 620 660 700 740 780 820 860 900 940
0
10
20
30
40
50
60
70
80
P
e
r
c
e
n
t
Aprob
59
22.5 52.5 82.5 112.5 142.5 172.5 202.5 232.5 262.5 292.5 322.5 352.5
0
10
20
30
40
50
60
70
P
e
r
c
e
n
t
Aprob
60
Figure 11: Sampling Distributions for the Unconditional Probability of Default on
Moodys B-Rated Credits for Alternative Sample Sizes Drawn from 1920-2008
Data, Zero Default Rates Truncated to 20 bps
125 135 145 155 165 175 185 195 205 215 225 235 245 255 265 275 285 295 305 315 325 335 345
0
2
4
6
8
10
12
14
P
e
r
c
e
n
t
Bprob
40 200 360 520 680 840 1000 1160 1320 1480 1640 1800 1960
0
5
10
15
20
25
30
35
P
e
r
c
e
n
t
Bprob
61
25 125 225 325 425 525 625 725 825 925 1025 1125 1225
0
2.5
5.0
7.5
10.0
12.5
15.0
17.5
P
e
r
c
e
n
t
Bprob
62
References
Cameron and Trivedia (2005). Microeconometrics: Methods and Applications. New
York: Cambridge University Press.
Chernick, Michael R. (1999). Bootstrap Methods, A practitioner's guide. Wiley Series in
Probability and Statistics.
Davison, A. C.; Hinkley, D. Bootstrap Methods and their Applications. (1997). Bootstrap
Methods and their Applications. Cambridge: Cambridge Series in Statistical and
Probabilistic Mathematics.
Davison, A. C.; Hinkley, D. Bootstrap Methods and their Applications. (2006). Bootstrap
Methods and their Applications (8th ed.). Cambridge: Cambridge Series in Statistical and
Probabilistic Mathematics.
Diaconis, P. & Efron, B. (May 1983). "Computer-intensive methods in statistics".
Scientific American: 116130.
Efron, B. (1979). "Bootstrap Methods: Another Look at the Jackknife". The Annals of
Statistics 7 (1): 126.
Financial Stability Institute Occasional Paper No. 6 (September 2006). Implementation
of the new capital adequacy framework in non-Basel Committee member countries,
Bank for International Settlements, http://www.bis.org/fsi/fsipaper06.pdf
Gordy, Michael (2003). A Risk-Factor Model Foundation for Ratings-based Bank
Capital Rules. Journal of Financial Intermediation, Vol. 12, pp. 199232.
Kupiec, P. (2007). Financial Stability and Basel II, Annals of Finance, Vol. 3, pp. 107-
130.
Kupiec, P. (2007). Capital Allocation for Portfolio Credit Risk, The Journal of
Financial Services Research, Vol. 32,
No. 1-2, p. 103-122.
Kupiec, P. (2008). Basel II: A Case for Recalibration, in Handbook of Financial
Intermediation and Banking, Anjan Thakor and Arnoud Boot, editors. New York: North
Holland.
Kupiec, P. (2008). A Generalized Single Common Factor Model of Portfolio Credit
Risk, The Journal of Derivatives, Vol. 15, No. 3, pp. 25-40.
63
Lando, David and Torben Skdeberg, (2002). Analyzing rating transistions and rating
drift with continuous observations, Journal of Banking and Finance, Vol. 26, pp. 423-
444.
Moodys Investor Service, (2007). Corporate Default and Recovery Rates, 1920-2006.
(February).
Moodys Investor Service, (2008). Corporate Default and Recovery Rates, 1920-2007.
(February).
Schnbucher, P. (2001). Factor Models: Portfolio Credit Risks When Defaults Are
Correlated. Journal of Risk Finance, Vol. 3, No. 1, pp. 4556.
Vasicek, O. (1987). Probability of Loss on a Loan Portfolio. KMV, Working Paper.
Published (2003) as Loan Portfolio Value. Risk, December, pp. 160162.
Wald, A. and Wolfowitz, J. (1940), "On a test whether two samples are from the same
population," Ann. Math Statist. 11, 147-162.
The Basel Committee on Banking Supervision (2004). International Convergence of
Capital Measurement and Capital Standards. Bank for International Settlements, June.