Академический Документы
Профессиональный Документы
Культура Документы
0
( 15 )
P
2
=
1
2
+
1
n
_ Rc _
K
-q
-
(i)
i
_ J
0
( 16 )
() = E|S
1
q
] = S
t
q
exp|A(t; I, ) +B
1
(t; I; )b(t +A),
( 17 )
A(t; I, ) = A(t +A; I, ) +r +B
1
(t +A; I; )
1
2
ln(1 2o
1
B
1
(t +A; I; ) )
( 18 )
B
1
(t; I, ) = (z +y
1
)
1
2
y
1
2
+[
1
B
1
(t +A; I; )
+
1 2 / (
1
)
2
1 2o
1
B
1
(t +A; I; )
( 19 )
14
pricing literature
7
. As these studies have shown, it consistently competes with
other models. Another advantage of the ABS model is that, due to its theoretical
inconsistency, it does not make any assumptions on the nature of the variance
process. The time-varying trend observed for its error series is therefore
independent of any theoretical constraints. This is not the case for the HN model,
which, for example, ignores jumps and relies on the assumption of a continuous
variance process. The HN error time series might therefore depend on time
varying jump likelihood.
We also choose to estimate the time series of the error for the HN GARCH
model. We justify this choice by the fact that GARCH models allow us to price
options without implying volatilities from the Black-Scholes formula, as valuation
is based on observables, such as the discrete observations of the underlying asset
price. The main advantage of the model developed by Heston and Nandi over
other GARCH models is that it has a closed-form solution and therefore does not
require the use of Monte Carlo simulations. Option prices, as equation ( 14 )
shows in the previous section, can be directly obtained from the history of asset
prices, option specifications and a finite number of risk neutral parameters
0
-
= (
-
, o
-
, [
-
, y
-
). We mainly consider the HN GARCH in our analysis to
assess the generalizability of our ABS results. We want to determine whether our
results for the ABS model can be replicated using a more theory-based model.
Similar results for two diametrically different models would suggest the existence
of the common pattern we are looking for.
Calibration
We now turn to a more in-depth description of the methodology used in the
calibration of the two models on the sample of option data.
ABS procedure
On each Wednesday of the sample, the implied volatility surface o
t
(S K / , I) of
equation ( 5 ) is calibrated on market data. We implemented a non-linear least
square (NLS) procedure
8
that minimizes an objective function, defined as the
dollar root mean squared error between model option values and observed option
prices ($RMSE). This cross-sectional fitting gives us 754 sets of optimized
parameters o
0
t
, o
1
t
, o
2
t
, o
3
t
, o
4
t
, o
5
t
, one for each Wednesday t, which define 754
implied volatility surfaces. These surfaces, when plugged in the Black-Scholes
pricing formulae, allow the closest match of model prices to option prices on the
market. More importantly, the NLS procedure also yields a time series of the
minimized dollar root mean squared error, which we will use as input for our
empirical analysis detailed in the next chapter.
Figure 2 shows the calibration of the ABS model using NLS on the cross-section
of out-of-the-money SPX options on June 16
th
, 2010, as well as the smoothed
volatility surface on that day. The first plot shows that the option prices yielded by
the ABS model closely match the market prices over the range of maturities on
7
see for instance Barone-Adesi, Engle and Mancini (2008), Heston and Nandi (2000) or Brandt
and Wu (2002)
8
The optimization procedure is implemented in Matlab, using the active-set algorithm and the
fminunc function
15
June 16
th
, 2010. The second plot shows the smoothed implied volatility surface on
that day. Note that it has the expected shape: implied volatilities decrease as
moneyness increases, with a surface getting flatter as the options time to maturity
increases. Trend reversal occurs for long-maturity DOTM put options (K/S<0.4,
T>250 days), as the bottom-left portion of the surface shows.
Figure 2: Calibration of the ABS model on the cross-section of OTM options on June 16th 2010 and
smoothed volatility surface
HN-GARCH model
A similar method is used to calibrate the HN GARCH model on the option data,
although the optimization process calls for more precaution in this case. A first
important remark is that the NLS procedure is now more computationally
cumbersome, due to the intrinsic nature of the HN GARCH pricing formula:
estimation of the risk-neutral-probabilities in equation ( 14 ) requires numerical
integration and the handing of complex numbers, coefficients A(t; I; ) and
B
1
(t; I; ) need to be computed recursively, etc. In and out, the overall greater
number of operations makes the pricing of one cross section of options much
slower with the HN model than with the ABS procedure.
Second, the objective function ($RMSE) for the HN-GARCH model contains
jump discontinuities and its optimization often requires a large number of function
evaluations, which translate into prohibitively long running times. The results
generated by the NLS procedure are also very sensitive to the starting values for
0
-
=
-
, o
-
, [
-
, y
-
, provided as input for the optimizer. We therefore incur the
risk of running into local minima of the loss function ($RMSE), rather than the
global minimum we are looking for. A promising solution to alleviate such
concerns is to use algorithms such as the accelerated random search (ARS)
proposed by Mller et al (2013). Mller et al acknowledge the caveats of local
optimizers such as the one used in this research (active-set algorithms) for non-
smooth objective functions, and show that ARS leads to significantly better results
for the Heston and Nandi GARCH model. However, using ARS for our data
would require considerable computational power (several multi-core computers
for parallel processing) we did not have at our disposal for this study. We
therefore implement a middle-ground solution: the objective function is evaluated
16
at several starting points 0
-
and the optimization starts with the set of initial
parameters that yield the smallest objective function value.
Let t be any given Wednesday of our sample, with t = 1 corresponding to
January 17
th
1996 and t = 7S4 being June 30
th
2010. At each step of the NLS
procedure, prices for the cross-section of options at time t are calculated from
equation ( 14 ), and therefore depend on the values of 0
-
=
-
, o
-
, [
-
, y
-
and
b(t +A). The conditional variance b(t +A) is obtained from expression ( 13 ),
which depends on the GARCH parameters being optimized, the log-return of the
spot asset since last Wednesday t A, and the conditional variance b(t) between
t A and t. We decided to proxy for the variance b(t) by using an adjustment on
the CBOE volatility index (VIX) at time t A. This index uses real-time SPX
options bid and ask quotes to provide a daily forecast for the expected volatility of
the S&P 500 Index over the next 30 days. In other words, the VIX uses present
time information to form expectations on the SPX volatility for the near future.
This makes the lagged level of the VIX index a particularly good candidate to
proxy for b(t), defined as the conditional variance between the last period t and
the current period t, based on the information set available at time t A. Figure 4
shows the calibrated prices for the first day of our sample, Jan. 17th, 1996, and
Figure 3 shows the weekly VIX index level over our sample period.
Figure 4: Calibration of the HN GARCH model on the cross-section of OTM SPX
options on Jan. 17th, 1996
Figure 3: Weekly VIX Index level (Jan. 1996 - June 2010
17
Empirical Results
In previous sections, we provided an overall description of the sample of option
and index data used in this research, as well as an overview of the theory we rely
on to estimate the time series of the option pricing error. We now proceed to
present our empirical results, which we divide in two parts. We start with a section
comparing the in-sample and out-of-sample pricing performance of our two
models, ABS and HN, over our sample period 1996-2010. In a second part, we
then turn to regression analysis in an attempt to explain some of the variation of
the in-sample pricing error over time for our sample of out-of-the-money options.
Based on the existing literature on option pricing and stock return predictability,
we test the explanatory power of a series of potential predictor variables, for both
the ABS and the HN model. Next, we generalize our results to the whole spectrum
of moneyness, and we show, relying on the ABS model only, that our regression
results are robust to the inclusion of ITM and DITM options. Finally, we rely on
the time series of out-of-sample RMSEs for the ABS model and study the ability
of our explanatory variables to forecast future OOS pricing error.
I. Option pricing performance of the models
In-sample model performance
In this section, we describe the in-sample option pricing performance of the ad-
hoc Black and Scholes procedure and the Heston and Nandi GARCH model.
Figure 5 shows the time series of the in-sample dollar root mean squared error
($RMSE) for our two models. On the next page, Table 4 provides the yearly
means and standard deviations of the coefficient estimates for the two models, as
well as yearly descriptive statistics for the $RMSE.
Figure 5:
Time series of the in-sample option pricing performance (January 17
th
, 1996- June 30
th
, 2010)
As a first important remark, we note that for both models, pricing performance
does not appear to behave randomly over the sample period. For both the ABS and
HN, the time series of the $RMSE exhibits a marked trend in Figure 5, with
considerably higher and more volatile in-sample pricing errors around the two
crises in 2000 and 2008. After the dot-com bubble, the $RMSE seems to revert to
its original pre-crisis level. A similar trend is observed in the aftermath of the
Table 4: Calibration of the HN GARCH and ABS pricing models
HN
o
-
1u
-
[
-
y
-
1u
2
-
1u
-
Ann. Vol.
RMSE
Year Mean Std. dev. Mean Std. dev. Mean Std. dev. Mean Std. dev. Mean Std. dev. Mean Std. dev.
1996 2,21 1,89 0,04 0,08 7,99 3,25 0,08 0,17 0,120 0,011
1,37 0,39
1997 4,28 3,11 0,14 0,18 5,14 1,62 0,37 0,82 0,161 0,022
2,20 0,49
1998
4,79 1,80 0,01 0,04 4,80 1,29 0,04 0,19
0,206 0,039
3,76 1,33
1999 4,25 1,95 0,01 0,03 5,34 1,89 0,01 0,05 0,221 0,023
5,21 1,35
2000
4,66 2,37 0,03 0,07 4,98 1,61 0,04 0,25
0,175 0,020
5,19 1,03
2001 6,31 3,62 0,05 0,11 4,26 1,48 0,32 0,94 0,175 0,011
3,49 1,06
2002
8,07 6,20 0,14 0,20 3,61 1,23 0,39 0,90
0,187 0,027
2,22 0,53
2003 9,86 5,66 0,19 0,20 3,19 1,38 0,77 1,47 0,178 0,026
1,52 0,37
2004
3,38 1,71 0,07 0,14 5,96 2,16 0,10 0,31
0,143 0,013
2,10 0,55
2005 1,92 0,84 0,03 0,12 7,58 1,97 0,06 0,18 0,116 0,007
2,33 0,58
2006
1,57 0,66 0,02 0,07 8,31 1,92 0,04 0,17
0,110 0,009
2,60 0,52
2007 2,21 2,26 0,02 0,06 8,40 3,34 0,02 0,07 0,138 0,033
3,44 0,80
2008
8,46 7,00 0,09 0,14 4,03 1,85 0,37 0,84
0,225 0,064
3,30 1,11
2009
19,09 12,26 0,13 0,20 2,47 1,21 0,91 2,03
0,258 0,045
2,00 0,54
2010
7,81 4,38 0,05 0,16 3,89 1,30 0,43 0,80
0,199 0,024
2,49 0,87
ABS
o
0
o
1
o
2
o
3
o
4
o
5
RMSE
Year Mean Std. dev. Mean Std. dev. Mean Std. dev. Mean Std. dev. Mean Std. dev. Mean Std. dev. Mean Std. dev
1996 -0,878 0,190 1,33 0,31 -0,33 0,13 0,32 0,053 0,01 0,012 -0,33 0,049
1,24 0,25
1997 -0,610 0,148 1,01 0,22 -0,24 0,07 0,25 0,073 0,00 0,012 -0,26 0,078
1,81 0,42
1998
-0,715 0,107 1,17 0,20 -0,27 0,07 0,29 0,039 -0,01 0,012 -0,26 0,040
2,34 0,51
1999 -0,689 0,103 1,16 0,15 -0,30 0,06 0,23 0,046 -0,02 0,012 -0,17 0,026
3,16 0,59
2000
-0,465 0,100 0,82 0,14 -0,18 0,04 0,14 0,060 0,01 0,014 -0,14 0,049
3,59 0,73
2001 -0,325 0,098 0,61 0,16 -0,09 0,07 0,13 0,077 0,01 0,018 -0,15 0,055
2,35 1,07
2002
-0,294 0,117 0,58 0,13 -0,08 0,05 0,15 0,063 0,01 0,018 -0,19 0,040
1,34 0,44
2003 -0,262 0,086 0,47 0,09 -0,04 0,03 0,20 0,051 0,00 0,015 -0,20 0,023
0,85 0,19
2004
-0,472 0,098 0,69 0,15 -0,11 0,06 0,24 0,038 -0,01 0,008 -0,21 0,031
1,08 0,71
2005 -0,673 0,137 0,97 0,22 -0,20 0,08 0,28 0,042 -0,01 0,008 -0,25 0,034
1,44 0,24
2006
-0,921 0,160 1,35 0,26 -0,34 0,10 0,35 0,047 0,00 0,012 -0,33 0,048
1,96 0,30
2007 -0,947 0,131 1,47 0,18 -0,39 0,07 0,31 0,046 0,00 0,015 -0,31 0,041
2,43 0,52
2008
-0,509 0,199 1,00 0,20 -0,24 0,08 0,17 0,106 0,02 0,042 -0,21 0,038
2,00 0,74
2009
-0,293 0,073 0,61 0,09 -0,08 0,03 0,23 0,045 -0,01 0,018 -0,21 0,022
1,10 0,19
2010
-0,424 0,065 0,68 0,14 -0,09 0,04 0,27 0,026 0,00 0,016 -0,24 0,027
1,24 0,25
stock market crash in 2008, with $RMSEs reverting to their levels of 1996 or 2003.
The mean yearly coefficient estimates in Table 4 also hint at a non-random time series
behavior. For example, the HN coefficients y
-
and o
-
both exhibit extrema around the
climaxes of the credit crisis in 2009. Furthermore, the annualized GARCH volatility
in the last column, as given by 2S2(
-
+ o
-
)(1 [
-
o
-
y
-
2
) also exhibits two peaks
in 2000 and 2009. Altogether, these results suggest that the two series of pricing
errors share a common trend, and that this trend might be explained or even
forecasted by variables from our option sample (for example: average option price or
average trading volume), but also by index characteristics or other market variables
(such as fluctuations in investor sentiment or short-term interest rates). This will be
the focus of our regression analyses in the later sections of this paper.
Figure 5 also seems to indicate that, over our sample period, the ABS procedure
consistently prices options more accurately than the HN GARCH model. This directly
contradicts the results presented in Heston and Nandi (2000) and Barone-Adesi, Engel
and Mancini (2008). In both these studies, the HN-GARCH model is shown to
compete closely with the ABS model or even outperform it in terms of in-sample
pricing performance. Using the ABS model as their benchmark, Barone-Adesi et al
(BEM) show that, on a yearly basis, the HN-GARCH model persistently outperforms
over the years 2002-2004. Heston and Nandi (HN) report the same result for the in-
sample performance of the two models over the period 1992-1994. Surprisingly, our
calibration results, as shown in Figure 5 hardly confirm those results and even suggest
the opposite: the $RMSEs for the ABS procedure over 1996-2010 are consistently
lower compared to the $RMSEs of the HN GARCH model. This divergence in
findings is easily explained by differences in the specification of the ABS procedure.
Both BEM and HN use the original equations from DFW (1998) for their ad-hoc
Black and Scholes benchmark, which we will henceforth notate as ABS
DFW
to
differentiate it from our ABS procedure. In DFW, the implied volatilities are
smoothed across maturities and strike prices K, rather than across maturities and
moneyness ratio SK, as we do in equation ( 5 ). Although this may seem like a
benign difference, there is empirical evidence that input variable transformation in
implied volatility functions has a significant impact on pricing performance (see
Andreou et al. (2013)). To test for differences in performance across ABS model
specifications, we calibrate the original ABS
DFW
procedure on our sample of option
data and compare its pricing performance against that of the HN GARCH model.
Figure 6 shows that our results actually corroborate the findings in HN and BEM: the
HN model consistently outperforms the ABS
DFW
benchmark for 2002-2004.
Figure 6: Comparison between the original DFW procedure and the HN GARCH model
20
Figure 6 shows that the error for the ABS
DFW
benchmark especially deteriorates
during periods of financial turbulence. The errors for this model around the years
2000 and 2008 are roughly three times as high as those of the HN model and four
times as high as those of the modified ABS model. In and out, Figure 5 and Figure 6
suggest that the modified ABS procedure we use in this study, as described in Bollen
and Whaley (2004), is the most efficient at capturing the volatility smile over our
sample period. Not only does it largely outperform the original ABS
DFW
, it also
exhibits an overall lower level of pricing error than the HN-GARCH model. We
provide several reasons to explain that last result.
1) First, this might be due to liquidity biases. As pointed out in Berkowitz
(2009), the ABS procedure is able to price options of all maturities and moneyness
categories with a consistently impressive empirical performance, even for options
that are thinly traded. Since roughly 50% of the option contracts in our sample have
a daily volume inferior to ten trades, the performance gap between the HN-GARCH
model and the ABS procedure might be partially explained by the fact that the HN-
GARCH model prices illiquid options with less accuracy.
2) A review of the GARCH option pricing literature provides a second potential
explanation: the HN GARCH model is relatively inaccurate for certain moneyness
or maturity categories. Hsieh et al (2005) claim that the HN model underperforms
for deep-out-the-money options and Ferreira et al (2005) find that its pricing
performance deteriorates for options close to expiration. In fact, in their original
article, Heston and Nandi already report that for their GARCH model, short-term
OTM options are the most difficult to valuate. Since short-maturity OTM and
DOTM options are both represented in non-negligible numbers among our option
sample, it makes sense to investigate the pricing performance across our range of
moneyness and maturities. We investigate this matter in the next section of this
paper.
3) Finally, flaws in our calibration algorithm might also explain part of the
pricing performance gap. Indeed, even if the weekly $RMSEs in our two time series
are the outcomes of a minimization algorithm, it is important to acknowledge the
potential presence of local minima in our results, for both the HN and ABS model.
This might be more relevant for the HN-GARCH model, for which the objective
function minimized during the calibration process is a highly non-linear function
with jump discontinuities and a high sensitivity to starting values. We therefore
suspect that running the NLS procedure with a genetic algorithm might produce time
series of the pricing error that are slightly different than those we present in this
research, with larger differences for the HN-GARCH than for the ABS model.
21
In-sample performance across maturity and moneyness categories
In the previous section, we introduced the idea of comparing pricing performance of
our two models for different style of options. We suspect that the relative
underperformance of the HN GARCH model observed in Figure 5 might result from
the inability of the GARCH model to capture some parts of the volatility smile. For
example, the results presented in Hsieh et al (2005) and Ferreira et al (2005) tend to
suggest that DOTM options with short time to maturity might be more severely
mispriced by the HN GARCH model. To better understand the time series behaviour
of our two models over our aggregate sample of OTM options, we investigate the
pricing performance across several moneyness and maturity categories. As in the
description of the option data, we segregate our sample into three maturity categories:
short maturity ( I < 6u days), medium maturity ( 6u I 18u days) and long
maturity (I > 18u days). Building on Bollen and Whaley (2004), we also define
three moneyness categories based on the options delta: deep-out-of-the money
(u.u2 < A
C
u.12S for calls and u.12S < A
P
u.u2 for puts), out-of-the-money
(u.12S < A
C
u.S7S, u.S7S < A
P
u.12S) and near-the money (u.S7S < A
C
u.62S, u.62S < A
P
u.S7S). Figure 7 illustrates the results of our classification.
Note that due to our initial filtering of option data, we do not consider ITM or DITM
options. To generalize our results to the full sample of option data, we repeated the
analysis including ITM and DITM options, using only the ABS model. These results,
along with their interpretation can be found in Appendix 1. When calibration is
performed on the full sample of SPX options, the HN model requires prohibitively
long computation times, and it was therefore left out.
Figure 7 shows that the difference in valuation performance is not uniform across
the spectrum of maturities and moneyness. In fact, the two pricing models valuate
some option categories with almost identical accuracy. For example, the HN model
appears to compete on an equal basis with ABS for medium-maturity options and
only slightly underperforms for options with long time to expiration. For some other
categories, the performance gap between the models widens considerably. This is
especially observable for short-term options, for which the pricing error of the HN
consistently fluctuates above the level of its ABS counterpart. The HN model perform
worst for short-maturity DOTM and OTM options, for which the $RMSEs are more
than twice as high as for the ABS model. Figure 7 therefore confirms our intuition:
while the ABS model accurately valuates options across maturities and moneyness,
the HN model systematically fails to adequately capture some parts of the volatility
surface and therefore underperforms. Figure 7 also allows us to reject our previous
hypothesis that the larger errors of the HN model are due to its poor performance with
thinly traded options. Indeed, a look back at Table 3 reveals that the options with the
lowest average trading volumes are the ones with medium and long maturities, which
are also those for which the HN GARCH performs closest to the ABS model. We
therefore conclude from our results that from the three explanations we give in the
previous section, the second one is the most likely to explain the performance gap.
Figure 7: Comparison of the pricing error across moneyness and maturity categories for the HN and ABS models
Out-of-sample model performance
We now turn to out-of-sample (OOS) valuation performance for both our models. On
each Wednesday in our sample, the in-sample parameter estimates provided by the
calibration are used to value SPX options one week later. Figure 8 shows the time
series of the OOS dollar pricing errors for our two models.
Figure 8: Out-of-sample pricing errors of the HN and ABS models
As for the in-sample pricing errors, the two OOS time series exhibit similar trends,
with increasing levels of pricing error around periods of financial turmoil and
regression to pre-crisis levels afterwards. The ABS procedure also slightly
outperforms over our aggregate sample period, as was the case in-sample and
valuation errors seem to remain in the same range, with no sudden explosion in the
OOS $RMSE values. This comforts us in the idea that both models are flexible
enough to achieve good valuation performance. On any given day of our sample, both
ABS and HN appear to fit the dynamics of index returns as well as the shape of the
implied volatility surface quite accurately. Although the ABS model still outperforms
in general over the period 1996-2010, the difference in out-of-sample pricing errors
between the HN and the ABS model tends to be smaller than those observed in
sample in Figure 5.
Appendix 3 shows the time series of the differences in $RMSE between the ABS and
HN model in- and out-of-sample. The HN tends to outperform ABS more often when
out-of-sample valuation is considered. This partially corroborates the results of
Heston and Nandi (2000) and Barone-Adesi, Engle and Mancini (2008), who find that
the performance of the ABS model considerably deteriorates out-of-sample because it
captures pricing mechanisms by overfitting the data. However, even if the pricing
performance of the ABS model indeed deteriorates out-of-sample relative to that of
the HN model, our results suggest that the effect is only of moderate magnitude. This
indicates that even if the ABS model slightly overfits the data, its estimates remain
stable out of sample. Overall, and despite the small differences in performance
reported in this section, the pricing errors of our two option pricing models seem to
share a common component, which is the focus of our next section.
24
II. Factors of option pricing performance
Foreword
We mentioned in the beginning of this chapter that there were two major parts to our
empirical analysis. In the previous section, we covered the first part by documenting
the in-sample and out-of-sample performance of our two option pricing models. We
now turn to the second part of our study, which directly derives from our previous
findings. In our earlier description of the in and out-of-sample pricing errors for the
two models, shown in Figure 5 and Figure 8, we observed that the $RMSE time series
of the HN and ABS models seemed to exhibit more than just random fluctuations.
This intuition is further confirmed when we consider the time series behaviour of the
error across option categories in Figure 7. In all these charts, valuation errors seem to
behave over time in a synchronous pattern for both the ABS and HN models. This is
all the more striking considering that these models are diametrically different in
nature, which reinforces our belief in a common cyclicality. The sharp increases
around the two crisis periods, and the relatively stable errors in between, make the
error time series in Figure 5 resemble the S&P500 index curve shown in Figure 1.
This leads us to believe that valuation errors might vary along with fluctuations of the
index level, or even other market variables such as interest rates. Motivated by this
intuition, we decide to shift our focus: instead of comparing the differences in
performance between models, we try to detect common patterns in the two time
series. Moreover, if such a common trend exists, we want to identify its underlying
factors. In other words, our primary goal is to find factors able to explain why our
alternative models both perform well during some periods and why their accuracy
deteriorates in other periods.
Stationarity and unit root testing
As said above, the first step of our approach is to statistically prove that the time
series of the pricing error for the ABS and HN model share a common trend. One way
to back that claim would be to derive the cross correlation function the two time series
of errors, and look for high values for the correlations between the various lags of the
two series. However, we first need to consider the issue of stationarity. Indeed, before
making any inferences on common trends among our two time series, statistical
theory requires that our two $RMSE series be stationary, or in other words mean-
reverting. This condition will also need to be fulfilled when we use time series
regressions later in this chapter, in order to avoid spurious regressions. The outcome
of the unit root tests (see Appendix 4) for our two pricing error series seem to indicate
that both the ABS (p
= 12, p = u.S1,
= 1u,
p = u.14,
= 2.41) $RMSE time series have at least one unit root. In other
words, the error time series for ABS and HN are integrated processes of order , with
1. This means that at least one order of differencing is necessary to induce
stationary behaviour. We calculate the first-difference of the two non-stationary
$RMSE series, A
t
=
t
t-1
. Charts representing these differenced series over our
sample period can be found in Appendix 2. A second round of ADF tests on those
differenced series shows that neither have unit roots, and that one order of
differencing was sufficient to induce stationarity.
25
Common trends among ABS and HN pricing errors
In the last section, we derived two stationary pricing error series by using first-
differences. We now show that these two differenced series share a common trend, by
calculating their cross-correlation (XCF) function. Figure 9 shows the autocorrelation
function of the two differenced series, as well the XCF function over twenty lag
values. Figure 10 shows the same features for the original non-stationary series for
comparison purposes. The first two charts of each figure show that first-order
differencing removed the slow, decaying pattern for the autocorrelations in the
original series. For the first-differenced series, autocorrelations fluctuate around zero
across lag values. In each case, the autocorrelation for the first lag is the only one
significantly different than zero, with values around around -0.5. This is a sign that
stationarity for the two error series may have come at the price of slight
overdifferencing. We acknowledge that caveat and address it later in the section on
time series regressions.
Figure 9: Autocorrelations and XCF function of first-differenced $RMSE series for ABS and HN
Figure 10: Autocorrelations and XCF function of original $RMSE series for ABS and HN
Note that the first-differenced error series are moderately correlated for lag 0
(Pearsons r = 0.40, p<0.001), which indicates that both in levels and in first-
differences, the $RMSE for ABS and HN move together over time. Furthermore, both
an Engle-Granger and a Johansen test for cointegration on the two original non-
stationary series reveal that the pricing errors for our models are cointegrated. The
results for these tests are shown in Appendix 5. Altogether, these results provide
statistical evidence to back the intuition we formulated earlier when describing Figure
5: HN and ABS, albeit relying on diametrically different constitutive equations,
26
exhibit a common trend in their valuation performance. This encourages us to search
for factors that could explain this observed trend.
Error forecasting variables
In this section, we try to determine a set of variables likely to explain the common
pattern in pricing errors across alternative models. We distinguish two broad
categories among our set of potential underlying factors: options market variables and
other market variables. The first set relates to characteristics of our sample of SPX
options, such as option maturity or moneyness. The second regroups factors that are
specific to the spot market for the S&P500 index, but also comprises other broad
market variables that we expect to have some additional explanatory power. We now
give the rationale behind our choice of error forecasting variables.
Option market variables
We already mentioned repeatedly the important impact of certain option
characteristics on pricing performance when we were comparing the ABS and HN
models in the previous chapter. Factors such as moneyness, time to expiration or
trading volume have been shown to influence model error in the existing option
pricing literature (Hsieh et al. (2005), Ferreira et al. (2005), Heston and Nandi (2000),
Barone-Adesi et al. (2008)). Our empirical results also provided evidence to support
that claim, with Figure 7 showing considerably different levels of errors across
moneyness and maturity categories. We therefore suspect variables such as mean
moneyness (defined as K/S), mean trading volume and mean maturity to influence the
pricing error over time. To check for additional patterns, we add three complementary
variables: number of options under 1$, total number of options and number of
untraded options. By doing so, we want to observe if large numbers of options with
very low prices, or zero trading volume have an influence on the overall $RMSE. The
variable total number of options is added to account for the explosive growth in size
of the S&P500 equity option market over our sample period
9
. Furthermore, we also
include the average implied volatility, the put-call ratio and deviations of the put-call
parity as predictors. We define the latter as the daily put trading volume divided by
the total trading volume. The put-call ratio is often considered a proxy for investment
sentiment by both finance academics and practitioners, and studies have successfully
linked it to stock returns. For example, Pan and Poteshman (2006) show that their put-
call ratio measure has significant predictive power for the returns of individual stocks,
with high (low) ratios indicating short-term underperformance (outperformance).
Since both our RMSE time series exhibit higher levels of pricing error during times of
crises, which can be considered as low-sentiment periods, we expect the put-call ratio
to explain some of the variance in pricing errors. Finally, we add deviations from the
put-call parity as our last option market variable. The put-call parity relationship
imposes that put and and call options with the same strike price and maturity should
have the same Black-Scholes implied volatility. In practice, however, deviations do
exist, due to short-sales constraints, difficulties to borrow the underlying stock, or
9
For our sample of option data, the number of OTM options traded daily increased from 92 on January
17
th
, 1996 to 557 on June 30
th
, 2010. In terms of intraday volume, the increase is even more
spectacular: from 31,179 on January 17
th
, 1996 to 449,726 on June 30
th
, 2010.
27
information asymmetries. For example, Lamont and Thaler (2003) argue that short
sales restriction on the spot asset can prevent arbitrageurs from restoring the
equilibrium between stock and options prices, which often lead to puts that are more
expensive than the corresponding calls. The impact of deviations from the put-call
parity on both the HN and ABS pricing errors is immediate. For example, for the HN
model, deviations directly lead to the mispricing of put options, since the put prices
are derived from call prices using the put call parity. To derive our measure of
deviations from the put-call parity, we use a measure similar to that used by Cremers
and Weinbaum (2010). On each day of the sample, we build pairs of put and call
options with the same strike price and time to expiration. We then calculate the
difference in implied volatility between each of these pairs of put and call options and
use the average daily difference as our proxy. Note that put-call parity deviations can
only be observed when the whole sample of SPX options is considered. Indeed, a put
and a call option can only have identical strikes and expiration dates if (1) both these
options are ATM or if (2) one is OTM and the other is ITM. We will therefore solely
include this variable in the regressions for the full sample of SPX options, which we
only estimate for the ABS model due to prohibitively long calibration times for the
HN model.
Other variables
In rational, efficiently functioning and complete markets, returns on the SPX index
and SPX options would be perfectly correlated and options would be valuated with
perfect accuracy. In practice, of course, options are priced with a certain level of
error, as our non-zero pricing error series show in Figure 5. Academics have studied
feedback effects between the spot and options markets extensively
10
. This stream of
research has shown that spot and derivatives markets interact through lead-lag
relationships in returns and volatilities, price discovery and information spillovers,
leading us us to think that market variables influencing stock returns might also affect
option pricing performance. As a result, we consider including internal
11
spot market
variables, such as one-period SPX returns, but also other broad market variables in
our set of error forecasting variables. We justify our choice of variables by reviewing
the findings of the literature on stock returns predictability. Our first step is to include
variables proper to the underlying S&P 500 index. We select the following index-
related variables as predictors: the one- period return, the trading volume on the day
of the pricing, the average volume over the last trading week, the observed volatility
over the last 10 days, the VIX index level, the volatility of volatility, the S&P 500
SKEW index level, the price to book ratio, the price-earnings ratio and the index
dividend yield. Note that although the VIX index, the SKEW index and the volatility of
volatility are option-based, we arbitrarily include them as index-related variables.
Our choice to include the contemporaneous and past trading volume on the index
relies on the visibility hypothesis described in Gervais Kaniel and Mindelgrin (2001).
They argue that shocks to trading activity, or volume, carry information about the
direction of future stock price movements. They postulate that unusually high trading
activity for individual stock creates shocks in trader interest, which lead to the
10
See for instance Conover and Peterson (1999), Gwilym and Buckle (2001) or Pan and Poteshman
(2006)
11
By internal variables, we mean variables that are specifically used as input for the ABS and HN
pricing formulae.
28
existence of a high-volume return premium over short term holding periods. They
build three value-weighted porfolios of stocks: a high volume portfolio, a normal
volume portfolio and a low volume portfolio. They show that, for formation periods
of one day and one week, the high-volume portfolio earns abnormal risk-adjusted
returns for holding periods of up to a hundred days, without being rebalanced. In a
similar portfolio study, Huang and Heian (2010) use a sample including all firms
listed on the NYSE and AMEX and study the existence of the high-value return
premium over a sample period of fifty years. Their results also support the existence
of a high-volume premium for holding periods of one to four weeks. They show that
the premiums are mostly concentrated in the first two weeks after the formation
period, that they monotonically decrease for extended holding periods, and that they
even become negative for periods longer than eight weeks. Building on this empirical
research, we hypothesize that if shocks to the trading volume of the index potentially
impact stock returns, they might also influence the option pricing error. To account
for both immediate and gradual impact of volume on pricing performance, we include
a contemporaneous measure of trading volume, trading volume on the day of the
pricing as well as a lagged measure, average volume over the last trading week. Both
volumes are defined in number of contracts traded.
Next, we include two measures of volatility for the S&P 500 index. The impact of the
volatility of the spot asset on option valuation is more direct than what we just
described for the trading volume. Indeed, the valuation formulae for the HN and ABS
model both take some form of index volatility as an input. For the ABS model, the
volatility used in the Black-Scholes pricing formula is implied from option prices,
which obviously depend on expected stock market volatility. In the case of the HN
GARCH model, the call price formula relies on the conditional variance over the next
period, b(t +A), which measures the expected variance between the current and the
next period to impact the pricing error. As we can see, the pricing performance is
likely to be influenced by the expectations the market has for the index volatility in
the near future. We account for that fact by adding one forward-looking measure of
volatility, the well-accepted VIX index level to our set of error forecasting variables.
Inspired by an article by Baltussen, Van Bekkum and Van Der Grient (2013), we also
decide to add the volatility of volatility (vol-of-vol) as forecasting variable. We use
the authors original proxy for the vol-of-vol, defined as the standard deviation of the
implied volatilities of ATM put and call options over the last 20 trading days. In their
empirical study, Baltussen, Van Bekkum and Van Der Grient sort individual stocks
by vol-of-vol into value-weighted portfolio quintiles and show that stocks in the
lowest quintile outperform those in the highest quintile by roughly 0.85% in the first
month after portfolio formation. They also show that their results are robust to
controlling for an extensive list of other known drivers of stock returns, such as size,
book-to-market ratio, beta, momentum factor, stock turnover, put-minus-call implied
volatilities or leverage. In light of these findings, we suspect that, as a measure
capturing uncertainty of expected stock returns, vol-of-vol might explain some of the
variance in the two time series of option pricing errors. In addition to the VIX index
level, whose inclusion we discussed earlier, we add another widely used index
released by the Chicago Board Options Exchange, the SKEW index. The skew index
provides traders and portfolio managers with a measure of the perceived tail risk of
the distribution of the SPX log returns at a 30-day horizon. It acts as an indicator for
the expected skewness of the distribution of log returns: when the SKEW level is
around a value of 100, the distribution of log returns is expected to be almost normal.
When the SKEW level is at higher values, the expected skewness becomes more
29
negative and the probability of outlier negative returns increases. Accordingly, the
SKEW index complements our measures of volatility by accounting for the fact that
the distribution of log-returns is not normal. Next, we provide the rationale for adding
the log price to book ratio, the log cyclically adjusted price-earnings ratio (CAPE)
12
and the log index dividend yield as predictor variables of the pricing error.
Pioneering articles by Fama and French (1988a, 1988b) and Campbell and Shiller
(1989) or more recently, research by Campbell and Viceria (2005), show that
variables such as the dividend yield or the price earnings ratio stock have significant
predictive power for stock returns. Fama and French (1998a) examine the predictive
power of the dividend yield on stock returns for various holding periods. Their study
stresses the importance of the holding horizon in stock return predictability: while the
dividend yield only explains less than 5% of the return variance for holding periods of
a month, it explains up to 25% of the 3-5 year return variance. Campbell and Schiller
build on this research by incorporating stock prices and the dividend yield in a Vector
Autoregressive model (VAR), along with the price/earnings ratio. They show that the
P/E ratio is a powerful predictor of stock returns, especially as the holding period
increases. Campbell and Viceira (2005) extend the VAR framework presented in
Campbell and Schiller by including bond and T-bill returns in the VAR model. Their
set of return-forecasting variables includes the log dividend yield, the nominal interest
rate on the three-month Treasury bill and the yield spread between long and short
maturity government bonds. Their article makes an important contribution to the
literature by showing the term structure of the return variance and covariance for
stocks, bonds and T-bills. In the specific case of stocks, they conclude that as holding
horizon increases, predictability from the dividend yield induces mean reversion,
which make stocks less risky to hold over long horizons. For their two other return
forecasting variables, short-term interest rates and term spreads, Campbell and
Viceira (2005) find no conclusive evidence of stock return predictability. In similar
research, Ang and Bekaert (2007) document the opposite: they show that at short
horizons, short-term interest rates have significant predictive power for future excess
returns. Since results differ across studies for the short-term interest rates and the term
spread, we decide to include both variables as explanatory variables, because we
suspect they might influence the option pricing error. We define short-term interest
rates as the log nominal interest rate on the 90-days Treasury bill and the yield spread
as the log difference between the yield on the five-year Treasury note and the yield on
the 90-days T-bill.
Finally, we also account for the fact that variations in investor sentiment might affect
the price of S&P 500 index options and leave room for mispricing. Han (2007) studies
the impact of investor sentiment on the shape of the implied volatility surface and the
risk-neutral skewness of index returns for SPX options. His results show that bearish
sentiment on the market leads to a more negative risk-neutral skewness of index
returns and a steeper volatility smile. For bullish markets, implied volatility smiles
tend to be flatter. In a more recent study, Frijns, Lehnert and Zwinkels (2012) present
a stochastic volatility that differentiates between three groups of option traders:
traders that trade on long-term mean reversion, traders that trade on short-term
patterns and traders that form expectations about future volatility based on
fluctuations in investor sentiment. The authors find that when the third group of
12
The CAPE or PE10 (Schiller) is based on average inflation-adjusted earnings from the previous 10
years
30
sentimental traders is accounted for, in- and out-of-sample pricing errors are
significantly reduced. Their results are in line with those of Han, and show that
sentiment is a non-negligible determinant of the shape of the volatility smile and
option prices. In this research, we include four commonly used proxies for investor
sentiment: the Baker and Wurgler sentiment index, mutual fund flows, the American
Association of Individual Investors (AAII) sentiment survey, and the Investors
Intelligence (II) Bearish Sentiment Index. The sentiment measure of Baker and
Wurgler (2006) is based on six proxies of investor sentiment: market turnover,
number of IPOs, first day return on IPOs, new equity issuances, and difference in
book-to-market ratios between dividend payers and non-dividend payers. In our
analysis, we use the modified version of the index, for which each proxy has been
orthogonalized with respect to the NBER recession indicator, consumption growth
and industrial production. This allows the index to capture pure sentiment
variations, rather than business cycle fluctuations. We obtain the historical values of
the index from the authors website. Our second proxy, mutual fund flows, has also
been shown to be a good measure of investor sentiment (see for example Ben-Refael
et al (2010) and Chiu and Omesh (2013)). We use the net monthly cash flows for
domestic US equity funds from the Investment Company Institute (ICI) as our
measure of sentiment. Finally, we include the AAII survey and the II report to
account for differences in sentiment between institutional and individual investors.
The AAII sentiment survey polls a random sample of individual investors every week
and requires the respondents to formulate expectations on how they think the market
will evolve for the next six months. Based on the responses, the AAII then calculates
the proportion of bullish, bearish and neutral investors. Following Brown and Cliff
(2004), we use the spread between the fraction of bullish and bearish investors as a
measure of investment sentiment. The II report gathers over a hundred independent
market newsletters every week and categorizes their content as bullish, bearish or
neutral. A bull-bear spread is then calculated, in a similar manner to that of the AAII.
At first, it may seem redundant to use four different proxies for investment sentiment.
We justify our choice by the fact that the contemporaneous values of these four
measures are relatively moderately correlated over our sample period, as the table in
Appendix 6 shows. Since our proxies for sentiment hardly move together over time,
we fear that using merely one of them might keep us from observing the real effect of
investor sentiment on the pricing error.
The inclusion of the ten option market variables and the fifteen other variables into
our set of error predictors leaves us with a total of twenty-five dependent variables for
our time series regressions. We show the time series of these predictors in Appendix
8. As it is often the case when working with financial time series, we face the problem
that our variables are not all sampled at the same frequency. For example, values for
the Baker and Wurgler sentiment index or those for the CAPE are only released
monthly by their respective authors. In this case, we use spline interpolation to
transform financial time series with monthly frequency to weekly data. A second
issue is that some of our pricing error predictors are non-stationary. To meet the
assumptions required by OLS regressions on time series, we need to induce
stationarity, either by differencing, or by detrending our raw predictor variables. The
procedure for unit-root testing is identical to the one we described in our previous
section on stationarity: advanced Dickey-Fuller tests are carried out on each predictor,
using the appropriate model (AR, ARD or TS) and the adequate lag length provided
by our information criterion. Trend-stationary variables are detrended and all
31
variables that fail to reject the ADF null hypothesis of non-stationarity are first-
differenced. Appendix 7 provides the results of the ADF tests for each predictor, as
well as an overview of the transformations used to induce stationarity. The next
section presents the results of our in-sample timeseries regression.
In-sample time series regression for the ABS model
The previous section provided the rationale behind our choice of variables to include
in our time series regressions. We also gave an overview of our unit-root testing and
data transformation process. We now proceed to the central piece of our research: the
time series regressions of the option pricing errors on our twenty-five explanatory
variables. Regression results are presented for the ad hoc Black and Scholes model,
for our filtered sample of out-of-the-money S&P 500 index options. We also provide
the regression estimates for the Heston and Nandi GARCH model as confirmatory
evidence in the next section. We first focus on in-sample regressions, and then turn to
out-of-sample prediction.
Table 5 shows the results of the multiple regressions of the pricing error on our set of
predictors for the ABS model. In all four regression model specifications, the
dependent variable is the first-difference of the $RMSE, and the values of the
predictors are contemporaneous to those of the dependent variable. We do not report
regression results for the level of the error, since the level series is non-stationary, and
its use in time series regressions would lead to spurious results. Note that we use four
models in our regression table. Model 1 refers to a first-order autoregressive process,
where the only predictor is the first lag of the dependent variable, Model 2 adds
option sample characteristics to the set of predictors, and Model 3 includes all other
predictors in the regression. Model 4 is a modified version of Model 3, for which all
predictors have been first-differenced. Also note that the volatility of volatility has
been omitted in Model 3, and that VIX index level is absent from Model 4. We
exclude these variables because of their high correlations with other predictors. In
levels, contemporaneous values of VIX and vol-of-vol are strongly correlated
(Pearsons r= 0.80, p<0.001). In first-differences, VIX and the average implied
volatility are strongly correlated (Pearsons r= 0.87, p<0.001). To alleviate
multicollinearity concerns, we therefore arbitrarily eliminate one of the variables in
each correlated pair of predictors
13
.
Model 1
A first look at the four columns of Table 5 confirms our previous intuition: in every
specification, some of our predictor variables are able to explain some fraction of the
variance in pricing error for the ABS model. As Model 1 shows, the mere lagged
values of the error already explain roughly 17 percent of the variance in pricing
performance. The coefficient estimate on the lagged $RMSE is negative and highly
significant, which indicates that a decrease in pricing error in the previous period is
likely to be associated with an increase in the current period. Note that the coefficient
estimate (-0.412) is simply the value for the first lag of the ABS error autocorrelation
function shown in Figure 9.
13
Our results are robust to replacing the VIX level by the vol-of vol in Model 3, and using the VIX
rather than the average implied volatility in Model 4.
32
Table 5: In sample regressions for ABS pricing errors (filtered sample of OTM options)
Model 1 Model 2 Model 3 Model 4
Lagged error -.412**
(0,033)
-0,413***
(0,033)
-0,412***
(0,033)
-0,423***
(0,032)
Mean option price 0,042***
(0,011)
0,043***
(0,011)
0,014
(0,013)
Mean moneyness -0,191
(0,394)
0,075
(0,611)
-1,224
(1,085)
Mean maturity 0,002
(0,001)
0,001
(0,001)
0,007***
(0,002)
Total number of options 3,904***
(1,133)
3,777***
(1,151)
0,004***
(0,001)
Number of untraded options -1,083
(1,258)
-1,406
(1,279)
-0,002
(0,001)
Number of options under 1$ -2,554
(1,771)
-1,893
(1,849)
-0,001
(0,002)
Put-call ratio 0,025
(0,180)
-0,018
(0,185)
0,079
(0,134)
Mean volume per contract 0,009
(0,058)
0,018
(0,058)
0,000008
(0,00006)
Mean implied volatility -0,282
(1,467)
2,137
(1,869)
3,680
(1,870)
VIX level 0,018
(0,373)
-
CAPE 0,525
(2,180)
0,705
(2,004)
Log T-bill interest rate 51,600***
(17,952)
62,951***
(16,79)
Log yield spread 40,658***
(14,477)
42,782***
(13,776)
Log dividend yield -0,050
(3,899)
-122,916***
(18,78)
Bull-Bear spread institutions 0,101
(0,205)
0,0003
(0,004)
Bull-Bear spread individuals -0,081
(0,132)
0,003**
(0,001)
Baker and Wurgler index 0,010
(0,494)
0,072
(0,457)
Skew level -0,001
(0,004)
-0,004
(0,006)
SPX volume -0,020
(0,019)
-0,00002
(0,00002)
SPX volume over past week -0,080**
(0,0393)
-0,00008**
(0,00004)
One-period log returns 0,014
(0,0105)
0,772
(0,669)
Price to book ratio -0,406*
(0,225)
-0,446**
(0,213)
Mutual fund flow -0,003
(0,014)
-0,00005
(0,00006)
Volatility of volatility - 0,015
Constant 0.001
(0,018)
-0,081
(0,443)
-0,238
(0,628)
0,00034
(0,031)
Num. Obs. 752 752 752 752
AJ. R
2
.1684 .1999 .2037 .2716
Reports the unstandardized coefficient, standard errors (in parentheses) and adjusted R
2
for the multiple
regressions of the in sample $RMSE on contemporaneous values of the predictor variables for the ad hoc Black
and Scholes procedure. Model 1 refers to an AR(1) model for the $RMSE. Model 2 and Model 3 respectively add
option market factors and equity market factors to the set of predictor variables. Regression model 4 is equivalent
to Model 3, with all predictor variables first-differenced. . ***, ** and * denote significance at the 1%, 5% and
10% level respectively.
33
In order to explain a higher fraction of the variance, we could also consider higher
order autoregressive models. To illustrate that claim, Appendix 9 shows the partial
autocorrelation function for the first-differenced pricing error of the ABS model. The
chart suggests that adding the second or third lag of the dependent variable as
predictors in Model 1 might increase the fraction of variance explained by a
significant amount. This also raises concerns that the inclusion of additional lags
might weaken the significance of our results for Model 2, 3 and 4 of Table 5, or even
eliminate the significance altogether. In untabulated results
14
, we show that our
regression results are unaffected by the inclusion of additional lags. For the sake of
conciseness, we only report our results for the first lag of the pricing error.
Model 2
As the second column of Table 5 shows, adding option market-related variables to the
set of predictors increases the adjusted fraction of explained variance to 20 percent.
The coefficient on the lagged difference in pricing error remains highly significant
and negative. Additionally, the coefficient estimates for Model 2 show that two more
predictors have a significant influence on the pricing error: the mean option price and
the first-difference of the total number of index options available for trade. The effect
of the mean option price on our measure of dollar pricing error is best understood by
considering our previous results. A look back at Table 3 shows that a high mean
option price on a given Wednesday is probably associated with a tilt toward long-
maturity or near-the-money options, which are the most expensive on average. Figure
7 also reveals that the level of pricing error for the ABS model increases as (1)
relative moneyness approaches unity and as (3) time to maturity increases. Combining
these two results, we provide the following explanation for the positive effect of the
first predictor: a high average price is associated with a tilt of the cross-section of
options toward more expensive options (long-maturity NTM options), which also are
those that are priced with the lowest accuracy by the ABS model. Ultimately, this
results in higher dollar pricing errors.
The effect of the second significant predictor, the total number of options, is
straightforward. To price options, the ABS model relies on the smoothing of the
volatility surface implied by option prices. An increase in the number of options
results in a more complex implied volatility surface that needs to be fitted with a fixed
number of six parameters, or degrees of freedom by the ABS procedure. This
translates into higher interpolation errors during the smoothing process, and
ultimately in higher valuation errors when the smoothed implied volatility surface is
plugged into the Black-Scholes valuation formulae. Note that, as our dependent
variable and unlike the mean option price, the total number of options is first-
differenced in the regression model, which means that an increase from the previous
week in the number of options is associated with in an increase in valuation errors.
Model 3
The inclusion of the rest of the predictors in Model 3 increases the adjusted R
2
by an
additional 0.4 percent, and the effects observed in Model 1 and 2 remain significant.
In addition to the lagged pricing errors, the total number of options and the mean
14
These results are available from the author upon request
34
average option price, four additional predictors (all first-differenced in the regression
model) appear to have a significant effect on the current pricing error.
First, Model 3 shows that both an increase in short-term interest rates and a
steepening of the yield curve for government bonds result in an increase in valuation
errors relative to the previous week. The effect for both interest rates and the yield
spread is highly significant at the 1% level. For the effect of the yield spread and
short-term interest rates on valuation error, we provide two explanations. First, there
might be a mechanical effect due to the way we model interest rates in our ABS
model. Consider the Black and Scholes valuation formula in equation ( 2 ). To
calculate the option price, the expected payoff of the option is discounted at the zero-
coupon rate r corresponding to the options maturity. In our ABS model, we calculate
this rate by interpolating the zero curve on the day of the pricing with quadratic
polynomials. Figure 11 shows the real zero curves for June 9
th
, 1999 and January 17
th
,
1996, along with their interpolation polynomial. The curves are representative for the
two typical shapes observed for the zero curve over our sample period: normal and
inverted. In the first case, an increase in the yield spread results in a more pronounced
curvature for the yield curve, which leads to a poorer fit of our interpolation
polynomial, which is limited to quadratic terms. Over our range of option maturities
(10-360 days), interest rates for short maturities (<60 days) are therefore
overestimated and those for long maturities (>180 days) are underestimated, as the
first chart of Figure 11 shows.
Figure 11: Zero curve interpolation, yield spread and option pricing error
This translates into the use of inadequate discount rates in the Black Scholes pricing
formula and ultimately in higher valuation errors. In the second chart, the yield spread
is negative, and an increase in its value has the opposite effect, since it results in a
flatter yield curve, a better interpolation fit and lower valuation errors. We therefore
believe that part of the effect of the yield spread on the pricing error is mechanical,
and that its sign depends on the shape of the zero curve. Since the yield curve has a
normal, concave shape more than 75% of the time over our sample period, the effect
of the yield spread on pricing error should ultimately be positive, as our regression
results show in Table 5. Interpolation errors might also partly explain the significant
positive effect of the short-term interest rates on valuation error. For some days of our
35
sample, the zero curve exhibits a hump for short-maturity interest rates, which also
results in a poorer fit of our quadratic interpolant, and ultimately in higher option
pricing errors. A second explanation for the positive coefficients on the yield spread
and the short-term interest rate is that high interest rates usually go along with periods
of economic uncertainty. In these crisis periods, there is an abundant demand for
protective puts from traders and portfolio managers who use these derivatives as
insurance against the eventuality of a crash. This unusual demand drives the prices of
short and medium term OTM and DOTM put options way above those of equivalent
call options, thereby distorting the shape of the implied volatility surface. Since this
distorted surface is used as an input for the Black and Scholes pricing formula, it
ultimately results in increased mispricing, which explains our results. We illustrate
this phenomenon later in this section using Figure 13.
Coming back to Table 5, Model 3 also shows that a downward movement for the
average trading volume of the S&P 500 index during the previous week is associated
with simultaneous increases in valuation errors. We provide the following hypothesis
to explain how the trading activity on spot markets influences valuation error. One
channel through which spot liquidity interacts with option markets is option market
makers, who often use the underlying asset to hedge their put and call option
positions. For example, Cho and Engle (1999) argue that the liquidity and bid-ask
spread in the options market is determined by the liquidity in the spot market, rather
by the activities in the derivative market itself. This means that low trading activity on
the spot market could lead to higher bid ask-spreads on the options market. In turn,
these higher bid-ask spreads might influence the shape of the implied volatility
function and therefore option prices (see Chou et al (2012)). We therefore argue that
the negative coefficient observed in Table 5 is attributable to higher liquidity costs on
option markets. Finally, a negative effect on the $RMSE is also found for the first-
difference of the price to book ratio, but this effect is only marginally significant.
Model 4
Finally, Model 4 shows the results of a full regression model for which all twenty-
four predictors have been first-differenced. This allows us to check for additional
effects that may not have been captured in Model 3, for which some of the variables
are expressed in levels and other in first-differences, as we explained when we
considered stationarity issues in the previous section. The adjusted R
2
for this last
specification is the highest among the four models, with a value of 27%. We note that
in addition to the 17% explained by the autoregressive nature of the $RMSE in Model
1, we are able to explain an additional 11% of the variance from option sample
properties, S&P 500 index characteristics and general market conditions. Regarding
regression coefficient estimates, those of the total number of options, the short-term
interest rate, the yield spread, the price to book ratio and the past week average
volume all remain significant and of the same sign. In addition, whereas the first-
difference of the mean option price does not appear to have any significant influence
on the pricing error, the coefficient estimate of the log dividend yield becomes highly
significant and negative after differencing. A similar effect is found for the mean
option maturity, whose first-difference has a positive highly significant impact on the
ABS pricing error. Finally, the coefficient on the AAII bull-bear spread, one of our
sentiment proxies, becomes positive and significant.
36
The influence of the mean option maturity on valuation error is rather straightforward
to explain, as options with longer time to expiration are harder to price than their
short-maturity counterpart. We discussed this already when we interpreted the
influence of the mean option price, and refer to Figure 7 for a classification of the
valuation error by maturity categories.
Regarding the highly significant negative coefficient on the dividend yield, a unique
cause is difficult to pinpoint but we provide the following insights. The direct
influence of the dividend yield itself on option prices is obvious. In the first term of
the Black and Scholes valuation formula, the spot price is discounted for expected
dividends. All other things left unchanged, an increase in the dividend yield reduces
the value of the first term in equation ( 2 ), and leads to lower option prices. The
magnitude of this effect is asymmetrical across maturities: it is stronger for long-
maturity options, for which dividends are discounted over longer periods, and almost
negligible for options close to expiration. What causes fluctuations in the dividend
yield is less clear: it can either arise from a change in the index level, or from a cut in
the overall amount of dividends paid out on the index constituents. As the mirrored
curves in Figure 12 show, variations of the dividend yield are usually due to price
fluctuations: when price increases, the dividend yield decreases. Dividend payouts
themselves change rather smoothly over time, since sudden cuts send negative signals
about future performance, and usually lead market participants to punish companies
by moving their investment elsewhere. However, a closer look at the two crises in
2000 and 2008 shows that during these periods, variation in the dividend yield might
arise from sudden changes in the amount of paid dividends itself. In 2008 for
example, many large companies of the S&P 500 suspended or cut dividends, mostly
financial institutions such as Bank of America Corp., but also other firms such as
General Motors, resulting in $33 billion of unpaid dividends to stockholders
15
. In
2001, the fraction of S&P500 constituents paying dividends was down 15% relative to
that in 1996 (source: S&P indicated rate change spreadsheet). In light of this
information, we believe the negative coefficient on the dividend yield in Table 5
arises from specificities of our sample period, which contains two major recession
periods in fifteen years of data. We argue that the link between the dividend yield and
the valuation error might be coincidental, since period with sharp decreases in the
former correspond to periods of increases in the latter. If this is the case, we expect
the coefficient to become insignificant over longer sample periods. We also suggest a
second possibility, which is related to the assumptions of our pricing model. In the
ABS pricing model, the expected dividend yield over the options remaining life is
approximated by a constant, which is the current S&P 500 index dividend yield on the
day of the pricing. Using this constant yield, the ABS model calculates the present
value of all future dividends paid on the S&P500 index until the options expiration.
In reality, as we showed in Figure 12, the dividend yield is very unlikely to remain at
its present level over the options remaining life (especially for long-maturity options)
if it experienced a recent series of consecutive large drops over the past few weeks.
While the ABS model does not incorporate that information, we hypothesize that
option market participants do. We suggest that they are influenced by recent
consecutive decreases in the dividend yield (for example during crisis periods), which
leads them to use a different dividend yield, adjusted on their pessimistic expectations
for the future evolution of the index dividend yield. We suggest that this effect is
15
source: The Wall Street Journal
37
asymmetric: market participants are less affected by recent increases than decreases in
the dividend yield. This could would lead to a discrepancy between model and market
prices and provide a second possible explanation for why the level of ABS pricing
error increases when the dividend yield decreases. Determining which of the first or
the second theory is the most likely to explain our results in behind the scope of this
study. However, in both cases, we are fairly convinced that the negative coefficient on
the dividend yield is linked to the two crisis periods.
As conclusive remark before discussing the regressions results for the Heston and
Nandi GARCH model, we stress out that one must remain critical when interpreting
our regression results. Although great attention has been devoted to any possible
collinearity or non-stationarity issues among our set of variables, we insist on the fact
that many of our variables are weakly or moderately correlated. First differencing
alleviates most of these concerns, but the issue of the representativeness of our sample
remains. It may be that some of the relationships we discussed above are specific to
our sample period.
Figure 12: Dividend yield and pricing error
38
In sample regressions for the HN GARCH model
We present the results in this section as confirmatory evidence for the relationships
we just described for the ABS model. Table 6 shows the results of the same four
regression models used in the previous section for the HN model.
Model 1
The results for Model 1 globally corroborate what we found earlier: the first lag of the
$RMSE explains a non-negligible fraction of the variance in pricing error and the
coefficient estimate for this predictors is negative and highly significant at the 1%
level. Note that for this first regression model, the coefficient estimates and the
adjusted R
2
are almost identical to those for the ABS model in Table 5.
Model 2
Extending our set of predictors to option characteristics yields a significant 11%
increase in the percentage of variance explained. As for the ABS model, the total
number of options and the mean option price are both positive and highly significant.
A noteworthy difference between the two option pricing models is the lesser
explanatory power of the lagged $RMSE in Model 2, and the much higher adjusted
R
2
of Model 2 (0.295 vs 0.20 for the ABS model). This is easily explained by the
additional effect of the number of untraded options, the put-call ratio and the mean
implied volatility. The coefficient estimates on the number of untraded options and
the put-call ratio are both negative, whereas the coefficient on the mean implied
volatility is positive.
Model 3
The inclusion of all other predictors in Model 3 produces the largest change in terms
of variance explained and coefficient estimates. The adjusted R
2
jumps to 0.46 and
the coefficients of Model 2 experience radical changes when the additional variables
of Model 3 are added to the regression model. For example, the explanatory power of
the put-call ratio simply disappears, the coefficient on the mean implied volatility
changes sign and the coefficient on the mean moneyness becomes negative and highly
significant. This is mainly explained by the inclusion in Model 3 of one-period log
returns. For the HN GARCH the explanatory power of this specific variable is
considerable: in Model 3, log-returns are the second major explanatory variable for
the pricing error (t = 14.41, p = 1.42 1u
-41
), after the first lag of the error
(t = 14.47, p = 6.8 1u
-42
). The log-returns on the S&P 500 index alone explain
more than 15% of the variance in the dependent variable. We argue that this
divergence from our previous results for the ABS model is due to the fact that for the
HN model, option prices and therefore valuation errors are directly dependent on the
recent history of index returns. As for the ABS model, the coefficient on the total
number of options and the mean option price remain positive and highly significant.
In addition, the short-term interest rates and the yield spread both have the same sign
and level of significance as for the ABS model, which confirms their influence on
pricing performance. Table 6 also reveals that decreases in mean moneyness lead to
an increase in valuation error. We provide the following explanation for this effect.
We defined mean moneyness as the ratio K/S, with high values of this ratio (way
above 1) corresponding to DOTM calls and low values (way under 1) to DOTM puts
options. Since our filtered sample is restricted to OTM options, a decrease in mean
39
moneyness translates into a more pronounced tilt toward put options. The coefficient
for the mean moneyness therefore indicates that when the set of options becomes
gradually tilted toward put options, the valuation error increases. We believe that this
link between moneyness and error might be attributable to specific features of our
data during the two crisis periods, with period of very high valuation errors being
periods with high demand for DOTM crash-protective puts. To illustrate that claim
we show the $RMSE surface on two different days of our sample. In the first chart,
the pricing error is highest for long-maturity DOTM options, and decreases as
moneyness increases and the expiration date approaches. The shape of the first
surface is typical for a vast majority of the days in our sample. The second surface,
however, shows a distortion, which is almost entirely attributable to the mispricing of
DOTM and OTM put options with shorter time to maturity. This chart, shown in the
midst of the 2008 financial crisis, illustrates the effects of an abnormal demand in
DOTM and OTM puts on valuation error. We argue that this phenomenon might
explain to some extent the negative coefficient on mean moneyness in Table 6.
Figure 13: Illustration of the distortions created by abnormal demand for "crash-protective" puts
From Table 6, we also observe that both increases in mean implied volatility and the
number of untraded options result in lower valuation errors.
Model 4
Finally, Model 4 shows the results of a regression model for which all independent
variables have been first-differenced. Our main conclusions from Model 3 remain,
with the exception of the coefficient on the mean moneyness, which becomes
unsignificant for Model 4. Interestingly, a negative influence of the price to book ratio
(as for the ABS model) and a negative influence of the institutional bull-bear spread
are also found to be significant. From the results of Model 4, it appears that an
upward movement in institutional sentiment leads to an improvement in pricing
performance for the HN GARCH model. For the ABS, we found a relationship of the
opposite sign for the individual bull-bear spread, which renders interpretation difficult
for these measures of investor sentiment. Nevertheless, we argue that the major
decreases of institutional sentiment coincide with recession periods for our sample
period. This adds to our previous evidence in favor of a positive relationship between
financial market trouble and deterioration of pricing performance. This concludes our
discussion of the in-sample regression results. In the previous two sections, we
showed that as much as 11% (30%) of the variance in valuation error for the ABS
40
Table 6: In sample regressions for HN pricing error (filtered sample of OTM options)
Model 1 Model 2 Model 3 Model 4
Lagged error -.431**
(0,033)
-0,391***
(0,031)
-0,394***
(0,027)
-0,274***
(0,028)
Mean option price 0,130***
(0,017)
0,131***
(0,016)
0,132***
(0,019)
Mean moneyness -0,532
(0,617)
-2,407***
(0,837)
-1,540
(1,53)
Mean maturity 0,003
(0,002)
0,001
(0,002)
0,00011
(0,003)
Total number of options 9,381***
(1,779)
10,315***
(1,581)
0,009***
(0,002)
Number of untraded options -4,225**
(1,968)
-4,582***
(1,751)
-0,005***
(0,002)
Number of options under 1$ 1,118
(2,771)
0,155
(2,533)
0,003
(0,003)
Put-call ratio -0,691**
(0,281)
-0,019
(0,253)
-0,252
(0,19)
Mean volume per contract 0,098
(0,091)
0,055
(0,079)
0,00010
(0,00008)
Mean implied volatility 4,996**
(2,295)
-15,897***
(2,558)
-12,731***
(2,61)
VIX level 0,577
(0,511)
-
CAPE 4,692
(2,99)
-1,36
(2,790)
Log T-bill interest rate 84,72***
(24,585)
93,406***
(23,37)
Log yield spread 39,303***
(19,84)
52,74***
(19,193)
Log dividend yield -3,11
(5,341)
-183,682***
(26,124)
Bull-Bear spread institutions 0,188
(0,280)
-0,017***
(0,005)
Bull-Bear spread individuals 0,251
(0,181)
-0,002
(0,00191)
Baker and Wurgler index -0,396
(0,676)
-0,42835
(0,636)
Skew level -0,00019
(0,0055)
-0,01040
(0,00801)
SPX volume -0,01614
(0,0267)
-0,00002
(0,00003)
SPX volume over past week -0,00001
(0,054)
-0,00001
(0,00005)
One-period log returns -0,206***
(0,014)
-12,434***
(0,99)
Price to book ratio -0,467
(0,3079)
-0,60054***
(0,296)
Mutual fund flow 0,00288
(0,020)
0,00002
(0,00008)
Volatility of volatility
-
0,032
(0,043)
Constant 0.0064
(0,03)
0.556
(0,693)
2,052**
(0,860)
0,004
(0,024)
Num. Obs. 752 752 752 752
Adj. R
2
.1846 .2951 0.4628 0.4922
Reports the unstandardized coefficient, standard errors (in parentheses) and adjusted R
2
for the multiple
regressions of the in sample $RMSE on contemporaneous values of the predictor variables for the Heston and
Nandi GARCH. Model 1 refers to an AR(1) model for the $RMSE. Model 2 and Model 3 respectively add option
market factors and equity market factors to the set of predictor variables. Regression model 4 is equivalent to
Model 3, with all predictor variables first-differenced. . ***, ** and * denote significance at the 1%, 5% and 10%
level respectively.
41
model (HN model) could be explained by changes in economic conditions, option
sample characteristics and index properties. In Appendix 11, we provide the results of
the same regressions for the ABS model on the unrestricted sample of SPX options,
including ITM and DITM options, as additional evidence to support our previous
findings. Appendix 11 mainly confirms our previous results for the filtered sample of
OTM options: highly significant positive coefficients on the short- term interest rates,
yield spread, one-period log returns, total number of options and highly significant
negative coefficients on the price to book ratio and the dividend yield. Interestingly,
the adjusted R
2
of Model 4 also shows that our predictors have considerably more
explanatory power when the whole sample of options is considered, as their first-
differences explain roughly 30% of the variance for in-sample pricing errors.
Altogether our results confirm our initial intuition: pricing performance considerably
deteriorates in depressed financial markets. In the next section, we discuss our
findings for the out-of-sample prediction of option valuation errors for both models.
Out-of-sample time series regression for the ABS and HN model
The previous section aimed to explain the variation in option pricing performance for
our models over our 1996-2010 period. We showed that factors such as the dividend
yield, the total number of options, or the one-period log-returns were able to explain a
sizeable portion of the in sample variance in valuation error. Despite small
discrepancies between the regression results for the ABS and HN model, we showed
that an overall pattern emerges: periods of financial market trouble, characterized by
negative index returns, high demand for crash-protector puts, low institutional
sentiment, sudden decreases in dividend pay-out, low price to book ratios and
significant movement in interest rates were associated with periods of high valuation
errors. This prompts the question of whether our results hold for out-of-sample
pricing performance. In this section, our aim is twofold. First, we want to find out if
future pricing performance can be forecasted from the current vales of our economic
predictors. To that end, we use regression models 1,2 and 3 from the previous section,
with the OOS pricing error as the dependent variable and the lagged values of the
predictors as independent variables. Second, we also want to determine the proportion
of the OOS error due to changes in economic environment between the moment of
estimation and the time of evaluation. To calculate this amount of variance explained,
we use Model 4 of the previous section, with the OOS pricing error as the dependent
variable and the contemporaneous first-differences of the predictors as independent
variables. The results of these OOS regressions for the ABS and HN model are shown
in Table 7 and Table 8, respectively.
Forecasti ng the prici ng error one peri od ahead
As the results for the first three models in Table 7 show, almost all the forecasting
power for the ABS model is concentrated in the lagged first-difference of the error.
The coefficient estimate for this variable remains negative, which translates into the
following interpretation: a decrease in OOS pricing error over last week is likely to be
associated with an increase in OOS error over the coming week. Including the
characteristics of the option sample in our set of forecasting variables increases the
adjusted R
2
by a mere 1,8%, and adding the rest of the predictors explains an
additional 1% of the variance in ABS OOS errors. In terms of significance, the major
predictors of OOS error are the mean option price, the number of options worth less
42
than 1$, the CAPE ratio and the mean volume over past week. The individual bull-
bear spread is only marginally significant.
For the HN model, the results of Models 1 to 3 indicate that the lagged first-difference
of the OOS error has significantly less predictive power than for the ABS model: it
only explains 13% of the OOS error variance (compared to almost 22% for ABS).
However, the other predictors have considerably more forecasting power. Factors
such as the mean option price, the mean implied volatility, the one-period log returns,
the CAPE and the institutional bull-bear spread account for roughly 7% of the
variance. For both ABS and HN, the mean option price and the CAPE are significant
at the 5% level, and their coefficient estimates are respectively positive and negative.
Over our sample period, a higher mean option price is often associated with an overall
higher level of implied volatility or a concentration of the market demand in one
specific type of option, which is typical of periods of financial uncertainty, as she
showed earlier in Figure 13. The positive coefficient estimate therefore indicates that
when the mean option price is currently high, one can reasonably expect a higher
OOS valuation error in the coming week, because of expected changes on financial
markets. These changes can either be a rapidly climbing index level (in the wake of
financial bubbles), or a plummeting index price (following the burst of a bubble).
Once again, we stress that this interpretation is specific to our sample period.
Regarding the negative coefficient estimate on the CAPE, we provide a similar
explanation. Sudden decreases in the CAPE can either be due to increases in
cyclically adjusted earnings, or decreases in stock price. Since the CAPE is based on
the average inflation-adjusted earnings of 500 constituents over the previous 10 years,
the probability of a sharp increase in earnings is unlikely. We therefore argue that
sudden variations in CAPE are due to price fluctuations. The CAPE chart in
Appendix 8 seems to be in line with that claim: large changes in CAPE are
concentrated in the two recession periods. In light of that evidence, we postulate that
the negative coefficient on the CAPE is also attributable to the specificities of our
sample period. Large recent decreases in CAPE go along with worsening economic
conditions. This, in turn, leads to uncertainty on the stock and option markets, which
explains the predicted increase in option valuation error. Also note the positive
coefficient on the mean implied volatility for the HN model, which tends to
corroborate our previous arguments. Interestingly, the lagged log-returns appear to
have a negative, highly significant effect on future OOS performance. We argue that
this relationship is induced mechanically, since lagged log-returns are highly
negatively correlated with the lagged contemporaneous OOS error, which is itself
highly negatively correlated with its own lagged values.
Altogether, regression models one to three for both HN and ABS seem to indicate that
our set of predictors has a limited, yet significant, forecasting power for the expected
movements in pricing performance over the coming week. Over a one-week horizon,
variables such as the CAPE, the mean option price and the lagged error enable us to
predict a non -negligible fraction of the future variance in valuation errors. One must
however remain critical since we suspect some of the relationships discussed above to
be specific to the troubled nature of our sample period. Nevertheless, our regressions
for the ABS and HN models show similar results regarding the sign and significance
of the forecasting variable. Considering that the two time series of OOS errors rely on
radically different option model specifications, we are highly confident that our
results are in no way coincidental.
43
Table 7: Out of sample regressions for ABS pricing errors (filtered sample of OTM options)
Model 1 Model 2 Model 3 Model 4
Lagged error -.468**
(0,033)
-0,479***
(0,034)
-0,475***
(0,034)
-0,464***
(0,031)
Mean option price 0,071**
(0,034)
0,081**
(0,034)
0,005
(0,039)
Mean moneyness -0,651
(1,197)
0,496
(1,862)
5,908*
(3,219)
Mean maturity -0,002
(0,004)
-0,005
(0,004)
0,018***
(0,006)
Total number of options -2,342
(3,493)
-2,542
(3,555)
0,020***
(0,003)
Number of untraded options 4,158
(3,887)
4,071
(3,946)
-0,016***
(0,004)
Number of options under 1$ -11,199**
(5,376)
-11,999**
(5,624)
-0,018***
(0,006)
Put-call ratio -0,483
(0,545)
-0,457
(0,558)
0,590
(0,396)
Mean volume per contract 0,073
(0,176)
0,054
(0,176)
0,00041**
(0,0002)
Mean implied volatility 7,184
(4,457)
5,101
(5,649)
5,050
(5,54948)
VIX level -0,822
(1,128)
-
CAPE -14,769**
(6,583)
3,393
(5,956)
Log T-bill interest rate 53,554
(54,497)
-126,992**
(49,812)
Log yield spread 14,510
(43,735)
66,287
(40,881)
Log dividend yield 8,257
(11,774)
-179,807***
(55,739)
Bull-Bear spread institutions 0,844
(0,619)
-0,036***
(0,012)
Bull-Bear spread individuals -0,667*
(0,400)
0,005
(0,004)
Baker and Wurgler index -0,986
(1,491)
-0,230
(1,356)
Skew level -0,003
(0,012)
-0,009
(0,017)
SPX volume 0,022
(0,059)
-0,00008
(0,000)
SPX volume over past week 0,254**
(0,119)
0,00006
(0,00011)
One-period log returns 0,002
(0,032)
1,783
(1,98393)
Price to book ratio -0,794
(0,681)
1,275**
(0,632)
Mutual fund flow -0,023
(0,043)
0,00006
(0,00018)
Volatility of volatility
-
0,202**
(0,09277)
Constant 0.0087
(0,055)
1,2063
(1,346)
0,453
(1,909)
-0,008
(0,051)
Num. Obs. 751 751 751 751
Adj. R
2
.2164 .2342 0.2421 0.3304
Reports the unstandardized coefficient, standard errors (in parentheses) and adjusted R
2
for the multiple
regressions of the out-of-sample $RMSE on lagged values of the predictor variables for the ad hoc Black and
Scholes procedure. Model 1 refers to an AR(1) model for the $RMSE. Model 2 and Model 3 respectively add
option market factors and equity market factors to the set of predictor variables. Regression model 4 is similar to
Model 3, with the difference that all predictor variables are contemporaneous to the dependent variable and first-
differenced ***, ** and * denote significance at the 1%, 5% and 10% level respectively.
44
Table 8: Out of sample regressions for HN pricing errors (filtered sample of OTM options)
Model 1 Model 2 Model 3 Model 4
Lagged error
-.3693**
(0,034)
-0,399***
(0,035)
-0,327***
(0,037)
-0,302***
(0,032)
Mean option price 0,080***
(0,029)
0,063**
(0,029)
0,150***
(0,032)
Mean moneyness -0,991
(1,001)
1,975
(1,530)
-1,566
(2,601)
Mean maturity 0,00034
(0,003)
-0,00005
(0,0034)
-0,001
(0,005)
Total number of options 1,273
(2,895)
0,076
(2,892)
0,008***
(0,003)
Number of untraded options -5,001
(3,187)
-4,900
(3,170)
0,002
(0,003)
Number of options under 1$ -2,399
(4,485)
-3,185
(4,591)
0,001
(0,005)
Put-call ratio 0,0365
(0,4557)
-0,5279
(0,4581)
0,296
(0,321)
Mean volume per contract 0,093
(0,147)
0,108
(0,144)
0,0002***
(0,0001)
Mean implied volatility 10,331***
(3,726)
24,961***
(4,756)
-20,76***
(4,510)
VIX level 0,152
(0,926)
-
CAPE -13,955**
(5,410)
-0,878
(4,810)
Log T-bill interest rate -37,527
(44,536)
40,186
(40,270)
Log yield spread 2,349
(35,962)
60,952*
(33,055)
Log dividend yield 18,8415*
(9,6662)
-254,404***
(45,040)
Bull-Bear spread institutions 1,163**
(0,508)
-0,040***
(0,009)
Bull-Bear spread individuals 0,087
(0,330)
-0,0001
(0,0033)
Baker and Wurgler index -0,09207
(1,2248)
0,682
(1,096)
Skew level 0,00433
(0,01)
0,001
(0,014)
SPX volume -0,002
(0,048)
0,00002
(0,00004)
SPX volume over past week 0,027
(0,098)
0,00010
(0,00009)
One-period log returns 0,16651***
(0,0278)
-12,37***
(1,645)
Price to book ratio -0,4122
(0,5578)
-0,956*
(0,511)
Mutual fund flow -0,0081
(0,0354)
0,00006
(0,00014)
Volatility of volatility
-
0,26779***
(0,0747)
Constant 0.0075
(0,0462)
0,892
(1,125)
-2,155
(1,567)
0,0001
(0,0411)
Num. Obs. 751 751 751 751
Adj. R
2
.1352 .1649 0.2036 0.3179
Reports the unstandardized coefficient, standard errors (in parentheses) and adjusted R
2
for the multiple
regressions of the out-of-sample $RMSE on lagged values of the predictor variables for the Heston and Nandi
GARCH. Model 1 refers to an AR(1) model for the $RMSE. Model 2 and Model 3 respectively add option market
factors and equity market factors to the set of predictor variables. Regression model 4 is similar to Model 3, with
the difference that all predictor variables are contemporaneous to the dependent variable and first-differenced
***, ** and * denote significance at the 1%, 5% and 10% level respectively.
45
Infl uence of period-to-peri od market fluctuations on OOS error
The last column of Table 7 and Table 8 allows us to study another set of relationships.
In Model 4, we determine the amount of variance in OOS error that can be explained
from changes in the economic environment between the point in time at which the
parameters of the model are estimated, and the time at which options are priced
(exactly one week later). We stress out that for Model 4, the predictors are first-
differenced, but also contemporaneous to the dependent variable. Noteworthy features
shared by both models in Table 7 and Table 8 are the positive coefficients on the total
number of options, mean volume per contract, volatility of volatility and the negative
coefficients on the institutional bull-bear spread and the dividend yield.
To interpret those results, let us consider a hypothetical situation for which we
estimate the ABS (GARCH) parameters for a given day, with the intention of using
these parameters to price options in the next period. Our results indicate that if the
number of contracts available on the options market and the mean option price
increase between the current and the next period, higher OOS valuation errors are
likely to follow.
Similarly, an increase in the vol-of-vol between periods will lead to a higher level of
mispricing in the next week. We defined volatility of volatility as the standard
deviation of the implied volatility of ATM options over the last 20 days. First
interpretation is that an increase in vol-of-vol is typical for a more troubled market,
with gradually raising levels of uncertainty for the short-term future and an irregular
shape for the implied volatility surface. Although we expect the effect of vol-of-vol to
be stronger for the ABS model (for which the implied volatility surface is used as an
input), our results show that this is not the case. Out of curiosity, we narrowed the
estimation period for the vol-of-vol from 20 to 9 days and estimated Model 4 again.
The results for both models were unequivocal: the coefficient on the vol-of-vol
becomes larger and highly significant for both models, even more so for the ABS
model (t = 11.S4, p = 1.S 1u
-2
), for which the R
2
jumps from 33% to 43%. This
supports our intuition that the influence of vol-of-vol proxies for a deterioration of
economic conditions over the week between estimation and evaluation. As a second
interpretation, we suggest that high vol-of-vol might translate into a radical change in
the variance process between two subsequent weeks, due to some major event
between estimation and evaluation. The parameters estimated in the previous period,
which at that time might have led to good in-sample performance, would therefore
lead to a poor fit at the time of evaluation.
The negative coefficient on the institutional bull-bear spread is also in line with this
reasoning, since a decrease in institutional sentiment between estimation and
evaluation is associated with deterioration in OOS pricing performance. Finally, the
negative, highly significant coefficient on the dividend yield relates to the argument
we made previously using Figure 12: a large decrease in the dividend yield over short
periods is indicative of financial turmoil, and it is often associated with unusual
changes in the shape of the implied volatility function that lead to higher valuation
errors. Another way to look at this phenomenon is by considering the persistency of
the option pricing model parameters: o
0
o
1
o
2
o
3
o
4
o
5
for ABS and o
-
[
-
y
-
-
during crisis periods.
46
Appendix 10 shows the time series of the parameter o
3
(linear maturity term) for the
ABS model, as well as that the series y
-
(risk-neutral skewness) for the HN GARCH.
Both time series exhibit considerably more variations around 2000 and 2008, which
illustrates our point. Since OOS valuations relies on past week estimates for the
models parameter, a high week-to-week volatility for these parameters results in less
accurate valuation. As confirmatory evidence, we provide the results of these
regressions on our unrestricted sample in Appendix 12 for the ABS model. Without
discussing these results in depth, we argue that a similar pattern as that we just
described emerges when ITM and DITM options are included in the sample. Most of
the significant coefficient estimates for our filtered sample of OTM options remain
significant and of the same sign after including in-the-money options, with the
exception of the mean option price, for which the coefficient in Model 4 becomes
negative. Interestingly, mean maturity and moneyness both become highly significant
and positive in Model 4.
47
Conclusions
Throughout this study, we have tried to fill a void in the option pricing literature.
Since the introduction of the Black-Scholes model in 1973, much attention had been
devoted to finding the best performing option-pricing model among the multitude of
existing alternative models. From binomial trees to stochastic volatility models,
academics and practitioners have progressively developed an impressive amount of
ever more accurate models. Oddly enough, to the extent of our knowledge no study
has ever focused on common cyclical performance patterns across models. Could
diametrically different models share a common trend in pricing performance over
time, and if so, can this shared variation be explained to some extent? These
questions were the guiding thread of our research.
This study focuses on two widely accepted option-pricing models, the ad hoc Black
and Scholes, or practitioners model, and the GARCH model with Gaussian
innovations developed by Heston and Nandi (2000). As first step of our study, we
derived the times series of the pricing error for both models, using fourteen years of
weekly option and index data for the S&P 500 index. This allowed us to observe a
strong common pattern of cyclicality in option pricing performance over our sample
period. For our two models, both in-sample and out-of-sample valuation errors
reached considerably higher levels during periods of financial turmoil. Although it
was not the primary aim of our research, we were also able to compare our results
with those of the option pricing literature. We found that our findings are in line with
those presented in Heston and Nandi (2000) and Barone-Adesi, Engel and Mancini
(2008): the HN GARCH model systematically yields lower valuation errors than the
ad hoc Black and Scholes model of Dumas, Fleming and Whaley (1998). We also
documented that a modified version of the ABS model described in Bollen and
Whaley (2004) was able to perform even better over our sample period. In order to
fully explain the overall superior pricing accuracy of the ad hoc Black and Scholes
model, we derived the time series of the pricing error for nine option categories based
on option moneyness and time to expiration. This allowed us to observe that the HN
GARCH model systematically fails to deliver performance for short-maturity options,
even more so when these options are out-of-the-money.
Coming back to our original guiding thread, we then shifted our focus to finding
underlying factors that could explain the cyclicality observed for our time series of
valuation errors. An extensive review of the empirical finance literature yielded a set
of twenty-five predictors related either to our option sample, to characteristics of the
S&P 500 index, or associated with global economic conditions. A series of in-sample
and out-of-sample time series regressions then allowed us to determine the impact of
our various predictors on pricing performance. Our regression results for both models
are unequivocal and they lead to the same general conclusion: pricing performance
deteriorates along with worsening economic conditions. Our regression results
especially indicate that over our sample period, crisis periods characterized by
negative index returns, high demand for crash-protective put options, declining
institutional sentiment, sudden reductions in dividend pay-out, low price to book
ratios and significant movement in interest rates coincide with periods of high
valuation errors. This conclusion holds for both out-of-sample and in-sample pricing
performance. Our research contributes to the existing literature in several ways.
48
To the extent of our knowledge, this is the first attempt to study the time series
behaviour of the pricing error across option-pricing models. Over our fifteen-year
sample period, we were able to show that, no matter the model, performance has a
discernable time component. For two diametrically different models, we showed that
the level of valuation error varies considerably over longer periods of time and that it
is highly dependent on current economic conditions. We therefore advocate for taking
this time component into account in future research.
Moreover, we identify a set of economic indicators and option market characteristics
able to explain part of the cyclical nature of option pricing errors. We show that the
cycles in pricing performance are mainly affected by variables representative of
general financial conditions, such as short-term interest rates, the shape of the yield
curve for government bonds, movements in institutional sentiment as well as dividend
yields and price-earnings ratios for the S&P 500 index. Our results indicate that, as
for many other asset pricing models in empirical finance, the performance of option
pricing models is significantly worse and more volatile around market crashes, due to
the erratic behaviour of the stock and options market in these troubled times. We also
showed that the parameters used as input in the option pricing-models tend to be more
volatile during crises periods, and lead to deteriorating out-of-sample performance,
due to large fluctuations between subsequent periods. This underlines the importance
of frequent model recalibration to account for variations in economic indicators such
as interest rates or institutional demand for crash-protective put options. Our last
remark is of high relevance for practitioners who use these option-trading models on a
regular basis as a professional tool.
In addition, the time series for the valuation error prove to be relatively sticky. The
high levels of error observed in the midst of recession periods are not confined to the
peaks of the associated financial crises; they already appear in the preceding months
and remain present for a long time afterwards. As a result, our results show an
alternation of relatively long periods of high and low valuation errors. At any point in
time, it is therefore possible to make reasonable predictions for the level of option
pricing error in the near future, from the current values of the error.
Our results also clearly show that, although the general level of error increases for all
models in depressed financial markets, some models might perform better than others
in crisis periods. In light of that evidence, we argue that comparing model
performance over short periods of time of two years or three years, as is usually done
in the existing literature, only offers limited insight. In our opinion, a fair comparison
between option-pricing models should account for the fact that the relative
outperformance of a model over another might only be true for a specific time period.
Finally, although it was not our primary aim, we also indirectly contributed to the
option pricing literature by showing that the performance of deterministic volatility
models, such as the ad hoc Black and Scholes model, is highly dependent on model
specification. Our results indicate that variable transformation in the deterministic
volatility function, such as the replacement of strike price by relative moneyness, lead
to considerable improvements in pricing performance. We showed that the modified
ad hoc Black and Scholes presented in Bollen and Whaley (2004) significantly
outperforms the Heston and Nandi GARCH model during crisis periods. This is in
line with the compelling amount of evidence showing that the ad hoc Black and
49
Scholes model, although it is simple and theoretically inconsistent, is a remarkable
tool to price options. In addition to these contributions, we also note a few
shortcomings and suggestions for further research
First, we note that despite the attention devoted to colinearity and stationarity issues
in our time-series regressions, our results have to be considered with caution. Many of
the variables used as predictors of option pricing performance are weakly or
moderately correlated, which might slightly inflate the portion of variance explained.
We accept this limitation, and expect future research to elucidate this issue and
provide further evidence for the relationships we highlighted in our research.
We also discuss several minor issues regarding the numerical implementation of our
option pricing models. Despite the great care with which we implemented the
calibration procedure, we suspect that common optimization issues, such as local
minima for the root mean square pricing error, might slightly affect our results. To
alleviate this concern, we propose the implementation of a genetic algorithm. A
second issue relates to the quadratic interpolation polynomials we used to fit the yield
curve. Although they provide fairly accurate estimates most of the time, these
polynomials exhibit a poor fit when the yield curve deviates from its usual shape and
becomes inverted. We recommend the use of spline interpolants, which would
guarantee an excellent fit in all situations.
Finally, this research was carried out using two option pricing models and mostly
weekly data. An interesting extension of our work would be to use our approach with
daily data, or to derive the time-series of pricing error for other alternative option
pricing models. We are confident that our results are not confined to the specifics of
our empirical methodology, and we trust that future evidence will contribute to shed
more light on the global patterns in pricing performance we unveiled in this research.
50
References
Andreou, P. C., Charalambous, C., Martzoukos, S. H. Assessing the Performance of
Symmetric and Asymmetric Implied Volatility Functions. Review of Quantitative
Finance and Accounting, forthcoming. Retrieved online on January 16
th
, 2014, from
http://www.pandreou.com/pdf/Andreou_Charalambous_Martzoukos_RQFA_2014.pd
f
Ang, Andrew and Bekaert, Geert. Stock return predictability: is it there? Review of
Financial Studies, vol. 20, issue 3, pages 651-707.
Baker, Malcolm and Wurgler, Jeffrey. 2007. Investor Sentiment in the Stock Market.
Journal of Economic Perspectives, vol. 21, issue 2, pages 129151
Bakshi, Gurdip and Cao, Charles and Chen, Zhiwu. 1997. Empirical Performance of
Alternative Option Pricing Models. Journal of Finance, vol. 52, issue 5, pages 2003-
2049
Baltussen, Guido and van Bekkum, Sjoerd and van der Grient, Bart, Unknown
Unknowns: Vol-of-Vol and the Cross-Section of Stock Returns. 2013. AFA 2013 San
Diego Meetings Paper. Available at SSRN: http://ssrn.com/abstract=2023066 or
http://dx.doi.org/10.2139/ssrn.2023066
Barone-Adesi, Giovanni and Engle, Robert F. and Mancini, Loriano, A GARCH
Option Pricing Model with Filtered Historical Simulation (January 2008). Review of
Financial Studies, 2008. Review of Financial Studies, vol. 21, issue 3, pages 1223-
1258
Ben-Rephael, Azi and Kandel, Schmuel and Wohl, Avi. 2012. Measuring investor
sentiment with mutual fund flows. Journal of Financial Economics, vol. 104, issue 2,
pages 363382
Berkowitz, Jeremy. On Justifications for the ad hoc Black-Scholes Method of Option
Pricing (2009). Studies in Nonlinear Dynamics & Econometrics, 2009, vol. 14, issue
1, pages 1-27
Bollen, Nicolas P. B. and Whaley, Robert E. April 2004. Does Net Buying Pressure
Affect the Shape of Implied Volatility Functions? The Journal of Finance, vol. 59,
issue 2, pages 711753
Brown, Gregory W. and Cliff, Michael T. 2004. Investor sentiment and the near-term
stock market. Journal of Empirical Finance, vol. 11, pages 1 27
Campbell, John Y. and Schiller, Robert J. 1988. Stock Prices, Earnings, and Expected
Dividends. The Journal of Finance, vol. 43, issue 3, pages 661-676
Campbell, John Y. and Viceira, Luis. 2005. The Term Structure of the Risk-Return
Tradeoff. Financial Analysts Journal, vol. 61, pages 34-44.
51
Chiu, H. and Omesh K. 2013. Equity Issuances, Equity Mutual Fund Flows, and
Noise Trader Sentiment. Review of Finance, forthcoming.
Cho, Young-Hye and Engle, Robert F. 1999. Modeling the Impacts of Market
Activity on Bid-Ask Spreads in the Option Market. NBER Working Papers 7331,
National Bureau of Economic Research.
Chou, Robin K. and Chung, San-Lin and Hsiao, Yu-Jen and Wang, Yaw-Huei. 2011.
The Impact of Liquidity on Option Prices. Journal of Futures Markets,
vol. 31, issue 12, pages 11161141
Christoffersen, Peter and Heston, Steven L. and Jacobs, Kris. 2006. Option Valuation
with Conditional Skewness. Journal of Econometrics, vol. 131, issues 12, pages
253284
Conover, Mitchel C. and Peterson, David R. 1999. The lead-lag relationship between
the option and stock markets prior to substantial earnings surprises and the effect of
securities regulation. Journal of Financial and Strategic Decisions, vol. 12, issue 1,
pages 41-52
Cox, J. C. and Ross, S. A. and Rubinstein, M. 1979. Option pricing: A simplified
approach. Journal of Financial Economics, vol. 7, issue 3, pages 229263
Cremers, Martijn and Weinbaum, David. Deviations from Put-Call Parity and Stock
Return Predictability. April 2010. Journal of Financial and Quantitative Analysis,
vol. 45, issue 02, pages 335-367
Dumas, Bernard and Fleming, Jeff and Whaley, Robert E. 1998. Implied Volatility
Functions: Empirical Tests. Journal of Finance, vol. 53, issue 6, pages 2059-2106.
Fama, Eugene F. and French, Kenneth R. 1988a. Dividend yields and expected stock
returns. Journal of Financial Economics, vol. 22, pages 3-25
Fama, Eugene F. and French, Kenneth R. 1988b. Permanent and Temporary
Components of Stock Prices. Journal of Political Economy, vol. 96, issue 2, pp. 246-
273
Ferreira, Eva and Gago, Mnica and Len, Angel and Rubio, Gonzalo. 2005. An
empirical comparison of the performance of alternative option pricing models.
Investigaciones Economicas, vol. 29, issue 3, pages: 483-523
Frijns, Bart and Lehnert, Thorsten and Zwinkels, Remco. Sentiment Trades and
Option Prices. LSF Research Working Paper Series. May 2012
Gervais, S., Kaniel, R. & Mingelgrin, D.H. 2001. The high-volume return premium.
The Journal of Finance, vol. 51, 877-919.
Gwilym, Owain Ap and Buckle, Mike. 2001. The lead-lag relationship between the
FTSE100 stock index and its derivative contracts. Applied Financial Economics, vol.
11, issue 4, pages 385-393.
52
Hamilton, James D. 1994. Time Series Analysis. Princeton University Press
Han, Bing. 2008. Investor Sentiment and Option Prices. The Review of Financial
Studies, Vol. 21, issue 1, pages 387-414
Heston, Steven L. and Nandi, Saikat. 2000. A closed-form Garch option valuation
model. Review of Financial Studies, vol. 13, issue 3, pages 585-625.
Hsieh, K. C. and Ritchken, P. 2005. An empirical comparison of GARCH option
pricing models. Review of Derivatives Research, vol. 8, issue 3, pages 129-150
Huang, Z. & Heian, J.B. 2001. Trading-volume shocks and stock returns: An
empirical analysis. The Journal of Financial Research, vol. 38, pages 153-177.
Lamont, O., and R. Thaler, 2003, Can the market add and subtract? Mispricing in tech
stock carve-outs. Journal of Political Economy, vol. 111, pages 227-268.
Lettau, Martin and Ludvigson, Sidney. 2010. Measuring and Modeling Variation in
the Risk-Return Tradeoff. Handbook of Financial Econometrics, vol. 1, pages 617-
690
Lobb, A. 2008. Investors Lick Wounds From Dividend Cuts. The Wall Street Journal,
retrieved Tuesday January 28th, 2014 from
http://online.wsj.com/news/articles/SB122601848166007053
Moyaert, Thibault and Petitjean, Mikael. The performance of popular stochastic
volatility option pricing models during the subprime crisis. 2011. Applied Financial
Economics, vol. 21, issue 14, pages 1059-1068
Mller, Juliane and Kanniainen, Juho and Pich, Robert. 2013. Calibration of
GARCH models using concurrent accelerated random search. Applied Mathematics
and Computation, vol. 221, pages 522534.
Ng, Serena & Perron, Pierre. 1995. Unit root tests in ARMA models with data-
dependent methods for the selection of the truncation lag. Journal of the American
Statistical Association, vol. 90, issue 429, pages 268-281
Ng, Serena & Perron, Pierre. 2001. Lag length selection and the construction of unit
root tests with good size and power. Econometrica, Econometric Society, vol. 69,
issue 6, pages 1519-1554.
Pan, Jun and Poteshman, Allen M.. 2006. The Information in Option Volume for
Future Stock Price. The Review of Financial Studies, vol. 19, issue 3, pages 871-908
53
Appendix
Appendix 2: First-differenced time series of the dollar root mean square pricing error for the ABS and HN
model
Appendix 1: Average pricing error by option category for the ABS model (including ITM options)
Appendix 1 shows the yearly mean $RMSE obtained from the ad hoc Black and Scholes model across option
categories for the whole sample of SPX options (including in-the-money options). For concision, we only show
the results one every two year. Maturity and moneyness categories are as described in the section describing the
option data. In-the-money options are defined by the following bounds for the option delta: u.62S < A
C
u.87S
for calls and u.87S < A
P
u.62S for puts. The bounds for deep-in-the-money options are u.87S < A
C
u.8 for calls and u.8 < A
P
u.87S for puts. The mean yearly pricing error is highest for ATM long-
maturity options in the early years of our sample period. The $RMSE decreases asymptotically as time to
expiration decreases. It also decreases for options located on the far ends of the moneyness spectrum (DOTM or
DITM). Since the dot-com bubble, the highest level of mispricing achieved by the ABS model is for DOTM
options with long time to expiration.
54
Appendix 3: Out of sample differences in ABS and HN pricing errors
Appendix 4: Stationarity and unit root testing
In this research we restrict our focus to weak-sense stationarity (WSS). Formally, for
a time series variable
t
to be weak-sense stationary, the following conditions are
required:
In other words, WSS requires the first two moments of the variable
t
to stay constant
over time. Looking back at the pricing errors series in Figure 5 or the charts of the
error forecasting variables in Appendix 8, it is clear that they behave like stochastic
processes: their discrete variations between two consecutive weeks appear to be
random. What is unclear however, is whether these processes are stationary. In order
to shed light on the issue of stationarity, we carry out unit root tests on our two time
series. We rely on Augmented Dickey-Fuller (ADF) tests. The ADF unit root test fits
the time series tested for stationarity
t
to the general model in equation ( 21 ).
Depending on the values of the parameters o and [, equation ( 21 ) can either model a
trend stationary process ([ u), a random walk without drift ([ = u, o = u), or a
random walk with a drift ([ = u, o u).
The number of lags p defines the order of the autoregressive process. Supposing that
o, p and [ are provided, the parameters of the model are estimated for the series
t
,
and the null hypothesis of the presence of a unit root y = u is tested against the
alternative hypothesis y < 1. The outcome of the unit root test is highly dependent on
the choice of the values for o, p and [, and it is therefore of utmost importance to use
appropriate values. To choose these parameters adequately, we rely on the following
step-by-step methodology:
1) First, we choose the appropriate model for the series
t
tested for stationarity:
autoregressive (AR), autoregressive with drift (ARD), or trend stationary (TS).
We observe the time series chart of
t
and determine whether or not the series
is trending, and whether it fluctuates around a zero mean. For example, the
error series for the ABS model, shown in Figure 5, has a non-zero constant
(
t
) = p
ai(
t
) = o
2
(
t
,
t-
) = y
( 20 )
A
t
= o +[t +y
t-1
+
1
A
t-1
++
-1
t-+1
+
t
( 21 )
55
drift term and does not follow any definite upward or downward trend over
our sample period. We therefore categorize it as ARD.
2) The second step is to choose the appropriate number of lags to be considered
in equation ( 21 ). We rely on the method described in Ng and Perron (1995,
2001), and start by calculating the maximum lag length p
. Schwert (1989)
suggests the following rule of thumb for determining p
= Rc _12 _
I
1uu
]
0.25
_
3) ADF unit roots tests are carried out for integer values p ranging from 1 to
p
= aig min
<
mcx
(AIC +BIC +EC)
Although these individual information criteria usually point towards the same
optimal lag length p
Model
ADF test
statistic
P value Treatment
Mean option price 20 ARD -3,42 0,011 -
Mean option moneyness 3 ARD -4,17 0,001 -
Mean option maturity 20 ARD -3,88 0,003 -
Total number of options 20 TS -0,70 0,972 First-difference
Number of untraded option contracts 20 TS -1,40 0,861 First-difference
Number of options worth less than
1$
20 TS -1,16 0,916 First-difference
Put-call ratio 5 ARD -7,28 0,001 -
Average number of trades per
contract
20 TS -2,17 0,508 First-difference
Average option implied volatility 1 ARD -3,20 0,021 -
VIX level 13 ARD -3,01 0,035 -
CAPE 20 TS -3,16 0,095 First-difference
Log 90-days T-bill interest rate 1 TS -1,11 0,926 First-difference
Log yield spread 1 ARD -2,24 0,193 First-difference
Log dividend yield 1 ARD -4,96 0,001 -
Past 10 days SPX volatility 2 ARD -6,24 0,001 -
Bull-bear spread (institutions) 17 ARD -3,19 0,021 -
Bull-bear spread (individuals) 20 ARD -3,95 0,002 -
Baker and Wurgler sentiment index 14 ARD -2,29 0,174 First-difference
SKEW level 18 TS -3,87 0,014 Detrending
Number of SPX shares traded 20 TS -3,25 0,075 First-difference
Average SPX volume over the past
week
20 TS -2,53 0,329 First-difference
Log one-period returns 1 AR -20,25 0,001 -
Price to book ratio 1 TS -2,89 0,166 First-difference
Mutual fund net flows 20 ARD -3,39 0,012 -
Volatility of volatility 2 ARD -6,14 0,001 -
57
Appendix 8: Time series of error pricing predictors (untransformed series)
Appendix 8 shows the time series of the untransformed predictor variables over our sample period 1996-
2010. Predictors related to the characteristics of our option sample, such as mean option moneyness or
option contracts volume, are shown for our filtered sample of OTM options. The deviations from put-call
parity (last chart) are derived from the unrestricted sample, including ITM and ATM options. The data on
the CAPE ratio is obtained from Robert Schillers website and the interest rates and yield spreads are from
the Federal Reserve. The data on investors sentiment is retrieved from Investment Company Institute
(mutual fund flows), Investor Intelligence (institutional bull-bear spread), the American Association of
Individual Investors (individual bull-bear spread) and Jeffrey Wurglers website (Baker and Wurgler
sentiment index).
58
59
Appendix 9: Partial Autocorrelation functions of the first-differenced pricing errors for the ABS and HN
models
Appendix 10: Parameter volatility over the sample period 1996-2010
60
Appendix 11: In sample regressions for the ABS error (full spectrum of option moneyness)
Model 1 Model 2 Model 3 Model 4
Lagged error -.424**
(0,033)
-0,439***
(0,032)
-0,439***
(0,032)
-0,325***
(0,027)
Mean option price 0,0091***
(0,002)
-0,010***
(0,003)
-0,022***
(0,003)
Mean moneyness -4.476***
(0,861)
7,227***
(1,633)
17,535***
(1,296)
Mean maturity 0,0079***
(0,002)
0,006**
(0,003)
0,010***
(0,003)
Total number of options 1,911
(1,380)
2,260
(1,388)
0,003***
(0,001)
Number of untraded options --1,4521
(1,629)
-1,699
(1,631)
0,001
(0,001)
Number of options under 1$ -7,024**
(2,952)
-7,459**
(3,034)
-0,016***
(0,003)
Put-call ratio -0,435
(0,433)
0,582
(0,435)
0,057
(0,271)
Mean volume per contract -0,0017
(0,159)
0,053
(0,158)
0,00008
(0,0001)
Mean implied volatility 0,2738
(0,910)
0,942
(3,213)
0,929
(2,512)
VIX level 0,510
(2,488)
-
CAPE 3,415
(4,262)
3,074
(3,391)
Log T-bill interest rate 97,303***
(33,822
113,4***
(28,117)
Log yield spread 89,484***
(27,626)
88,60***
(23,1)
Log dividend yield 2,942
(7,360)
-279,70***
(31,97)
Bull-Bear spread institutions 1,106***
(0,394)
0,004
(0,01)
Bull-Bear spread individuals -0,189
(0,261)
0,005**
(0,002)
Baker and Wurgler index 0,876
(0,961)
0,454
(0,774)
Skew level 0,011
(0,008)
0,016*
(0,010)
SPX volume -0,043
(0,038)
-0,00002
(0,00003)
SPX volume over past week -0,127*
(0,076)
-0,00010
(0,00006)
One-period log returns 0,032*
(0,017)
6,131***
(1,083)
Price to book ratio -1,121**
(0,435)
-1,052***
(0,361)
Mutual fund flow 0,013
(0,028)
0,000
(0,0001)
Volatility of volatility
-
-0,039
(0,052)
Constant 0.0016
(0,036)
5.106***
(0,891)
-7,994***
(1,741)
0,005
(0,029)
Num. Obs. 751 751 751 751
Adj. R
2
.1782 .2293 0.2535 0.4772
Reports the unstandardized coefficient, standard errors (in parentheses) and adjusted R
2
for the multiple
regressions of the in sample $RMSE on contemporaneous values of the predictor variables for the ad hoc Black
and Scholes procedure. Model 1 refers to an AR(1) model for the $RMSE. Model 2 and Model 3 respectively add
option market factors and equity market factors to the set of predictor variables. Regression model 4 is equivalent
to Model 3, with all predictor variables first-differenced. . ***, ** and * denote significance at the 1%, 5% and
10% level respectively.
61
Appendix 12: Out-of-sample regressions for ABS pricing error (full spectrum of option moneyness)
Model 1 Model 2 Model 3 Model 4
Lagged error -.424**
(0,033)
-0,460***
(0,035)
-0,457***
(0,035)
-0,415***
(0,030)
Mean option price 0,0038
(0,0039)
0,0039
(0,0050)
-0,0247***
(0,0052)
Mean moneyness -1,53
(1,41)
-3,15
(2,70)
17,19***
(2,18)
Mean maturity -0,0011
(0,0038)
-0,0015
(0,0042)
0,021***
(0,005)
Total number of options -1,493
(2,287)
-2,083
(2,322)
0,0139***
(0,0021)
Number of untraded options 1,89
(2,71)
2,1871
(2,73)
-0,0102***
(0,0025)
Number of options under 1$ -17,23***
(4,86)
-18,1844***
(5,06)
-0,0272***
(0,0047)
Put-call ratio -0,0621
(0,7046)
-0,4838
(0,7122)
0,5281
(0,4649)
Mean volume per contract 0,0736
(0,2604)
-0,0206
(0,2604)
0,0006**
(0,0002)
Mean implied volatility -0,888
(1,481)
-0,209
(5,263)
4,327
(4,310)
VIX level -0,504
(4,075)
-
CAPE -16,54**
(6,97)
4,665
(5,826)
Log T-bill interest rate 19,69
(55,58)
-87,51*
(48,24)
Log yield spread -9,51
(45,33)
109,64***
(39,58)
Log dividend yield 16,1110
(12,059)
-293,31***
(54,84)
Bull-Bear spread institutions 0,6210
(0,6461)
-0,0224**
(0,0114)
Bull-Bear spread individuals -0,9091*
(0,4268)
0,0064
(0,0039)
Baker and Wurgler index -0,3285
(1,5732)
-0,0305
(1,3272)
Skew level -0,0120
(0,0129)
0,0100
(0,0169)
SPX volume -0,00979
(0,0616)
-0,0001
(0,0001)
SPX volume over past week 0,1980
(0,1243)
0,00005
(0,00011)
One-period log returns -0,0490*
(0,0271)
4,8702***
(1,8505)
Price to book ratio -0,7655
(0,7129)
0,6054
(0,6197)
Mutual fund flow -0,0248
(0,0452)
0,0001
(0,0002)
Volatility of volatility
-
0,1371
(0,0905)
Constant 0.0016
(0,036)
1.6402
(1,463)
3,224
(2.878)
-0,0045
(-0,0498)
Num. Obs. 751 751 751 751
Adj. R
2
.2052 .2164 0.2323 0.4097
Reports the unstandardized coefficient, standard errors (in parentheses) and adjusted R
2
for the multiple
regressions of the out of sample sample $RMSE on contemporaneous values of the predictor variables for the ad
hoc Black and Scholes procedure. Model 1 refers to an AR(1) model for the $RMSE. Model 2 and Model 3
respectively add option market factors and equity market factors to the set of predictor variables. Regression
model 4 is equivalent to Model 3, with all predictor variables first-differenced. . ***, ** and * denote significance
at the 1%, 5% and 10% level respectively.