Академический Документы
Профессиональный Документы
Культура Документы
Application of Financial
Econometrics
This term paper explores the applications of the various financial econometric techniques for
the analysis and modeling time series of financial data with the major focus on S&P 500 Index.
But to explore the power of financial econometrics tools I have also worked on some other
small data bases. In this term paper, I have tried to apply three broad categories of financial
econometric methods. First I have used the univariate time-series models for understanding the
basic structure of S&P 500 Index and simple as well as logarithmic returns of S&P 500 Index.
These are the class of models that attempts to model and forecast financial variables by using
the historical data of the same series rather than using structural relationships from financial
theories. The primary purpose of this class of models is to capture empirically important
characteristics of the observed data that might have arisen from a variety of different structural
models. This is discussed in the section II of this term paper. Next I have looked at the non-
linear financial econometric models especially ARCH/GARCH models for modeling the
volatility of S&P 500 Index returns. These models allow the behavior of a series to follow
different patterns at different points of times. It is presented in the section III. Last I have tried
to study the model of long term relationship in finance specifically cointegration relationship
to get some flavor of structural modelling. Generally, if we linearly combine two non-
stationary series, resultant series will also be non-stationary. But if two non-stationary series
are cointegrated, their linear combination be will stationary. I have studies the cointegration
relationship between spot Nifty Index and its futures using Engle-Granger two step method.
And I have also studies the cointegration relationship between real exchange rates i.e. relative
purchasing power parity (Relative PPP) using Johnsen technique. It is discussed in the section
IV.
series and series of its simple as well as logarithmic returns. I have obtained daily data of S&P
500 Index from January 3, 2007 to March 17, 2017 from Yahoo Finance. It has 2570 daily
observations. Figure 1 shows the time plots of daily closing price and trading volume of S&P
500 Index and Figure 2 depicts daily log returns of the same from January 3, 2007 to March
17, 2017.
Figure 1 Time plots of daily closing price and trading volume of S&P 500 Index from January 3, 2007 to March 17, 2017.
A. Return Distribution
Table 1 below presents basic summary statistics of S&P 500 Index returns. Figure 3 shows the
histogram of S&P 500 Index returns while Figure 4 gives the comparison between empirical
and normal density of S&P 500 Index returns. From observation of Table 1, Figure 3 and Figure
4, we can see that S&P 500 Index returns has meso kutic negatively skewed distribution. For
the final concluding evidence on normality we perform the Jarque - Bera Normality Test on
S&P 500 Index returns. We get chi-squared value of 11249.85 with corresponding p-value of
approximately zero and thus we reject the null hypothesis that S&P 500 Index returns follows
normal distribution.
No.
Mean Median Max Min SD Obs. 1st_Qu. 3rd_Qu. Skew. Kurtosis
10.38
0.01% 0.06% % -9.93% 1.31% 2570 -0.45% 0.58% -0.57 10.19
To understand the structure of the time series S&P 500 Index returns of I have calculated auto
correlation function (ACF) and Partial auto correlation function of S&P 500 Index returns.
Table 2 presents the values of ACF and PACF of the S&P 500 Index returns for five lags.
While the Figure 5 and Figure 6 presents the ACF and PACF for 50 lags.
Next to test the joint hypothesis that all five of the auto correlation coefficients are
simultaneously zero or not we perform Ljung-Box Test. I get chi-squared value of 44 with
corresponding p-value of approximately zero and thus we reject the null hypothesis that all
Table 2 ACF and PACF of S&P 500 Returns from March 3, 2007 to March 17, 2017.
5 -0.04681 -0.0479
Figure 5 of S&P 500 Returns from March 3, 2007 to March 17, 2017.
C. ARIMA Modelling
To identify correct lag of the AR process for SPR, I have used AIC criteria using maximum
likelihood estimate of variance. AIC values of for different lags is plotted in the Figure 7 and
we can see that lag of S&P 500 Index returns as this criterion is 12.
Figure 7 AIC Values for S&P 500 Returns from March 3, 2007 to March 17, 2017
It starts with testing for unit root using the Dickey-Fuller test and if null hypothesis of is not
rejected, testing for higher order. For unit root we get Dickey-Fuller test statistic value (-
13.5063) with corresponding p-value of 0.01. Thus, we conclude that series is stationary i.e.
Now the last thing needed to complete the ARIMA model estimation is the optimal lag
of MA structure. To get it we first fit ARIMA (12,0,0) to the S&P 500 Index returns as we
know the its AR (9) and I (0) series then we check the ACF of residual of these model and
perform Ljung-Box Test on the residuals of this model. Figure 8 gives the plot of ACF of
residuals of ARIMA (12,0,0) for 12 lags. We can see that ACF values for all initial lags is
approximately zero. I have also performed Ljung-Box test for significance of ACF of residual
of ARIMA (12,0,0).
thus we fail reject the null hypothesis that all twelve auto correlation coefficients are
simultaneously insignificant. So, our final estimated univariate time series model for the S&P
500 Index returns is ARIMA (12,0,0). And the coefficients for the estimated model are given
in the Table 3.
Lag 1 Lag 2 Lag 3 Lag 4 Lag 5 Lag 6 Lag 7 Lag 8 Lag 9 Lag 10 Lag 11 Lag 12
-0.103 -0.065 0.022 -0.023 -0.045 0.011 -0.020 0.031 -.011 0.049 -0.018 0.037
In this section I will discuss the non-linear models especially ARCH/GARCH for modeling
the volatility of S&P 500 Index returns. As earlier noted these models allow the behavior of a
series to follow different patterns at different points of times. Figure 13 presents the observed
First I check whether AR kind of structure is useful in volatility modelling using ACF plot and
Ljung-Box test for 12 lags. I have used 12 lags here as well as it was the lag identified for AR
model of S&P 500 Index returns. ACF plot for volatility of S&P 500 Index returns is given in
the Figure 9. For Ljung-Box test, I get chi-squared statistic value 2000 with corresponding p-
value of approximately zero and thus we reject the null hypothesis that all twelve auto
First simple model like basic AR model for volatility modelling is Auto Regressive Conditional
Heteroscedasticity (ARCH). The results of the ARCH (12) test of volatility of S&P 500 Index
returns from jaggo is given in the Table 4 and so we conclude that S&P 500 Index returns
Now I estimate the ARCH (12) model for the S&P 500 Index returns volatility. Summary of
the estimated model is given in the Table 5. Correlation between ARCH (12) estimated
volatility and realized volatility is almost 90%. Figure 11 presents the standardized residuals
for ARCH (12) model for the volatility of S&P 500 returns from jaggo. The plot of standardized
Table 5 Estimated Coefficients for ARCH (12) model for the volatility of S&P 500 returns from jaggo
A natural extension of ARCH (12) model that allows the conditional variance to depend upon
the previous values of own lags is GARCH (12, p). I will first try GARCH (12, 1). Figure 11
presents the standardized residuals for GARCH (12, 1) model for the volatility of S&P 500
returns and it is not significantly better than ARCH (12). And results do not change for different
Next I have used volatility estimated from GARCH (12,1) to create two sigma confidence
interval arounds. It is presented in the Figure 14. All the observations lie in the two-sigma
Figure 14 S&P 500 daily realized returns with two sigma confidence interval based on GARCH (12,1) forecasted volatility
integration are linearly combined, the combination will have an order of integration equal to
that of the largest. But there is an important exception to this rule, cointegrated series. If two-
time series are cointegrated, their linear combination be will stationary. I have investigated two
I have obtained 251 observations of spot and future values of Nifty Index from
www.nseindia.com. This data corresponds to period March, 2016 to December, 2016. First I
have done Augmented Dickey-Fuller test to check the order of integration of log of future and
spot values of Nifty Index. Results of it are presented in the Table 6. As expected both spot and
future log prices follows unit root process while returns of both spot and future Nifty Index are
stationary processes.
Data: Log Future Nifty Index Data: Log Spot Nifty Index
Dickey-Fuller = -0.718, Lag order = 5, p-value = 0.97 Dickey-Fuller = -0.853, Lag order = 5, p-value = 0.96
Data: Future Returns Nifty Index Data: Spot Returns Nifty Index
Dickey-Fuller = -6.53, Lag order = 5, p-value = 0.01 Dickey-Fuller = -6.67, Lag order = 5, p-value = 0.01
From theory cost of carry we know that if the markets are efficient, changes in the log of the
spot price of a Nifty Index and its corresponding changes in the log of the futures price of it
We will test these predictions using the error correction model. We will use Engle-Granger
two step method to estimate the error correction model. In step one we estimate the
potentially cointegration equation between log of Nifty spot and Nifty future. We do this by
regressing log of Nifty spot on the log Nifty future and we preform the ADF test on residuals
of this model to check whether cointegration relationship exists between these series. The
summary of it is presented in the Table 7 and from it we can infer that the residuals from
Table 7 Estimated Potentially Coingrating Relationship and ADF test on the residuals of this equation
Data: Residual
The next step in building an error correction model by Engle-Granger two step method is to
use the lag of first stage residuals as the correction term in the following general equation.
of the variables is statistically significant. The coefficient for the error correction term is
negative, indicating that difference between the logs of the spot and future prices is positive in
one period, the spot Nifty Index is likely to follow next period but it is not statistically
significant.
Table 8 Estimated error correction model for relationship between spot and future Nifty Index
Dependent variable:
Spot Returns
Correction Term -0.013
(0.175)
Future Return Lag -0.577
(0.514)
Spot Return Lag 0.609
(0.513)
Constant 0.0003
(0.001)
Observations 192
R2 0.008
Adjusted R2 -0.008
Residual Std. Error 0.008 (df = 188)
F Statistic 0.519 (df = 3; 188)
*
Note: p<0.10 **p<0.05 ***p<0.01
Relative PPP states that the long-term exchange rate between two countries is equal to the ratio
of their relative price levels i.e. real exchange is constant. Mathematically real exchange rate
is defined as below.
where Rt is the real exchange rate, Et is the nominal exchange rate in domestic currency per
unit of foreign currency, Pt is the domestic price level and *Pt is the foreign price level. Taking
logarithms of above equation and rearranging it we get the following testable equation
+ =
+ = 0
where the lower-case letters indicate logarithmic transforms of the corresponding upper case
letters earlier used. We will test this relationship using Engle-Granger Two Step Procedure
used earlier.
To test this relationship, I have collected consumer price index (CPI) and exchange rate data
for the Japan, Switzerland, United Kingdom, United States, China, and India from January
2007, to December 2016 from IMF website. First step is to check whether the log nominal
exchange rates have unit root. I have used ADF test to check it and results of it are presented
in the Table 9 below. We can clearly see the log nominal exchange rates has unit root.
Data: Log Nominal Rs-Dollar Exchange Rate Data: Log Nominal Rs-Pound Exchange Rate
Dickey-Fuller = -2.34, Lag order = 4, p-value = 0.44 Dickey-Fuller = -1.89, Lag order = 4, p-value = 0.62
Data: Log Nominal Rs-Swiss Franc Exchange Rate Data: Log Nominal Rs-Yen Exchange Rate
Dickey-Fuller = -2.07, Lag order = 4, p-value = 0.55 Dickey-Fuller = -2.11, Lag order = 4, p-value = 0.53
In step one we estimate the potentially cointegration equation between log of nominal exchange
rate, lop CPI of India, and CPI of foreign country. We do this by regressing log of nominal
exchange rate on lop CPI of India and CPI of foreign country and we preform the ADF test on
residuals of this model to check whether cointegration relationship between these series. The
summary of it is presented in the Table 10 and Table 11, from it we can infer that the residuals
from cointegrating regression are not stationary. There is no point in performing the next step
to estimate the error correction model when there is cointegrating relationship between the log
of nominal exchange rate and the log of CPI. The failure detects the cointegrating relationship
is not surprising as CPI and for that matter any of the available price indexes is not perfect
proxy for the overall price levels in the economy at different periods of time.
Dependent variable:
Rs_Dollar Rs_Yen Rs_Franc Rs_Pound
(1) (2) (3) (4)
CPI India 0.758*** 0.738*** 0.922*** 0.500**
(0.110) (0.040) (0.020) (0.223)
CPI US -0.613
(0.561)
CPI Japan -5.816***
(0.642)
CPI Switzerland 1.582***
(0.476)
CPI UK -0.610
(0.755)
Constant 3.230 22.748*** -7.649*** 4.915**
(2.100) (2.886) (2.202) (2.474)
*
Note: p<0.10 **p<0.05 ***p<0.01
Table 11 ADF test on the residuals of estimated cointegrating equation for Relative PPP
Dickey-Fuller = -2.02, Lag order = 4, p-value = 0.57 Dickey-Fuller = -2.97, Lag order = 4, p-value = 0.17
Dickey-Fuller = -3.38, Lag order = 4, p-value = 0.06 Dickey-Fuller = -1.92, Lag order = 4, p-value = 0.61
In this term paper, I have explored the applications of the various financial econometric
techniques for the analysis and modeling time series of financial data. I have tried to apply
three broad categories of financial econometric methods. First I have used univariate time-
series models for understanding returns of S&P 500 Index. It was useful in the understanding
the broad structure of the series but it has very limited forecasting powers. Next I have looked
modeling the volatility of S&P 500 Index returns. These models are easy to implement and
understand but again has very limited forecasting ability. Last I have tried to study the model
of long term relationship in finance specifically cointegrating relationship for spot Nifty Index
and its futures and nominal exchange rates and the price levels i.e. relative purchasing power
parity (Relative PPP) using Engle-Granger two step method. I could detect the coingrating
relationship between the spot Nifty Index and its futures but the error correction term was
insignificant. This might be since error term didnt represented the error in the relationship per
se but it was due to the carry as the model was estimated with low frequency data. I could not
find the coingrating relationship between the log of nominal exchange rate and the log of CPI.
The probable reason for this is that CPI is not a good proxy for the overall price levels in the
economy.
University (2008).
2. Tsay, Ruey S. An introduction to analysis of financial data with R. John Wiley &
Sons, 2014.
3. https://www.imf.org/external/np/fin/data/param_rms_mth.aspx
4. https://www.nseindia.com/products/content/equities/indices/historical_index_data.ht