Академический Документы
Профессиональный Документы
Культура Документы
Introduction
Park test
Goldfeld-Quandt test
White test
Breusch Pagan test
Tests of Heteroscedasticity
Introduction
Econometricians do not use always the same test to verify the presence of heteroscedastic
disturbances, because heteroscedasticity may assume different structures and sometimes it is not
easy to understand this structure.
There is not a uniform approach in the choice of tests, but generally it is preferable to answer to
some questions before the application of whichever test:
◮ are there specification errors in the chosen regression model?
◮ are there possibilities of heteroscedastic errors in the phenomenon analized?
◮ considering the graphical distribution of residuals versus each regressor, is there evidence of
heteroscedasticity? It is interesting to analyse the graphical distribution of residuals versus
some regressors if they are thought to generate heteroscedasticity; if residuals appear related
(positively or negatively) with a regressor Z there may be heteroscedasticity.
◮ the assumption assumption of constant error variance can be checked throught the residual
plot. A residual plot is a scatterplot of the standardised residuals against the fitted values.
Introduction
Introduction
If the residuals seem to increase or decrease in average magnitude with the fitted values, it is an
indication that the variance of the residuals is not constant.
If the points in the plot lie on a curve around zero, rather than fluctuating randomly, it is an
indication that linearity assumption is broken.
If a few points in the plot lie a long way from the rest of the points, they might be outliers, that is,
data points for which the model is not appropriate.
Park test
2
ln(ei ) = α0 + α1 lnZi + vi
Park test
The Park test is not used frequentely because is not easy to chose the r.v. Z .
When, in cross-section data, the observation units are regions, nations, provinces, etc. the r.v.
(proportionality factor) to be chosen is a size variable which measures indirectly the observational
units dimension.
Example
We use Park test to verify heteroscedasticity in the following data (n = 33):
◮ Y is the number of customers of sampled restaurants;
◮ C is the regressor measuring the number of competitive restaurants;
◮ P is the regressor measuring the resident population;
◮ I is the regressor measuring the average income of resident population.
Estimated equation is:
Park test
ˆ 2)
ln(e = 21, 05 + 0, 29 ln(Pi )
i
(0, 63)
−0, 46
Given the sample value of the t statistic we accept the null hypothesis (α1 = 0) then accept the
null hypothesis of omoscedasticity.
Goldfeld-Quandt test
This test is frequentely used because it is easy to apply when one of the regressors (or another r.v.)
is considered the proportionality factor of heteroscedasticity.
The test has two limits: its difficulty to reject the null hypothesis of omoscedasticity and the fact
that it do not allow to verify other forms of heteroscedasticity.
This test is based on the hypothesis that the error variance is related to a regressor X .
The test procedure is the following:
1 - the observations on Y and X are sorted following the ascending order of the regressor X
which is the proportionality factor;
2 - we divide the sample observations in three subsamples omitting the central one;
3 - we estimate throught OLS the regression models on the first and third subsample (then on
n−c
2 observations each; the number of observations considered has to be sufficiently large);
4 - we calculate the relative RSS, denoted as RSS1 and RSS2 ;
RSS2
5 - we derive the Goldfeld-Quandt test: GQ = R = RSS1 ;
n−c−2k
6 - the test R under the null hypothesis has F distribution with degrees of freedom 2
both for numerator and denominator.
Goldfeld-Quandt test
If the sample value of the test F is greater (in a.v.) than the critical value, at the chosen
significance level, we reject the null hypothesis of omoscedasticity.
Idea: if R is large then RSS2 is greater that RSS1 , which means that residuals increase with the
regressor.
The power of this test depends on the number of omitted observations (usually n3 observations
have to be omitted). If we exclude too much observations the RSS2 and RSS1 have too low
degrees of freedom, if we exclude to few observations the test power is low because the comparison
between RSS2 and RSS1 becomes less effective.
White test
Sometimes the researcher with to verify if more than one variable is proportionality factor in the
heteroscedasticity process: in these situations it is preferable to consider the Breush Pagan test or
the White test.
The White test has the advantage that it does not assume a specific form of heteroscedasticity.
It is based on a auxiliary regression with suqred residuals as dependent variable and regressors
given by: the regressors of the initial model,, their squares and their cross-products.
The White test procedure is as follows:
1 - we estimate the regression model throught OLS obtaining the OLS residuals, ei . For
instance we estimate: ŷi = b0 + b1 x1i + b2 x2i , then ei = yi − ŷi ;
2 - we estimate an auxiliary regression model with ei2 as dependent variable and initial
regressors, their squares and cross-products as covariates. For instance, we estimate:
ei2 = α0 + α1 x1i + α2 x2i + α3 x1i2 + α4 x2i2 + α5 x1i x2i .
3 - we verify the significance of the auxiliary regression throught the test nR 2 , which, under
the null hypothesis (omoscedasticity) has χ2 (q), where the degrees of freedom q are equal
to the number of regressors in the auxiliary model. In the example q = 5.
4 - if the sample value of the χ2 (q) is greater than the critical one we reject the null
hypothesis of omoscedasticity.
White test
This test may have some problems when the number of regressors in the initial model is high.
In these situation the cross-products of regressors may be omitted in the auxiliary model.
When in the initial model there are dummy variables their squares are not included in the auxiliary
regression to avoid multicollinearity problems.
H0 : α2 = α3 = · · · = αp = 0
This test needs the knowledge of the regressors ~z but not the knowledge of the functional form
h(·). Sometimes the regressors in ~z may be some regressors included in the original model, in such
case this test becomes an ad hoc version on the White test.