Академический Документы
Профессиональный Документы
Культура Документы
of Econometrics
47 (1991) 67-84.
North-Holland
1. Introduction
Most of the many diagnostic tests for serial correlation
(SC) and dynamic
conditional
heteroskedasticity
(DCH) of regression
disturbances
have good
asymptotic local power against finite autoregressive
(AR) or moving average
(MA) alternatives.
In testing for SC, statistics typically depend on finitely
many, p, least squares (LS) residual sample autocorrelations.
One test, that
rejects when sample size N times the sum of squares of the first p LS
residual sample autocorrelations
lies in the upper tail of the x, distribution
[e.g., Box and Pierce (1970)1, has a Lagrange multiplier
(LM) interpretation
relative to Gaussian AR(p) and MA(p) alternatives,
and can be justified as
asymptotically
locally most powerful unbiased (ALMPU) against these. Tests
that are optimal
against one-sided
AR(l)
alternatives
[e.g., Durbin
and
Watson (195011 have long been popular,
and other alternatives,
such as
constrained
ARC41 and ARC51 models, have also featured in empirical work.
*This article is based on research
funded by the Economic
(ESRC). reference numbers BOO232156 and R000231441.
0304-4076/91/$03.50
0 1991-Elsevier
Science
Publishers
and
Social
B.V. (North-Holland)
Research
Council
A popular LM test for DCH is ALMPU against both an AR(/>) and MAt [I)
alternative
(with intercept) for the squared disturbances
[that is an ARCH(p)
or GARCH(p)
model], and is based on the first p sample autocorrelations
of
the squared LS residuals [e.g.. Engle (1982). Weiss (1986)]: this is often called
an ARCH test.
In some applications
of tests for SC it has seemed sufficient to take 11= I
and in some others a natural finite AR parameterization
may be indicated.
for example when seasonal effects seem likely. Otherwise no alternative
may
dominate
on prior grounds.
and applied workers calculate
statistics
for
several, possibly many. values of p. It is not uncommon
for some of these
statistics to be significant. and some insignificant.
Drawing conclusions
from
such a scquencc
can be a delicate business, and unless the statistics take
account of sample autocorrelations
at long lags there is always the possibility
that relevant information
is being neglected, leading perhaps to unwarranted
confidence
in the usual standard
errors of LS estimates
of the regression
coefficients, while in models containing
lagged dependent
variables there will
be a failure to detect a source of inconsistency
of the LS estimates.
Long memory covariance
stationary
time series have been characterized
by the property
that autocorrelations
decay, but so slowly as not to he
summable.
or similarly that a spectral density exists but is unbounded.
They
represent
an alternative
direction of approach to unit root behaviour
compared with the usual AR one. Many series are too short, or span too brief a
period. to enable reliable distinction
of long memory autocorrclation
from
AR autocorrelation
with a near-unit
root. However. some evidence of such
long-memory
behaviour in economic and other time series has been reported.
Unlike tests against ARt ~1) or MA( 11) SC for specific /I, tests against long
memory bchaviour
use all the sample autocorrelations,
and might be expected to have good power against AR or MA alternatives
of long, but
unknown
order. Tests against AR or MA SC will generally
be consistent
against long-memory
behaviour, but inefficient.
In ARCH tests, dependence
on 1, causes similar difficulty. To avoid this
we can resort to concepts of long-memory
DCH, in which the Wold dccomposition innovations
have conditional
variances whose dependence
on remote
innovations
decays very slowly. Standard
ARCH tests are often consistent
against such alternatives,
but more efficient tests are available.
This paper considers classes of statistics that allow account to be taken of
possible long-memory
behaviour in moments of order two, three, and four in
deciding whether SC or DCH are present. We find it convenient
to index
members
of the classes by conditionally
Gaussian
parametric
alternatives,
deriving
statistics
via the LM principle.
The classes thus include
some
currently
used statistics, with good power against the short-memory
AR,
MA, ARCH, and GARCH
models, as well as new ones with good power
against long-memory
models. not to mention many in which autocorrelations
69
are summable
but decay slower than exponentially
[see, e.g., Robinson
(197811. For the usual asymptotic distribution
theory to hold, our conditions
on the dependence
structure of the alternative
models seem relatively weak.
Realistically,
an applied worker may have no grounds for believing
in a
particular
SC or DCH alternative,
and nor will he or she necessarily wish to
construct an array of statistics, each designed to detect a different type of
departure.
Thus a particular practical goal of this paper is to suggest specific
diagnostics for SC and for DCH that are of relatively simple form and appear
to complement
the usual ones. These emerge in relation
to one of two
popular
long-memory
time-series
models. Generally
the use of the LM
principle leads to computationally
convenient
statistics, with use of the fast
Fourier transform
(FFT) recommended
when N is large.
Throughout
we focus on the static linear regression model with stochastic
or nonstochastic
regressors,
y, = Px, + u f,
t=
1,2,...,
(1)
Although the LM-type tests are derived from an objective function that is the
likelihood when the disturbances
are conditionally
Gaussian,
of course no
such precise distributional
assumptions
are employed in asymptotically
justifying the tests, indeed the moment conditions
we require seem minimal and
null models are described
in terms of martingale
difference.
not independence, assumptions.
We aim for conditions
that are not only mild but
primitive, separating
different features of the problem in as comprehensible
way as possible. Future workers might wish to investigate
the use of our
statistics within a conditionally
Gaussian environment,
in which computationally more complicated
rules with stronger
finite-sample
justification
arc
available.
The following section introduces
two statistics for testing against disturbance SC, and derives their null asymptotic distributions.
One test assumes
there is no DCH, the other is robust to DCH of unknown form. Section 3
introduces
statistics for testing against disturbance
DCH of two types. and
derives null asymptotic distributions.
Versions which do and do not assume
constancy
of conditional
dynamic
skewness and kurtosis are considcrcd.
Corresponding
simultaneous
tests against both SC and DCH arc presented in
section 4. In section 5 we discuss the precise specification
of test statistics
when good power against long memory alternatives
is desired.
E(u,lF,)
against
= 0
of events generated
as.
WC
(2)
the alternative
E(u,lF,)
cp,Wq,
(3)
L1.S.
1-l
Concerning
we introduce
the following
assumption
A.l.
A.2.
The matrix
71
e = 0.
(4)
The assumptions on the cpj(f3) seem necessary and sufficient both for the
construction of our test statistics and for the properties derived. A related
specification was used by Robinson et al. (198.5) in testing for disturbance SC
in the Tobit model, though the actual alternatives discussed there were the
AR(p) and MA(p), familiar special cases of our specification. Condition A.2
is a kind of identifiability condition at the null model, ruling out, for example,
standard mixed autoregressive moving-average alternatives. Versions of the
cpj(e) suitable for long-memory alternatives are discussed in section 5.
The first version of our test is based also on the following assumption,
which implies no DCH, as was standard in the earlier literature on testing
for SC:
A.3.
(5)
based on
~ci$&+,,
ni = h>Ai,
(6)
+d
(7)
X,S
It is easy to see that a LM statistic for testing (2) against (3) based on a
Gaussian likelihood is of form A, with
N-l
A
I =N*@-I*
cpicj/c,,,
(8)
j=l
(2). it follows
thut
(7)
c, = T, + N- c
I
where
( [,u, +, + ~,j,
I
El &4,1',+,1
. ,, Now
2 El z14fh,,,+,
51J2(Ch,,)
?(Ch ,,,_, ,,)I 1Iktr-.
( 10)
where h,,, =_~,,(x;_ ,x,x,!K x, , and we use h,, 2 0, c>= ,b,, = k. A.3. and (2).
Thus EIc, - , 1 = O( N- 1 uniformly in ,j. Next, using A. I.
choosing
M + X, M/N + 0. It follows that N ,c;
F,(c, -- 2,) = o,J I).
From A.2 and the martingale
central
limit theorem
of Brown (I971 ).
)- /N/?~;P,~,/rr?
+<,
N(0, I ) for fixed ~1. As m + x.
(CYV, p;
cz,cp,cp; 4 0. Because N EC;; I CJ' and NET,?, = 0, 0 <,j < k, it follows that
NEIC~.-~~,~,~I(~~NC:~~I(P,I~
+ 0 as m + x. Finally
are stationary integrable
martingale
differences.
Co +,,
because
(TV. o
the uf - CT
When U, is Gaussian,
standard
methods can be employed to derive the
exact finite-sample
distribution
of /\, and ,l,, because
A, is a ratio of
13
quadratic forms in the u,. Using Pitmans (1937) result, exact moments of A,
and A, can be obtained under a Gaussian null model, and used to improve
the asymptotic approximations.
Unless the pi are eventually
all 0 [as in case of AR(p)
or MA(p)
alternatives]
computation
of A, or A, entails 0(N2> operations.
This is
unlikely to pose a serious computational
drawback nowadays unless N is very
large, in which case a double use of the FFT enables all c, to be computed in
something like the order of N 1ogN operations.
Similar savings are available
if A, is approximated
in the frequency
domain. Let h(o) = CT= rcpj cos Jo,
W(W) = N-/2C~Z,Li,
eitw and, with no effect on the asymptotic distributional
properties we discuss, replace A, by
i 1 =Q)-2N~2~h(o,~)lw(w,)12/~,~,
where o, = 27rs/N
and the primed sum omits from s = 1,. . . , N any terms
for which h is unbounded.
The w(w,~> can be computed
via the FFT. By
Parsevals equality @ = $-jTvh(w)h(-~)dw,
which can be approximated
by +N-- c; h(w,)h( -w,).
The behaviour of A, when (2) is false is also of interest. For fixed 0 # 0,
long-memory
parameterizations
of (3) may result in the N1/2(Cj - rj) not
being asymptotically
normal, where yj = E(u,u,+j),
and thus a nonnormal
limiting distribution
for the correspondingly
centred A, is a possibility [see,
e.g., Fox and Taqqu (1985)]. A condition for consistency
of a test based on
rejecting (4) when A, is large, against a general fixed alternative
autocovariante sequence (~~1, is x7= ,p, y, + 0. For a sequence of Pitman-type
alternatives, J = z/N I/, where E(u,lF,) = ~~=,T~(~>u,_~ and the s x 1 vector ( is 0
if and only if To
= 0, for all j 2 1, the asymptotic relative efficiency of our
test is <c~~jcpi~-c~(pip,)/c~~pi2,
where pi = z(a/aJ)~~Tj(O).
Theorem 2.1 will not be robust to departures
from the no-DCH assumption A.3. To justify a robust test, replace A.3 by
A.4.
E(u,u,+~u;+J
Ecu;)
= 0,
(11)
(12)
< cc,
infjZ rE(u:uf+,)
k>j21,
> 0.
(13)
We can check A.4 for versions of ARCH models, for example, such as
Of course (11) and
UrIF, - N(0, LY()+ (~ru:_r> when (Y> 0 and 0 <(Y, < 6.
(13) would be implied by (2) and (51, but because of (121, A.4 is not strictly
weaker than A.3, which required existence of only second moments.
74
Our robustified
LM statistic
A.I.
Proof.
The proof that Elc, - ?, 1 = 0( N ) uniformly, differs from that for
Theorem 2.1 only in that ElCu,~,,+,l
i .3k(E(u;))
replaces (9). For fixed
m. using (11) (cyqj~;d;)
We can bound IC(l;f;l+,
~IId,I%,+,I7rl~-11~,+,/.
N. h,c,d 2 0,
Ll + h + c + d = 4.
C+ d > 0.
The fourth
+, ).
( 13)
by
(C4)( Cu:+,)(
c~~;)i(
CCL)
=O,(N
U-th)(
-&)?((
cl,;_,)-
(15)
= o,,( N).
from
(10) and
NPC+;+,.
easily
as m
Eld,l
as m
A.4. Thus
By ergodicity
nz, where
d, =
tn. We
have
+, 0 for fixed
NE?,!<E(u:),
NE(?,Fx)=O,
O<j<k,
so that NEIC~~~~,T,/+O
+ r, as in Theorem 2.1s proof. Because of (12) and thus Eld,l r O( 1).
= O(1) uniformly.
it follows that C~pcp,cp,!(Eld,I + Eld,l + ld,l) + 0
+ x. Finally note that A.2 and (13) imply C;q,q;d, is nonsingular.
0
conditional
heteroskedasticity
A.6.
The matrix
Two distinct
75
classes of alternatives
V(u,l&) = 1+ t $jw
j=l
I12
u+
$j;.(e>ut-j
a.s.
(16)
j=l
and
V(u,lF,)
= a2 + 5
$J~(ti)(~f_~ - a2)
a.s.
( 17)
j=l
E(ujlF,)
= F~, E(uylF,)
= p4, where
sequence
so far as
(18)
where
e =
cl/{
N-l g (ii: - cc,)),
ej = N-
0
I=
CZ?~(&~+~ - co).
(19)
Our test for DCH, like those of Weiss (1986), is robust to departures
from
Gaussianity,
though in case of Gaussian kurtosis of u,, puq = 3a4, e can be
3.1.
Under
( 5 1. it follow,s
rhrrr
( 7) holds for i = 3.
Proof.
Write.
f214,
for .j 2 1.
_ ,I,
where
C, = N &4,(ufA,
Cl1 ,+,L, +,[I, are uniformly
where
r,+, -+/y+
!C, +
(l,
--
(f,, -
,P)li,}.
cr).
The proofs
that
x(14;,,
O,,( I ) follow to), using A.7. We have
tr)l,,
and
in ( IO) and
cuz+,c
= ~~~:+,h;+,, + 2&4,',,h,
+,,, c
il,h,,
\+,A,
has bounded
expectation.
We can handle ~j,~+,~, in the same way. while
1CI.:~,I, 1 I cl,:+,(c/,f
) ~ = (I,,( I) using (IO). The proof that CC,,,~ CT)cfi, =
O,,( 1) uses previous arguments
and the fact that now F,, - (7 myO,,( N ).
by A.7. It follows that C, ~ 2, = O,,( N ) uniformly. Because the 14,(14f,,
- w)
arc stationary square-integrable
martingale
differences the remainder
of the
proof follows that in section 2. and we only riced demonstrate
that C+,,
( /A, - cr4)z cr. We have
E(u,u,+,u;+~)
E(u:)
infi,
to departures
77
and
0
k>j>l,
= 0,
(20)
< cc,
(21)
(22)
N*
c
i
- ]/2 N_ ]
N-- 1
A, =
the
cp.@f
I /I
9jej9
where
A.5,
A.8,
=N12@-.2
C
I
Cpig,/g]],
alternatives
(17). A LM statistic
is A,
78
where
c (~2;-
g, = N--
The statistic _ti can also be regarded as the LM statistic when the alternatives are given by the model II, = a( 1 + C;cp,tH)tt-, , - l)}) t.,. for a standard
normal white noise sequence F,.
Theorem 3.1.
Under (2), A.S.
that (7) holds for i = 5.
(5), itjbllows
Proof.
This is very similar to previous ones. We can expand g, much as
before and demonstrate
that, for j 2 1, g, = g, + O,,(N ) uniformly, where
holds under
the fourthS, = N-I&;
~ CJ%I;+, - rr). This property
moment condition
of A.4 for much the same reason that second moments
suffked in section 2. By way of a specific illustration,
C(U~ - CT)/.,+, dcpcnds
on the L(:. but we have already seen that this is O,,( 1). We omit remaining
details.
0
Finally,
WC robustitj
,I,. replacing
where
h, = N
We introduce
A.!?.
q,)(if+, - c,,).
condition:
E((uf ~cr%q+,
E(ur)
(1;; -
- a)~, a +,, ) = 0,
I\>;2
(711)
I.
< cc,
inf ,>,E((uf
(24)
-v~)(u;~,
- CT-))) > 0.
Theorem 3.4.
Under (2), A.4,
follows that ( 7) holds for i = 6.
A.5,
A.Y.
(Sk
it
Proof.
79
Omitted.
A frequency-domain
possible, along similar
representation
of our tests against
lines to those mentioned
in section 2.
DCH
is also
and dynamic
and
(16),
(25)
(3)
and
(17).
(26)
These models extend the notion of AR models with ARCH errors. Considering (2.5) first, we initially impose the independence-up-to-fourth-moments
condition A.7. We assume also:
A.lO.
@ + P is nonsingular.
Condition
A.10 is implied by either A.2 or A.6, but if the ~~(0) and t,!,,(e)
are disjointly parameterized,
as is likely, A.2 or A.6 would not hold. A LM
statistic for alternative
(25), based on a likelihood with conditionally
Gaussian u,, is of form A, given by (6) with
N-l
A,
=N12b!-2
{pjCj/C
l+Qj/C;:},
j=l
where
d = @ + (e&2)(
r + r) + e*/c,:?P,
(2)
plus
fr0of.
Omitted.
A robust version
A.7 will be justified
A. 1I.
The conditions
E(rr,~r, ,,&)=O.
k 2; 2 I.
to departures
and
extends
(22). Let
where
Clearly
i is always
i+00f.
Omitted.
nonnegative
where
dctinite
from
Omitted.
A robustified
A.12.
81
version
The conditions
E(u,(u:+~
is justified
under:
-a%:+,)
14jZk21,
= 0,
and
2a*u,u,+j
inf min aE
jr 1 /lull= I
(uf - cqu:+,
-a*>
2~2%% +,
(u:-~T*)(u:+;-c?)
a>O,
Let
N-l
where
Omitted.
5. Long-memory
examples
Two parameterizations
of long-memory
process that have proved successful in modelling a variety of time-series
data, including economic series, will
be shown to generate alternatives
which satisfy the requirement
that @Jor 1v
be finite. For a real even sequence indexed by 13, r,(0), j = 0. + 1,. , with
y,,(B) = I, define
f(w; 0) = (27i)PcT=
_,-y,(0)e-Jw,
which need not be
bounded.
We can interpret
r,(0) and f(~; 0) as the autocovariance
and
spectral density of a covariance-stationary
process with zero mean and unit
variance.
The first model is most simply described by
Model
1:
(27)
where the scalar 0 < +. Model 1 is a fractional ARIMA (O,H, 0); Adenstedt
(1974) gave an expression
for y,(B). The second model is most simply
described by
Model
2:
r,(e)
{{l,j-
I/+ -
(28)
aY,u))
t,=-=
ae
TT
/ -Ti
af(w;O)
%3
eldw.
Routine
calculation
ModelI:
tj=i,
Model 2:
tj =
Z=G,
.I -2m
as
j-+m.
Grenander.
U.. 1954. On the estimation of regression coefficients in the case of an autocorrelated disturbance, Annals of Mathematical
Statistics 2.5. 252-272.
Pitman, E.J.G.,
1937. The closest estimates of statistical parameters.
Proceedings of the
Cambridge Philosophical Society 3.3. 212-222.
Robinson, P.M., 1978. Alternative
modelr for ;tationary stochastic procrsxs,
Stochastic Prtrcesses and Their Applications X, 141~ 152.
Robinson, P.M., AK.
Bera. and C.M. Jarque. 19X5. Texts tar \erlal dependence
in limited
dependent variable models, International
Economic Review 26. 629-f%.
Samarov. A. and MS. Taqqu. 1988. On the efficiency of the sample mean in long memo? noise.
Journal of Time Series Analysis 0. IY I-200.
Sinai. Y.G.. lY76, Self-vmilar
prohahilit). distributions. Theory of Probability and Its Appllc+
tions 21, h4-XII.
Weiss, A.A., 1986. Asymptotic theory for ARCH
models estimation and testing. Econometric
Theory 2. 107-131.
Whistler, D.E.N..
1990. Semiparametric
models of daily and intra-daily ertchangr rate volatility.
Ph.D. thesis submitted to the University of London.
Yajima. Y.. 1988, On estimation of a rcgresaion model with long-memory $tationar). error\.
Annals of Statistics 16. 79-807.