Академический Документы
Профессиональный Документы
Культура Документы
1 Introduction
Willa W. Chen
Department of Statistics, Texas A&M University, College Station, TX 77843, USA, e-mail:
wchen@stat.tamu.edu
Cliord M. Hurvich
New York University, 44 W. 4th Street, New York, NY 10012, USA, e-mail:
churvich@stern.nyu.edu
There are a variety of ways in which long memory can be dened. We start
by considering univariate series, and then move on to consider vector series.
Most denitions of long memory for a stationary series involve an asymp-
totic power law for some quantity, e.g., the spectral density near zero fre-
quency, the autocorrelation at large lags, the variance of partial sums with
increasing aggregation, the moving average coecients in an M A() repre-
sentation of the process. If the process {xt } is weakly stationary and invert-
ible with spectral density f , autocovariance sequence {cr }
r= , and moving
Fractional Cointegration 711
average representation xt = k=0 ak tk where {t } are white noise, the
long-memory assumption may be written in terms of the memory parameter
d (1/2, 1/2) {0} in the following generally non-equivalent ways:
f () K1 2d as 0+ ,
cr K2 r2d1 as r ,
n
var xt K3 n2d+1 as n ,
t=1
ak K4 k d1 as k ,
where K1 > 0, K2 , K3 > 0 and K4 are constants. Connections between these
properties are given in Robinson (1995b) and Taqqu (2003). All of these
properties hold for fractional ARIMA models (Adenstedt 1974, Granger and
Joyeux 1980, Hosking 1981), with d (1/2, 1/2) {0}, though semipara-
metric specications of long memory typically assume one of these properties
at the potential expense of the others. We will say that a weakly-stationary
series has short memory if its spectral density satises the property given
above with d = 0, i.e., the spectral density tends to a positive constant as
the frequency tends to zero.
We will say that a process is integrated of order d, denoted by I(d), if it
has memory parameter d. Thus, for a stationary and invertible series, I(d)
can be dened by any of the asymptotic power-law relations given above.
Since in fractional cointegration the original series may be nonstationary, it
is essential to be able to dene I(d) even for d > 1/2. It is also convenient to
be able to dene I(d) in the non-invertible case, d 1/2.
Hurvich and Ray (1995) worked with the spectral denition of long mem-
ory as given above, which remains valid in the non-invertible case. For all
d < 1/2, they dene a weakly stationary process to be I(d) if its spec-
tral density satises f () K1 2d as 0+ , where K1 > 0. In the
non-stationary case d > 1/2, they dene a process to be I(d) if there ex-
ists a positive integer k such that the kth ordinary dierence of the series
is I(d k), where d k (1/2, 1/2). For example, if {xt } is a random
walk, xt = xt1 + t where {t } is white noise, the ordinary rst dierence
is xt xt1 = t , which is I(0), so {xt } is I(1). In a semiparametric context,
the denition (henceforth referred to as the Type I denition) has been used
in papers on estimation of the memory parameter by Velasco (1999a, 1999b),
and Hurvich and Chen (2000) and in papers on fractional cointegration by
Chen and Hurvich (2003a, 2003b, 2006), and Velasco (2003).
Marinucci and Robinson (1999) present an alternative denition of I(d)
based on truncation, which we will call the Type II denition. Given the
choice of a time origin, t = 0 (which plays an important role here), for any
d > 1/2, the (Type II) I(d) series is represented for t 1 as
712 W.W. Chen and C.M. Hurvich
t1
xt = k tk , (1)
k=0
It follows that for d > 1/2, the Type-II I(d) process, which is neither station-
ary nor asymptotically stationary in this case, can be written as the partial
sum
xt = u1 + u2 + + ut t 1,
where
t1
ut = (1 B)1d t = ak tk ,
k=0
Limit theory has also been developed for statistics based on I(d) processes
under the Type I denition. For example, Sowell (1990) obtained the limiting
distribution of the OLS estimator in a regression of xt on xt1 for a process
{xt } such that (1 B)d xt = t , where t is iid with zero mean, with d
(0.5, 1.5). The limit distribution is a functional of Type-I fractional Brownian
motion. (See the remark on Sowells results in Marinucci and Robinson 1999).
Implications of the Type I and Type II denitions of I(d) on properties
of the discrete Fourier transform (DFT), semiparametric log-periodogram re-
gression estimates of the memory parameter, and other statistics of interest
were studied by Velasco (2007). There are noticeable dierences in the prop-
erties of the DFTs even for d < 1/2, but these do not have any important
impact on the properties of the memory parameter estimates unless d 1/2.
It is notable in this regard that, for the region d [1/2, 3/4), assuming a
Gaussian process, the (suitably normalized) log-periodogram regression esti-
mate of d has been shown to be asymptotically normal in the Type-I case
(Velasco 1999a), but no such result has been established as of yet in the
Type-II case, owing perhaps to the much stronger asymptotic correlations of
the normalized DFTs under Type-II compared to Type I I(d) when d 1/2.
f () G , 0+ , (3)
yt = Azt , (5)
(0) (s)
where zt = vec ut , . . . , ut and A = A0 As . Chen and Hurvich
(2006) make additional assumptions on the spectral density of {zt }. These
assumptions guarantee that {zt } is not cointegrated, and that the spectral
density of {yt } satises (3) with G singular, which is in turn a necessary
and sucient condition for cointegration in a stationary process such that all
components have the same memory parameter. The methodology presented
in Chen and Hurvich (2006) does not require either r or s to be known.
Robinson and Marinucci (2003) considered a Type-II model with cointe-
grating rank 1 for a non-dierenced q 1 series {Zt } which is partitioned
as {Zt } = {(Yt , Xt ) } with {Yt } 1 1 and {Xt } (q 1) 1 such that
Yt = Xt + Ut , where is an unknown (q 1) 1 cointegration param-
eter, and {Ut } has a smaller memory parameter than the minimum memory
parameter of the entries of {Zt }. Thus, (1, ) is a cointegrating vector for
{Zt }. The need to specify one of the entries of {Zt } as the response variable
may cause diculties when q 3 since there is no guarantee in general that
all entries of {Zt } appear in a cointegrating relationship with at least one of
the other entries. Thus if a randomly chosen component of {Zt } is labeled as
the response variable, there is no guarantee that the regression model above
will hold, and in any case regression-type estimators of the parameter will
not be invariant to this choice.
The models for cointegration considered above are all semiparametric, in that
the spectral density is only specied in a neighborhood of zero frequency.
Parametric models are also of interest, in which the time-series dynamics
of the series is fully determined by a nite set of xed, unknown parame-
ters, although the distribution of the innovations may not be parametrically
Fractional Cointegration 717
4 Tapering
yt = xt + ut ,
+ cov(xt , ut )/var(xt ).
where fx is the spectral density of {xt } and fxu is the cross-spectral density
of {(xt , ut ) }. By the Cauchy-Schwartz inequality, fxu () = O( (d+du ) ) as
0+ , so that fxu ()/fx () 0 as 0+ . Thus, if lies in a neighbor-
hood which shrinks to zero as the sample size n increases, the contribution
to cov(xt , ut ) from frequency is negligible compared to the contribution
to var(xt ) from frequency . This motivates the narrowband least-squares
estimator of , given by Robinson (1994) as
mn
mn
N BLS = Re{Jx,j Jy,j }/ |Jx,j |2 ,
j=1 j=1
1
n
Jz,j = zt exp(ij t)
2n t=1
for any series {zt }, and j = 2j/n. It then follows from Robinson (1994)
that N BLS as n , under an additional assumption on the errors
p
m0
m0
CH = T T
Re{Jx,j Jy,j }/ |Jx,j
T 2
| ,
j=1 j=1
and
ht = (1/2)[1 exp{i2(t 1/2)/n}]. (6)
Chen and Hurvich (2003a) showed that subject to suitable regularity condi-
tions the rate of convergence of CH is always nddu (the same as ndX dU ),
in the sense that nddu (CH ) converges in distribution. This uniformity
of the rate of convergence of CH (i.e. the dependence on d du alone), as
contrasted with the nonuniformity of the rate of OLS or N BLS , is due to
the combination of tapering and the use of a xed bandwidth m0 in CH .
720 W.W. Chen and C.M. Hurvich
Though it may at rst seem surprising that CH can be consistent for
even though m0 is xed (after all, the use of a xed number of frequencies
in semiparametric estimation of the memory parameter would result in an
inconsistent estimator), the consistency of CH follows since it can be shown
that each term of
m0
m0
CH = T T
Re{Jx,j Ju,j }/ |Jx,j
T 2
|
j=1 j=1
For a given value of the dierence of the memory parameters (i.e., the strength
of the cointegrating relationship), CH , which is based on the tapered dier-
ences of order p 1 (and is invariant to additive polynomial trends of order
p 1 in the levels) will be ndX dU -consistent. The comparison is separated
into several cases, due to the non-uniform rates of convergence for OLS
and N BLS . All three estimators converge at the same rate when dU > 0,
dU + dX > 1, and also when dU = 0, dX = 1. The estimators CH and
OLS converge at the same rate when dU = 0, dX > 1, though Robinson
and Marinucci (2001) do not present a rate of convergence for N BLS in this
case. In the remaining cases, CH has a faster rate of convergence than the
other two estimators. We present the comparisons in terms of the improve-
ment factor, given as the ratio of the rates of convergence. For example, if
another estimator is n -consistent with < dX dU then the improvement
factor for CH relative to the other estimator is ndX dU . For the case
dU > 0, dU + dX = 1, the improvement factor for CH relative to OLS is
log n, and the improvement factor for CH relative to N BLS is log mn . For
the case dU 0, dU + dX < 1, the improvement factor for CH relative to
OLS is n1dU dX , and the improvement factor for CH relative to N BLS
U dX
is m1d
n . In the latter two cases, the slower the rate of increase of mn ,
Fractional Cointegration 721
the less inferior is the performance of N BLS compared to that of CH . This
helps to justify the use of a xed bandwidth m0 in CH . It can be seen that
the spectral density and periodogram matrices at the very low frequencies
(e.g., 1 , . . . , m0 with m0 xed) play a key role in fractional cointegration.
For a multivariate series, Robinson and Marinucci (2003) considered prop-
erties of NBLS and OLS estimators of the cointegrating parameter in the
multiple regression model
Yt = Xt + Ut
described above. The estimators are
and OLS given by a similar formula with (1, mn ) replaced by (1, "n/2#) or
(0, "n/2#), where
k
Fab (, k) =
Re(Ja,j Jb,j ),
j=
the superscript denotes conjugate transpose, and for any time series of
column vectors {ct }nt=1 ,
1
n
Jc,j = ct exp(ij t).
2n t=1
for i = 1, . . . , q 1, where iN BLS is the ith entry of N BLS , di is the memory
parameter of the ith entry of {Xt }, and dU is the memory parameter of
{Ut }. They next considered a Type-II model in which the observable series
are nonstationary. Here, the convergence rates (also reported in Marinucci
and Robinson 2001) for iN BLS i and iN BLS i are analogous to those
obtained for bivariate series in Robinson and Marinucci (2001).
Chen and Hurvich (2006) considered estimation in the Type-I fractional
common components model (4), (5), based on the averaged periodogram
matrix,
m0
T
Im0 = Re{Jy,jT
Jy,j },
j=1
T
where m0 is a xed positive integer, Jy,j is the (q 1) tapered DFT vector
n
T
Jy,j = hp1
t yt exp(ij t),
t=1
722 W.W. Chen and C.M. Hurvich
and the taper {ht } is given by (6). Note that Im0 , and statistics based on
it, are equivariant to permutation of the entries of {yt }. The eigenvalues
1 2 q of Im0 satisfy j = Op (n2dk ), for j Nk , where
N0 = {1, . . . , a0 } and Nk = {(a0 + + ak1 ) + 1, . . . , (a0 + + ak )}
for k = 1, . . . , s. The eigenvectors corresponding to the jth largest eigenval-
ues for j Nk converge in distribution to (random) vectors lying in the kth
cointegrating subspace, k = 0, . . . , s. Although these eigenvectors do not con-
sistently estimate xed population vectors, the estimated space nevertheless
converges to the kth cointegrating subspace in the sense that the norm of
the sine of the angle between the true and estimated cointegrating subspaces
is Op (nk ) where k is the shortest gap between the memory parameters
corresponding to the given and adjacent subspaces. Gaussian semiparametric
estimators constructed from the residuals, i.e., the contemporaneous linear
combination of yt with weights given by an eigenvector in the estimated
kth cointegrating subspace, are asymptotically normal and consistent for dk ,
where the bandwidth mn , with an upper bound that becomes more
restrictive when min = mink (k ) = min(d0 d1 , . . . , ds1 ds ) decreases.
If the short-memory component of the spectral density of {zt } is suciently
smooth, then the upper bound is determined by (m2+1 n /n2 ) log2 mn 0,
1/2
where = min(min , 2). If min > 1/2, then the estimator is mn -consistent.
1/2
If min 1/2 then the estimator is no longer mn -consistent. The restriction
on the rate of increase of mn arises because a linear combination of series with
slightly dierent memory parameters will typically have an irregular short-
memory component in its spectral density. Given an a priori lower bound
on min it is possible to use the estimates of the dk to consistently estimate
the dimensions a0 , . . . , as of the cointegrating subspaces, as well as s itself
and the cointegrating rank r = a1 + + as . This can be accomplished by
setting the group boundaries at the points where the sorted estimates of the
dk dier by a sucient amount. While the need for a lower bound on min
is unfortunate since such a quantity would rarely be known in practice, we
note that such lower bounds (assuming s = 1) arise implicitly or explicitly
in other works on semiparametric fractional cointegration. See Robinson and
Yajima (2002), Assumption D, and Velasco (2003), Theorems 2 and 4, as well
as Nielsen and Shimotsu (2007).
The estimators for the cointegrating parameter considered above are all
direct in the sense that they do not require estimation of memory param-
eters or other nuisance parameters. An alternative promising approach was
proposed by Robinson (2006), in the context of a Type I stationary bivariate
system. Under this approach, a bivariate local Whittle estimator is used to
jointly estimate the memory parameters du1 < du2 < 1/2, the parameter
(which is a cointegrating parameter if it is nonzero) and the phase pa-
rameter in the bivariate system (yt , xt ) where yt = xt + u1,t , xt = u2,t
and the spectral density of (u1,t , u2,t ) satises (3) with phase parameters
1 = 0, 2 = (/2 )(du2 du1 ). If = 0, then both {xt } and {yt } have
memory parameter du2 but the linear combination yt xt has memory pa-
Fractional Cointegration 723
Although this chapter has focused mainly on models for fractional cointegra-
tion and estimation of the cointegrating parameter, there remains the more
basic question of testing for the existence of fractional cointegration. There
has been some progress in this direction, which we describe briey here. A
method proposed by Robinson (2006) based on joint local Whittle estima-
tion of all parameters was described in Section 5, although the bivariate
model considered there rules out the possibility that the two series have the
same memory parameter if there is no cointegration. Marinucci and Robin-
son (2001) proposed a Hausman-type test in which the memory parameters of
the observed series are estimated by two dierent methods: (1) a multivariate
Gaussian semiparametric estimator, imposing the constraint that the mem-
ory parameters are the same, and (2) a univariate Gaussian semiparametric
estimator of a particular entry. The component of (1) for this entry would
724 W.W. Chen and C.M. Hurvich
References