Академический Документы
Профессиональный Документы
Культура Документы
Hankel Matrices
Alok Bakshi
May 11, 2011
Abstract
Using method of moments, Bryc, Dembo, and Jiang (2006) showed
that for both Toeplitz and Hankel matrices (denoted by T and H re-
spectively), the corresponding empirical spectral distribution (n
1
2
T
n
)
and (n
1
2
H
n
) weakly converges to non-random symmetric probabil-
ity measures
T
and
H
with unbounded support, and moreover
T
and
H
dont depend upon the distribution of input sequences. They
conjectured that both measures are absolutely continuous and in par-
ticular
T
has a smooth density. Thereafter Sen, and Virag (2011)
gave a partial solution to the conjecture about Toeplitz matrix, prov-
ing that
T
is absolutely continuous with bounded density. In their
proof, rather than using method of moments, they realized the spec-
trum of Gaussian Toeplitz matrix as diagonal matrix (consisting of
independent Gaussian random variables) conjugated by the determin-
istic projection matrix. The next key step then was to use the spectral
averaging technique, which was developed by Combes, Hislop, and
Mourre (1996) in dierent context.
In this project, we undertook the task of adapting the technique
to Hankel matrix. Building upon the work of Bose and Hazra, where
using reverse circulant matrices they realized the spectrum of Gaussian
Hankel matrix as random Diagonal matrix (consisting of independent
Gaussian and Rayleigh distributed random variables) conjugated by
(nondeterministic) projection matrix we seek out to nd a suitable
variant of spectral averaging technique to apply in this particular case.
1
1 Empirical and limiting spectral distribu-
tions
Given an n n matrix A
n
, its empirical spectral measure
n
is dened as
n
=
1
n
n
i=1
i
where (
i
)
n
i=1
are the eigenvalues of A
n
and is the dirac delta function.
For symmetric real matrices, with which we will be concerned here, all the
eigenvalues are real and so
n
is a measure on real line R. The corresponding
empirical distribution function (ESD) is given by
F
A
n
(x) =
1
n
n
i=1
I
i
x
For a sequence of matrices (A
n
)
n=1
, the limiting spectral distribution
(LSD) is dened as the weak limit of F
A
n
with the implicit assumption that
such a limit exists. In real symmetric case, all ESD and therefore LSD too
is a distribution on R.
2 Spectrum for random symmetric Toeplitz
Matrix
A random symmetric Toeplitz matrix is dened as
T
n
= ((x
|ij|
))
0i,jn1
where the input sequence (x
i
)
i=0
consists of independent identically dis-
tributed random variables with Var(x
0
) = 1.
Since
T
doesn not depend upon the distribution of individual entries,
so without loss of generality we can assume x
i
to be independent standard
Gaussian distributed random variables. For the sake of convenience in com-
putation, we will work with T
o
n
, which has
2x
0
on its diagonal entries and
2
rest of the entries are same as that of T. Using Mallows metric we see
W
2
2
(F
n
1/2
T
n
, F
n
1/2
T
o
n
)
1
n
n
i=1
(
i
(n
1
2
T
n
)
i
(n
1
2
T
o
n
)))
2
1
n
Tr(n
1
2
T
n
n
1
2
T
o
n
)
2
1
n
2
n1
i=0
(
2 1)
2
x
2
0
0 as n
so that T
o
n
and T
n
will have same limiting spectral distribution.
A nn random symmetric Toeplitz matrix T
o
n
is nn principal submatrix
of 2n 2n symmetric circulant matrix, which is dened as
C
2n
= ((y
n|n|ij||
))
0i,j2n1
, where y
j
=
_
_
_
x
j
0 < j < n
x
2nj
n < j < 2n j
2x
j
j 0, n
Thus, we do have
Q
2n
C
2n
Q
2n
=
_
T
o
n
0
n
0
n
0
n
_
, where Q
2n
=
_
I
n
0
n
0
n
0
n
_
(1)
The circulant matrix C
2n
can be diagonalized as (2n)
1
2
C
2n
= U
2n
D
2n
U
2n
,
the unitary matrix U
2n
being given by
U
2n
(j, k) =
1
2n
exp
_
2ijk
2n
_
, where 0 j, k 2n 1
and the diagonal matrix D
2n
= diag(d
0
, d
1
, . . . , d
2n1
), where
d
j
=
1
2n
2n1
k=0
y
k
exp
_
2ijk
2n
_
=
1
2n
_
2x
0
+ (1)
j
2x
n
+ 2
n1
k=1
x
k
cos
_
2jk
2n
_
_
Since y
j
= y
2nj
, and all the d
j
are real so for n < j < 2n we observe
that d
j
= d
2nj
. Furthermore it can be seen that (d
j
)
0jn
are independent
mean zero Gaussian random variables, where Var(d
j
) =
_
1 0 < j < n
2 j 0, n
Dening P
2n
= U
2n
Q
2n
U
2n
, we see that P
2
2n
= P
2n
= P
2n
or P
2n
is a
Hermitian projection matrix. Moreover P
2n
is a deterministic project matrix
(since its entries dont depend upon (x
i
)) with P
2n
(j, j) =
1
2
j. Thus
(2n)
1
2
U
2n
Q
2n
C
2n
Q
2n
U
2n
= P
2n
D
2n
P
2n
(2)
3
so that spectrum of Q
2n
C
2n
Q
2n
is realized as the diagonal matrix D
2n
,
conjugated by the projection matrix P
2n
. But from equation (1), we see
that (counting multiplicities) eigenvalues of Q
2n
C
2n
Q
2n
precisely consist of
all eigenvalues of T
o
n
and zero (n times). Thus the above procedure gives us
a handle on the spectrum of the Toeplitz matrix.
3 Spectrum for random Hankel Matrix
A random Hankel matrix is dened as
H
n
= ((x
i+j
))
0i,jn1
where the input sequence (x
i
)
i=0
consists of independent identically dis-
tributed random variables with Var(x
0
) = 1.
Since
H
doesn not depend upon the distribution of individual entries, so
again without loss of generality we can assume x
i
to be independent standard
Gaussian distributed random variables. Furthermore Hankel matrix can be
realized as the principal minor of reverse circulant matrix. Reverse circulant
matrix is related to circulant matrix via the following relation
R
2n
= T
2n
C
2n
, where T
2n
=
_
_
_
_
_
_
_
_
_
1 0 0 . . . 0 0
0 0 0 . . . 0 1
0 0 0 . . . 1 0
.
.
.
0 0 1 . . . 0 0
0 1 0 . . . 0 0
_
_
_
_
_
_
_
_
_
In words, in reverse circulant matrix, shifting each row to the left instead
of right, we get the next row. Thus we have
Q
2n
R
2n
Q
2n
=
_
H
n
0
n
0
n
0
n
_
, where Q
2n
=
_
I
n
0
n
0
n
0
n
_
(3)
So as in the case of Toeplitz matrix, we need to analyze the eigenvalues of
R
2n
. But unlike its relative C
2n
, the description is bit complicated. Similar
to the case of symmetric Circulant matrix, for circulant matrix we have the
spectral decomposition namely, (2n)
1
2
C
2n
= U
2n
D
2n
U
2n
, where (denoting
4
primitive 2n root of unity by ) U
2n
is the following matrix
U
2n
=
_
_
1 1 1 . . . 1 1
1
2
. . .
2n2
2n1
1
2
4
. . .
2(2n2)
2(2n1)
.
.
.
1
2n2
(2n2)2
. . .
(2n2)(2n2)
(2n2)(2n1)
1
2n1
(2n1)2
. . .
(2n1)(2n2)
(2n1)(2n1)
_
_
Interestingly, we have the following relation
U
2n
U
2n
= U
2n
U
2n
= T
2n
where R
2n
= T
2n
C
2n
Therefore, we have
(2n)
1
2
R
2n
= (2n)
1
2
T
2n
C
2n
= T
2n
U
2n
D
2n
U
2n
= U
2n
T
2n
D
2n
U
2n
= U
2n
M
2n
U
2n
where M
2n
= T
2n
D
2n
Now to get hold of eigenvalues of reverse circulant matrix (2n)
1
2
R
2n
, it
suces to get the spectral decomposition of M
2n
. Concretely, we have
M
2n
=
_
_
d
0
0 0 . . . 0 0
0 0 0 . . . 0 d
2n1
0 0 0 . . . d
2n2
0
.
.
.
0 0 d
2
. . . 0 0
0 d
1
0 . . . 0 0
_
_
Here d
j
are the eigenvalues of the circulant matrix C
2n
. Denoting the rst
row of R
2n
as (y
k
)
0k2n1
we have
d
j
=
1
2n
2n1
k=0
y
k
exp
_
2ijk
2n
_
=
1
2n
2n1
k=0
y
k
cos
_
2jk
2n
_
+i
1
2n
2n1
k=0
y
k
sin
_
2jk
2n
_
= A
j,n
+iB
j,n
= r
j
exp(i
j
)
5
Where r
j
=
_
A
2
j,n
+B
2
j,n
and
j
= tan
1
(
B
j,n
A
j,n
) When (y
k
)
02n1
are inde-
pendent standard Gaussian random variables then (r
j
)
1jn1
, (
j
)
1jn1
,
d
0
and d
n
are independent random variables, where r
j
follow Rayleigh dis-
tribution and
j
follow uniform distribution over [, ].
Let I
j
=
_
e
j
j 0, n
1
2
[sgn(n j) exp(i
j
)e
j
+e
2nj
] otherwise
We dene L
2n
= [I
0
, I
1
, I
2
, . . . , I
2n1
]. Concretely we have
L
2n
=
_
_
1 0 0 . . . 0 0
0
1
2
exp(i
1
) 0 . . . 0
1
2
exp(i
1
)
0 0
1
2
exp(i
2
) . . .
1
2
exp(i
2
) 0
.
.
.
0 0
1
2
. . .
1
2
0
0
1
2
0 . . . 0
1
2
_
_
Moreover let D
2n
= [d
0
, r
1
, r
2
, . . . , r
n1
, d
n
, r
n1
, . . . , r
1
]. By result of Bose
and Mitra(2002), D
2n
consist of all the eigenvalues of reverse circulant matrix
R
2n
and in fact M
2n
= L
2n
D
2n
L
2n
So we have
(2n)
1
2
R
2n
= U
2n
T
2n
D
2n
U
2n
= U
2n
M
2n
U
2n
= U
2n
L
2n
D
2n
L
2n
U
2n
= V
2n
D
2n
V
2n
where V
2n
= U
2n
L
2n
Being a product of two unitary matrices, V
2n
is also a unitary matrix.
Dening hermitian projection matrix P
H
2n
= V
2n
Q
2n
V
2n
, we get the relation
(2n)
1
2
Q
2n
R
2n
Q
2n
= Q
2n
V
2n
D
2n
V
2n
Q
2n
= V
2n
P
H
2n
D
2n
P
H
2n
V
2n
From equation (3), we see that (counting multiplicities) eigenvalues of
(2n)
1
2
R
2n
precisely consists of eigenvalues of (2n)
1
2
H
2n
and zero (n times).
Thus spectrum of (2n)
1
2
H
2n
is realized as the diagonal matrix D
2n
conju-
gated by the non-deterministic projection matrix P
H
2n
.
6
4 Spectral Properties
Given a nite measure on R, its Cauchy-Stieltjes transformation is
o(z, ) =
_
R
1
x z
(dx), where z C, (z) > 0
In the special case, when is the empirical measure of a nn symmetric
matrix A, we have
o(z, (A)) =
1
n
n
i=1
1
i
z
=
1
n
Tr(A zI)
1
=
1
n
n1
j=0
e
j
, (A zI)
1
e
j
_
y
x
(o(E +i; ))dE
where x < y are the continuity points of .
Moreover putting a bound on (o(z; )) in the upper half complex plane
will entail that is absolutely continuous with respect to the Lebesgue mea-
sure. This follows from the inversion formula, as
sup
z:(z)>0
(o(z; ) K = (x, y)
K
(x, y) =
d
d
K
Sen and Virag(2011) used the same approach for proving the absolute
continuity of
T
. To put a bound on Stieltjes transform, rst we need to
rewrite it in suitable format.
Let (e
j
)
0j2n1
be the coordinate vectors of R
2n
, then for z C and
(z) > 0, we next prove the following two results
o(z; E(n
1
2
T
0
n
)) =
2
n
2n1
j=0
EPe
j
, (PDP zI)
1
Pe
j
o(z; E(n
1
2
H
n
)) =
2
n
2n1
j=0
EP
H
e
j
, (P
H
D
P
H
zI)
1
P
H
e
j
7
and these thus give us relation between the Stieltjes transformation of Toeplitz/Hankel
matrix and the corresponding representation of it as diagonal matrix conju-
gated by the projection matrix. We prove the rst of these results, the second
one is proved by following the same procedure. So, we have
o(z, (n
1
2
T
o
n
)) =
1
n
n1
j=0
e
j
, (n
1
2
QCQzI)
1
e
j
2
n
n1
j=0
e
j
, (QUDU
QzI)
1
e
j
2
n
n1
j=0
U
e
j
, (PDP zI)
1
U
e
j
Moreover we have
2n1
j=0
P e
j
e
j
P = P
_
2n1
j=0
e
j
e
j
_
P = P
2
= P = U
QU
= U
_
n1
j=0
e
j
e
j
_
U =
n1
j=0
U
e
j
e
j
U
8
Thus we do the following manipulation
o(z, (n
1
2
T
o
n
)) =
2
n
n1
j=0
U
e
j
, (PDP zI)
1
U
e
j
=
n1
j=0
e
j
, U(PDP zI)
1
U
e
j
=
n1
j=0
U(PDP zI)
1
U
(j, j)
=
n1
j=0
2n1
j
1
,j
2
=0
U(j, j
1
)(PDP zI)
1
(j
1
, j
2
)U
(j
2
, j)
=
2n1
j
1
,j
2
=0
(PDP zI)
1
(j
1
, j
2
)
_
n1
j=0
U
(j
2
, j)U(j, j
1
)
_
=
2n1
j
1
,j
2
=0
(PDP zI)
1
(j
1
, j
2
)
_
n1
j=0
U
e
j
e
j
U
_
(j
2
, j
1
)
=
2n1
j
1
,j
2
=0
(PDP zI)
1
(j
1
, j
2
)
_
2n1
j=0
Pe
j
e
j
P
_
(j
2
, j
1
)
=
2n1
j
1
,j
2
=0
(PDP zI)
1
(j
1
, j
2
)
_
2n1
j=0
P(j
2
, j)P(j, j
1
)
_
=
2n1
j=0
2n1
j
1
,j
2
=0
P(j, j
1
)(PDP zI)
1
(j
1
, j
2
)P(j
2
, j)
=
2n1
j=0
P(PDP zI)
1
P(j, j) =
2n1
j=0
Pe
j
, (PDP zI)
1
Pe
j
1
2
T
0
n
)) =
2
n
2n1
j=0
EPe
j
, (PDP zI)
1
Pe
j
Next we state and prove the spectral averaging technique by Combes et.
al. (1996) and afterwards a simple application of the above theorem will give
us a bound on the Stieltjes transformation of
T
, thus proving the absolute
continuity of
T
. Later we will see the diculties in using the same approach
for Hankel matrix.
9
5 Spectral averaging technique by Combes
et. al. (1996)
Let H
, R be a C
2
family of self adjoint operators such that D(H
) =
D
0
1 R, and such that (H
zI)
1
is twice strongly dierentiable
in for all z, (z) ,= 0. Assume that there exists a nite positive constant
c
0
, and a positive bounded self adjoint operator B such that,on D
0
=
dH
d
c
0
B
2
Also assume that H
is linear in , i.e.
H
=
d
2
H
d
2
= 0. Then for all E R
and twice continuously dierentiable function g such that g, g/, g// /
1
(R)
and for all 1, we have
sup
>0
_
R
g(), B(H
(E +i)I)
1
Bd
c
1
0
(|g|
1
+|g/|
1
+|g//|
1
) ||
2
(4)
The vanishing second derivative hypothesis can be relaxed by just requir-
ing that [
H
[ c
1
H
(E+i)I)
1
B = , B(H
(E i)I)
1
B, so prov-
ing the above inequality (where the supremum is taken in the upper half
complex plane) will imply the same inequality where the supremum is taken
in the lower half complex plane and vice versa. In particular we do have
sup
<0
_
R
g(), B(H
(E +i)I)
1
Bd
c
1
0
(|g|
1
+|g/|
1
+|g//|
1
) ||
2
sup
>0
_
R
g(), B(H
(E +i)I)
1
Bd
c
1
0
(|g|
1
+|g/|
1
+|g//|
1
) ||
2
In the proof, we will take supremum over the negative half complex plane.
10
Finally we observe that we dont need to have supremum over the whole
half plane but taking supremum over the strip : 0 < < 1, will suce. This
follows since in the case of when 1, we can let G
=
H
and D =
B
, so
that
G
c
0
D
2
and thefore
sup
0<<1
_
R
g(), D(G
(E +i)I)
1
Gd
c
1
0
(|g|
1
+|g/|
1
+|g//|
1
) ||
2
sup
>0
_
R
g(), B(H
(E +i)I)
1
Bd
c
1
0
(|g|
1
+|g/|
1
+|g//|
1
) ||
2
Now, for > 0 and 0 < < 1, we make the following denitions
R(, , ) = (H
(E i)I +i
H
)
1
and K(, , ) = BR(, , )B
Now, keeping in mind the inequality
H
c
0
B
2
, we have
, K(, , ) = , BR(, , )
(I +
H
)R(, , )B c
0
|K(, , )|
2
But by Cauchy Schwartz inequality and the fact that || = 1, we also have
, K(, , ) |K(, , )|. So we get the following inequality
|K(, , )|
1
c
0
(5)
Now we make another denition
F(, ) =
_
R
g(), K(, , )d (6)
Inequality (5) gives us the following bound
F(, )
1
c
0
|g|
1
(7)
Dierentiating F(, ) with respect to , and using integration by parts and
the fact that
H
= 0, we get
i
dF(, )
d
=
_
R
g()BR(, , )
H
R(, , )Bd
=
_
R
g()
d
d
, K(, , )d
11
Therefore from (7) and using integration by parts, we have
dF(, )
d
_
R
g/(), K(, , )d
1
c
0
|g/|
1
(8)
Integration of the above inequality, yields the following improved bound on
F(, )
[F(, )[ c
1
0
|g/|
1
[ log()[ +[F(1, )[ c
1
0
|g/|
1
[ log()[ +c
1
0
|g|
1
(9)
If we dene the function
F(, ) =
_
R
g/(), K(, , )d then following
the same procedure for
F(, ) instead of F(, ), we get
[
F(, )[ c
1
0
|g/|
1
[ log()[ +[F(1, )[ c
1
0
|g//|
1
[ log()[ +c
1
0
|g/|
1
which implies that
dF(, )
d
c
1
0
|g//|
1
[ log()[ +c
1
0
|g/|
1
Finally integrating the above inequality again, we get
[F(, )[ c
1
0
(|g//|
1
+|g/|
1
) +[F(1, )[ c
1
0
(|g//|
1
+|g/|
1
+|g|
1
)
(10)
We note that R(, , ) (H
(E i)I)
1
as 0
+
, therefore by
dominated convergence theorem we get the following result
sup
0<<1
_
R
g(), B(H
(E i)I)
1
Bd
c
1
0
(|g|
1
+|g/|
1
+|g//|
1
)
(11)
and as discussed earlier, the general result follows from the aforementioned
inequality.
6 Absolute Continuity for
T
Lets take
E
j
=
_
e
j
e
j
+e
2nj
e
2nj
1 j < n
e
j
e
j
j 0, n
B
j
=
_
Pe
j
e
j
P or Pe
2nj
e
2nj
P 1 j < n
Pe
j
e
j
P j 0, n
12
Now we take H
= P(D+(d
j
)E
j
)P, so that H
= PE
j
P B
j
=
P(j, j)
1
B
2
j
= 2B
2
j
, so that in theorem we can use c
0
= 2 and P(j, j) =
1
2
.
Now for the choice of g, let Z denote the standard Gaussian random variable.
Then we take
g =
_
density of Z 0 < j < n
density of
2Z j 0, n
From the denition, we have |g|
1
= 1, |g/|
1
_
2
, |g//|
1
2.
Thus plugging in = e
j
or e
2nj
and taking expectation with respect to
d
0
, d
1
, . . . , d
j1
, d
j+1
, . . . , d
n
(noticing that integral is equivalent to taking
expectation with respect to d
j
, and by virtue of independence of (d
k
) we can
break up expectation in this form) we nally get
sup
z:(z)>0
P(j, j)
2
EPe
j
, (PDP zI)
1
Pe
j
c
1
0
(|g//|
1
+|g/|
1
+|g|
1
)
Therefore, we get the following bound on o(z; E(n
1
2
T
0
n
))
o(z; E(n
1
2
T
0
n
))
2
n
2n1
j=0
EPe
j
, (PDP zI)
1
Pe
j
2
n
2n 8
16
2
So as a consequence of bound, we know that
T
is absolutely continuous
with respect to Lebesgue measure.
7 Absolute Continuity for
H
The same approach can be used for Hankel matrix. Though projection matrix
in this case P
H
is non-deterministic, still it does not poses major problems
as we still have P(j, j) =
1
2
. However because of the structure of diagonal
matrix D
j
e
2nj
e
2nj
for 1 j < n, and
13
that means
H
= PE
j
P is not even a
postive semidenite matrix.
Our task in the project was to study the proof of spectral averaging
technique and to check whether it could be suitably modied so that we no
longer need the condition on lower bound of
H