Академический Документы
Профессиональный Документы
Культура Документы
Shane M. Haas
Laboratory for Information and Decisions Systems, and Research Laboratory of Electronics, Massachusetts Institute of Technology,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
Email: shaas@mit.edu
Jeffrey H. Shapiro
Laboratory for Information and Decisions Systems, and Research Laboratory of Electronics, Massachusetts Institute of Technology,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
Email: jhs@mit.edu
Vahid Tarokh
Laboratory for Information and Decisions Systems, Massachusetts Institute of Technology, 77 Massachusetts Avenue,
Cambridge, MA 02139, USA
Email: vahid@mit.edu
A space-time channel coding technique is presented for overcoming turbulence-induced fading in an atmospheric optical hetero-
dyne communication system that uses multiple transmit and receive apertures. In particular, a design criterion for minimizing the
pairwise probability of codeword error in a space-time code (STC) is developed from a central limit theorem approximation. This
design criterion maximizes the mean-to-standard-deviation ratio of the received energy difference between codewords. It leads
to STCs that are a subset of the previously reported STCs for Rayleigh channels, namely those created from orthogonal designs.
This approach also extends to other fading channels with independent, zero-mean path gains. Consequently, for large numbers of
transmit and receive antennas, STCs created from orthogonal designs minimize the pairwise codeword error probability for this
larger class of fading channels.
Keywords and phrases: space-time codes, optical communication, orthogonal designs.
α11
α12
Information Space-time
source encoder Receiver
...
...
α1M
αNM
source to a codeword of the STC. The codeword c, where c = the codeword c that minimizes
(c1(1), c1(2), . . . , c1 (T), c2(1), . . . , c2(T), . . . , cN (1), . . . , cN (T)) 2
M T N
is sent over T nonoverlapping adjacent discrete-time slots
rm (t) − αnm cn (t) . (2)
using N transmit apertures. During time slot t, transmit m=1 t =1
n=1
aperture n sends cn (t), a symbol from a quadrature-
amplitude-modulation (QAM) signal constellation. The exact probability of error is difficult to calculate for
A laser beam propagating over a clear-weather, line-of- an STC with more than two codewords. An upper bound on
sight atmospheric path from transmitter exit optics to re- this probability of error comes from the union bound
ceiver entrance optics experiences amplitude and phase fluc-
tuations due to refractive-index turbulence [3]. A propa- Pe ≤ Pr(c −→ e), (3)
c,e
gation model, established in [4], based on the extended
Huygens-Fresnel principle [5] characterizes this fading as a where Pr(c → e) is the probability of decoding codeword c
complex lognormal process with correlation times on the or- as codeword e in the absence of all other codewords. This
der of 10−3 to 10−2 seconds. At high data rates, a single fade sum is usually dominated by the terms of the closest, or min-
can obliterate several message packets. imum distance, codeword pairs. The union bound estimate
Because the duration of a fade is usually much longer of the codeword error probability is the sum of pairwise error
than the length of a message packet, we will assume that probabilities of the minimum distance codeword pairs
the fades are constant during a codeword transmission. We
will model the path gain from transmit aperture n to re- Pe ≈ Kmin Pr(c −→ e)min , (4)
ceive aperture m as αnm = exp(χnm + jφnm ). Here χnm ,
φnm are independent Gaussian random variables with mo- where Kmin is the average number of minimum-distance
ments var(χnm ) = σχ2 , E(χnm ) = −σχ2 , var(φnm ) = σφ2 1, codeword neighbors and Pr(c → e)min is the pairwise prob-
and E(φnm ) = 0. The log-amplitude variance, σχ2 , typically ability of erroneously decoding a pair of minimum distance
lies within the range 0.01 (mild fading) to 0.35 (severe fad- codewords.
ing). We also assume that the spacing between elements of Given knowledge of the path gains and assuming equally-
the receiver aperture array is large enough to ensure that likely codewords, the pairwise probability of incorrectly de-
the path gains for different (n, m) values are approximately coding transmitted codeword c as codeword e is
independent.
We will assume optical heterodyne reception, for which d2 (c, e)
Pr(c −→ e | α) = Q , (5)
the detector output is known to consist of a frequency down- 2N0
shifted version of the incident optical field plus an additive
white Gaussian noise [6]. We will use wm (t) to denote this ad- where
ditive Gaussian noise for receive aperture m during time slot 2
M T N
t; it is a complex-valued, zero-mean, white Gaussian random d (c, e) =
2
αnm en (t) − cn (t) (6)
process with variance N0 /2 per real dimension. m=1 t =1 n=1
Combining the fading and additive noise fluctuations,
the signal at receive aperture m ∈ {1, . . . , M } during time is the distance between codewords at the receiver, and Q(x) is
slot t ∈ {1, . . . , T } is the area under the tail of the standard normal density func-
tion. Averaging over α, the unconditional probability of in-
N correctly decoding c as e is therefore,
rm (t) = αnm cn (t) + wm (t). (1)
n=1
Pr(c −→ e) = Pr(c −→ e | α)pα (α)dα, (7)
Given a received sequence {rm (t) : 1 ≤ m ≤ M, 1 ≤ t ≤ T }
and knowledge of the path gains α = {αnm : 1 ≤ n ≤ N, 1 ≤ where pα (α) is the joint probability density function of the
m ≤ M }, the minimum probability of error receiver chooses lognormal path gains.
Space-Time Codes for Wireless Optical Communications 213
Our ultimate objective is to construct a space-time code matrix, minimizes the CLT approximation to the pairwise
that minimizes the exact probability of error, Pe . In this pa- probability of error.
per, however, we will focus on minimizing Pr(c → e)min in
the union bound estimate of this probability. 3.1. Normalized parameters
Our first step in minimizing (10) is to rewrite Pr(c → e)
3. DESIGN CRITERION in terms of normalized parameters. The first normalized pa-
rameter measures the strength of the fading. Define the nor-
The integral in (7) is very difficult to evaluate analytically be-
malized fading strength, η, to be the standard-deviation-to-
cause of the lognormal density function. We will attempt to
mean ratio of the energy difference between the codewords
simplify its evaluation using a central limit theorem (CLT)
at the receiver, that is, η = σ/µ. The second normalized pa-
approximation.
rameter measures the total received signal-to-noise ratio and
Rewriting (6) as
is defined as ρ = µ/N0 .
M
N
N With the change of variables z = x/µ, (10) becomes
d2 (c, e) = αnm α∗km Ank , (8)
m=1 n=1 k=1 ∞
1
Pr c −→ e; η , ρ ≈2
Q ρz pZ |Z ≥0 (z)dz, (12)
where 0 2
T ∗ where
Ank = en (t) − cn (t) ek (t) − ck (t) (9)
t =1
1/ 2πη2 e−(1/2η )(z−1)
2 2
For m1
= m2 , we have equality, is
N N n−1 2
E αn1 m1 α∗k1 m1 αn2 m2 α∗k2 m2 M
2
e4σχ − 1 2 Ank
n=1 Ann +2 n=1 k=1
η2 =
2
= E αn1 m1 α∗k1 m1 E αn2 m2 α∗k2 m2 M2 N
n=1 Ann
(17) (21)
N N n−1 2
1 if n1 = k1 and n2 = k2 , e 4σχ2
−1 2 Ank
n=1 Ann +2 n=1 k=1
= ≥ N .
0 otherwise. MN 2
n=1 Ann
When m1 = m2 = m, we find that Equality holds in (21) when Ann = β, n = 1, . . . , N, for some
positive real number β. Furthermore, setting Ank = 0 for n
=
k minimizes the numerator in (21). Thus we get the bound
E αn1 m1 α∗k1 m1 αn2 m2 α∗k2 m2 = E αn1 m α∗k1 m αn2 m α∗k2 m
4σ 2 e4σχ − 1
2
e χ if n1 = k1 = n2 = k2 , η ≥
2
, (22)
MN
1 if n1 = k1
= n2 = k2 ,
= with equality when A = βI, where I is the N × N identity
1 if n2 = k1
= n1 = k2 ,
matrix. Also, Ann = Tt=1 |cn (t) − en (t)|2 = β, n = 1, . . . , N
0 otherwise. implies that β = Ed /N. Orthogonal designs [2] provide a
(18) method to construct STCs that satisfy the design criterion
A = (Ed /N)I and provide easy decoding at the receiver.
From these results it follows that the second moment of Therefore, STCs created from orthogonal designs maximize
d2 (c, e) is the mean-to-standard-deviation ratio of the received energy
difference between codewords.
E d4 (c, e) We start the upper bound derivation by noticing that A is
positive semi-definite [7] because it has an N × T square-root
M
4σχ2 2
N
N
N matrix B with ntth element cn (t) − en (t) such that A = BB†
= e Ann + Ann Akk [1]. Let λ1 , . . . , λN denote the nonnegative eigenvalues of A.
m1 =1 n=1 n=1 k=1 2
k
=n For e4σχ − 2 ≥ 0, an upper bound on η is found as follows:
N
N
M
N
N 2 N N N 2
e4σχ − 2 2
n=1 Ann + n=1 k=1 Ank
+ Akn Ank + Ann Akk η2 =
2
n=1 k=1 m2 =1 n=1 k=1 N
k
=n m2
=m1
M n=1 Ann
N N (23)
N
N
N
2
e4σχ − 2 Ank 2 + N N Ank 2
= M e 4σχ2
−2 A2nn + Ank + M tr(A) ,
2 2
≤
n=1 k=1 n=1 k=1
,
2
n=1 n=1 k=1 N
M n=1 Ann
(19)
with equality when
N N
A is a diagonal matrix. Using tr(A2 ) =
whence N
k=1 |Ank | =
2 2
n=1 n=1 λn , this upper bound becomes
2
σ 2 = var d2 (c, e) e4σχ − 1
2
N
λn
2
e4σχ − 1
η2 ≤ N ≤ , (24)
N
N
n−1 (20) M n=1 k=1 λk
M
= M e 4σχ2
−1 A2nn +2 Ank 2 .
n=1 n=1 k=1
with equality when A is a diagonal matrix of rank one. The
last inequality follows from
Notice that we have assumed that the path gains are log-
normally distributed, but we have only used the fact that 2
N
λn
N
λn
they are independent and identically distributed with zero N ≤ N = 1, (25)
mean, unit variance, and finite fourth moment. Therefore, n=1 k=1 λk n=1 k=1 λk
our method and results extend to all fading distributions that
satisfy these weaker conditions. which is met with equality when exactly one of the eigenval-
ues is nonzero.
2
3.3. Bounds on the normalized fading fluctuation For e4σχ − 2 < 0, an upper bound on η is found by sup-
pressing the first term in σ 2 :
The µ and σ that we have found are tied to the design matrix
N N 2 2
A by (14) and (20), respectively. We will now derive bounds 1 N
n=1 k=1 Ank λn 1
on their ratio η = σ/µ expressed in terms of A. η2 ≤ = ≤
M n=1 N λk
2 . (26)
N M
A lower bound, obtained via the Cauchy-Schwarz in- M n=1 Ann k=1
Space-Time Codes for Wireless Optical Communications 215
Equality in (26) requires A to be rank one with all its diagonal 10−1
terms equal to zero, an impossibility if A is positive semi-
definite. 10−2
The bounds on η are then 10−3
2 2
e4σχ − 1 max 1, e4σχ − 1 10−4
≤ η2 ≤ . (27)
Pr(c → e)
MN M
10−5
2
The lower bound is achieved when A = (Ed /N)I. If e4σχ −
1 ≥ 1, the upper bound is achieved when A has only one 10−6
nonzero diagonal element. The upper bound is unachieved 10−7
2
when e4σχ − 1 < 1.
10−8
3.4. Minimizing the probability of codeword error
10−9
To our knowledge, the codeword probability of error in (12) 10−4 10−3 10−2 10−1 100 101 102
does not have a closed-form solution. In this section, we will 2 2
var(d )/mean(d ) 2
= 1, 2
e4σχ − 1
ρ
2πη 1 − Q(1/η)
2 Pr c −→ e; , MSNR
MN
(32)
∞ max 1, e4σχ − 1
2
10−2 10
SNR = 4 dB
9 SNR = 5 dB
10−3
6
10−6
5
10−7
4
10−8 3
10−9 2
10−4 10−3 10−2 10−1 100 101 102 1 2 3 4 5 6 7 8
2 2
var(d )/mean(d ) 2 Number of receive antennas (M)
Pr(c → e)for M ∗ SNR = 18 dB
Limit as var(d 2 )/mean(d 2 )2 → 0 Figure 5: The smallest values of M and N such that (32) holds in
Lower bound = (exp(4σχ2 ) − 1)/MN moderate fading (σχ2 = 0.1).
Upper bound = max(exp(4σχ2 ) − 1, 1)/M
10
6
9 5
8 4
7 3
2
6 1 2 3 4 5 6 7 8
5 Number of receive antennas (M)
4. PERFORMANCE
evaluation that the value of η that maximizes Pr(c → e; η2 , ρ) In this section, we address the validity of the central limit the-
increases with increasing ρ. As a result, increasing N and M orem approximation and the performance of STCs on log-
cause the bounds on η given in (27) to become tighter, and normal channels.
the mode of Pr(c → e; η2 , ρ) to increase until (32) eventu-
ally holds. These threshold values of M and N are plotted 4.1. Performance bounds for orthogonal design STCs
in Figures 4, 5, and 6 for different values of SNR and fading
environments. For a given SNR, these plots show the small- We will now derive the pairwise probability of decoding
est number of transmit and receive apertures required for codeword c as codeword e assuming that the space-time code
A = (Ed /N)I to be the optimal design matrix. From these satisfies the design criterion A = (Ed /N)I, but without using
Space-Time Codes for Wireless Optical Communications 217
A closed form evaluation of the frustration function 4.2. A lower bound on the probability
does not exist; therefore, we use a saddle-point integration of codeword error
method developed by Halme in [8] to numerically evalu- In Section 4.1, we derived lower and upper bounds on the
ate it. For the design criterion A = (Ed /N)I, Figure 7 com- probability of incorrectly decoding codeword c as codeword
pares the probability of codeword error in (34), the central e under the design criterion A = (Ed /N)I without using the
218 EURASIP Journal on Applied Signal Processing
100
These limits appear as circles in Figure 2 for MSNR = 8,
13, 15, 18 dB.
One can view this limit as the error probability of a one
10−1 transmit, M receive aperture system with no fading. In other
words, the large number of transmit apertures mitigates the
fading, and the only uncertainty in the decision process arises
10−2 σχ2 = 0.01 from the additive white Gaussian noise.
σχ2 = 0.1
σχ2 = 0.35 4.4. An orthogonal design example:
10−3
100 101 102 the Alamouti scheme
Aperture product Alamouti in [9] proposed a simple transmit diversity tech-
nique using two transmit apertures (N = 2), two time-slots
Figure 8: Difference in SNR required to achieve a 10−6 pairwise (T = 2), M receive apertures, and a complex QAM signal
codeword error probability between the actual lognormal error ex- constellation of size 2b . During the first time-slot, 2b bits
pression in (34) and its central limit theorem approximation in arrive, determining two signal constellation points, s1 and s2
(12). that are transmitted simultaneously on the first and second
apertures, respectively. During the second time-slot, the first
aperture transmits −s∗2 , while the second sends s∗1 . In other
central limit theorem approximation for d2 (c, e). In this sec- words, this STC consists of all the codewords of the form
tion, we derive a lower bound on this probability of error c = (c1 (1), c1 (2), c2 (1), c2 (2)) = (s1 , −s∗2 , s2 , s∗1 ) where s1 and
without using the central limit theorem approximation that s2 range over all possible signal constellation points. Tarokh
is valid for an arbitrary design matrix A. Using the Cauchy- in [2] showed that the Alamouti scheme is an example of a
Schwarz inequality on (6) gives STC created from a complex orthogonal design.
The design matrix of this STC for two codewords c =
M
T
N N (c1 , −c2∗ , c2 , c1∗ ) and e = (e1 , −e2∗ , e2 , e1∗ ) satisfies our design cri-
d2 (c, e) ≤ αnm 2 ek (t) − ck (t)2
teria A = (Ed /2)I, where Ed = 2|c1 − e1 |2 + 2|c2 − e2 |2 is the en-
m=1 t =1 n=1 k=1
ergy difference between the codewords. The performance of
M
N 2 this code for pairs of codewords is shown in Figure 7 for eight
= Ed αnm (38) receive apertures (M = 8) in moderate fading (σχ2 = 0.1).
m=1 n=1 Orthogonal designs have the property that the sym-
MN bol
T
sequences on each aperture are orthogonal, that is,
= Ed e2χk , ∗
t =1 cn (t)ck (t) = 0 for n
= k. As a result, space-time
k=1 codes created from orthogonal designs, such as the Alamouti
scheme, have a simple decoding algorithm. Rewriting the de-
where we have renumbered the sum of the MN indepen-
cision metric in (2) as
dent lognormal random variables as in Section 4.1. Follow-
ing a similar derivation to that in Section 4.1, a lower bound
M
T N
N
on the probability of error for any design matrix is ĉ = argmin rm (t)2 + αnm α∗km cn (t)ck∗ (t)
c∈NT m=1 t =1 n=1 k=1
MN
1 SNR
Pr(c −→ e) ≥ Fr , 0; σχ . (39)
N
4 2 − 2 Re rm (t)∗
αnm cn (t)
n=1
For a large number of transmit apertures, N, this bound can
N
T
M 2
be quite loose, confer the orthogonal design bound in (37). = argmin αnm cn (t)2
c∈NT n=1 t =1 m=1
4.3. Infinite transmit diversity performance limit
M
If we fix the energy difference between codewords, Ed , and − 2 Re rm∗ (t)αnm cn (t)
have enough receive apertures, M, such that (32) holds, then m=1
A = (Ed /N)I minimizes the pairwise error probability, and (41)
this design matrix gives η2 = (exp(4σχ2 ) − 1)/MN. As we in-
crease the number of transmit apertures, N, we see that η ap- shows that joint detection of (c1 (1), . . . , cN (T)) is equivalent
proaches zero, and hence (30) provides a performance limit to decoding each individual symbol, cn (t), separately. The
Space-Time Codes for Wireless Optical Communications 219
structure of the Alamouti STC allows for further simplifica- [8] S. J. Halme, B. K. Levitt, and R. S. Orr, “Bounds and ap-
tion, and the decision rules become proximations for some integral expression involving lognormal
statistics,” Technical report, MIT Res. Lab. Electron. Quart.
2
M 2 Prog. Rept., 1969.
ŝ1 = argmin −1+ αnm |s|2 [9] S. M. Alamouti, “A simple transmit diversity technique for
s∈ n=1 m=1
wireless communications,” IEEE Journal on Selected Areas in
Communications, vol. 16, no. 8, pp. 1451–1458, 1998.
M
2
+ s − r (1)α∗1m + rm∗ (2)α2m ,
m=1 m
(42) Shane M. Haas is a Ph.D. candidate in
Electrical Engineering at the Massachusetts
2 M 2
ŝ2 = argmin − 1 + αnm |s|2 Institute of Technology. He received B.S.
s∈ n=1 m=1 and M.S. degrees in Electrical Engineer-
ing in 1998, 1999 and B.S. and M.A. de-
M
2
grees in Mathematics from the University of
+ s − r (1)α∗2m + rm∗ (2)α1m .
m=1 m Kansas with honors and highest distinction
in 1998, 1999, respectively. His doctoral re-
search concerns the theoretical and experi-
5. CONCLUSIONS mental study of coding for and the capacity
of optical communication through the turbulent atmosphere. Mr.
In this paper, we presented a framework for developing Haas is a National Defense Science and Engineering Graduate Fel-
space-time codes for an atmospheric optical heterodyne low, National Science Foundation Fellow, and a Goldwater Scholar.
communication system. Through a central limit theorem ap- In 1998, he was named the Outstanding Senior in both the Univer-
proximation, we found that the design criterion A = (Ed /N)I sity of Kansas School of Engineering and the Department of Elec-
trical Engineering and Computer Science. As an undergraduate, he
minimized the pairwise probability of codeword error for
was supported by the National Science Foundation Research Ex-
large numbers of apertures. Although developed for lognor- perience for Undergraduates program for four consecutive years.
mal fading, this method generalizes to other fading distribu- From 1997 to 2000, he worked at Flint Hills Scientific, L.L.C., de-
tions in which the fades are zero-mean and independent. veloping an adaptive digital signal processing algorithm to predict
Our design criterion also satisfies the rank and determi- and detect the onset of epileptic seizures. He is currently a graduate
nant criteria presented in [1] for Rayleigh channels. Further- research and teaching assistant at the Laboratory of Information
more, orthogonal designs provide a method of constructing and Decision Systems and the Research Laboratory of Electronics
STCs that satisfy our criterion, and require only linear pro- at MIT.
cessing at the receiver [2]. Jeffrey H. Shapiro received the S.B., S.M.,
E.E., and Ph.D. degrees in Electrical Engi-
ACKNOWLEDGMENT neering from the Massachusetts Institute of
Technology in 1967, 1968, 1969, and 1970,
This research was supported in part by Defense Advanced respectively. As a graduate student he was
Research Projects Agency Grant MDA972-00-1-0012. a National Science Foundation Fellow, a
Teaching Assistant, and a Fannie and John
REFERENCES Hertz Foundation Fellow. His doctoral re-
search was a theoretical study of adaptive
[1] V. Tarokh, N. Seshadri, and A. R. Calderbank, “Space-time techniques for improved optical communi-
codes for high data rate wireless communication: performance cation through atmospheric turbulence. From 1970 to 1973, Dr.
criterion and code construction,” IEEE Transactions on Infor- Shapiro was an Assistant Professor of Electrical Sciences and Ap-
mation Theory, vol. 44, no. 2, pp. 744–765, 1998. plied Physics at Case Western Reserve University. From 1973 to
[2] V. Tarokh, H. Jafarkhani, and A. R. Calderbank, “Space-time 1985, he was an Associate Professor of Electrical Engineering at
block codes from orthogonal designs,” IEEE Transactions on MIT, and in 1985, he was promoted to Professor of Electrical En-
Information Theory, vol. 45, no. 5, pp. 1456–1467, 1999. gineering. From 1989 until 1999 Dr. Shapiro served as Associate
[3] J. H. Shapiro and R. C. Harney, “Simple algorithms for calculat-
Department Head. In 1999, he became the Julius A. Stratton Pro-
ing optical communication performance through turbulence,”
fessor of Electrical Engineering. In 2001, Dr. Shapiro was appointed
in Proc. SPIE Contr. and Commun. Tech. in Laser Sys., vol. 295,
pp. 41–54, 1981. director of MIT’s Research Laboratory of Electronics. Dr. Shapiro’s
[4] J. H. Shapiro and R. C. Harney, “Burst-mode atmospheric op- research interests have centered on the application of communica-
tical communication,” in Proc. Nat. Telecommun. Conf. Record, tion theory to optical systems. He is best known for his work on
pp. 27.5.1–27.5.7, New York, NY, USA, 1980. the generation, detection, and application of squeezed-state light
[5] J. H. Shapiro, “Imaging and optical communication through beams, but he has also published extensively in the areas of at-
atmospheric turbulence,” in Laser Beam Propagation in the At- mospheric optical communication and coherent laser radar. Dr.
mosphere, J. W. Strohbehn, Ed., chapter 6, Springer, Berlin, Ger- Shapiro is a fellow of the Institute of Electrical and Electronics En-
many, 1978. gineers, of the Optical Society of America, and of the Institute of
[6] R. M. Gagliardi and S. Karp, Optical Communications, John Physics, and he is a member of SPIE (The International Society for
Wiley & Sons, New York, NY, USA, 1976. Optical Engineering). He has been an Associate Editor of the IEEE
[7] R. A. Horn and C. R. Johnson, Matrix Analysis, Cambridge Transactions on Information Theory and the Journal of the Optical
University Press, New York, NY, USA, 1988. Society of America.
220 EURASIP Journal on Applied Signal Processing