Академический Документы
Профессиональный Документы
Культура Документы
Fischer
Copyright 02002 John Wiley & Sons, Inc.
ISBN: 0-471-22410-3
Appendix A
Wirtinger Calculus
405
406 WIRTINGER CALCULUS
The point xopt,for which f(x) is maximum’ is obtained by taking the derivative of
f with respect to x and setting it to zero. For xoptthe following equation has to be
valid:
(A. 1.2)
Here we assume f(x) to be continuous in some region R,and the derivative to exist.
Whether the solution of the above equation actually gives a minimum or maximum
point has to be checked via additional considerations or by inspecting higher-order
derivatives.
Analogous to real functions, a derivative can be defined for complex functions of
a complex variable
f :C 3 zt+ w = f ( z ) E c (A.1.3)
as well:
(A.1.4)
The above limit has to exist for the infinitely many series {zn} which approach
ZO, i.e.,lim z , = 20. If f’(z)exists in a region R C C,the function f ( z ) is called
n+ 03
analytic, holomorphic, or regular in R. In the following, the relations between real
and complex derivatives are discussed.
A complex function can be decomposed into two real functions, each depending
on two real variables x and y, the real and imaginary parts of z :
(A.1.6a)
(A. 1.6b)
(A.1.7)
As already stated, we have to treat real functions of one or more complex variables.
Thus, let us now consider functions
(A.2.3)
where, for the moment, a1 and a2 are arbitrary real and nonzero constants. Equations
(A.2.2) and (A.2.3) are equivalent (and hence, of course, result in the same solution)
because real and imaginary part are orthogonal. As already stated, this procedure is
mainly intended to get a compact representation.
408 WlRTlNGER CALCULUS
Writing real part and imaginary part of z = x + j y as the tuple (z,y), we can
define the following differential operator:
(A.2.4)
This operator can, of course, also be applied to complex functions (A. 1.5). This is
reasonable, because real cost functions are often composed of complex components,
, an obvious definition of fl,z(z)E C.
e.g., f ( z ) = 1zI2 = z . z* fl(z) . f ~ ( z )with
Note, z* 2 x - j y denotes the complex conjugate of z = x j y. +
The remaining task is to chose suitable constants a1 and u2. The main aim is
to obtain a calculus that is easily remembered and easy to apply. As will be shown
later, the choice a1 = $ and a2 = -$ meets all requirements. To honor the work
of the Austrian mathematician Wilhelrn Wirtinger (1865-1945) who established this
differential calculus, we call it Wirtinger Calculus.
af a 1. a.f .a.f
-
dz - 2 (ax By)
--,- (A.2.5)
and
(A.2.6)
A.2.1 Examples
We now study some important examples. First, let f ( z ) = cz, where c 6 C is a
constant. Derivation of f ( z ) yields
and
az* 2
+ j "(>J; ") = 1(c +j (jc)) = 0 .
2
(A.2.8)
and
+ j d c ( x d ~ J y ) ) = -1( c + j ( - j c ) ) =c. (A.2.10)
dz* 2 dX 2
Next, we consider the function f = zz* = 1zI2 = x2 + y2. Here the derivatives
a
read:
-
az
a 1
zz* = -
2 (d(x2+ y2)
dX
-j
aY
+ y2)) = (22 - j 2 y ) = z* , (A.2.11)
and
d
- zz* = -
dz* 2
(
1 d(X2+Y2)
dX +j dy
y2)) = f (2x +j 2y) = z . (A.2.12)
CZ C 0
CZ* 0 c
ZZ* z* z
Note that using the Wirtinger Calculus differentiation is formally done as with
real functions. Moreover, and somewhat unexpected, z* is formally considered
as a constant when derivating with respect to z and vice versa. It is also easy
to show that the sum, product, and quotient rules still hold. For example, given
f ( z ) = f l ( z ) . f2(z),we obtain
a
-f1(z) .f2(z) =
az
-j afl(.)f2(z) -j j l ( z ) ~ )
dY
(A.2.13)
410 WIRTINGER CALCULUS
A.2.2 Discussion
The Wirtinger derivative can be considered to lie inbetween the real derivative of a
real function and the complex derivative of a complex function. Rewriting (A.2.5)
and (A.2.6),we arrive at:
(A.2.15a)
a f -= o (A.2.15b)
dz*
On the one hand, equation (A.2.15a) states that for holomorphic functions the
Wirtinger derivative with respect to z agrees with the ordinary derivativeof a complex
function (cf. (A.l.7)). On the other hand, (A.2.15b) can be interpreted in the way
that holomorphic functions do not formally depend on z * .
Contrary to the usual complex derivative, the Wirtinger derivative exists for all
functions, in particular nonholomorphic ones, such as real functions. Since both
operators and & are merely a compact notation incorporating two real differential
quotients, they can be applied to arbitrary functions of complex variables. For
nonholomorphic functions, $ # 0 usually holds, and thus either the derivative with
respect to z or z* can be used for optimization. The actual cost functions determines
GRADIENTS 411
which one is more advantageous; if quadratic forms are considered, the operator
is preferable.
To summarize, it should again be emphasized, that, because of its compact no-
tation, Wirtinger Calculus is very well suited for optimization in engineering. It
circumvents a separate inspection of real part and imaginary part of the cost function.
Because of the simple arithmetic rules-mostly it can be calculated as known from
real functions-the Wirtinger Calculus is very clear.
A.3 GRADIENTS
For the majority of applications the cost function does not only depend on one, but
on many variables, e.g., we have f : C" 3 z = [ z l ,z2,. . . ,znIT c-) w = f(z)E IR.
For optimization, all n partial derivatives with respect to the complex variables
z1 = 2 1 + jy1 through z, = 2, +
jy, have to be calculated. Usually, these
derivatives are again combined into a vector, the so called gradient:
A
which, in the optimum, has to equal the zero vector 0 = [0, 0, . . . , 0IT. Wirtinger
Calculus is especially well suited for such multidimensional functions, because here
only with a great effort can the real part and the imaginary part be separated and in-
spected independently. Using the above definitions of the partial derivatives ((A.2.5)
and (A.2.6)), we arrive at simple arithmetic rules, now expressed using vectors and
matrices.
A.3.1 Examples
We now again study some important examples. First, let f(z)= cTz = c:=l
c,z, or
f(r) = cTz* = c,=l
n
czz:, respectively, with c = [ c ~c2,.
, . . , c,IT and c, constant.
It is easy to prove the following properties for gradients:
-
' Tc z=c, -' c T
z*=o,
az 8% (A.3.2)
L cT z =o, Z cT z * = c .
az* az*
*Here we use column vectors, but the same considerations also apply to row vectors.
4f2 WlRTfNGER CALCULUS
A.3.2 Discussion
The gradient with respect to the Wirtinger derivatives, is related to the gradient
(A.3.5)
(Vf(Z))* = 2-
af ( z ) . (A.3.7)
8%
The first disadvantage of definition (A.3.6) compared to Wirtinger Calculus is that
an undesired factor of 2 occurs. In particular, if the chain rule is applied 1 times,
the result is artificially multiplied by 2l. Second, the calculus is not very elegant,
because the gradient of, e.g., f ( z ) = cTz is V(cTz) = 0, but that of f ( z ) = cTz*
calculates to V(cTz*) = 2%. Hence, because of its much clearer arithmetic rules,
we exclusively apply the Wirtinger Calculus.
GRADIENTS 413
REFERENCES