Академический Документы
Профессиональный Документы
Культура Документы
1 Introduction
Stator
Bearings
Stator
Conductors
Rotor
Air
Gap
Rotor
Shaft End Windings Conductors
This section of notes discusses some of the fundamental processes involved in electric machinery.
In the section on energy conversion processes we examine the two major ways of estimating elec-
tromagnetic forces: those involving thermodynamic arguments (conservation of energy) and field
methods (Maxwell’s Stress Tensor). But first it is appropriate to introduce the topic by describing
a notional rotating electric machine.
Electric machinery comes in many different types and a strikingly broad range of sizes, from
those little machines that cause cell ’phones and pagers to vibrate (yes, those are rotating electric
machines) to turbine generators with ratings upwards of a Gigawatt. Most of the machines with
which we are familiar are rotating, but linear electric motors are widely used, from shuttle drives in
weaving machines to equipment handling and amusement park rides. Currently under development
are large linear induction machines to be used to launch aircraft. It is our purpose in this subject
to develop an analytical basis for understanding how all of these different machines work. We start,
however, with a picture of perhaps the most common of electric machines.
a fashion which is the same as the induction machine or which differ in ways which are easy to
reference to the induction machine.
Most (but not all!) machines we will be studying have essentially this morphology. The rotor
of the machine is mounted on a shaft which is supported on some sort of bearing(s). Usually, but
not always, the rotor is inside. I have drawn a rotor which is round, but this does not need to be
the case. I have also indicated rotor conductors, but sometimes the rotor has permanent magnets
either fastened to it or inside, and sometimes (as in Variable Reluctance Machines) it is just an
oddly shaped piece of steel. The stator is, in this drawing, on the outside and has windings. With
most of the machines we will be dealing with, the stator winding is the armature, or electrical
power input element. (In DC and Universal motors this is reversed, with the armature contained
on the rotor: we will deal with these later).
In most electrical machines the rotor and the stator are made of highly magnetically permeable
materials: steel or magnetic iron. In many common machines such as induction motors the rotor
and stator are both made up of thin sheets of silicon steel. Punched into those sheets are slots
which contain the rotor and stator conductors.
Figure 2 is a picture of part of an induction machine distorted so that the air-gap is straightened
out (as if the machine had infinite radius). This is actually a convenient way of drawing the machine
and, we will find, leads to useful methods of analysis.
Stator Core
Stator Conductors
In Slots
Air Gap
Rotor Conductors
In Slots
What is important to note for now is that the machine has an air gap g which is relatively
small (that is, the gap dimension is much less than the machine radius r). The air-gap also has a
physical length l. The electric machine works by producing a shear stress in the air-gap (with of
course side effects such as production of “back voltage”). It is possible to define the average air-
gap shear stress, which we will refer to as τ . Total developed torque is force over the surface area
times moment (which is rotor radius):
Power transferred by this device is just torque times speed, which is the same as force times
If we note that active rotor volume is , the ratio of torque to volume is just:
T
=2<τ >
Vr
Now, determining what can be done in a volume of machine involves two things. First, it is
clear that the volume we have calculated here is not the whole machine volume, since it does not
include the stator. The actual estimate of total machine volume from the rotor volume is actually
quite complex and detailed and we will leave that one for later. Second, we need to estimate the
value of the useful average shear stress. Suppose both the radial flux density Br and the stator
surface current density Kz are sinusoidal flux waves of the form:
√
Br = 2B0 cos (pθ − ωt)
√
Kz = 2K0 cos (pθ − ωt)
Note that this assumes these two quantities are exactly in phase, or oriented to ideally produce
torque, so we are going to get an “optimistic” bound here. Then the average value of surface
traction is:
1 2π
�
< τ >= Br Kz dθ = B0 K0
2π 0
The magnetic flux density that can be developed is limited by the characteristics of the magnetic
materials (iron) used. Current densities are a function of technology and are typically limited by
how much effort can be put into cooling and the temperature limits of insulating materials. In
practice, the range of shear stress encountered in electric machinery technology is not terribly
broad: ranging from a few kPa in smaller machines to about 100 kPa in very large, well cooled
machines.
It is usually said that electric machines are torque producing devices, meaning tht they are
defined by this shear stress mechanism and by physical dimensions. Since power is torque times
rotational speed, high power density machines necessarily will have high shaft speeds. Of course
there are limits on rotational speed as well, arising from centrifugal forces which limit tip velocity.
Our first step in understanding how electric machinery works is to understand the mechanisms
which produce forces of electromagnetic origin.
Pm = T Ω
3
Electro-
Pd = Pe − Pm
It will sometimes be convenient to employ the fact that, in most machines, dissipation is small
enough to approximate mechanical power with electrical power. In fact, there are many situations in
which the loss mechanism is known well enough that it can be idealized away. The “thermodynamic”
arguments for force density take advantage of this and employ a “conservative” or lossless energy
conversion system.
f
+ Magnetic Field
v x
- System
To start, consider some electromechanical system which has two sets of “terminals”, electrical
and mechanical, as shown in Figure 4. If the system stores energy in magnetic fields, the energy
stored depends on the state of the system, defined by (in this case) two of the identifiable variables:
flux (λ), current (i) and mechanical position (x). In fact, with only a little reflection, you should
be able to convince yourself that this state is a single-valued function of two variables and that the
energy stored is independent of how the system was brought to this state.
Now, all electromechanical converters have loss mechanisms and so are not themselves conser-
vative. However, the magnetic field system that produces force is, in principle, conservative in the
sense that its state and stored energy can be described by only two variables. The “history” of the
system is not important.
It is possible to chose the variables in such a way that electrical power into this conservative
system is:
dλ
P e = vi = i
dt
Similarly, mechanical power out of the system is:
dx
Pm = fe
dt
The difference between these two is the rate of change of energy stored in the system:
dWm
= Pe − Pm
dt
It is then possible to compute the change in energy required to take the system from one state to
another by: � a
Wm (a) − Wm (b) = idλ − f e dx
b
where the two states of the system are described by a = (λa , xa ) and b = (λb , xb )
If the energy stored in the system is described by two state variables, λ and x, the total
differential of stored energy is:
∂Wm ∂Wm
dWm = dλ + dx
∂λ ∂x
and it is also:
dWm = idλ − f e dx
So that we can make a direct equivalence between the derivatives and:
∂Wm
fe = −
∂x
In the case of rotary, as opposed to linear, motion, torque T e takes the place of force f e and
angular displacement θ takes the place of linear displacement x. Note that the product of torque
and angle has the same units as the product of force and distance (both have units of work, which
in the International System of units is Newton-meters or Joules.
In many cases we might consider a system which is electricaly linear, in which case inductance
is a function only of the mechanical position x.
λ(x) = L(x)i
In this case, assuming that the energy integral is carried out from λ = 0 (so that the part of the
integral carried out over x is zero),
λ 1 1 λ2
�
Wm = λdλ =
0 L(x) 2 L(x)
This makes
1 ∂ 1
f e = − λ2
2 ∂x L(x)
5
Note that this is numerically equivalent to
1 ∂
f e = − i2 L(x)
2 ∂x
This is true only in the case of a linear system. Note that substituting L(x)i = λ too early in the
derivation produces erroneous results: in the case of a linear system it produces a sign error, but
in the case of a nonlinear system it is just wrong.
R/2
g
x R CL
µ
N turns
µ0 πR2
P1 =
x
and the permeance of the radial clearance gap is, if the gap dimension is small compared with the
radius:
2µ0 πRℓ µ0 πR2
P2 = =
g g
The inductance of the coil system is:
N2 P1 P2 µ0 πR2 N 2
L= = N2 =
R1 + R2 P2 + P2 x+g
Magnetic energy is:
6
λ0 1 λ2 λ2 x + g
�
Wm = idλ = = 0
0 2 L(x) 2 µ0 πR2 N 2
∂Wm λ2 d 1
fe = − =− 0
∂x 2 dx L(x)
µ0 πR2 N i
λ = L(x)i =
x+g
and then total force may be seen to be:
µ0 πR2 N 2 i2
fe = −
(x + g)2 2
The force is ‘negative’ in the sense that it tends to reduce x, or to close the gap.
ik dλk − f e dx
�
dWm =
k
which simply suggests that electrical input to the magnetic field energy storage is the sum (in this
case over the index k) of inputs from each of the coils. To find the total energy stored in the
system it is necessary to integrate over all of the coils (which may and in general will have mutual
inductance).
�
Wm = i · dλ
Of course, if the system is conservative, Wm (λ1 , λ2 , . . . , x) is uniquely specified and so the actual
path taken in carrying out this integral will not affect the value of the resulting energy.
7
3.1.3 Coenergy
We often will describe systems in terms of inductance rather than its reciprocal, so that current,
rather than flux, appears to be the relevant variable. It is convenient to derive a new energy
variable, which we will call co-energy, by:
�
′
Wm = λi ii − Wm
i
and in this case it is quite easy to show that the energy differential is (for a single mechanical
variable) simply:
λk dik + f e dx
�
′
dWm =
k
so that force produced is:
∂Wm′
fe =
∂x
Stator
A’
Gap C B
θ
B’
Rotor F’ C’
F
A
A µ
F F’
C’ B’
B C
µ A’
Consider a simple electric machine as pictured in Figure 6 in which there is a single winding
on a rotor (call it the field winding and a polyphase armature with three identical coils spaced at
uniform locations about the periphery. We can describe the flux linkages as:
if this is the case, so that techniques have been developed to make those flux linkages very nearly
sinusoidal. We will see some of these techniques in later chapters of these notes. For the moment,
we will simply assume these dependencies. In addition, we assume that the rotor is magnetically
’round’, which means the stator self inductances and the stator phase to phase mutual inductances
are not functions of rotor position. Note that if the phase windings are identical (except for their
angular position), they will have identical self inductances. If there are three uniformly spaced
windings the phase-phase mutual inductances will all be the same.
Now, this system can be simply described in terms of coenergy. With multiple excitation it
is important to exercise some care in taking the coenergy integral (to ensure that it is taken over
a valid path in the multi-dimensional space). In our case there are actually five dimensions, but
only four are important since we can position the rotor with all currents at zero so there is no
contribution to coenergy from setting rotor position. Suppose the rotor is at some angle θ and that
the four currents have values ia0 , ib0 , ic0 and if 0 . One of many correct path integrals to take would
be:
� ia0
′
Wm = La ia dia
0
� ib0
+ (Lab ia0 + La ib ) dib
0
� ic0
+ (Lab ia0 + Lab ib0 + La ic ) dic
0
� if 0 �
2π 2π
�
+ M cos(pθ)ia0 + M cos(pθ − )ib0 + M cos(pθ + )ic0 + Lf if dif
0 3 3
The result is:
1 �2 �
′
Wm = La ia0 + i2b0 + i2co + Lab (iao ib0 + iao ic0 + ico ib0 )
2
2π 2π 1
� �
+M if 0 ia0 cos(pθ) + ib0 cos(pθ − ) + ic0 cos(pθ + ) + Lf i2f 0
3 3 2
Since there are no variations of the stator inductances with rotor position θ, torque is easily
given by:
∂Wm ′ �
2π 2π
�
Te = = −pM if 0 ia0 sin(pθ) + ib0 sin(pθ − ) + ico sin(pθ + )
∂θ 3 3
ia0 = Ia cos ωt
2π
� �
ib0 = Ia cos ω t −
3
2π
� �
ic0 = Ia cos ω t +
3
if 0 = If
9
and assume the rotor is turning at synchronous speed:
pθ = ωt + δi
1
Noting that cos x sin y
= 2 sin(x − y) +
12 sin(x + y), we find the torque expression above to be:
1 1
� �
Te = −pM Ia If sin δi + sin (2ωt + δi )
2 2
1 1 4π
� � ��
+ sin δi + sin 2ωt + δi −
2 2 3
1 1 4π
� � ��
+ sin δi + sin 2ωt + δi +
2 2 3
The sine functions on the left add and the ones on the right cancel, leaving:
3
Te = − pM Ia If sin δi
2
And this is indeed one way of looking at a synchronous machine, which produces steady torque
if the rotor speed and currents all agree on frequency. Torque is related to the current torque angle
δi . As it turns out such machines are not generally run against current sources, but we will take
up actual operation of such machines later.
�
� · ∂B − E
� �
� ·∇×E
H � −E
� ·∇×H
� =∇· E� ×H
� = −H � · J�
dt
On the left of this expression is the divergence of electromagnetic energy flow:
�=E
S � ×H
�
10
Here, S � is the celebrated Poynting flow which describes power in an electromagnetic field
sysstem. (The units of this quantity is watts per square meter in the International System). On
the right hand side are two terms: H � · ∂ B~ is rate of change of magnetic stored energy. The second
dt
term, E� · J� looks a lot like power dissipation. We will discuss each of these in more detail. For the
moment, however, note that the divergence theorem of vector calculus yields:
� ��
� =
S
∇ · Sdv � · �nda
volume
that is, the volume integral of the divergence of the Poynting energy flow is the same as the Poynting
energy flow over the surface of the volume in question. This integral becomes:
�
� �
� · ∂B
�� �
� · �nda = −
S � · J� + H
E dv
volume ∂t
which is simply a realization that the total energy flow into a region of space is the same as the
volume integral over that region of the rate of change of energy stored plus the term that looks like
dissipation. Before we close this, note that, if there is motion of any material within the system, we
can use the empirical expression for transformation of electric field between observers moving with
respect to each other. Here the ’primed’ frame is moving with respeect to the ’unprimed’ frame
with the velocity �v
�′ = E
E � + �v × B
�
This transformation describes, for example, the motion of a charged particle such as an electron
under the influence of both electric and magnetic fields. Now, if we assume that there is material
motion in the system we are observing and if we assign �v to be the velocity of that material, so that
E� ′ is measured in a frame in which thre is no material motion (that is the frame of the material
itself), the product of electric field and current density becomes:
� � � � � �
� · J� = E
E � · J� = E
� ′ − �v × B � ′ · J� − �v × B � ′ · J� + �v · J� × B
� · J� = E �
In the last step we used the fact that in a scalar triple product the order of the scalar (dot)
and vector (cross) products can be interchanged and that reversing the order of terms in a vector
(cross) product simply changes the sign of that product. Now we have a ready interpretation for
what we have calculated:
If the ’primed’ coordinate system is actually the frame of material motion,
� ′ · J� = 1 |J�|2
E
σ
which is easily seen to be dissipation and is positive definite if material conductivity σ is positive.
The last term is obviously conversion of energy from electromagnetic to mechanical form:
� �
�v · J� × B
� = �v · F�
� = J� × B
F �
11
This is the Lorentz Force Law, which describes the interaction of current with magnetic field
to produce force. It is not, however, the complete story of force production in electromechanical
systems. As we learned earlier, changes in geometry which affect magnetic stored energy can also
produce force. Fortunately, a complete description of electromechanical force is possible using only
magnetic fields and that is the topic of our next section.
�−1 H
� �
F� = J� × B � ·H
� ∇µ
2
� is the magnetic field intensity and µ is the permeability.
where H
Now, note that current density is the curl of magnetic field intensity, so that:
� −1 H
�� � �
�
F = � × µH
∇×H � ·H
� ∇µ
2
� −1 H
� � � �
= µ ∇×H� ×H � ·H
� ∇µ
2
And, since:
� − 1∇ H
� � � � � �
� ×H
∇×H � = H� ·∇ H � ·H
�
2
force density is:
� − 1 µ∇ H � −1 H
� � � � � �
F� = µ H� ·∇ H � ·H � ·H
� ∇µ
2� 2
� �
� ·∇ H� −∇ 1 �
� ·H
�
��
= µ H µ H
2
This expression can be written by components: the component of force in the i’th dimension is:
� �
�� ∂ ∂ 1 � 2
�
Fi = µ Hk Hi − µ H
k
∂xk ∂xi 2 k k
12
The last term in this expression is easily shown to be divergence of magnetic flux density, which is
zero:
� ∂
∇·B � = µHk = 0
k
∂xk
Using this, we can write force density in a more compact form as:
� �
∂ µ � 2
Fk = µHi Hk − δik Hn
∂xi 2 n
The interpretation of all of this is less difficult than the notation suggests. This field description
of forces gives us a simple picture of surface traction, the force per unit area on a surface. If we
just integrate this traction over the area of some body we get the whole force on the body.
Note one more thing about this notation. Sometimes when subscripts are repeated as they are
here the summation symbol is omitted. Thus we would write τi = k Tik nk = Tik nk .
�
13
K
g z
111111111111111111111
000000000000000000000
µ y
000000000000000000000
111111111111111111111
000000000000000000000
111111111111111111111
000000000000000000000
111111111111111111111
u
x
µ
σ K
s s
Note that we are ignoring some important effects, such as those arising from finite length of the
stator and of the shuttle. Such effects can be quite important, but we will leave those until later,
as they are what make linear motors interesting.
Viewed from the shuttle for which the dimension in the direction of motion is x′ − x − ut′ , the
relative frequency is:
ωt − kx = (ω − ku) t − kx′ = ωs t − kx′
Now, since the shuttle surface can support a surface current and is excited by magnetic fields
which are in turn excited by the stator currents, it is reasonable to assume that the form of rotor
current is the same as that of the stator:
n ′
o
Ks = Re K s ej(ωs t−kx )
−jωs B y = jkE ′z
or
ωs
E ′z = − µ0 H y
k
A bit of algebraic manipulation yields expressions for the complex amplitudes of rotor surface
current and gap magnetic field:
−j µ0kω2sgσs
Ks = Kz
1 + j muk02ωgs σs
j Kz
Hy =
kg 1 + j muk02ωgs σs
14
To find surface traction, the Maxwell Stress Tensor can be evaluated at a surface just below the
stator (on this surface the x- directed magnetic field is simply H x = K z . Thus the traction is
τx = Txy = µ0 Hx Hy
and the average of this is:
µ0 � �
< τx >= Re H x H y ∗
2
This is:
2 µ0 ωs σs
µ0 1 |K z | k2 g
< τx >=
2 kg 1 + µ0 ωs σs 2
� �
k2 g
Now, if we consider electromagnetic power flow (Poynting’s Theorem): in the y- direction:
Sy = Ez Hx
And since in the frame of the shuttle E ′z = − ωks µ0 H y
µ0 ωs σs
1 ω s µ0 k2 g 2 ωs
< Sy′ >= − �2 |K z | = − < τx >
2 k kg 1 + µ0 ωs σs k
�
k2 g
τ θ = µ0 H r H θ
Assuming that there are no fluxes inside the surface of the rotor, simply integrating this over
the surface gives azimuthal force. In principal this is the same as surrounding the surface of the
rotor by a continuum of infinitely small boxes, one surface just outside the rotor and with a normal
facing outward, the other surface just inside with normal facing inward. (Of course the MST is
zero on this inner surface). Then multiplying by radius (moment arm) gives torque. The last step
is to note that, if the rotor is made of highly permeable material, the azimuthal magnetic field just
outside the rotor is equal to surface current density.
15
5 Generalization to Continuous Media
Now, consider a system with not just a multiplicity of circuits but a continuum of current-carrying
paths. In that case we could identify the co-energy as:
� �
′
Wm = λ(�a)dJ� · d�a
area
where that area is chosen to cut all of the current carrying conductors. This area can be picked to
be perpedicular to each of the current filaments since the divergence of current is zero. The flux λ
is calculated over a path that coincides with each current filament (such paths exist since current
has zero divergence). Then the flux is:
�
λ(�a) = � · d�n
B
where d�ℓ is the path around the current filament. This implies directly that the coenergy is:
� � �
′
Wm = � · d�ℓdJ� · d�a
A
area J
Now: it is possible to make d�ℓ coincide with d�a and be parallel to the current filaments, so that:
�
′
Wm = � · dJdv
A �
vol
Now, the effect of the magnetization is to act as if there were a current (called an amperian current)
with density:
J�∗ = ∇ × M �
Note that this amperian current “acts” just like ordinary current in making magnetic flux density.
Magnetic co-energy is: �
Wm ′
= A� · ∇ × dM
� dv
vol
16
� � � � � �
� ×D
Next, note the vector identity ∇ · C � =D
� · ∇×C
� −C
� · ∇×D
� Now,
� � � � � �
′
Wm = � × dM
−∇ · A � dv + � · dM
∇×A � dv
vol vol
� = ∇ × A:
Then, noting that B �
�� �
′
Wm � × dM
=−
A � d�s + � · dM
B � dv
vol
The first of these integrals (closed surface) vanishes if it is taken over a surface just outside the
magnet, where M� is zero. Thus the magnetic co-energy in a system with only a permanent magnet
source is �
′
Wm = � · dM
B � dv
vol
Adding current carrying coils to such a system is done in the obvious way.
17
1 Introduction
Magnetic Circuits offer, as do electric circuits, a way of simplifying the analysis of magnetic field
systems which can be represented as having a collection of discrete elements. In electric circuits
the elements are sources, resistors and so forth which are represented as having discrete currents
and voltages. These elements are connected together with ‘wires’ and their behavior is described
by network constraints (Kirkhoff’s voltage and current laws) and by constitutive relationships such
as Ohm’s Law. In magnetic circuits the lumped parameters are called ‘Reluctances’ (the inverse
of ‘Reluctance’ is called ‘Permeance’). The analog to a ‘wire’ is referred to as a high permeance
magnetic circuit element. Of course high permeability is the analog of high conductivity.
By organizing magnetic field systems into lumped parameter elements and using network con-
straints and constitutive relationships we can simplify the analysis of such systems.
2 Electric Circuits
First, let us review how Electric Circuits are defined. We start with two conservation laws: conser-
vation of charge and Faraday’s Law. From these we can, with appropriate simplifying assumptions,
derive the two fundamental circiut constraints embodied in Kirkhoff’s laws.
2.1 KCL
Conservation of charge could be written in integral form as:
dρf
�� �
J~ · ~nda + dv = 0 (1)
volume dt
This simply states that the sum of current out of some volume of space and rate of change of
free charge in that space must be zero.
Now, if we define a discrete current to be the integral of current density crossing through a part
of the surface:
��
ik = − J~ · ~nda (2)
surfacek
and if we assume that there is no accumulation of charge within the volume (in ordinary circuit
theory the nodes are small and do not accumulate charge), we have:
��
J~ · ~nda = −
�
ik = 0 (3)
k
1
which holds if the sum over the index k includes all current paths into the node. This is, of
course, KCL.
2.2 KVL
Faraday’s Law is, in integral form:
~ · d~ℓ = − d
� ��
E B~ · ~nda (4)
dt
where the closed loop in the left hand side of the equation is the edge of the surface of the
integral on the right hand side.
Now if we define voltage in the usual way, between points a and b for element k:
� bk
vk = ~ · d~ℓ
E (5)
ak
Then, if we assume that the right-hand side of Faraday’s Law (that is, magnetic induction) is
zero, the loop equation becomes:
�
vk = 0 (6)
k
This works for circuit analysis because most circuits do not involve magnetic induction in the
loops. However, it does form the basis for much head scratching over voltages encountered by
‘ground loops’.
J~ = σ E
~ (7)
Suppose, to start, we can identify a piece of stuff which has constant area and which is carrying
current over some finite length, as shown in Figure 1. Assume this rod is carrying current density
J~ (We won’t say anything about how this current density managed to get into the rod, but assume
that it is connected to something that can carry current (perhaps a wire....). Total current carried
by the rod is simply
I = |J|A
and then voltage across the element is:
~ · dℓ = ℓ I
�
v= E
σA
from which we conclude the resistance is
V ℓ
R= =
I σA
2
J E
Of course we can still employ the lumped parameter picture even with elements that are more
complex. Consider the annular resistor shown in Figure 2. This is an end-on view of something
which is uniform in cross-section and has depth D in the direction you can’t see. Assume that
the inner and outer elements are very good conductors, relative to the annular element in be-
tween. Assume further that this element has conductivity σ and inner and outer radii Ri and Ro ,
respectively.
Electrodes
+
Resistive Material
Now, if the thing is carrying current from the inner to the outer electrode, current density would
be:
I
J~ = ~ir Jr (r) =
2πDr
Electric field is
Jr I
Er = =
σ 2πDrσ
Then voltage is
� Ro I Ro
v= Er (r) = log
Ri 2πσD Ri
so that we conclude the resistance of this element is
Ro
log Ri
R=
2πσD
3
3 Magnetic Circuit Analogs
In the electric circuit, elements for which voltage and current are defined are connected together
by elements thought of as ‘wires’, or elements with zero or negligible voltage drop. The intercon-
nection points are ‘nodes’. In magnetic circuits the analogous thing occurs: elements for which
magnetomotive force and flux can be defined are connected together by high permeability magnetic
circuit elements (usually iron) which are the analog of wires in electric circuits.
which means that the total amount of flux coming out of a region of space is always zero.
Now, we will define a quantity which is sometimes called simply ‘flux’ or a ‘flux tube’. This
might be thought to be a collection of flux lines that can somehow be bundled together. Generally
it is the flux that is identified with a magnetic circuit element. Mathematically it is:
��
Φk = ~ · ~nda
B (9)
In most cases, flux as defined above is carried in magnetic circuit elements which are made of high
permeability material, analogous to the ‘wires’ of high conductivity material which carry current
in electric circuits. It is possible to show that flux is largely contained in such high permeability
materials.
If all of the flux tubes out of some region of space (’node’) are considered in the sum, they must
add to zero: �
Φk = 0 (10)
k
Where, as for Faraday’s Law, the closed contour on the left is the periphery of the (open) surface
on the right. Now we define what we call Magnetomotive Force, in direct analog to ‘Electromotive
Force’, (voltage).
� bk
Fk = ~ · d~ℓ
H (12)
ak
Further, define the current enclosed by a loop to be:
��
F0 = J~ · ~nda (13)
4
Note that the analog is not exact as there is a source term on the right hand side whereas KVL
has no source term. Note also that sign counts here. The closed integral is taken in such direction
so that the positive sense of the surface enclosed is positive (upwards) when the surface is to the
left of the contour. (This is another way of stating the celebrated ‘right hand rule’: if you wrap
your right hand around the contour with your fingers pointing in the direction of the closed contour
integration, your thumb is pointing in the positive direction for the surface).
Φ = BA
where B is the flux density crossing the gap and A is the gap area. Note that we are ignoring
‘fringing’ fields in this simplified analysis. This neglect often requires correction in practice. Since
the permeability of free space is µ0 , (assuming the gap is indeed filled with ’free space’), magnetic
field intensity is
B
H=
µ0
and gap MMF is just magnetic field intensity times gap dimension. This, of course, assumes that
the gap is uniform and that so is the magnetic field intensity:
B
F = g
µ0
Which means that the reluctance of the gap is the ratio of MMF to flux:
F g
R= =
Φ µ0 A
Φ
y
Area A
x
µ
g
5
3.4 Simple Case
Consider the magnetic circuit situation shown in Figure 4. Here there is a piece of highly permeable
material shaped to carry flux across a single air-gap. A coil is wound through the window in the
magnetic material (this shape is usually referred to as a ‘core’). The equivalent circuit is shown in
Figure 5.
Region 1
Region 2
I
Note that in Figure 4, if we take as the positive sense of the closed loop a direction which goes
vertically upwards through the leg of the core through the coil and then downwards through the
gap, the current crosses the surface surrounded by the contour in the positive sense direction.
F = N I
Φ
−
6
In this picture, note that magnetic field H ~ parallel to the surface must be the same inside the
material as it is outside. Consider Ampere’s Law carried out about a very thin loop consisting of
the two arrows drawn at the top boundary of the material in Figure 6 with very short vertical paths
joining them. If there is no current singularity inside that loop, the integral around it must be zero
which means the magnetic field just inside must be the same as the magnetic field outside. Since
the material is very highly permeable and B ~ = µH,~ and ’highly permeable’ means µ is very large,
~ must be quite small. Thus the magnetic circuit has small magnetic field
unless B is really large, H
H and therfore flux densities parallel to and just outside its boundaries aer also small.
B is perpendicular
At the surface of the magnetic material, since the magnetic field parallel to the surface must be
very small, any flux lines that emerge from the core element must be perpendicular to the surface
as shown for the gap region in Figure 7. This is true for region 1 as well as for region 2, but note
that the total MMF available to drive fields across the gap is the same as would produce field
lines from the area of region 1. Since any lines emerging from the magnetic material in region 1
would have very long magnetic paths, they must be very weak. Thus the magnetic circuit material
largely confines flux, with only the relatively high permeance (low reluctance) gaps carrying any
substantive amount of flux.
F0 = N I = F1 + F2 = 2RΦ
and then
NI µ0 AN I
Φ= =
2R 2g
7
µ g
Area A
N
Turns
Φ
I
8
− F +
N
Turns
Φ
I
1 Introduction
Losses in electric machines arise from conduction and magnetic hysteresis. Conduction losses are
attributed to straightforward transport conduction and to eddy currents. Transport losses are
relatively easy to calculate so we will not pay them much attention. Eddy currents are more
interesting and result in frequency dependent conduction losses in machines.
Eddy currents in linear materials can often be handled rigorously, but eddy currents in saturat-
ing material are more difficult and are often handled in a heuristic fashion. We present here both
analytical and semi-emiprical ways of dealing with such losses.
We start with surface impedance: the ratio of electric field to surface current. This is important
not just in calculating machine losses, but also in describing how some machines operate.
∂H
∇ 2 H = µ0 σ
∂t
1
Conductive Region
Hx Kz
y
x
Permeable
Surface
In view of the boundary condition at the back surface of the material, taking that point to be
y = 0, a general solution for the magnetic field in the material is:
� �
Hx = Re A sinh αyej(ωt−kx)
k
� �
Hy = Re j A cosh αyej(ωt−kx)
α
where the coefficient α satisfies:
α2 = jωµ0 σ + k2
and note that the coefficients above are chosen so that H has no divergence.
Note that if k is small (that is, if the wavelength of the excitation is large), this spatial coefficient
α becomes
1+j
α=
δ
where the skin depth is: �
2
δ=
ωµ0 σ
Faraday’s law:
∂B
∇×E =−
∂t
gives:
ω
E z = −µ0 H y
k
Now: the “surface current” is just
K s = −H x
so that the equivalent surface impedance is:
Ez ω
Z= = jµ0 coth αT
−H x α
A pair of limits are interesting here. Assuming that the wavelength is long so that k is negligible,
then if αT is small (i.e. thin material),
ω 1
Z → jµ0 2
=
α T σT
2
On the other hand as αT → ∞,
1+j
Z→
σδ
Next it is necessary to transfer this surface impedance across the air-gap of a machine. So,
assume a new coordinate system in which the surface of impedance Z s is located at y = 0, and we
wish to determine the impedance Z = −E z /H x at y = g.
In the gap there is no current, so magnetic field can be expressed as the gradient of a scalar
potential which obeys Laplace’s equation:
H = −∇ψ
and
∇2 ψ = 0
Ignoring a common factor of ej(ωt−kx) , we can express H in the gap as:
� �
H x = jk ψ + eky + ψ − e−ky
� �
H y = −k ψ + eky − ψ − e−ky
It is useful to note that, in the limit of Z s → ∞, this expression approaches the gap impedance
ωµ0
Zg = j
k2 g
and, if the gap is small enough that kg → 0,
Z → Z g ||Z s
3 Iron
Electric machines employ ferromagnetic materials to carry magnetic flux from and to appropriate
places within the machine. Such materials have properties which are interesting, useful and prob-
lematical, and the designers of electric machines must deal with this stuff. The purpose of this
note is to introduce the most salient properties of the kinds of magnetic materials used in electric
machines.
We will be concerned here with materials which exhibit magnetization: flux density is something
other than B ~ = µ0 H.
~ Generally, we will speak of hard and soft magnetic materials. Hard materials
are those in which the magnetization tends to be permanent, while soft materials are used in
magnetic circuits of electric machines and transformers. Since they are related we will find ourselves
talking about them either at the same time or in close proximity, even though their uses are widely
disparite.
3.1 Magnetization:
It is possible to relate, in all materials, magnetic flux density to magnetic field intensity with a
consitutive relationship of the form:
� �
~ = µ0 H
B ~
~ +M
where magnetic field intensity H and magnetization M are the two important properties. Now,
in linear magnetic material magnetization is a simple linear function of magnetic field:
~ = χm H
M ~
~ = µ0 (1 + χm ) H
B ~
Note that in the most general case the magnetic susceptibility χm might be a tensor, leading
to flux density being non-colinear with magnetic field intensity. But such a relationship would still
be linear. Generally this sort of complexity does not have a major effect on electric machines.
Flux
Remanent Density
Flux Density B r
Saturation Flux
Density Bs
Magnetic
Coercive Field Field
Hc
Saturation
Field H s
which is the same as a volt-second. In SI the unit of flux density is the tesla (T), but many people
refer to the gauss (G), which has its origin in CGS. 10,000 G = 1 T. Now it gets worse, because
there is an English system measure of flux density generally called kilo-lines per square inch. This
is because in the English system the unit of flux is the line. 108 lines is equal to a weber. Thus a
Tesla is 64.5 kilolines per square inch.
The SI and CGS units of flux density are easy to reconcile, but the units of magnetic field
are a bit harder. In SI we generally measure H in amperes/meter (or ampere-turns per meter).
Often, however, you will see magnetic field represented as Oersteds (Oe). One Oe is the same as
the magnetic field required to produce one gauss in free space. So 79.577 A/m is one Oe.
In most useful magnetic materials the magnetic domaines tend to be somewhat “sticky”, and a
more-than-incremental magnetic field is required to get them to move. This leads to the property
called “hysteresis”, both useful and problematical in many magnetic systems.
Hysteresis loops take many forms; a generalized picture of one is shown in Figure 3. Salient
features of the hysteresis curve are the remanent magnetization Br and the coercive field Hc . Note
that the actual loop that will be traced out is a function of field amplitude and history. Thus there
are many other “minor loops” that might be traced out by the B-H characteristic of a piece of
material, depending on just what the fields and fluxes have done and are doing.
Hysteresis is important for two reasons. First, it represents the mechanism for “trapping”
magnetic flux in a piece of material to form a permanent magnet. We will have more to say about
that anon. Second, hysteresis is a loss mechanism. To show this, consider some arbitrary chunk of
material for which we can characterize an MMF and a flux:
�
F = NI = ~ · d~ℓ
H
V
� ��
Φ = dt = ~ · dA
B ~
N Area
6
Energy input to the chunk of material over some period of time is
� � � � ��
w= V Idt = F dΦ = ~ · d~ℓ
H ~ · dA
dB ~ dt
t
Now, imagine carrying out the second (double) integral over a continuous set of surfaces which
are perpendicular to the magnetic field H. (This IS possible!). The energy becomes:
� ���
w= ~ · dBdvol
H ~ dt
t
and, done over a complete cycle of some input waveform, that is:
���
w = Wm dvol
� vol
Wm = ~ · dB
H ~
t
That last expression simply expresses the area of the hysteresis loop for the particular cycle.
Generally, for most electric machine applications we will use magnetic material characterized
as “soft”, having as narrow a hysteresis loop (and therefore as low a hysteretic loss) as possible. At
the other end of the spectrum are “hard” magnetic materials which are used to make permanent
magnets. The terminology comes from steel, in which soft, annealed steel material tends to have
narrow loops and hardened steel tends to have wider loops. However permanent magnet technology
has advanced to the point where the coercive forces possible in even cheap ceramic magnets far
exceed those of the hardest steels.
7
y
t x
z
Now we can use Faraday’s law to determine the electric field and therefore current density in
the sheet. If the problem is uniform in the x- and z- directions,
∂E x
= −jω0 B0
∂y
Note also that, unless there is some net transport current in the x- direction, E must be anti-
symmetric about the center of the sheet. Thus if we take the origin of y to be in the center, electric
field and current are:
E x = −jωB0 y
J x = −jωB0 σy
Λ = N Φ = N AB0
and voltage is of course just V = jwL. Total power dissipated in this core would be:
1 2 2 2 V2
Pc = Aℓ ω B0 t σ =
12 Rc
A 12N2
Rc =
ℓ σt2
8
B
B0
Bx = B0 sign(Hx )
9
y
x
Bs
Separating Surface
Bs
Penetration
Depth
That is, in the region between the separating surface and the top of the material, electric field
Ez is uniform and magnetic field Hx is a linear function of depth, falling from its impressed value at
the surface to zero at the separating surface. Now: electric field is produced by the rate of change
of flux which is:
∂Φ ∂ζ
Ez = = 2Bx
∂t ∂t
Eliminating E, we have:
∂ζ Hs
2ζ =
∂t σBx
and then, if the impressed magnetic field is sinusoidal, this becomes:
dζ 2 H0
= | sin ωt|
dt σB0
This is easy to solve, assuming that ζ = 0 at t = 0,
�
2H0 ωt
ζ= sin
ωσB0 2
Now: the surface always moves in the downward direction (as we have drawn it), so at each half
cycle a new surface is created: the old one just stops moving at a maximum position, or penetration
depth: �
2H0
δ=
ωσB0
This penetration depth is analogous to the “skin depth” of the linear theory. However, it is an
absolute penetration depth.
The resulting electric field is:
2H0 ωt
Ez = cos 0 < ωt < π
σδ 2
This may be Fourier analyzed: noting that if the impressed magnetic field is sinusoidal, only the
time fundamental component of electric field is important, leading to:
8 H0
Ez = (cos ωt + 2 sin ωt + . . .)
3π σδ
10
Complex surface impedance is the ratio between the complex amplitude of electric and magnetic
field, which becomes:
E 8 1
Zs = z = (2 + j)
Hx 3π σδ
Thus, in practical applications, we can handle this surface much as we handle linear conductive
surfaces, by establishing a skin depth and assuming that current flows within that skin depth of
16
the surface. The resistance is modified by the factor of 3π and the “power factor” of this surface is
about 89 % (as opposed to a linear surface where the “power factor” is about 71 %.
Agarwal suggests using a value for B0 of about 75 % of the saturation flux density of the steel.
11
12
100
10
Loss, W/Lb
Flux Density
1
0.1 T
0.3 T
0.5 T
0.1 0.7 T
1.0 T
0.01
10 100 1000 10000
Frequency, Hz
Figure 8: Steel Sheet Core Loss Fit vs. Flux Density and Frequency
�
Q = Pa2 − P 2
Where q is the number of machine phases and V is phase voltage. Note that this picture is, strictly
speaking, only valid for the voltage and frequency for which the flux density was calculated. But
it will be approximately true for small excursions in either voltage or frequency and therefore
useful for estimating voltage drop due to exciting current and such matters. In design program
applications these parameters can be re-calculated repeatedly if necessary.
“Looking up” this data is a bit awkward for design studies, so it is often convenient to do a
“curve fit” to the published data. There are a large number of possible ways of doing this. One
method that has been found to work reasonably well for silicon iron is an “exponential fit”:
�ǫB � �ǫF
B f
�
P ≈ P0
B0 f0
This fit is appropriate if the data appears on a log-log plot to lie in approximately straight lines.
Figure 8 shows such a fit for the same steel sheet as the other figures.
For “apparent power” the same sort of method can be used. It appears, however, that the simple
exponential fit which works well for real power is inadequate, at least if relatively high inductions
are to be used. This is because, as the steel saturates, the reactive component of exciting current
rises rapidly. I have had some success with a “double exponential” fit:
�ǫ0 �ǫ1
B B
� �
VA ≈ VA0 + VA1
B0 B0
13
Table 1: Exponential Fit Parameters for Two Steel Sheets
To first order the reactive component of exciting current will be linear in frequency.
References
[1] W. MacLean, “Theory of Strong Electromagnetic Waves in Massive Iron”, Journal of Applied
Physics, V.25, No 10, October, 1954
[2] P.D. Agarwal, “Eddy-Current Losses in Solid and Laminated Iron”, Trans. AIEE, V. 78, pp
169-171, 1959
14
1 Introduction
The objective here is to develop a simple but physically meaningful model of the synchronous
machine, one of the major classes of electric machine. We can look at this model from several
different directions. This will help develop an understanding of analysis of machines, particularly
in cases where one or another analytical picture is more appropriate than others. Both operation
and sizing will be of interest here.
Along the way we will approach machine windings from two points of view. On the one hand,
we will approximate windings as sinusoidal distributions of current and flux linkage. Then we will
take a concentrated coil point of view and generalize that into a more realistic and useful winding
model.
Stator
µ→∞
R
K zs θ
µ→∞
r
K zr
g z
Rotor
The ‘rotor’ and ‘stator’ bodies are made of highly permeable material (we approximate this as
being infinite for the time being, but this is something that needs to be looked at carefully later).
We also assume that the rotor and stator have current distributions that are axially (z) directed
and sinusoidal:
KzS = KS cos pθ
KzR = KR cos p (θ − φ)
Here, the angle φ is the physical angle of the rotor. The current distribution on the rotor is
fixed with respect to the rotor. Now: assume that the air-gap dimension g is much less than the
radius: g << R. It is not difficult to show that with this assumption the radial flux density Br is
nearly uniform across the gap (i.e. not a function of radius) and obeys:
∂Br K S + KzR
= −µ0 z
∂Rθ g
Then the radial magnetic flux density for this case is simply:
µ0 R
Br = − (KS sin pθ + KR sin p (θ − φ))
pg
Now it is possible to compute the traction on rotor and stator surfaces by recognizing that
the surface current distributions are the azimuthal magnetic fields: at the surface of the stator,
Hθ = −KzS , and at the surface of the rotor, Hθ = KzR . So at the surface of the rotor, traction is:
µ0 R
τθ = Trθ = − (KS sin pθ + KR sin p (θ − φ)) KR cos p (θ − φ)
pg
The average of that is simply:
µ0 R
< τθ >= − KS KR sin pφ
2pg
The same exercise done at the surface of the stator yields the same results (with opposite sign).
To find torque, use:
µ0 πR3 ℓ
T = 2πR2 ℓ < τθ >= KS KR sin pφ
pg
We pause here to make a few observations:
1. For a given value of surface currents Ks and Kr, torque goes as the third power of linear
dimension. That implies that the the achieved shear stres is constant with machine size. And
the ratio of machine torque density to machine volume is constant.
2. If, on the other hand, gap is held constant, torque goes as the fourth power of machine volume.
Since the volume of the machine goes as the third power, this implies that torque capability
goes as the 4/3 power of machine volume.
3. Actually, this understates the situation since the assumed surface current densities are the
products of volume current densities and winding depth, which one would expect to increase
with machine size. As machine radius grows one would expect both stator and rotor sur-
face current densities to grow. Thus machine torque (and power) densities tend to increase
somewhat faster than linearly with machine volume.
2
4. The current distributions want to align with each other. In actual practice what is done is to
generate a stator current distribution which is not static as implied here but which rotates in
space:
KzS = KS cos (pθ − ωt)
and this pulls the rotor along.
5. For a given pair of current distributions there is a maximum torque that can be sustained,
but as long as the torque that is applied to the rotor is less than that value the rotor will
adjust to the correct angle.
So, if only the stator winding is excited, radial magnetic field is:
N S IS
Hr = − sin pθ
2gp
3
and thus the elementary coil flux is:
µ0 NS IS ℓR
Φi (θ) = cos pθ
p2 g
Now, this is flux linked by an elementary coil. To get flux linked by a whole winding we must
‘add up’ the flux linkages of all of the elementary coils. In our continuous approximation to the
real coil this is the same as integrating over the coil distribution:
� π
2p
λS = p Φi (θ)nS (θ)Rdθ
π
− 2p
π ℓRNS2
LS = µ 0
4 gp2
The same process can be used to find self-inductance of the rotor winding (with appropriate
changes of spatial variables), and the answer is:
π ℓRNR2
LR = µ 0
4 gp2
To find the mutual inductance between the two windings, excite one and compute flux linked
by the other. All of the expressions here can be used, and the answer is:
π ℓRNS NR
M (φ) = µ0 cos pφ
4 gp2
Now it is fairly easy to compute torque using conventional methods. Assuming both windings
are excited, magnetic coenergy is:
1 1
′
Wm = LS IS2 + LR IR
2
+ M (φ)IS IR
2 2
∂Wm′ π ℓRNS NR
T = = −µ0 IS IR sin pφ
∂φ 4 gp
NS IS = 2RKS
NR IR = 2RKR
we get the same answer for torque as with the field approach:
µ0 πR3 ℓ
T = 2πR2 ℓ < τθ >= − KS KR sin pφ
pg
4
4 Classical, Lumped-Parameter Synchronous Machine:
Now we are in a position to examine the simplest model of a polyphase synchronous machine.
Suppose we have a machine in which the rotor is the same as the one we were considering, but the
stator has three separate windings, identical but with spatial orientation separated by an electrical
angle of 120◦ = 2π/3. The three stator windings will have the same self- inductance (La ).
With a little bit of examination it can be seen that the three stator windings will have mutual
inductance, and that inductance will be characterized by the cosine of 120◦ . Since the physical
angle between any pair of stator windings is the same,
1
Lab = Lac = Lbc = − La
2
There will also be a mutual inductance between the rotor and each phase of the stator. Using
M to denote the magnitude of that inductance:
π ℓRNa Nf
M = µ0
4 gp2
Maf = M cos (pφ)
2π
� �
Mbf = M cos pφ −
3
2π
� �
Mcf = M cos pφ +
3
We show in Chapter 1 of these notes that torque for this system is:
2π 2π
� � � �
T = −pM ia if sin (pφ) − pM ib if sin pφ − − pM ic if sin pφ +
3 3
5 Balanced Operation:
Now, suppose the machine is operated in this fashion: the rotor turns at a constant velocity, the
field current is held constant, and the three stator currents are sinusoids in time, with the same
amplitude and with phases that differ by 120 degrees.
pφ = ωt + δi
if = If
ia = I cos (ωt)
2π
� �
ib = I cos ωt −
3
2π
� �
ic = I cos ωt +
3
5
Operated in this way, with balanced currents and with the mechanical speed consistent with
the electrical frequency (pΩ = ω), the machine exhibits a constant torque. The phase angle δi is
called the torque angle, but it is important to use some caution, as there is more than one torque
angle.
Now, look at the machine from the electrical terminals. Flux linked by Phase A will be:
Noting that the sum of phase currents is, under balanced conditions, zero and that the mutual
phase-phase inductances are equal, this simplifies to:
I = Iejθi
If = If ejθm
jXd
I
+ +
E af V
-
-
6
Now, if that is connected to a voltage source (i.e. if V is fixed), terminal current is:
V − Eaf ejδ
I=
jXd
where Xd = ωLd is the synchronous reactance.
Then real and reactive power (in phase A) are:
1 ∗
P + jQ = VI
2 � �∗
1 V − Eaf ejδ
= V
2 jXd
1 |V |2 1 V Eaf ejδ
= −
2 −jXd 2 −jXd
This makes real and reactive power:
1 V Eaf
Pa = − sin δ
2 Xd
1V2 1 V Eaf
Qa = − cos δ
2 Xd 2 Xd
If we consider all three phases, real power is
3 V Eaf
P =− sin δ
2 Xd
Now, at last we need to look at actual operation of these machines, which can serve either as
motors or as generators.
Vector diagrams that describe operation as a motor and as a generator are shown in Figures 3
and 4, respectively.
Ia
V
V
δ δ Ia
jXdIa
jXdIa
Eaf
Eaf
Operation as a generator is not much different from operation as a motor, but it is common to
make notations with the terminal current given the opposite (“generator”) sign.
7
Eaf Eaf
jXdIg jXdIg
Ig
V
Ig V
6 Reconciliation of Models
We have determined that we can predict its power and/or torque characteristics from two points
of view : first, by knowing currents in the rotor and stator we could derive an expression for torque
vs. a power angle:
3
T = − pM IIf sin δi
2
From a circuit point of view, it is possible to derive an expression for power:
3 V Eaf
P =− sin δ
2 Xd
and of course since power is torque times speed, this implies that:
3 V Eaf 3 pV Eaf
T =− sin δ = − sin δ
2 ΩXd 2 ωXd
In this section of the notes we will, first of all, reconcile these notions, look a bit more at what
they mean, and then generalize our simple theory to salient pole machines as an introduction to
two-axis theory of electric machines.
8
LdI
t
O
MIf
So the current- and voltage- based pictures do give the same result for torque.
7 Per-Unit Systems:
Before going on, we should take a short detour to look into per-unit systems, a notational device
that, in addition to being convenient, will sometimes be conceptually helpful. The basic notion is
quite simple: for most variables we will note a base quantity and then, by dividing the variable by
the base we have a per-unit version of that variable. Generally we will want to tie the base quantity
to some aspect of normal operation. So, for example, we might make the base voltage and current
correspond with machine rating. If that is the case, then power base becomes:
PB = 3VB IB
impedance 3Z is equivalent to a wye connection of Z), the per-unit impedances of a given system
are not dependent on the particular connection. In fact one of the major advantages of using a
per-unit system is that per-unit values are uniquely determined, while ordinary variables can be
line-line, line-neutral, RMS, peak, etc., for a large number of variations.
Perhaps unfortunate is the fact that base quantities are usually given as line-line voltage and
base power. So that:
PB VB 1 VBΔ V2
IB = √ ZB = = = BΔ
3VBΔ IB 3 IBΔ PB
Now, we will usually write per-unit variables as lower-case versions of the ordinary variables:
V P
v= p= etc.
VB PB
Thus, written in per-unit notation, real and reactive power for a synchronous machine operating
in steady state are:
veaf v2 veaf
p=− sin δ q= − cos δ
xd xd xd
These are, of course, in motor reference coordinates, and represent real and reactive power into
the terminals of the machine.
8 Normal Operation:
The synchronous machine is used, essentially interchangeably, as a motor and as a generator. Note
that, as a motor, this type of machine produces torque only when it is running at synchronous
speed. This is not, of course, a problem for a turbogenerator which is started by its prime mover
(e.g. a steam turbine). Many synchronous motors are started as induction machines on their
damper cages (sometimes called starting cages). And of course with power electronic drives the
machine can often be considered to be “in synchronism” even down to zero speed.
As either a motor or as a generator, the synchronous machine can either produce or consume
reactive power. In normal operation real power is dictated by the load (if a motor) or the prime
mover (if a generator), and reactive power is determined by the real power and by field current.
Figure 6 shows one way of representing the capability of a synchronous machine. This picture
represents operation as a generator, so the signs of p and q are reversed, but all of the other elements
of operation are as we ordinarily would expect. If we plot p and q (calculated in the normal way)
against each other, we see the construction at the right. If we start at a location q = −v 2 /xd , (and
remember that normally v = 1 per-unit) , then the locus of p and q is what would be obtained by
swinging a vector of length veaf /xd over an angle δ. This is called a capability chart because it is
an easy way of visualizing what the synchronous machine (in this case generator) can do. There
are three easily noted limits to capability. The upper limit is a circle (the one traced out by that
vector) which is referred to as field capability. The second limit is a circle that describes constant
|p + jq |. This is, of course, related to the magnitude of armature current and so this limit is called
armature capability. The final limit is related to machine stability, since the torque angle cannot
go beyond 90 degrees. In actuality there are often other limits that can be represented on this type
of a chart. For example, large synchronous generators typically have a problem with heating of the
stator iron when they attempt to operate in highly underexcited conditions (q strongly negative),
so that one will often see another limit that prevents the operation of the machine near its stability
10
Field Limit
Stator
Limit
1
X Stability Limit
d
limit. In very large machines with more than one cooling state (e.g. different values of cooling
hydrogen pressure) there may be multiple curves for some or all of the limits.
Another way of describing the limitations of a synchronous machine is embodied in the Vee
Curve. An example is shown in Figure 7 . This is a cross-plot of magnitude of armature current
with field current. Note that the field and armature current limits are straightforward (and are the
right-hand and upper boundaries, respectively, of the chart). The machine stability limit is what
terminates each of the curves at the upper left-hand edge. Note that each curve has a minimum at
unity power factor. In fact, there is yet another cross-plot possible, called a compounding curve, in
which field current is plotted against real power for fixed power factor.
11
Vee Curves
1.2
0.8
Per-Unit Ia
0.6
0.4
0.2
and q- axis components is straightforward and will appear in Chapter 8 of these notes.
The key here is to separate MMF and flux into two orthogonal components and to pretend that
each can be treated as sinusoidal. The two components are aligned with the direct axis and with
the quadrature axis of the machine. The direct axis is aligned with the field winding, while the
quadrature axis leads the direct by 90 degrees. Then, if φ is the angle between the direct axis and
the axis of phase a, we can write for flux linking phase a:
λs = λd cos φ − λq sin φ
dλa
Then, in steady state operation, if Va = dt and φ = ωt + δ ,
Vd = −ωλq
Vq = ωλd
one might think of the ‘voltage’ vector as leading the ‘flux’ vector by 90 degrees.
Now, if the machine is linear, those fluxes are given by:
λd = Ld Id + M If
λ q = L q Iq
12
d axis
Iq
q axis I Id
f
Vq V
V
d
Figure 9: Resolution of Terminal Voltage
Referring to Figure 9, one can resolve terminal voltage into these components:
Vd = V sin δ
Vq = V cos δ
or:
13
where
Xd = ωLd Xq = ωLq Eaf = ωM If
Now, we are working in ordinary variables (this discussion should help motivate the use of per-
unit!), and each of these variables is peak amplitude. Then, if we take up a complex frame of
reference:
V = Vd + jVq
I = Id + jIq
I
d
ψ
I
V
I δ
q
d axix j X Id
d
jX I
q
q axis
E
1
jX I
q q
E
af
A phasor diagram for a salient pole machine is shown in Figure 10. This is a little different
from the equivalent picture for a round-rotor machine, in that stator current has been separated
into its d- and q- axis components, and the voltage drops associated with those components have
been drawn separately. It is interesting and helpful to recognize that the internal voltage Eaf can
be expressed as:
Eaf = E1 + (Xd − Xq ) Id
14
where the voltage E1 is on the quadrature axis. In fact, E1 would be the internal voltage of a
round rotor machine with reactance Xq and the same stator current and terminal voltage. Then
the operating point is found fairly easily:
� �
−1 Xq I cos ψ
δ = − tan
V + Xq I sin ψ
�
E1 = (V + Xq I sin ψ)2 + (Xq I cos ψ)2
Power-Angle Curves
1.5
xd=2.2
1
xq = 1.6
0.5
Per-Unit
Round Rotor
0 Salient Rotor
-4 -3 -2 -1 0 1 2 3 4
-0.5
-1
-1.5
Torque Angle
Figure 11: Torque-Angle Curves: Round Rotor and Salient Pole Machines
A comparison of torque-angle curves for a pair of machines, one with a round, one with a salient
rotor is shown in Figure 11 . It is not too difficult to see why power systems analysts often neglect
saliency in doing things like transient stability calculations.
15
10.1 Voltage
Assume that our sinusoidal approximation for turns density is valid:
Na
na (θ) = cos pθ
2R
And suppose that working flux density is:
Now, to compute flux linked by the winding (and consequently to compute voltage), we first
compute flux linked by an incremental coil:
� θ+ π
p
λi (θ) = ℓBr (θ ′ )Rdθ ′
θ
This is instantaneous flux linked when the rotor is at angle φ. If the machine is operating at some
electrical frequency ω with a phase angle so that pφ = ωt + δ, the RMS magnitude of terminal
voltage is:
ωπ B0
Va = 2ℓRNa √
p4 2
Finally, note that the useful peak current density that can be used is limited by the fraction of
machine periphery used for slots:
B0 = Bs (1 − λs )
where Bs is the flux density in the teeth, limited by saturation of the magnetic material.
10.2 Current
The (RMS) magnitude of the current sheet produced by a current of (RMS) magnitude I is:
q Na I
Kz =
2 2R
And then the current is, in terms of the current sheet magnitude:
2
I = 2RKz
qNa
Note that the surface current density is, in terms of area current density Js , slot space factor λs
and slot depth hs :
Kz = λs Js hs
This gives terminal current in terms of dimensions and useful current density:
4R
I= λs hs Js
qNa
16
10.3 Rating
Assembling these expressions, machine rating becomes:
ω Bs
|P + jQ| = qV I = 2πR2 ℓ √ λs (1 − λs ) hs Js
p 2
This expression is actually fairly easily interpreted. The product of slot factor times one minus
slot factor optimizes rather quickly to 1/4 (when λs = 1). We could interpret this as:
|P = jQ| = As us τ ∗
where the interaction area is:
As = 2πRℓ
The surface velocity of interaction is:
ω
us = R = ΩR
p
and the fragment of expression which “looks like” traction is:
Bs
τ ∗ = hs Js √ λs (1 − λs )
2
Note that this is not quite traction since the current and magnetic flux may not be ideally aligned,
and this is why the expression incorporates reactive as well as real power.
This is not quite yet the whole story. The limit on Bs is easily understood to be caused by
saturation of magnetic material. The other important element on shear stress density, hs Js is a
little more involved.
17
10.5 Field Winding
Thus, given a value for xd and ψ, per- unit internal voltage eaf is also fixed. Then field current
required can be calculated by first estimating field winding current for “no-load operation”.
µ0 Nf If nl
Br =
2gp
and rated field current is:
If = If nl eaf
or, required rated field current is:
2gp(1 − λs )Bs
N f If = eaf
µ0
Next, If can be related to a field current density:
NRS
Nf If = ARS Jf
2
where NRS is the number of rotor slots and the rotor slot area ARS is
ARS = wR hR
We will not have a lot more to say about this. Note that the ratio of xd /eaf can be quite small
(if the per-unit reactance is small), will never be a very large number for any practical machine,
and is generally less than one. As a practical matter it is unusual for the per-unit synchronous
reatance of a machine to be larger than about 2 or 2.25 per-unit. What this tells us should be
obvious: either the rotor or the stator of a machine can produce the dominant limitation on shear
stress density (and so on rating). The best designs are “balanced”, with both limits being reached
at the same time.
18
Massachusetts Institute of Technology
Department of Electrical Engineering and Computer Science
6.685 Electric Machinery
1 Introduction
The purpose of this document is to show how the inductances of windings in round- rotor machines
with narrow air gaps may be calculated. We deal only with the idealized air- gap magnetic fields,
and do not consider slot, end winding, peripheral or skew reactances. We do, however, consider
the space harmonics of winding magneto-motive force (MMF).
2 Description of Stators
Back Iron
Slots
Slot
Depression
Teeth
Figure 1 shows a cartoon view of an axial cross-section of a twelve-slot stator. Actually, what
is shown is the shape of a thin sheet of steel, or lamination that is used to make up the magnetic
circuit. The iron is made of thin sheets to control eddy current losses. Thickness varies according
to freuqency of operation, but in machines for 60 Hz (the vast bulk of machines made for industrial
use), lamination thickness is typically .014” (.355 mm). These are stacked to make the magnetic
circuit of the appropriate length. Windings are carried in the slots of this structure.
Figure 1 shows trapezoidal slots with teeth of approximately uniform cross-section over most of
their length but wider extent near the air-gap. The tooth ends, in combination with the relatively
narrow slot depression region, help control certain parasitic losses in the rotor of many machines
by improving uniformity of the air-gap fields, increase the air-gap permeance and help hold the
windings in the slots. It should be noted that large machines, with what are called “form wound”
coils, have straight-sided rectangular slots and consequently teeth of non-uniform cross-section.
The description that follows will hold for both types of machine.
A A C’ C’ B B A’ A’ C C B’ B’
1 2 3 4 5 6 7 8 9 10 11 12
To simplify the discussion, imagine the slot/tooth region to be “straightened out” as shown in
Figure 2. This shows a three-phase, two-pole winding in the twelve slots. Such a winding would
have two slots per pole per phase. One of the two coils of phase A would be wound in slots 1 and 7
(six slots apart).
A A C’ C’ B B A’ A’ C C B’ B’
A C’ C’ B B A’ A’ C C B’ B’ A
1 2 3 4 5 6 7 8 9 10 11 12
Machines are seldom wound as shown in Figure 2 for a variety of reasons. It is usually advanta-
geous in reducing the length of the end turns and to reducing space harmonic effects in the machine
(usually bad effects!) to wind the machine with “short-pitched” windings as shown in Figure 3.
Each phase in this case consists of four coils (two per slot). The four coils of Phase A would span
between slots 1 and 6, slots 2 and 7, slots 7 and 12 and slots 8 and 1. Each of these coil spans is
five slots, so this choice of winding pattern is referred to as “Five-Sixths” pitch.
So this cartoon-figure machine stator (which could represent either a synchronous or induction
motor or generator) has both breadth because there are more than one slots per pole per phase,
and it may have the need for accounting for winding pitch. What follows in this note is a simple
protocol for estimating the important air-gap fields and inductances.
3 Winding MMF
To start, consider the MMF of a full- pitch, concentrated winding as shown in schematic form in
Figure 4. Assuming that the winding has a total of N turns over p pole- pairs, and is carrying
current I the MMF is: ∞
� 4 NI
F =− sin npθ (1)
nπ 2p
n=1
nodd
This distribution is shown, as a function of angle θ in Figure 5.
This leads directly to magnetic flux density in the air- gap:
∞
� µ0 4 N I
Br = − sin npθ (2)
g nπ 2p
n=1
nodd
Note that a real winding, which will most likely not be full- pitched and concentrated, will have a
winding factor which is the product of pitch and breadth factors, to be discussed later.
Magnetic
Circuit:
Stator
Rotor
NI
µ p θ
r
R z
g
Air-Gap
Now, suppose that there is a polyphase winding, consisting of more than one phase (we will use
three phases), driven with one of two types of current. The first of these is balanced, current:
Ia = I cos(ωt)
2π
Ib = I cos(ωt − )
3
2π
Ic = I cos(ωt + ) (3)
3
Conversely, we might consider Zero Sequence currents:
Ia = Ib = Ic = I cos ωt (4)
F( θ )
NI
p
π π 3π
2
π θ
2p p 2p p
Then it is possible to express magnetic flux density for the two distinct cases. For the balanced
case: ∞ �
Br = Brn sin(npθ ∓ ωt) (5)
n=1
where
• The upper sign holds for n = 1, 7, ...
• The lower sign holds for n = 5, 11, ...
• all other terms are zero
and
3 µ0 4 N I
Brn = (6)
2 g nπ 2p
The zero- sequence case is simpler: it is nonzero only for the triplen harmonics:
∞
� µ0 4 N I 3
Br = (sin(npθ − ωt) + sin(npθ + ωt)) (7)
n=3,9,...
g nπ 2p 2
Next, consider the flux from a winding on the rotor: that will have the same form as the flux
produced by a single armature winding, but will be referred to the rotor position:
∞
� µ0 4 N I
Brf = sin npθ ′ (8)
g nπ 2p
n=1
nodd
ωt
which is, substituting θ ′ = θ − p ,
∞
� µ0 4 N I
Brf = sin n(pθ − ωt) (9)
g nπ 2p
n=1
nodd
The next step here is to find the flux linked if we have some air- gap flux density of the form:
∞
�
Br = Brn sin(npθ ± ωt) (10)
n=1
� π
p
Φ= Br Rldθ (11)
0
λ = NΦ (13)
The end of this is a set of expressions for various inductances. It should be noted that, in the
real world, most windings are not full-pitched nor concentrated. Fortunately, these shortcomings
can be accommodated by the use of winding factors.
The simplest and perhaps best definition of a winding factor is the ratio of flux linked by an
actual winding to flux that would have been linked by a full- pitch, concentrated winding with the
same number of turns. That is:
λactual
kw = (14)
λf ull−pitch
It is relatively easy to show, using reciprocity arguments, that the winding factors are also
the ratio of effective MMF produced by an actual winding to the MMF that would have been
produced by the same winding were it to be full- pitched and concentrated. The argument goes
as follows: mutual inductance between any pair of windings is reciprocal. That is, if the windings
are designated one and two, the mutual inductance is flux induced in winding one by current in
winding two, and it is also flux induced in winding two by current in winding one. Since each
winding has a winding factor that influences its linking flux, and since the mutual inductance must
be reciprocal, the same winding factor must influence the MMF produced by the winding.
The winding factors are often expressed for each space harmonic, although sometimes when a
winding factor is referred to without reference to a harmonic number, what is meant is the space
factor for the space fundamental.
Two winding factors are commonly specified for ordinary, regular windings. These are usually
called pitch and breadth factors, reflecting the fact that often windings are not full pitched, which
means that individual turns do not span a full π electrical radians and that the windings occupy a
range or breadth of slots within a phase belt. The breadth factors are ratios of flux linked by a given
winding to the flux that would be linked by that winding were it full- pitched and concentrated.
These two winding factors are discussed in a little more detail below. What is interesting to note,
although we do not prove it here, is that the winding factor of any given winding is the product of
the pitch and breadth factors:
kw = kp kb (15)
With winding factors as defined by (14) and the sections below, it is possible to define winding
inductances. For example, the synchronous inductance of a winding will be the apparent inductance
of one phase when the polyphase winding is driven by a balanced set of currents as in (3). This is,
approximately:
�∞
3 4 µ0 N 2 Rlkwn
2
Ld = (16)
n=1,5,7,...
2π p2 gn2
5
This expression is approximate because it ignores the asynchronous interactions between higher
order harmonics and the rotor of the machine. These are beyond the scope of this note.
Zero- sequence inductance is the ratio of flux to current if a winding is excited by zero sequence
currents, as in (4):
∞
� 4 µ0 N 2 Rlkwn
2
L0 = 3 (17)
n=3,9,...
π p2 gn2
And then mutual inductance, as between a field winding (f ) and an armature winding (a), is:
∞
� 4 µ0 Nf Na kf n kan Rl
M (φ) = cos(npφ) (18)
π p2 gn2
n=1
nodd
4 Winding Factors
Now we turn our attention to computing the winding factors for simple, regular winding patterns.
We do not prove but only state that the winding factor can, for regular winding patterns, be
expressed as the product of a pitch factor and a breadth factor, each of which can be estimated
separately.
α
θ
r
z
Pitch factor is found by considering the flux linked by a less- than- full pitched winding. Consider
the situation in which radial magnetic flux density is:
Br = Bn sin(npθ − ωt) (19)
A winding with pitch α will link flux (see Figure 6:
π
�
2p
+ 2αp
λ = Nl Bn sin(npθ − ωt)Rdθ (20)
π
2p
− 2αp
6
Pitch α refers to the angular displacement between sides of the coil, expressed in electrical
radians. For a full- pitch coil α = π.
The flux linked is:
2N lRBn nπ nα
λ= sin( ) sin( ) (21)
np 2 2
Using the definition (14), the pitch factor is seen to be:
nα
kpn = sin (22)
2
θ
γ
r
z
A full- pitch coil with one side at angle ξ will, in the presence of magnetic flux density as
described by (19), link flux:
� π
p
− pξ
λ = Nl ξ
Bn sin(npθ − ωt)Rdθ (23)
p
where in (24), complex number notation has been used for convenience in carrying out the rest of
this derivation.
What happens here is that the coils link fluxes that differ in phase, so the addition of flux is as
shown in vector form in Figure 8.
7
Individual
Flux Linkages Total Flux
Linkage
Now: if the winding is distributed into m sets of slots and the slots are evenly spaced, the
angular position of each slot will be:
m−1
ξi = iγ − γ (25)
2
N
and the number of turns in each slot will be mp , so that actual flux linked will be:
2N lRBn 1 m−1 � �
Re ej(ωt−nξi )
�
λ= (26)
np m i=0
The breadth factor is then simply:
1 m−1 m−1
e−jn(iγ− 2 γ)
�
kb = (27)
m i=0
Now, focus on that sum. We know that any coverging geometric sum has a simple sum:
∞
1
xi =
�
(29)
i=0
1−x
and that a truncated sum is:
m−1
� ∞
� ∞
�
= − (30)
i=0 i=0 i=m
Then the sum in (28) can be written as:
m−1 ∞
� �� 1 − ejnmγ
e−jniγ = 1 − ejnmγ
�
e−jniγ = (31)
i=0 i=0
1 − e−jnγ
Now, inserting the results of (31) into (28), and using the definitions for sine, the breadth factor is
found:
sin nmγ
2
kbn = (32)
m sin nγ
2
8
4.3 Alternate Derivation of Breadth Factor
Most textbooks, if they bother to prove the Breadth Factor, use a geometric proof as shown in
Figure 9.
A
B
γ
2
mγ
2
The short vectors (e.g. AC) represent the voltages induced in individual coils. In fact, what is
shown in this figure is the same as is shown in Figure 8, but spread out to show the actual addition.
Now, note that if each of the vectors is bisected by a line segment at right angles, all of those
line segments meet at point O. The line segment that includes OB is one of these. Line segments
that run from O to the ends of the vectors will have an angle γ2 from the bisectors of the vectors.
Similarly, the line segment OA has an angle of mγ 2 with respect to the bisector of the resultant
voltage vector.
Now, if we note F1 as the length of each of the individual coil voltage vectors and F as the
length of the resultant sum, the length of half of the bisector is:
F mγ
AB = = OA sin (33)
2 2
but then
1 1 γ
AC = F1 = OA sin (34)
2 2 2
Then the resultant vector is:
sin m γ2
F
= 2AB
= mF1 (35)
m sin γ2
9
Massachusetts Institute of Technology
Department of Electrical Engineering and Computer Science
1 Introduction
Virtually all electric machines, and all practical electric machines employ some form of rotating
or alternating field/current system to produce torque. While it is possible to produce a “true
DC” machine (e.g. the “Faraday Disk”), for practical reasons such machines have not reached
application and are not likely to. In the machines we have examined so far the machine is operated
from an alternating voltage source. Indeed, this is one of the principal reasons for employing AC
in power systems.
The first electric machines employed a mechanical switch, in the form of a carbon brush/commutator
system, to produce this rotating field. While the widespread use of power electronics is making
“brushless” motors (which are really just synchronous machines) more popular and common, com-
mutator machines are still economically very important. They are relatively cheap, particularly in
small sizes, and they tend to be rugged and simple.
You will find commutator machines in a very wide range of applications. The starting motor
on all automobiles is a series-connected commutator machine. Many of the other electric motors in
automobiles, from the little motors that drive the outside rear-view mirrors to the motors that drive
the windshield wipers are permanent magnet commutator machines. The large traction motors
that drive subway trains and diesel/electric locomotives are DC commutator machines (although
induction machines are making some inroads here). And many common appliances use “universal”
motors: series connected commutator motors adapted to AC.
1.1 Geometry:
Stator Yoke
Field Poles
Rotor Ω
Armature Winding
Field Winding
1
A schematic picture (“cartoon”) of a commutator type machine is shown in 1. The armature
of this machine is on the rotor (this is the part that handles the electric power), and current is fed
to the armature through the brush/commutator system. The interaction magnetic field is provided
(in this picture) by a field winding. A permanent magnet field is applicable here, and we will have
quite a lot more to say about such arrangements below.
Now, if we assume that the interaction magnetic flux density averages Br , and if there are Ca
conductors underneath the poles at any one time, and if there are m parallel paths, then we may
estimate torque produced by the machine by:
Ca
Te = RℓBr Ia
m
where R and ℓ are rotor radius and length, respectively and Ia is terminal current. Note that Ca
is not necessarily the total number of conductors, but rather the total number of active conductors
(that is, conductors underneath the pole and therefore subject to the interaction field). Now, if we
note Nf as the number of field turns per pole, the interaction field is just:
N f If
Br = µ 0
g
leading to a simple expression for torque in terms of the two currents:
Te = GIa If
Now, note that the armature conductors are moving through the magnetic field produced by
the stator (field) poles, and we can ascribe to them an axially directed electric field:
Ez = −RΩBr
2
H H
B dl
H
v
Figure 2: Motion of a contour through a magnetic field produces flux change and electric field in
the moving contour
If the armature conductors are arranged as described above, with Ca conductors in m parallel
paths underneath the poles and with a mean active radial magnetic field of Br , we can compute a
voltage induced in the stator conductors:
Ca
Eb = RΩBr
m
Note that this is only the voltage induced by motion of the armature conductors through the
field and does not include brush or conductor resistance. If we include the expression for effective
magnetic field, we find that the back voltage is:
Eb = GΩIf
which leads us to the conclusion that newton-meters per ampere squared equals volt seconds per
ampere. This stands to reason if we examine electric power into the interaction and mechanical
power out:
Pem = Eb Ia = Te Ω
Now, a more complete model of this machine would include the effects of armature, brush and
lead resistance, so that in steady state operation:
Va = Ra Ia + GΩIf
Now, consider this machine with its armatucre connected to a voltage source and its field
operating at steady current, so that:
Va − GΩIf
Ia =
Ra
Then torque, electric power in and mechanical power out are:
Va − GΩIf
Te = GIf
Ra
Va − GΩIf
Pe = Va
Ra
Va − GΩIf
Pm = GΩIf
Ra
Now, note that these expressions define three regimes defined by rotational speed. The two
“break points” are at zero speed and at the “zero torque” speed:
Va
Ω0 =
GIf
3
Ra
+
+
Va G ΩI f
-
Electrical
Mechanical
For 0 < Ω < Ω0 , the machine is a motor: electric power in and mechanical power out are both
positive. For higher speeds: Ω0 < Ω , the machine is a generator, with electrical power in and
mechanical power out being both negative. For speeds less than zero, electrical power in is positive
and mechanical power out is negative. There are few needs to operate machines in this regime,
short of some types of “plugging” or emergency braking in tractions systems.
1.2 Hookups:
We have just described a mode of operation of a commutator machine usually called “separately
excited”, in which field and armature circuits are controlled separately. This mode of operation is
used in some types of traction applications in which the flexibility it affords is useful. For example,
some traction applications apply voltage control in the form of “choppers” to separately excited
machines.
Note that the “zero torque speed” is dependent on armature voltage and on field current.
For high torque at low speed one would operate the machine with high field current and enough
armature voltage to produce the requisite current. As speed increases so does back voltage, and
field current may need to be reduced. At any steady operating speed there will be some optimum
mix of field and armature currents to produced the required torque. For braking one could (and this
is often done) re-connect the armature of the machine to a braking resistor and turn the machine
into a generator. Braking torque is controlled by field current.
A subset of the separately excited machine is the shunt connection in which armature and field
are supplied by the same source, in parallel. This connection is not widely used any more: it does
Ra
GΩ I f
+
V
not yield any meaningful ability to control speed and the simple applications to which it used to
be used are handled by induction machines.
Another connection which is still widely used is the series connection, in which the field winding
is sized so that its normal operating current level is the same as normal armature current and the
two windings are connected in series. Then:
V
Ia = If =
Ra + Rf + GΩ
5
A variation on this class of machine is the very widely used “universal motor”, in which the stator
and rotor (field and armature) of the machine are both constructed to operate with alternating
current. This means that both the field and armature are made of laminated steel. Note that such
a machine will operate just as it would have with direct current, with the only addition being the
reactive impedance of the two windings. Working with RMS quantities:
V
I =
Ra + Rf + GΩ + jω (La + Lf )
G|V |2
Te =
(Ra + Rf + GΩ)2 + (ωLa + ωLf )2
where ω is the electrical supply frequency. Note that, unlike other AC machines, the universal
motor is not limited in speed to the supply frequency. Appliance motors typically turn substantially
faster than the 3,600 RPM limit of AC motors, and this is one reason why they are so widely used:
with the high rotational speeds it is possible to produce more power per unit mass (and more power
per dollar).
1.3 Commutator:
The commutator is what makes this machine work. The brush and commutator system of this
class of motor involves quite a lot of “black art”, and there are still aspects of how they work
which are poorly understood. However, we can make some attempt to show a bit of what the
brush/commutator system does.
To start, take a look at the picture shown in Figure 7. Represented are a pair of poles (shaded)
and a pair of brushes. Conductors make a group of closed paths. Current from one of the brushes
takes two parallel paths. You can follow one of those paths around a closed loop, under each of
the two poles (remember that the poles are of opposite polarity) to the opposite brush. Open
commutator segments (most of them) do not carry current into or out of the machine.
A commutation interval occurs when the current in one coil must be reversed. (See Figure 8
In the simplest form this involves a brush bridging between two commutator segments, shorting
out that coil. The resistance of the brush causes the current to decay. When the brush leaves the
leading segment the current in the leading coil must reverse.
We will not attempt to fully understand the commutation process in this type of machine, but
we can note a few things. Resistive commutation is the process relied upon in small machines.
6
Figure 8: Commutator at Commutation
When the current in one coil must be reversed (because it has left one pole and is approaching the
other), that coil is shorted by one of the brushes. The brush resistance causes the current in the
coil to decay. Then the leading commutator segment leaves the brush the current MUST reverse
(the trailing coil has current in it), and there is often sparking.
1.4 Commutation
Stator Yoke
Field Poles
Rotor Ω
Armature Winding
Field Winding
Commutation
Interpoles
In larger machines the commutation process would involve too much sparking, which causes
brush wear, noxious gases (ozone) that promote corrosion, etc. In these cases it is common to use
separate commutation interpoles. These are separate, usually narrow or seemingly vestigal pole
pieces which carry armature current. They are arranged in such a way that the flux from the
interpole drives current in the commutated coil in the proper direction. Remember that the coil
being commutated is located physically between the active poles and the interpole is therefore in the
right spot to influence commutation. The interpole is wound with armature current (it is in series
with the main brushes). It is easy to see that the interpole must have a flux density proportional
to the current to be commutated. Since the speed with which the coil must be commutated is
proportional to rotational velocity and so is the voltage induced by the interpole, if the right
7
number of turns are put around the interpole, commutation can be made to be quite accurate.
1.5 Compensation:
Field Poles
Pole−Face
Compensation
Winding
Rotor Ω
Armature Winding
Field Winding
Commutation
Interpoles
The analysis of commutator machines often ignores armature reaction flux. Obviously these
machines DO produce armature reaction flux, in quadrature with the main field. Normally, com-
mutator machines are highly salient and the quadrature inductance is lower than direct-axis in-
ductance, but there is still flux produced. This adds to the flux density on one side of the main
poles (possibly leading to saturation). To make the flux distribution more uniform and therefore
to avoid this saturation effect of quadrature axis flux, it is common in very highly rated machines
to wind compensation coils: essentially mirror-images of the armature coils, but this time wound
in slots in the surface of the field poles. Such coils will have the same number of ampere-turns as
the armature. Normally they have the same number of turns and are connected directly in series
with the armature brushes. What they do is to almost exactly cancel the flux produced by the
armature coils, leaving only the main flux produced by the field winding. One might think of these
coils as providing a reaction torque, produced in exactly the same way as main torque is produced
by the armature. A cartoon view of this is shown in Figure 10.
force densities which permit vastly larger air gaps than conventional field windings, and this in turn
permits design flexibility which can result in even better electric machines.
These advantages come not without cost. Permanent magnet materials have special character-
istics which must be taken into account in machine design. The highest performance permanent
magnets are brittle ceramics, some have chemical sensitivities, all are sensitive to high temperatures,
most have sensitivity to demagnetizing fields, and proper machine design requires understanding
the materials well. These notes will not make you into seasoned permanent magnet machine de-
signers. They are, however, an attempt to get started, to develop some of the mathematical skills
required and to point to some of the important issues involved.
1
0.8
0.6
0.4
0.2
Tesla
0
-400 -300 -200 -100-0.2 0 100 200 300 400
-0.4
-0.6
-0.8
-1
Kilo Am peres/Meter
Permanent magnet materials are, at core, just materials with very wide hysteresis loops. Fig-
ure 11 is an example of something close to one of the more popular ceramic magnet materials. Note
that this hysteresis loop is so wide that you can see the effect of the permeability of free space.
It is usual to display only part of the magnetic characteristic of permanent magnet materials
(see Figure 12), the third quadrant of this picture, because that is where the material is normally
operated. Note a few important characteristics of what is called the “demagnetization curve”. The
remanent flux density Br , is the value of flux density in the material with zero magnetic field H.
The coercive field Hc is the magnetic field at which the flux density falls to zero. Shown also on
the curve are loci of constant energy product. This quantity is unfortunately named, for although
it has the same units as energy it represents real energy in only a fairly general sense. It is the
product of flux density and field intensity. As you already know, there are three commonly used
systems of units for magnetic field quantities, and these systems are often mixed up to form very
confusing units. We will try to stay away from the English system of units in which field intensity
H is measured in amperes per inchand flux density B in lines (actually, usually kilolines) per
square inch. In CGS units flux density is measured in Gauss (or kilogauss) and magnetic field
intensity in Oersteds. And in SI the unit of flux density is the Tesla, which is one Weber per square
meter, and the unit of field intensity is the Ampere per meter . Of these, only the last one, A/m is
obvious. A Weber is a volt-second. A Gauss is 10−4 Tesla. And, finally, an Oersted is that field
Demagnetization Curve
0.5
0.45
0.4 Br
Energy Product Loci
0.35
0.3
B, Tesla
0.25
0.2
0.15
0.1
Hc 0.05
0
-250 -200 -150 -100 -50 0
H, kA/m
intensity required to produce one Gauss in the permeability of free space. Since the permeability
of free space µ0 = 4π × 10−7 Hy/m, this means that one Oe is about 79.58 A/m. Commonly, the
energy product is cited in MgOe (Mega-Gauss-Oersted)s. One MgOe is equal to 7.958kJ/m3 . A
commonly used measure for the performance of a permanent magnet material is the maximum
energy product, the largest value of this product along the demagnetization curve.
To start to understand how these materials might be useful, consider the situation shown in
Figure 13: A piece of permanent magnet material is wrapped in a magnetic circuit with effectively
infinite permeability. Assume the thing has some (finite) depth in the direction you can’t see. Now,
if we take Ampere’s law around the path described by the dotted line,
I
~ · d~ℓ = 0
H
10
Permanent Magnet
Permanent Magnet
hm
This defines the unit permeance, essentially the ratio of the permeance facing the permanent
magnet to the internal permeance of the magnet. The problem can be, if necessary, solved graph-
ically, since the relationship between Bm and Hm is inherently nonlinear, as shown in Figure 15
“load line” analysis of a nonlinear electronic circuit.
Now, one more ‘cut’ at this problem. Note that, at least for fairly large unit permeances the
slope of the magnet characteristic is fairly constant. In fact, for most of the permanent magnets
used in machines (the one important exception is the now rarely used ALNICO alloy magnet), it
is generally acceptable to approximate the demagnitization curve with:
B ~m + M
~ m = µm H ~0
Here, the magnetization M0 is fixed. Further, for almost all of the practical magnet materials
the magnet permeability is nearly the same as that of free space (µm ≈ µ0 ). With that in mind,
consider the problem shown in Figure 16, in which the magnet fills only part of a gap in a magnetic
circuit. But here the magnet and gap areas are essentially the same. We could regard the magnet
as simply a magnetization.
11
− µ 0℘u
Permanent Magnet
In the region of the magnet and the air-gap, Ampere’s Law and Gauss’ law can be written:
∇×H ~ = 0
∇ · µ0 ~m + M
H ~0 = 0
~g = 0
∇ · µ0 H
Now, if in the magnet the magnetization is constant, the divergence of H in the magnet is zero.
Because there is no current here, H is curl free, so that everywhere:
~ = −∇ψ
H
∇2 ψ = 0
That is, magnetic field can be expressed as the gradient of a scalar potential which satisfies
Laplace’s equation. It is also pretty clear that, if we can assign the scalar potential to have a value
of zero anywhere on the surface of the magnetic circuit it will be zero over all of the magnetic
circuit (i.e. at both the top of the gap and the bottom of the magnet). Finally, note that we can’t
actually assume that the scalar potential satisfies Laplace’s equation everywhere in the problem.
In fact the divergence of M is zero everywhere except at the top surface of the magnet where it is
singular! In fact, we can note that there is a (some would say fictitious) magnetic charge density:
~
ρm = −∇ · M
12
At the top of the magnet there is a discontinuous change in M and so the equivalent of a
magnetic surface charge. Using Hg to note the magnetic field above the magnet and Hm to note
the magnetic field in the magnet,
µ0 Hg = µ0 (Hm + M0 )
σm = M0 = Hg − Hm
and then to satisfy the potential condition, if hm is the height of the magnet and g is the gap:
gHg = −hm Hm
Solving,
hm
Hg = M0
hm + g
Now, one more observation could be made. We would produce the same air-gap flux density
if we regard the permanent magnet as having a surface current around the periphery equal to the
magnetization intensity. That is, if the surface current runs around the magnet:
Kφ = M0
F = Kφ hm
and then since the magnetic field is just the MMF divided by the total gap:
F hm
Hg = = M0
hm + g hm + g
The real utility of permanent magnets comes about from the relatively large magnetizations:
numbers of a few to several thousand amperes per meter are common, and these would translate
into enormous current densities in magnets of ordinary size.
13
Back Iron
h
m
g
Rotor
Permanent
Magnets
A book on this topic by James Ireland suggests values for the two “fudge factors”:
We may further estimate the ratio of areas of the gap and magnet by:
Ag R + g2
=
Am R + g + h2m
Now, there are a bunch of approximations and hand wavings in this expression, but it seems to
work, at least for the kind of machines contemplated.
A second correction is required to correct the effective length for electrical interaction. The
reason for this is that the magnets produce fringing fields, as if they were longer than the actual
”stack length” of the rotor (sometimes they actually are). This is purely empirical, and Ireland
gives a value for effective length for voltage generation of:
ℓ∗
ℓeff =
fℓ
where ℓ∗ = ℓ + 2N R , and the empirical coefficient
A hm
� �
N ≈ log 1 + B
B R
where
hm
B = 7.4 − 9.0
R
A = 0.9
14
3.1.1 Voltage:
It is, in this case, simplest to consider voltage generated in a single wire first. If the machine is
running at angular velocity Ω, speed voltage is, while the wire is under a magnet,
vs = ΩRℓBr
Now, if the magnets have angular extent θm the voltage induced in a wire will have a waveform
as shown in Figure 18: It is pulse-like and has the same shape as the magnetic field of the magnets.
vs
π
Ωt
θm
The voltage produced by a coil is actually made up of two waveforms of exactly this form, but
separated in time by the ”coil throw” angle. Then the total voltage waveform produced will be
the sum of the two waveforms. If the coil thrown angle is larger than the magnet angle, the two
voltage waveforms add to look like this: There are actually two coil-side waveforms that add with
a slight phase shift.
vc
0m
0m
Figure 19: Voltage Induced in a Coil
If, on the other hand, the coil thrown is smaller than the magnet angle, the picture is the same,
only the width of the pulses is that of the coil rather than the magnet. In either case the average
voltage generated by a coil is:
θ∗
v = ΩRℓNs Bd
π
where θ ∗ is the lesser of the coil throw or magnet angles and Ns is the number of series turns in
the coil. This gives us the opportunity to develop the number of “active” turns:
Ca θ∗ C θ∗
= Ns = tot
m π m π
15
Here, Ca is the number of active conductors, Ctot is the total number of conductors and m is the
number of parallel paths. The motor coefficient is then:
Rℓeff Ctot Bd θ ∗
K=
m π
(dw is wire diameter, σw is wire conductivity and ℓt is length of one half-turn). This length depends
on how the machine is wound, but a good first-order guess might be something like this:
ℓt ≈ ℓ + πR
16
1 Introduction
This document is a brief introduction to the design evaluation of permanent magnet motors, with
an eye toward servo and drive applications. It is organized in the following manner: First, we
describe three different geometrical arrangements for permanent magnet motors:
After a qualitative discussion of these geometries, we will discuss the elementary rating param-
eters of the machine and show how to arrive at a rating and how to estimate the torque and power
vs. speed capability of the motor. Then we will discuss how the machine geometry can be used to
estimate both the elementary rating parameters and the parameters used to make more detailed
estimates of the machine performance.
Some of the more involved mathematical derivations are contained in appendices to this note.
2 Motor Morphologies
There are, of course, many ways of building permanent magnet motors, but we will consider only a
few in this note. Actually, once these are understood, rating evaluations of most other geometrical
arrangements should be fairly straightforward. It should be understood that the “rotor inside” vs.
“rotor outside” distinction is in fact trivial, with very few exceptions, which we will note.
1
Rotor Core
Stator Winding (Shaft)
in Slots
Stator Core
Air−Gap
Rotor
Magnets
The stator winding of this machine is “conventional”, very much like that of an induction motor,
consisting of wires located in slots in the surface of the stator core. The stator core itself is made of
laminated ferromagnetic material (probably silicon iron sheets), the character and thickness of the
sheets determined by operating frequency and efficiency requirements. They are required to carry
alternating magnetic fields, so must be laminated to reduce eddy current losses.
This sort of machine is simple in construction. Note that the operating magnetic flux density in
the air-gap is nearly the same as in the magnets, so that this sort of machine cannot have air-gap
flux densities higher than that of the remanent flux density of the magnets. If low cost ferrite
magnets are used, this means relatively low induction and consequently relatively low efficiency
and power density. (Note the qualifier “relatively” here!). Note, however, that with modern, high
performance permanent magnet materials in which remanent flux densities can be on the order of
1.2 T, air-gap working flux densities can be on the order of 1 T. With the requirement for slots to
carry the armature current, this may be a practical limit for air-gap flux density anyway.
It is also important to note that the magnets in this design are really in the “air gap” of
the machine, and therefore are exposed to all of the time- and space- harmonics of the stator
winding MMF. Because some permanent magnets have electrical conductivity (particularly the
higher performance magnets), any asynchronous fields will tend to produce eddy currents and
consequent losses in the magnets.
• Flux concentrating designs allow the flux density in the air-gap to be higher than the flux
density in the magnets themselves.
2
• In interior magnet designs there is some degree of shielding of the magnets from high order
space harmonic fields by the pole pieces.
• There are control advantages to some types of interior magnet motors, as we will show anon.
Essentially, they have relatively large negative saliency which enhances “flux weakening” for
high speed operation, in rather direct analogy to what is done in DC machines.
• Some types of internal magnet designs have (or claim) structural advantages over surface
mount magnet designs.
Rotor Pole
Pieces
Armature in
Slots
Rotor
Magnets
Non−magnetic
Rotor Core
Stator
(shaft)
Core
The geometry of one type of internal magnet motor is shown (crudely) in Figure 2. The
permanent magnets are oriented so that their magnetization is azimuthal. They are located between
wedges of magnetic material (the pole pieces) in the rotor. Flux passes through these wedges,
going radially at the air- gap, then azimuthally through the magnets. The central core of the rotor
must be non-magnetic, to prevent “shorting out” the magnets. No structure is shown at all in
this drawing, but quite obviously this sort of rotor is a structural challenge. Shown is a six-pole
machine. Typically, one does not expect flux concentrating machines to have small pole numbers,
because it is difficult to get more area inside the rotor than around the periphery. On the other
hand, a machine built in this way but without substantial flux concentration will still have saliency
and magnet shielding properties.
A second morphology for an internal magnet motor is shown in Figure 3. This geometry
has been proposed for highly salient synchronous machines without permanent magnets: such
machines would run on the saliency torque and are called synchronous reluctance motors. however,
the saliency slots may be filled with permanent magnet material, giving them some internally
generated flux as well. The rotor iron tends to short out the magnets, so that the ’bridges’ around
the ends of the permanent magnets must be relatively thin. They are normally saturated.
3
Stator Core
Stator Slots
Air Gap
Rotor
Saliency Slots
At first sight, these machines appear to be quite complicated to analyze, and that judgement
seems to hold up.
4
Rotor Core
Stator Winding (Shaft)
111111111111111111111111
000000000000000000000000
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
Stator Core
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
111111111111111111111111
000000000000000000000000
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111 Rotor
000000000000000000000000
111111111111111111111111
000000000000000000000000
111111111111111111111111
Magnets
111111111111111111111111
000000000000000000000000
X Ia
�
∩∩∩∩
�� +
+
Ea Vt
��
−
-
What is shown here is the equivalent circuit of a single phase of the machine. Most motors
are three-phase, but it is not difficult to carry out most of the analysis for an arbitrary number
of phases. The circuit shows an internal voltage Ea and a reactance X which together with the
terminal current I determine the terminal voltage V . In this picture armature resistance is ignored.
If the machine is running in the sinusoidal steady state, the major quantities are of the form:
5
I �
��
��� �
�� ψ V
�� � �
�� � �
�� �
�� δ �
��� �
�� � jXI
�� �
�� �
Ea ���
Ia = I cos (ωt − ψ)
The machine is in synchronous operation if the internal and external voltages are at the same
frequency and have a constant (or slowly changing) phase relationship (δ). The relationship between
the major variables may be visualized by the phasor diagram shown in Figure 3.1. The internal
voltage is just the time derivative of the internal flux from the permanent magnets, and the voltage
drop in the machine reactance is also the time derivative of flux produced by armature current in
the air-gap and in the “leakage” inductances of the machine. By convention, the angle ψ is positive
when current I lags voltage V and the angle δ is positive then internal voltage Ea leads terminal
voltage V . So both of these angles have negative sign in the situation shown in Figure 3.1.
If there are q phases, the time average power produced by this machine is simply:
q
P = V I cos ψ
2
For most polyphase machines operating in what is called “balanced” operation (all phases doing
the same thing with uniform phase differences between phases), torque (and consequently power)
are approximately constant. Since we have ignored power dissipated in the machine armature, it
must be true that power absorbed by the internal voltage source is the same as terminal power, or:
q
P = Ea I cos (ψ − δ)
2
ω
P = T
p
where T is torque and ω/p is mechanical rotational speed, torque can be derived from the terminal
quantities by simply:
q
T = p λa I cos (ψ − δ)
2
In principal, then, to determine the torque and hence power rating of a machine it is only
necessary to determine the internal flux, the terminal current capability, and the speed capability
of the rotor. In fact it is almost that simple. Unfortunately, the model shown in Figure 5 is not
quite complete for some of the motors we will be dealing with, and we must go one more level into
machine theory.
6
3.2 A Little Two-Reaction Theory
The material in this subsection is framed in terms of three-phase (q = 3) machine theory, but
it is actually generalizable to an arbitrary number of phases. Suppose we have a machine whose
three-phase armature can be characterized by internal fluxes and inductance which may, in general,
not be constant but is a function of rotor position. Note that the simple model we presented in
the previous subsection does not conform to this picture, because it assumes a constant terminal
inductance. In that case, we have:
Ia = I cos ωt
2π
Ib = I cos(ωt − )
3
2π
Ic = I cos(ωt + )
3
θ = ωt + θ0
maps to:
Id = I cos θ0
Iq = −I sin θ0
Now, if θ = ωt + θ0 , the transformation coordinate system is chosen correctly and the “d-” axis
will correspond with the axis on which the rotor magnets are making positive flux. That happens
if, when θ = 0, phase A is linking maximum positive flux from the permanent magnets. If this is
the case, the internal fluxes are:
λaa = λf cos θ
2π
λab = λf cos(θ − )
3
2π
λac = λf cos(θ + )
3
Now, if we compute the fluxes in the d-q frame, we have:
Now: two things should be noted here. The first is that, if the coordinate system has been chosen
as described above, the flux induced by the rotor is, in the d-q frame, simply:
λf
λR = 0 (6)
0
The assertion that inductances in the d-q frame are constant is actually questionable, but it is
close enough to being true and analyses that use it have proven to be close enough to being correct
that it (the assertion) has held up to the test of time. In fact the deviations from independence
on rotor position are small. Independence of axes (that is, absence of mutual inductances in the
d-q frame) is correct because the two axes are physically orthogonal. We tend to ignore the third,
or “zero” axis in this analysis. It doesn’t couple to anything else and has neither flux nor current
anyway. Note that the direct- and quadrature- axis inductances are in principle straightforward to
compute. They are
direct axis the inductance of one of the armature phases (corrected for the fact of multiple phases)
with the rotor aligned with the axis of the phase, and
quadrature axis the inductance of one of the phases with the rotor aligned 90 electrical degrees
away from the axis of that phase.
d d
V ph = λph = T −1 λdq (8)
dt dt
and that the transformed armature voltage must be:
8
V dq = T V ph
d
= T (T −1 λdq )
dt
d d
= λdq + (T T −1 )λdq (9)
dt dt
The second term in this expresses “speed voltage”. A good deal of straightforward but tedious
manipulation yields:
0 − dθ
dt 0
d −1 dθ
T T = 0 0 (10)
dt dt
0 0 0
dλd
Vd = − ωλq (11)
dt
dλq
Vq = + ωλd (12)
dt
where
dθ
ω=
dt
Instantaneous power is given by:
P = Va Ia + Vb Ib + Vc Ic (13)
Using the transformations given above, this can be shown to be:
3 3
P = Vd Id + Vq Iq + 3V0 I0 (14)
2 2
which, in turn, is:
3 3 dλd dλq dλ0
P = ω (λd Iq − λq Id ) + ( Id + Iq ) + 3 I0 (15)
2 2 dt dt dt
Then, noting that ω = pΩ and that (15) describes electrical terminal power as the sum of shaft
power and rate of change of stored energy, we may deduce that torque is given by:
q
T = p(λd Iq − λq Id ) (16)
2
Note that we have stated a generalization to a q- phase machine even though the derivation
given here was carried out for the q = 3 case. Of course three phase machines are by far the
most common case. Machines with higher numbers of phases behave in the same way (and this
generalization is valid for all purposes to which we put it), but there are more rotor variables
analogous to “zero axis”.
Now, noting that, in general, Ld and Lq are not necessarily equal,
λd = Ld Id + λf (17)
λq = Lq Iq (18)
then torque is given by:
q
T = p (λf + (Ld − Lq ) Id ) Iq (19)
2
9
3.3 Finding Torque Capability
For high performance drives, we will generally assume that the power supply, generally an inverter,
can supply currents in the correct spatial relationship to the rotor to produce torque in some
reasonably effective fashion. We will show in this section how to determine, given a required torque
(or if the torque is limited by either voltage or current which we will discuss anon), what the
values of Id and Iq must be. Then the power supply, given some means of determining where the
rotor is (the instantaneous value of θ), will use the inverse Park’s transformation to determine the
instantaneous valued required for phase currents. This is the essence of what is known as “field
oriented control”, or putting stator currents in the correct location in space to produce the required
torque.
Our objective in this section is, given the elementary parameters of the motor, find the capability
of the motor to produce torque. There are three things to consider here:
• Armature current is limited, generally by heating,
• A second limit is the voltage capability of the supply, particularly at high speed, and
• If the machine is operating within these two limits, we should consider the optimal placement
of currents (that is, how to get the most torque per unit of current to minimize losses).
Often the discussion of current placement is carried out using, as a tool to visualize what is
going on, the Id , Iq plane. Operation in the steady state implies a single point on this plane.
A simple illustration is shown in Figure 7. The thermally limited armature current capability is
represented as a circle around the origin, since the magnitude of armature current is just the length
of a vector from the origin in this space. In general, for permanent magnet machines with buried
magnets, Ld < Lq , so the optimal operation of the machine will be with negative Id . We will show
how to determine this optimum operation anon, but it will in general follow a curve in the Id , Iq
plane as shown.
Finally, an ellipse describes the voltage limit. To start, consider what would happen if the
terminals of the machine were to be short-circuited so that V = 0. If the machine is operating at
sufficiently high speed so that armature resistance is negligible, armature current would be simply:
λf
Id = −
Ld
Iq = 0
Now, loci of constant flux turn out to be ellipses around this point on the plane. Since terminal
flux is proportional to voltage and inversely proportional to frequency, if the machine is operating
with a given terminal voltage, the ability of that voltage to command current in the Id , Iq plane is
an ellipse whose size “shrinks” as speed increases.
To simplify the mathematics involved in this estimation, we normalize reactances, fluxes, cur-
rents and torques. First, let us define the base flux to be simply λb = λf and the base current Ib to
be the armature capability. Then we define two per-unit reactances:
L d Ib
xd = (20)
λb
L q Ib
xq = (21)
λb
10
Voltage Limit
i
Loci Optimal Torque q
Speed = Base Locus Armature Current Limit
Speed > Base
id
Short Circuit
Point
te = (1 − (xq − xd ) id ) iq (22)
It is fairly straightforward (but a bit tedious) to show that the locus of current-optimal operation
(that is, the largest torque for a given current magnitude or the smallest current magnitude for a
given torque) is along the curve:
v v
u !2 u !2
u 2
ui 1 1 u 1 i2a
id = −t a + 2 − t + (23)
2 4 (xq − xd ) 2 (xq − xd ) 4 (xq − xd ) 2
v v
u !2 u !2
u 2
ui 1 1 u 1 i2a
iq = −t a − 2 + t + (24)
2 4 (xq − xd ) 2 (xq − xd ) 4 (xq − xd ) 2
The “rating point” will be the point along this curve when ia = 1, or where this curve crosses the
armature capability circle in the id , iq plane. It should be noted that this set of expressions only
works for salient machines. For non-salient machines, of course, torque-optimal current is on the
q-axis. In general, for machines with saliency, the “per-unit” torque will not be unity at the rating,
so that the rated, or “Base Speed” torque is not the “Base” torque, but:
Tr = Tb × te (25)
where te is calculated at the rating point (that is, ia = 1 and id and iq as per (23) and (24)).
11
For sufficiently low speeds, the power electronic drive can command the optimal current to
produce torque up to rated. However, for speeds higher than the “Base Speed”, this is no longer
true. Define a per-unit terminal flux:
V
ψ=
ωλb
Operation at a given flux magnitude implies:
ψ 2 = (1 + xd id )2 + (xq iq )2
which is an ellipse in the id , iq plane. The Base Speed is that speed at which this ellipse crosses the
point where the optimal current curve crosses the armature capability. Operation at the highest
attainable torque (for a given speed) generally implies d-axis currents that are higher than those
on the optimal current locus. What is happening here is the (negative) d-axis current serves to
reduce effective machine flux and hence voltage which is limiting q-axis current. Thus operation
above the base speed is often referred to as “flux weakening”.
The strategy for picking the correct trajectory for current in the id , iq plane depends on the
value of the per-unit reactance xd . For values of xd > 1, it is possible to produce some torque at any
speed. For values of xd < 1, there is a speed for which no point in the armature current capability is
within the voltage limiting ellipse, so that useful torque has gone to zero. Generally, the maximum
torque operating point is the intersection of the armature current limit and the voltage limiting
ellipse:
v !2
x2q − ψ 2 + 1
u
xd u xd
id = 2 − t + (26)
2
xq − x d x
q −
x
2d
2 x2q −
x
2d
q
iq = 1 − i2d (27)
It may be that there is no intersection between the armature capability and the voltage limiting
ellipse. If this is the case and if xd < 1, torque capability at the given speed is zero.
If, on the other hand, xd > 1, it may be that the intersection between the voltage limiting
ellipse and the armature current limit is not the maximum torque point. To find out, we calculate
the maximum torque point on the voltage limiting ellipse. This is done in the usual way by
differentiating torque with respect to id while holding the relationship between id and iq to be on
the ellipse. The algebra is a bit messy, and results in:
v
u 3x (xq − x ) − x2 2
!
x2d
u
3xd (xq − xd ) − d d d (xq − xd ) (ψ 2 − 1) + xd
id = − − t + (28)
4x2d (xq −
xd ) 4x2d (xq −
xd ) 2 (xq − xd ) x2d
1
q
iq = ψ 2 − (1 + xd id )2 (29)
xq
Ordinarily, it is probably easiest to compute (28) and (29) first, then test to see if the currents
are outside the armature capability, and if they are, use (26) and (27).
These expressions give us the capability to estimate the torque-speed curve for a machine. As
an example, the machine described by the parameters cited in Table 1 is a (nominal) 3 HP, 4-pole,
3000 RPM machine.
The rated operating point turns out to have the following attributes:
12
Table 1: Example Machine
The loci of operation in the Id , Iq plane is shown in Figure 8. The armature current limit
is shown only in the second and third quadrants, so shows up as a semicircle. The two ellipses
correspond with the rated point (the larger ellipse) and with a speed that is three times rated
(9000 RPM). The torque-optimal current locus can be seen running from the origin to the rating
point, and the higher speed operating locus follows the armature current limit. Figure 9 shows the
torque/speed and power/speed curves. Note that this sort of machine only approximates “constant
power” operation at speeds above the “base” or rating point speed.
4 Parameter Estimation
We are now at the point of estimating the major parameters of the motors. Because we have a
number of different motor geometries to consider, and because they share parameters in not too
orderly a fashion, this section will have a number of sub-parts. First, we calculate flux linkage,
then reactance.
where Br is the radial flux through the coil. And, if Br is sinusoidally distributed this will have
a peak value of
2RlBr
φp =
p
13
PM Brushless Machine Current Loci
60
40
20
Q-Axis Current (A)
-20
-40
-60
-80 -60 -40 -20 0 20
PM Brushless Machine
10
8
Torque, N-m
0
0 1000 2000 3000 4000 5000 6000 7000 8000 9000
4000
3000
Power, Watts
2000
1000
0
0 1000 2000 3000 4000 5000 6000 7000 8000 9000
Speed, RPM
14
Now, if the actual winding has Na turns, and using the pitch and breadth factors derived in
Appendix 1, the total flux linked is simply:
2RlB1 Na kw
λf = (30)
p
where
kw = kp kb
α
kp = sin
2
sin m γ2
kb =
m sin γ2
B1 = µ0 M1 kg (31)
where the space-fundamental magnetization is:
Br 4 pθm
M1 = sin
µ0 π 2
where Br is remanent flux density of the permanent magnets and θm is the magnet angle.
and where the factor that describes the geometry of the magnetic gap depends on the case. For
magnets inside and p =
� 1,
Rsp−1 p p+1 p
kg = R2 − R1p+1 + Ri2p R11−p − R21−p
Rs2p − Ri2p p+1 p−1
Rip−1 p p+1 p
kg = R2 − R1p+1 + Rs2p R11−p − R21−p
Rs2p − Ri2p p+1 p−1
15
1 1 R2
kg = 2 R22 − R12 + Rs2 log
Rs − Ri2 2 R1
Where Rs and Ri are the outer and inner magnetic boundaries, respectively, and R2 and R1
are the outer and inner boundaries of the magnets.
Note that for the case of a small gap, in which both the physical gap g and the magnet thickness
hm are both much less than rotor radius, it is straightforward to show that all of the above expres-
sions approach what one would calculate using a simple, one-dimensional model for the permanent
magnet:
hm
kg →
g + hm
This is the whole story for the winding-in-slot, narrow air-gap, surface magnet machine. For air-
gap armature windings, it is necessary to take into account the radial dependence of the magnetic
field.
This just expresses the fact that there is more azimuthal room at larger radii, so with uniform
density the number of turns as a function of radius is linearly dependent on radius. Here, Rwo and
Rwi are the outer and inner radii, respectively, of the winding.
Now it is possible to compute the flux linked due to a magnetic field distribution:
Z Rwo 2lNa kw r 2r
λf = 2 2 µ0 Hr (r)dr (32)
Rwi p Rwo − Rwi
Note the form of the magnetic field as a function of radius expressed in 80 and 81 of the second
appendix. For the “winding outside” case it is:
Hr = A r p−1 + Rs2p r −p−1
Then a winding with all its turns concentrated at the outer radius r = Rwo would link flux:
2lRwo kw 2lRwo kw
λc = µ0 Hr (Rwo ) = p−1
µ0 A Rwo + Rs2p Rwo
−p−1
p p
16
Carrying out (32), it is possible, then, to express the flux linked by a thick winding to the flux that
would have been linked by a radially concentrated winding at its outer surface by:
λf
kt =
λc
where, for the winding outside, p �= 2 case:
!
1 − x2+p ξ 2p 1 − x2−p
2
kt = + (33)
(1 − x ) (1 + ξ 2p )
2 2+p 2−p
where we have used the definitions ξ = Rwo /Rs and x = Rwi /Rwo . In the case of winding outside,
p = 2, !
1 − x4 ξ 4
2
kt = − log x (34)
(1 − x2 ) (1 + ξ 2p ) 4
In a very similar way, we can define a winding factor for a thick winding in which the reference
radius is at the inner surface. (Note: this is done because the inner surface of the inside winding
is likely to be coincident with the inner ferromagnetic surface, as the outer surface of the outer
winding ls likely to be coincident with the outer ferromagnetic surface). For p �= 2:
!
2x−p 1 − x2+p 1 − x2−p
kt = + (ηx)2p (35)
(1 − x2 ) (1 + η 2p ) 2+p 2−p
and for p = 2: !
2x−2 1 − x4
kt = − (ηx)4 log x (36)
(1 − x2 ) (1 + η 2p ) 4
where η = Ri /Rwi
So, in summary, the flux linked by an air-gap armature is given by:
2RlB1 Na kw kt
λf = (37)
p
where B1 is the flux density at the outer radius of the physical winding (for outside winding
machines) or at the inner radius of the physical winding (for inside winding machines). Note that
the additional factor kt is a bit more than one (it approaches unity for thin windings), so that,
for small pole numbers and windings that are not too thick, it is almost correct and in any case
“conservative” to take it to be one.
17
And the incremental permeance of a magnet is:
hm l
℘m = µ0
wm
The magnet sees a unit permeance consisting of its own permeance in series with one half of
each of two pole pieces (in series) :
℘ag Rθp wm
℘u = =
℘m 4g hm
Magnetic flux density in the magnet is:
℘u
Bm = B 0
1 + ℘u
And then flux density in the air gap is:
2hm 2hm wm
Bg = Bm = B 0
Rθp 4ghm + Rθp wm
The space fundamental of that can be written as:
4 pθp wm
B1 = sin B0 γm
π 2 2g
γm = w m θp R
1+ g 4 hm
2RlB1 Na kw
λf = (38)
p
18
Here, g is the magnetic gap, including the physical rotational gap and any magnet retaining means
that might be used. hm is the magnet thickness.
Since the magnet thickness is included in the air-gap, the air-gap permeance may not be very
large, so that slot leakage inductance may be important. To estimate this, assume that the slot
shape is rectangular, characterized by the following dimensions:
hs height of the main portion of the slot
ws width of the top of the main portion of the slot
hd height of the slot depression
wd slot depression opening
Of course not all slots are rectangular: in fact in most machines the slots are trapezoidal in
shape to maintain teeth cross-sections that are radially uniform. However, only a very small error
(a few percent) is incurred in calculating slot permeance if the slot is assumed to be rectangular
and the top width is used (that is the width closest to the air-gap). Then the slot permeance is,
per unit length:
1 hs hd
P = µ0 +
3 ws wd
Assume for the rest of this discussion a standard winding, with m slots in each phase belt
(this assumes, then, that the total number of slots is Ns = 2pqm), and each slot holds two half-
coils. (A half-coil is one side of a coil which, of course, is wound in two slots). If each coil
has Nc turns (meaning Na = 2pmNc ) , then the contribution to phase self-inductance of one
slot is, if both half-coils are from the same phase, 4lPNc2 . If the half-coils are from different
phases, then the contribution to self inductance is lPNc2 and the magnitude of the contribution to
mutual inductance is lPNc2 . (Some caution is required here. For three phase windings the mutual
inductance is negative, so are the senses of the currents in the two other phases, so the impact of
“mutual leakage” is to increase the reactance. This will be true for other numbers of phases as
well, even if the algebraic sign of the mutual leakage inductance is positive, in which case so will
be the sense of the other- phase current.)
We will make two other assumptions here. The standard one is that the winding “coil throw”,
or span between sides of a coil, is N 2p − Nsp . Nsp is the coil “short pitch”. The other is that each
s
phase belt will overlap with, at most two other phases: the ones on either side in sequence. This
last assumption is immediately true for three- phase windings (because there are only two other
phases. It is also likely to be true for any reasonable number of phases.
Noting that each phase occupies 2p(m − Nsp ) slots with both coil halves in the same slot and
2pNsp slots in which one coil half shares a slot with each of two different phases, we can write down
the two components of slot leakage inductance, self- and mutual:
h i
Las = 2pl (m − Nsp ) (2Nc )2 + 2Nsp Nc2
Lam = −2plNsp Nc2
For a three- phase machine, then, the total slot leakage inductance is:
La = Las − Lam = 2plPNc2 (4m − Nsp )
For a uniform, symmetric winding with an odd number of phases, it is possible to show that the
effective slot leakage inductance is:
2π
La = Las − 2Lam cos
q
19
Total synchronous inductance is the sum of air-gap and leakage components: so far this is:
Ld = Ld1 + La
20
and k = np and g = Rs − Ri is the conventionally defined “air gap”. If the aspect ratio Ri /Rs is
not too far from unity, neither is ka . In the case of p = 2, the fundamental component of ka is:
" 2 #
1 − x4 2γ 4 + x4 1 − γ 4 γ4 ξ 4 1 − x4
2gk 1 2
ka = 2 − log x + (log x) +
Rwo (1 − x2 ) 8 4 (1 − γ 4 ) ξ 4 (1 − γ 4 ) 16 (1 − γ 4 )
For a q-phase winding, a good approximation to the inductance is given by just the first space
harmonic term, or:
q 4 µ0 Na2 Rs Lkwn
2
Ld = ka
2π n 2 p2 g
Note that only the pole area is carrying useful flux, so that the space fundamental of radial flux
density is:
µ0 Na I 4 sin pθ2m
B1 =
2g π 1 + wm Rθp
hm 4g
The d- axis inductance, including mutual phase coupling, is (for a q- phase machine):
2
q 4 µ0 Na2 Rlkw pθp
Ld = 2
γm sin
2π p g 2
The quadrature axis is quite different. On that axis, the armature does not tend to push flux
through the magnets, so they have only a minor effect. What effect they do have is due to the fact
that the magnets produce a space in the active air- gap. Thus, while a full- pitch coil aligned with
the quadrature axis will produce an air- gap flux density:
µ0 N I
Br =
g
the space fundamental of that will be:
21
µ0 N I 4 pθt
B1 = 1 − sin
g π 2
where θt is the angular width taken out of the pole by the magnets.
So that the expression for quadrature axis inductance is:
q 4 µ0 Na2 Rlkw
2 pθt
Lq = 1 − sin
2π p2 g 2
where Rwi = R+hd and Rwo = Rwi +hs in a normal, stator outside winding. In this case, Ja = Js λs
and the two types of machines can be evaluated in the same way.
It would seem apparent that one would want to make λs as large as possible, to permit high
currents. The limit on this is that the magnetic teeth between the conductors must be able to carry
the air-gap flux, and making them too narrow would cause them to saturate. The peak of the time
fundamental magnetic field in the teeth is, for example,
2πR
Bt = B 1
Ns wt
where wt is the width of a stator tooth:
2π(R + hd )
wt = − ws
Ns
so that
B1
Bt ≈
1 − λs
22
5.1 Resistance
Winding resistance may be estimated as the length of the stator conductor divided by its area and
its conductivity. The length of the stator conductor is:
lc = 2lNa fe
where the “end winding factor” fe is used to take into account the extra length of the end turns
(which is usually not negligible). The area of each turn of wire is, for an air-gap winding :
2 − R2
θwe Rwo wi
Aw = λw
2 Na
where λw , the “packing factor” relates the area of conductor to the total area of the winding. The
resistance is then just:
4lNa2
Ra = 2 − R2 λ σ
θwe Rwo wi w
and, of course, σ is the conductivity of the conductor.
For windings in slots the expression is almost the same, simply substituting the total slot area:
2qlNa2
Ra =
Ns hs ws λw σ
The end turn allowance depends strongly on how the machine is made. One way of estimating
what it might be is to assume that the end turns follow a roughly circular path from one side of
the machine to the other. The radius of this circle would be, very roughly, Rw /p, where Rw is the
average radius of the winding: Rw ≈ (Rwo + Rwi )/2
Then the end-turn allowance would be:
πRw
fe = 1 +
pl
23
11111111111111111111111111111
00000000000000000000000000000
Outer Magnetic Boundary
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
R
11111111111111111111111111111
00000000000000000000000000000
s
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
Winding
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
R
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
i
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
111111111111
000000000000 R
11111111111111111111111111111
00000000000000000000000000000
1
11111111111111111111111111111
00000000000000000000000000000
R Inner Magnetic
11111111111111111111111111111
00000000000000000000000000000
2
11111111111111111111111111111
Boundary
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
00000000000000000000000000000
11111111111111111111111111111
Figure 10: Coordinate System for Inductance Calculation
over the annular wedge occupied by the phase. The resulting distribution can be fourier analyzed,
and the n-th harmonic component of this will be (assuming the coordinate system has been chosen
appropriately):
4 θwe 4 Na Ia
Jzn = Jz0 sin n = kwn
nπ 2 π R22 − R12
where the n-th harmonic winding factor is:
sin n θwe
2
kwn =
n θwe
2
24
For this problem, all variables will be varying sinusoidally with angle, so we will assume that
angular dependence ejkθ . Thus:
1 ∂ ∂Az k2
r − 2 Az = −µ0 Jz (40)
r ∂r ∂r r
This is a three-region problem. Note the regions as:
i Ri < r < R1
w R1 < r < R2
o R2 < r < Rs
For i and o, the current density is zero and an appropriate solution to (40) is:
Az = A+ r k + A− r −k
In the region of the winding, w, a particular solution must be used in addition to the homoge-
neous solution, and
Az = A+ r k + A− r −k + Ap
where, for k �= 2,
µ0 Jz r 2
Ap = −
4 − k2
or, if k = 2,
µ0 Jz r 2 1
Ap = − log r −
4 4
And, of course, the two pertinent components of the magnetic flux density are:
1 ∂Az
Br =
r ∂θ
∂Az
Bθ = −
∂r
Next, it is necessary to match boundary conditions. There are six free variables and corre-
spondingly there must be six of these boundary conditions. They are the following:
• At the inner and outer magnetic boundaries, r = Ri and r = Rs , the azimuthal magnetic
field must vanish.
• At the inner and outer radii of the winding itself, r = R1 and r = R2 , both radial and
azimuthal magnetic field must be continuous.
25
2µ0 Jz R2
−kAw k−1
+ R2
w −k−1
+ kA− R2 + = −kAo+ R2k−1 + kAo− R2−k−1
4 − k2
k−1 µ0 Jz R1
Aw+ R1 + Aw
− R1
−k−1
− i
= A+ R1k−1 + Ai− R1−k−1
4 − k2
k−1 2µ0 Jz R1
−kAw
+ R1 + kAw −k−1
− R1 + = −kAi+ R1k−1 + kAi− R1−k−1
4 − k2
Note that we are carrying this out here only for the case of k �= 2. The k = 2 case may be obtained
by substituting its particular solution in at the beginning or by using L’Hopital’s rule on the final
solution. This set may be solved (it is a bit tedious but quite straightforward) to yield, for the
winding region:
" !
µ0 Jz
Az = + r k
Now, the inductance linked by any single, full-pitched loop of wire located with one side at
azimuthal position θ and radius r is:
λi = 2lAz (r, θ)
To extend this to the whole winding, we integrate over the area of the winding the incremental flux
linked by each element times the turns density. This is, for the n-th harmonic of flux linked:
4lkwn Na
Z R2
λn = 2 Az (r)rdr
R2 − R12 R1
Making the appropriate substitutions for current into the expression for vector potential, this
becomes:
26
8 Layout
The assumed geometry is shown in Figure 11. Assumed iron (highly permeable) boundaries are
at radii Ri and Rs . The permanent magnets, assumed to be polarized radially and alternately
(i.e. North-South ...), are located between radii R1 and R2 . We assume there are p pole pairs (2p
magnets) and that each magnet subsumes an electrical angle of θme . The electrical angle is just p
times the physical angle, so that if the magnet angle were θme = π, the magnets would be touching.
11111111111111111111111111111
00000000000000000000000000000
Outer Magnetic Boundary
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
R
11111111111111111111111111111
00000000000000000000000000000
s
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
Magnets
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
R
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
i
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111
11111111111111111111111111111
00000000000000000000000000000
000000000000
111111111111 R
11111111111111111111111111111
00000000000000000000000000000
1
11111111111111111111111111111
00000000000000000000000000000
R Inner Magnetic
11111111111111111111111111111
00000000000000000000000000000
2
Boundary
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
11111111111111111111111111111
00000000000000000000000000000
00000000000000000000000000000
11111111111111111111111111111
Figure 11: Axial View of Magnetic Field Problem
If the magnets are arranged so that the radially polarized magnets are located around the
azimuthal origin (θ = 0), the space fundamental of magnetization is:
M = ir M0 cos pθ (41)
H = −∇ψ (43)
The divergence of this is:
∇2 ψ = −∇ · H (44)
27
Since magnetic flux density is divergence-free,
∇·B =0 (45)
we have:
∇ · H = −∇ · M (46)
or:
1
∇2 ψ = ∇ · M = M0 cos pθ (47)
r
Now, if we let the magnetic scalar potential be the sum of particular and homogeneous parts:
ψ = ψp + ψh (48)
where ∇2 ψh = 0, then:
1
∇2 ψp = M0 cos pθ (49)
r
We can find a suitable solution to the particular part of this in the region of magnetization by
trying:
ψp = Cr γ cos pθ (50)
Carrying out the Laplacian on this:
1
∇2 ψp = Cr γ−2 γ 2 − p2 cos pθ = M0 cos pθ (51)
r
which works if γ = 1, in which case:
M0 r
ψp = cos pθ (52)
1 − p2
Of course this solution holds only for the region of the magnets: R1 < r < R2 , and is zero for the
regions outside of the magnets.
A suitable homogeneous solution satisfies Laplace’s equation, ∇2 ψh = 0, and is in general of the
form:
ψh = Ar p cos pθ + Br −p cos pθ (53)
Then we may write a trial total solution for the flux density as:
A1 r p + B1 r −p cos pθ
Ri < r < R1 ψ = (54)
M0 r
R1 < r < R2 ψ= A2 r p + B2 r −p + cos pθ (55)
1 − p2
ψ = A3 r p + B3 r −p cos pθ
R2 < r < Rs (56)
The boundary conditions at the inner and outer (assumed infinitely permeable) boundaries at
r = Ri and r = Rs require that the azimuthal field vanish, or ∂ψ
∂θ = 0, leading to:
B1 = −Ri2p A1 (57)
B3 = −Rs2p A3 (58)
28
At the magnet inner and outer radii, Hθ and Br must be continuous. These are:
1 ∂ψ
Hθ = − (59)
r∂θ
∂ψ
Br = µ0 − + Mr (60)
∂r
These become, at r = R1 :
M0
−pA1 R1p−1 − Ri2p R1−p−1 = −p A2 R1p−1 + B2 R1−p−1 − p (61)
1 − p2
M0
−pA1 R1p−1 + Ri2p R1−p−1 = −p A2 R1p−1 − B2 R1−p−1 − + M0 (62)
1 − p2
and at r = R2 :
M0
−pA3 R2p−1 − Rs2p R2−p−1 = −p A2 R2p−1 + B2 R2−p−1 − p (63)
1 − p2
M0
−pA3 R2p−1 + Rs2p R2−p−1 = −p A2 R2p−1 − B2 R2−p−1 − + M0 (64)
1 − p2
Some small-time manipulation of these yields:
M0
A1 R1p − Ri2p R1−p = A2 R1p + B2 R1−p + R1 (65)
1 − p2
M0
A1 R1p + Ri2p R1−p = A2 R1p − B2 R1−p + pR1 (66)
1 − p2
M0
A3 R2p − Rs2p R2−p = A2 R2p + B2 R2−p + R2 (67)
1 − p2
M0
A3 R2p + Rs2p R2−p = A2 R2p − B2 R2−p + pR2 (68)
1 − p2
Taking sums and differences of the first and second and then third and fourth of these we obtain:
1+p
2A1 R1p = 2A2 R1p + R1 M0 (69)
1 − p2
p−1
2A1 Ri2p R1−p = −2B2 R1−p + R1 M0 (70)
1 − p2
1+p
2A3 R2p = 2A2 R2p + R2 M0 (71)
1 − p2
p−1
2A3 Rs2p R2−p = −2B2 R2−p + R2 M0 (72)
1 − p2
and then multiplying through by appropriate factors (R2p and R1p ) and then taking sums and
differences of these,
M0 p + 1
(A1 − A3 ) R1p R2p = (R1 R2p − R2 R1p ) (73)
2 1 − p2
M p−1
0
A1 Ri2p − A3 Rs2p R1−p R2−p = R1 R2−p − R2 R1−p (74)
2 1 − p2
29
Dividing through by the appropriate groups:
R1 R2p − R2 R1p M0 1 + p
A1 − A3 = (75)
R1p R2p 2 1 − p2
R1 R2−p − R2 R1−p M0 p − 1
A1 Ri2p − A3 Rs2p = (76)
R1−p R2−p 2 1 − p2
R1 R2p − R2 R1p M0 1 + p
!
R1 R2−p − R2 R1−p M0 p − 1
A1 Rs2p − Ri2p = Rs2p − (77)
R1p R2p 2 1 − p2 R1−p R2−p 2 1 − p2
Now, noting that the scalar potential is, in region 1 (radii less than the magnet),
r < R1 (80)
M0 p p
Hr = R11−p − R21−p Rs2p + R21+p − R11+p r p−1 + Ri2p r −p−1 cos pθ
2 Rs2p − Ri2p p−1 p+1
r > R2 (81)
M0 p p
Hr = R11−p − R21−p Ri2p + R21+p − R11+p r p−1 + Rs2p r −p−1 cos pθ
2 Rs2p − Ri2p p−1 p+1
The case of p = 1 appears to be a bit troublesome here, but is easily handled by noting that:
p 1−p R2
lim R1 − R21−p = log
p→1 p − 1 R1
Now: there are a number of special cases to consider.
For the iron-free case, Ri → 0 and R2 → ∞, this becomes, simply, for r < R1 :
M0 p 1−p
Hr = R1 − R21−p r p−1 cos pθ (82)
2 p−1
30
Note that for the case of p = 1, the limit of this is
M0 R2
Hr = log cos θ
2 R1
M0 p p+1
Hr = R2 − R1p+1 r −(p+1) cos pθ
2 p+1
For the case of a machine with iron boundaries and windings in slots, we are interested in the
fields at the boundaries. In such a case, usually, either Ri = R1 or Rs = R2 . The fields are:
at the outer boundary: r = Rs :
Rsp−1 p p+1 p
Hr = M0 R2 − R1p+1 + Ri2p R1
1−p
− R21−p cos pθ
Rs2p − Ri2p p+1 p−1
Rip−1 p p+1 p
Hr = M0 R2 − R1p+1 + Rs2p R11−p − R21−p cos pθ
Rs2p − Ri2p p+1 p−1
31
1 Introduction
Induction machines are perhaps the most widely used of all electric motors. They are generally
simple to build and rugged, offer reasonable asynchronous performance: a manageable torque-speed
curve, stable operation under load, and generally satisfactory efficiency. Because they are so widely
used, they are worth understanding.
In addition to their current economic importance, induction motors and generators may find
application in some new applications with designs that are not similar to motors currently in
commerce. An example is very high speed motors for gas compressors, perhaps with squirrel cage
rotors, perhaps with solid iron (or perhaps with both).
Because it is possible that future, high performance induction machines will be required to
have characteristics different from those of existing machines, it is necessary to understand them
from first principles, and that is the objective of this document. It starts with a circuit theoretical
view of the induction machine. This analysis is strictly appropriate only for wound-rotor machines,
but leads to an understanding of more complex machines. This model will be used to explain the
basic operation of induction machines. Then we will derive a model for squirrel-cage machines.
Finally, we will show how models for solid rotor and mixed solid rotor/squirrel cage machines can
be constructed.
The view that we will take in this document is relentlessly classical. All of the elements that
we will use are calculated from first principles, and we do not resort to numerical analysis or
empirical methods unless we have no choice. While this may seem to be seriously limiting, it serves
our basic objective here, which is to achieve an understanding of how these machines work. It is
our feeling that once that understanding exists, it will be possible to employ more sophisticated
methods of analysis to get more accurate results for those elements of the machines which do not
lend themselves to simple analysis.
An elementary picture of the induction machine is shown in Figure 1. The rotor and stator are
coaxial. The stator has a polyphase winding in slots. The rotor has either a winding or a cage, also
in slots. This picture will be modified slightly when we get to talking of “solid rotor” machines,
anon. Generally, this analysis is carried out assuming three phases. As with many systems, this
generalizes to different numbers of phases with little difficulty.
1
Stator Core Stator Winding
in Slots
wave to produce torque. To start the analysis of this machine, assume that both the rotor and the
stator can be described by balanced, three – phase windings. The two sets are, of course, coupled
by mutual inductances which are dependent on rotor position. Stator fluxes are (λa , λb , λc ) and
rotor fluxes are (λA , λB , λC ). The flux vs. current relationship is given by:
λa ia
λb
L
ib
S M SR
λc ic
= (1)
λA iA
M TSR LR
λB iB
λC iC
To carry the analysis further, it is necessary to make some assumptions regarding operation.
To start, assume balanced currents in both the stator and rotor:
ia = IS cos(ωt)
2π
ib = IS cos(ωt − 3 ) (5)
2π
ic = IS cos(ωt + 3 )
iA = IR cos(ωR t + ξR )
2π
iB = IR cos(ωR t + ξR − 3 ) (6)
2π
iC = IR cos(ωR t + ξR + 3 )
θ = ωm t + θ0 (7)
Under these assumptions, we may calculate the form of stator fluxes. As it turns out, we need
only write out the expressions for λa and λA to see what is going on:
3
λa = (La − Lab )Is cos(ωt) + M IR cos((pωm + ωR )t + ξR + pθ0 ) (9)
2
2π 2π
λA = M Is (cos p(ωm t + θ0 ) cos(ωt)) + cos(p(ωm t + θ0 ) − ) cos(ωt − ) (10)
3 3
2π 2π
+ cos(p(ωm t + θ0 ) + ) cos(ωt + ) + (LA − LAB )IR cos(ωR t + ξR )
3 3
ω − pωm = ωR (12)
It is important to keep straight the different frequencies here:
ω is stator electrical frequency
ωR is rotor electrical frequency
ωm is mechanical rotation speed
3
so that pωm is electrical rotation speed.
To refer rotor quantities to the stator frame (i.e. non- rotating), and to work in complex
amplitudes, the following definitions are made:
V a = jωΛa + Ra I a (22)
4
3 4 µ0 RlNS2 kS2
LS = La − Lab = + LSl (26)
2π p2 g
3 4 µ0 RlNR2 kR
2
LR = LA − LAB = + LRl (27)
2π p2 g
Where the normal set of machine parameters holds:
R is rotor radius
l is active length
Ll is “leakage” inductance
The two leakage terms LSl and LRl contain higher order harmonic stator and rotor inductances,
slot inducances, end- winding inductances and, if necessary, a provision for rotor skew. Essentially,
they are used to represent all flux in the rotor and stator that is not mutually coupled.
In the same terms, the stator- to- rotor mutual inductance, which is taken to comprise only a
space fundamental term, is:
4 µ0 RlNS NR kS kR
M= (28)
π p2 g
Note that there are, of course, space harmonic mutual flux linkages. If they were to be included,
they would hair up the analysis substantially. We ignore them here and note that they do have an
effect on machine behavior, but that effect is second- order.
Air- gap permeance is defined as:
4 µ0 Rl
℘ag = (29)
π p2 g
so that the inductances are:
3
LS = ℘ag kS2 NS2 + LSl (30)
2
3 2
LR = ℘ag kR NR2 + LRl (31)
2
M = ℘ag NS NR kS kR (32)
Here we define “slip” s by:
ωR = sω (33)
5
so that
pωm
s=1− (34)
ω
Then the voltage balance equations become:
3 3
V a = jω ℘ag kS2 NS2 + LSl I a + jω ℘ag NS NR kS kR I AR + Ra I a (35)
2 2
3 3
2
V AR = jsω ℘ag NS NR kS kR I a + jsω ℘ag kR NR2 + LRl I AR + RA I AR (36)
2 2
At this point, we are ready to define rotor current referred to the stator. This is done by
assuming an effective turns ratio which, in turn, defines an equivalent stator current to produce
the same fundamental MMF as a given rotor current:
NR kR
I2 = I (37)
NS kS AR
Now, if we assume that the rotor of the machine is shorted so that V AR = 0 and do some
manipulation we obtain:
R2
0 = jXM I a + j(XM + X2 )I 2 + I (39)
s 2
where the following definitions have been made:
3
XM = ω℘ag NS2 kS2 (40)
2
X1 = ωLSl (41)
2
NS kS
X2 = ωLRl (42)
NR kR
2
NS kS
R2 = RA (43)
NR kR
These expressions describe a simple equivalent circuit for the induction motor shown in Figure 2.
We will amplify on this equivalent circuit anon.
a Ra
I� X1 X2 I2
∧∧∧
∨∨
∩∩∩∩ ∩∩∩∩ �
⊃ <
⊃ Xm <> R2
⊃ >
< s
⊃
7
(Note that, since both Pag and s will always have the same sign, dissipated power is positive.)
The rest of this discussion is framed in terms of motor operation, but the conversion to generator
operation is simple. The difference between power crossing the air-gap and power dissipated in the
rotor resistance must be converted from mechanical form:
Pm = Pag − Ps (50)
3.2 Example
That MATLAB script has been run for a standard motor with parameters given in Table 1.
Torque vs. speed and power vs. speed are plotted for this motor in Figure 3. These curves were
generated by the MATLAB script shown above.
8
% ------------------------------------------------------
% -------------------------------------------------------
Rating 300 kw
Voltage 440 VRMS, l-l
254 VRMS, l-n
Stator Resistance R1 .73 Ω
Rotor Resistance R2 .64 Ω
Stator Reactance X1 .06 Ω
Rotor Reactance X2 .06 Ω
Magnetizing Reactance Xm 2.5 Ω
Synchronous Speed Ns 1200 RPM
9
Induction Motor
300
250
200
N−m
150
100
50
4
x 10
3
2.5
Watts
1.5
0.5
Note that in 56 we have made use of the simple transformation between rotor and stator
coordinates:
φ′ = φ − ωm t (57)
and that
pωm = ω − ωr = ω(1 − s) (58)
Here, we have used the following symbols:
The rotor current will produce an air- gap flux density of the form:
Br = Re B r ej(ωt−pφ) (59)
where
R
B r = −jµ0 K (60)
pg r
10
Note that this describes only radial magnetic flux density produced by the space fundamental
of rotor current. Flux linked by the armature winding due to this flux density is:
Z 0
λAR = lNS kS Br (φ)Rdφ (61)
−π
p
where
2lµ0 R2 NS kS
ΛAR = Kr (63)
p2 g
Adding this to flux produced by the stator currents, we have an expression for total stator flux:
!
3 4 µ0 NS2 RlkS2 2lµ0 R2 NS kS
Λa = + L Sl I a + Kr (64)
2π p2 g p2 g
Expression 64 motivates a definiton of an equivalent rotor current I2 in terms of the space
fundamental of rotor surface current density:
π R
I2 = K (65)
3 NS kS z
Then we have the simple expression for stator flux:
3 4 µ0 NS2 kS2 Rl
Lad = (67)
2π p2 g
11
−j 2πp
I k = Ie NR (71)
and I is the complex amplitude of current in slot number zero. Expression 71 shows a uniform
progression of rotor current phase about the rotor. All rotor slots carry the same current, but that
current is phase retarded (delayed) from slot to slot because of relative rotation of the current wave
at slip frequency.
The rotor current density can then be expressed as a sum of impulses:
NX
R −1
1 j(ωr t−k 2πp ) 2πk
Kz = Re Ie NR
δ(φ′ − ) (72)
k=0
R NR
The unit impulse function δ() is our way of approximating the rotor current as a series of
impulsive currents around the rotor.
This rotor surface current may be expressed as a fourier series of traveling waves:
∞
!
j(ωr t−npφ′ )
X
Kz = Re K ne (73)
n=−∞
Note that in 73, we are allowing for negative values of the space harmonic index n to allow for
reverse- rotating waves. This is really part of an expansion in both time and space, although we are
considering only the time fundamental part. We may recover the nth space harmonic component
of 73 by employing the following formula:
1
Z 2π
K n =< Kr (φ, t)e−j(ωr t−npφ) dφ > (74)
π 0
Here the brackets <> denote time average and are here beause of the two- dimensional nature
of the expansion. To carry out 74 on 72, first expand 72 into its complex conjugate parts:
NR −1
1 X I j(ωr t−k 2πp I ∗ −j(ωr t−k 2πp 2πk
) )
Kr = e NR
+ e NR
δ(φ′ − ) (75)
2 k=0 R R NR
If 75 is used in 74, the second half of 75 results in a sum of terms which time average to zero.
The first half of the expression results in:
12
I
Z 2π NX
R −1
2πk
−j 2πpk jnpφ
Kn = e NR
e δ(φ − )dφ (76)
2πR 0 k=0
NR
The impulse function turns the integral into an evaluation of the rest of the integrand at the
impulse. What remains is the sum:
NR −1
I X j(n−1) 2πkp
Kn = e NR
(77)
2πR k=0
The sum in 77 is easily evaluated. It is:
NX
(
R −1
j
2πkp(n−1)
NR if (n − 1) NPR = integer
e NR
= (78)
k=0
0 otherwise
The integer in 78 may be positive, negative or zero. As it turns out, only the first three of these
(zero, plus and minus one) are important, because these produce the largest magnetic fields and
therefore fluxes. These are:
p
(n − 1) = −1 or n = − NRp−p
NR
=0 or n = 1
NR +p
=1 or n = p (79)
Note that 79 appears to produce space harmonic orders that may be of non- integer order. This
is not really true: is is necessary that np be an integer, and 79 will always satisfy that condition.
So, the harmonic orders of interest to us are one and
NR
n+ = +1 (80)
p
NR
n− = − −1 (81)
p
Each of the space harmonics of the squirrel- cage current will produce radial flux density. A
surface current of the form:
NR I j(ωr t−npφ′ )
Kn = Re e (82)
2πR
′
Brn = Re B rn ej(ωr t−npφ ) (83)
where
µ 0 NR I
B rn = −j (84)
2πnpg
In turn, each of the components of radial flux density will produce a component of induced
voltage. To calculate that, we must invoke Faraday’s law:
13
∂B
∇×E =− (85)
∂t
The radial component of 85, assuming that the fields do not vary with z, is:
1 ∂ ∂Br
Ez = − (86)
R ∂φ ∂t
Or, assuming an electric field component of the form:
Ezn = Re E n ej(ωr t−npφ) (87)
Using 84 and 87 in 86, we obtain an expression for electric field induced by components of air-
gap flux:
ωr R
En = B (88)
np n
µ 0 NR ω r R
E n = −j I (89)
2πg(np)2
Now, the total voltage induced in a slot pushes current through the conductors in that slot. We
may express this by:
E 1 + E n− + E n+ = Z slot I (90)
Now: in 90, there are three components of air- gap field. E1 is the space fundamental field,
produced by the space fundamental of rotor current as well as by the space fundamental of stator
current. The other two components on the left of 90 are produced only by rotor currents and
actually represent additional reactive impedance to the rotor. This is often called zigzag leakage
inductance. The parameter Zslot represents impedance of the slot itself: resistance and reactance
associated with cross- slot magnetic fields. Then 90 can be re-written as:
µ 0 NR ω r R 1 1
E 1 = Z slot I + j 2
+ I (91)
2πg (n+ p) (n− p)2
To finish this model, it is necessary to translate 91 back to the stator. See that 65 and 77 make
the link between I and I 2 :
NR
I2 = I (92)
6NS kS
Then the electric field at the surface of the rotor is:
6NS kS 3 µ0 NS kS R 1 1
E1 = Z slot + jωr 2
+ I2 (93)
NR π g (n+ p) (n− p)2
This must be translated into an equivalent stator voltage. To do so, we use 88 to translate 93
into a statement of radial magnetic field, then find the flux liked and hence stator voltage from
that. Magnetic flux density is:
14
pE 1
Br =
ω R
r
6NS kS p Rslot 3 µ0 NS kS p 1 1
= + jLslot + j + I2 (94)
NR R ωr π g (n+ p)2 (n− p)2
where the slot impedance has been expressed by its real and imaginary parts:
Which becomes:
λag = Re Λag ejωt (97)
where:
2NS kS lR
Λag = j B r (98)
p
Then “air- gap” voltage is:
2ωNS kS lR
V ag = jωΛag = − Br
p
" #
12lNS2 kS2 R2 6 µ0 RlNS2 kS2 1 1
= −I 2 jωLslot + + jω + (99)
NR s π g (n+ p)2 (n− p)2
Expression 99 describes the relationship between the space fundamental air- gap voltage V ag
and rotor current I 2 . This expression fits the equivalent circuit of Figure 4 if the definitions made
below hold:
X2 I2
∩∩∩∩ �
<>
<> R2
< s
15
The first term in 100 expresses slot leakage inductance for the rotor. Similarly, 101 expresses
rotor resistance in terms of slot resistance. Note that Lslot and Rslot are both expressed per unit
length. The second term in 100 expresses the “zigzag” leakage inductance resulting from harmonics
on the order of rotor slot pitch.
Next, see that armature flux is just equal to air- gap flux plus armature leakage inductance.
That is, 66 could be written as:
3 4 µ0 NS2 k52 Rl
Xag5 = ω (103)
2π 52 p2 g
3 4 µ0 NS2 k72 Rl
Xag7 = ω (104)
2π 72 p2 g
The belt harmonics link to the rotor, however, and actually appear to be in parallel with
components of rotor impedance appropriate to 5p and 7p pole- pair machines. At these harmonic
orders we can usually ignore rotor resistance so that rotor impedance is purely inductive. Those
components are:
16
4.3.2 Zigzag Leakage
Stator zigzag leakage is from those harmonics of the orders pns = Nslots ± p where Nslots .
Here, the skew in the rotor is ς electrical radians from one end of the machine to the other.
Evaluation of this yields:
2B1 Rl sin 2ς
λ= ς
p 2
Now, the difference between what would have been linked by a non-skewed rotor and what is
linked by the skewed rotor is the skew leakage flux, now expressible as:
!2
sin 2ς
Xk = Xag 1 − ς
2
17
1
wm = Lℓ I 2
2
If there are N conductors in the slot, each carrying current I, the current density in the slot is:
NI
J=
hw
Using Ampere’s Law around a loop (shown dotted in the figure), magnetic field in the x direction
at height y from the bottom of the slot is:
NI y
Hx = −
w h
In the slot depression that field is:
NI
Hxd = −
u
Magnetic energy stored in the slot and slot depression are then conveniently calculated as:
1
Z h 1 1 2 1h d
wm = Lℓ I 2 = wℓ µ0 Hx2 dy + udℓ Hxd = ℓµ0 + N 2I 2
2 0 2 2 3w u
Noting the slot permeance as:
1h d
+P = ℓµ0
3w u
We have the total inductance of the slot to be:
Lℓ = PN 2
For the purpose of this estimate we will assume an ordinary winding consisting of coils of Nc
turns each. For such a winding if there are m slots per pole per phase and p pole pairs and if the
winding is short-pitched by Nsp slots, there will be 2p(m − Nsp ) slots per phase with two coils from
the same phase and 2pNsp slots per phase sharing another phase. (We assume here a three phase
machine). Then the ’self’ slot leakage inductance must be:
Lsℓ = P 4Nc2 2p(m − Nsp ) + Nc2 2pNsp
Since there are a total of pNsp ’mutual’ slots between each pair of phases, and the sense of the
windings is opposite, the mutual component of slot leakage is:
18
u
d
1111
0000
000
10111
0000
1111
10111
1011111
00000
10000
000
10111 1011111
00000
10
000
111
0
1
10 J 1010
1010 1010
10 1010
10 y
1010 1010
w 000000
111111
10111111
000000 10 x
14 q µ0 RNa2
Xe = (p − 0.3)
4π 2 2 p2
As with all such formulae, extreme care is required here, since we can give little guidance as to
when this expression is correct or even close. And we will admit that a more complete treatment
of this element of machine parameter construction would be an improvement.
ℓw = Na 2 (ℓ + ℓe )
Estimating ℓe , the length of one end winding, requires knowing how the winding is laid out and
is beyond our scope here. (But once you see it you will know that length.)
The area of the winding may be estimated by knowing wire diameter and how many strands
are in parallel:
π 2
Aw = d Ninh
4 w
The area of the winding is related to slot area by a winding factor:
2Nc Aw
λa =
Aslot
19
Winding resistance, per phase, is simply
ℓw
Ra =
σAw
where σ is wire conductivity. Note that conductivity of the materials used in induction machines
is a function of temperature and so will be winding resistance (and rotor resistance for that matter).
The Fitzgerald, Kingsley and Umans textbook[2] gives the following correction for resistance of
copper:
T0 + T
RT = R t
T0 + t
where RT and Rt are resistances at temperatures T and t. T0 = 234.5 for copper with basic
conductivity of IACS (5.8 × 107 S/m)[3]. For aluminum with conductivity of 63% of IACS, T0 ≈
212.9 Temperatures are given in Celcius.
12lNs2 k72
R2,7 = Rslot,7 (111)
NR
The higher-order slot harmonics will have relative frequencies (slips) that are:
( )
n = 6k + 1
sn = 1 ∓ (1 − s)n k an integer (112)
n = 6k − 1
The induction motor electromagnetic interaction can now be described by an augmented mag-
netic circuit as shown in Figure 20. Note that the terminal flux of the machine is the sum of all
of the harmonic fluxes, and each space harmonic is excited by the same current so the individual
harmonic components are in series.
Each of the space harmonics will have an electromagnetic interaction similar to the fundamental:
power transferred across the air-gap is:
2 R2,n
Pem,n = 3I2,n
sn
Of course dissipation in each circuit is:
20
leaving
2 R2,n
Pm,n = 3I2,n (1 − sn )
sn
Note that this equivalent circuit has provision for two sets of circuits which look like “cages”.
In fact one of these sets is for the solid rotor body if that exists. We will discuss that anon. There
is also a provision (rc ) for loss in the stator core iron.
Power deposited in the rotor harmonic resistance elements is characterized as “stray load” loss
because it is not easily computed from the simple machine equivalent circuit.
1
Rslot = (113)
ws hs σ
hs
Lslot = µ0 (114)
3ws
The slot resistance is obvious, the slot inductance may be estimated by recognizing that if the
current in the slot is uniform, magnetic field crossing the slot must be:
I x
Hy =
ws hs
Then energy stored in the field in the slot is simply:
hs 2
1 µ0 Ix 1 µ0 hs 2
Z
L I 2 = ws dx = I
2 slot 0 2 ws hs 6 ws
21
wd
hd x
hs
ws
sinh(1 + j) xδ
Hy = Htop
sinh(1 + j) hδs
Of course the impedance (purely reactive) due to the slot depression must be added to this. It
is possible to extract the real and imaginary parts of this impedance (the process is algebraically a
bit messy) to yield:
22
4.8 Arbitrary Slot Shape Model
It is possible to obtain a better model of the behavior of rotor conductor slots by using simple
numerical methods. In many cases rotor slots are shaped with the following objectives in mind:
1. A substantial part of the periphery of the rotor should be devoted to active conductor, for
good running performance.
2. The magnetic iron of the rotor must occupy a certain fraction of the periphery, to avoid
saturation.
3. For good starting performance, some means of forcing current to flow only in the top part of
the rotor bar should be devised.
Generally the rotor teeth, which make up part of the machine’s magnetic circuit, are of roughly
constant width to avoid flux concentration. The rotor conductor bars are therefore tapered, with
their narrow ends towards the center of the rotor. To provide for current concentration on starting
they often have a ’starting bar’ at the outer periphery of the rotor with a much narrower region
which has high inductance just below. The bulk of the rotor bar occupies the tapered region allowed
between the teeth.
This geometry is quite a bit more complicated than that described in the previous section. Note
that, if we can describe the slot impedance per unit length as a function of frequency: Zs (ω) =
Rs (ω) + jXs (ω), we can carry out the analysis of the machine as described previously. Thus our
analysis is directed toward frequency response modeling of the rotor slot. Focusing then on a single
slot, use the notation as described in Figure 7.
Ez [n] w[n]
x = nΔ x
Ez [n−1]
Δ x
y z
The impedance per unit length is the ratio between slot current and axial electric field:
Ez
Zs =
I
For the purpose of this analysis we will use the symbol x as the radial distance from the bottom
of the slot. Assume the slot can be divided radially into a number of regions or ’slices’, each with
23
radial height Δx. We further assume that currents are axially (z) directed and that magnetic field
crosses the slot in the y direction. Under these assumptions the electric field at the top of one of
the slices is related to the electric field at the bottom of the slice by magnetic field crossing through
the slice. Using the trapezoidal rule for integration:
Δx
E z (x) − E z (x − Δx) = jωµ0 H y (x) + H y (x − Δx)
2
The magnetic field is simply:
1
Z x 1 X
H y (x) = σw(x)Ez (x)dx = nI
w(x) 0 wn i=1 n
where In is the total current flowing in one slice. Note that this can be reformulated into a ladder
network by again using the trapezoidal rule for integration: current flowing in slice number n would
be:
Δx
In = σ (wn E n + wn−1 E n−1 )
2
Now the slot may be described as is shown in the ladder network of Figure 8. The incremental
reactance of one slice is:
2
Xn = ωµ0 Δx
(wn + wn−1 )
and the resistance of a slice is:
1 2
Rn =
σΔx wn + wn−1
R[n] R[n−1]
The procedure is to start at the bottom of the slot, corresponding to the right-hand end of the
ladder (the inductance at the bottom of the slot is infinite so the first slice has only the resistance),
and building toward the top of the slot.
24
This effect can be carried even further by making use of multiple cages, such as is shown in
Figure 9. Here there are two conductors in a fairly complex slot. Estimating the impedance of this
slot is done in stages to build up an equivalent circuit.
wd
hd
h2
hs
w2
ws
h1
w1
Assume for the purposes of this derivation that each section of the multiple cage is small enough
that currents can be considered to be uniform in each conductor. Then the bottom section may be
represented as a resistance in series with an inductance:
1
Ra =
σw1 h1
µ0 h1
La =
3 w1
The narrow slot section with no conductor between the top and bottom conductors will contribute
an inductive impedance:
hs
Ls = µ 0
ws
The top conductor will have a resistance:
1
Rb =
σw2 h2
Now, in the equivalent circuit, current flowing in the lower conductor will produce a magnetic field
across this section, yielding a series inductance of
h2
Lb = µ 0
w2
By analogy with the bottom conductor, current in the top conductor flows through only one third of
the inductance of the top section, leading to the equivalent circuit of Figure 10, once the inductance
of the slot depression is added on:
hd
Lt = µ 0
wd
Now, this rotor bar circuit fits right into the framework of the induction motor equivalent circuit,
shown for the double cage case in Figure 11, with
12lNS2 kS2
R2a = Ra
NR
25
1 2
Lt 3 Lb 3 Lb Ls La
∩∩∩∩ ∩∩∩∩ ∩∩∩∩ ∩∩∩∩ ∩∩∩∩
< <
<> Rb <> Ra
<> <>
12lNS2 kS2
R2b = Rb
NR
12lNS2 kS2 2
X2a = ω ( Lb + Ls + La )
NR 3
12lNS2 kS2 1
X2a = ω (Lt + Lb )
NR 3
a Ra
I� X1 X2b I2 X2a
∧∧∧
∨∨
∩∩∩∩ ∩∩∩∩ � ∩∩∩∩
⊃ < <
⊃ Xm <> R2b <> R2a
⊃ > s > s
⊃ < <
26
where wr is the average width of a conductor bar and lr is the axial end ring length.
Now, the ratio of losses (and hence the ratio of resistances) is found by multiplying the square
of current density ratio by the ratio of volumes. This is approximately:
2
Rend NR wr 2πR lr NR Rwr
= 2 = (118)
Rslot 2πp lr NR l wr πllr p2
4.11 Windage
Bearing friction, windage loss and fan input power are often regarded as elements of a “black art”.
We approach them with some level of trepidation, for motor manufacturers seem to take a highly
empirical view of these elements. What follows is an attempt to build reasonable but simple models
for two effects: loss in the air gap due to windage and input power to the fan for cooling. Some
caution is required here, for these elements of calculation have not been properly tested, although
they seem to give reasonable numbers
The first element is gap windage loss. This is produced by shearing of the air in the relative
rotation gap. It is likely to be a signifigant element only in machines with very narrow air gaps or
very high surface speeds. But these include, of course, the high performance machines with which
we are most interested. We approach this with a simple “couette flow” model. Air-gap shear loss
is approximately:
Pw = 2πR4 Ω3 lρa f (119)
where ρa is the density of the air-gap medium (possibly air) and f is the friction factor, estimated
by:
.0076
f= 1 (120)
Rn4
and the Reynold’s NumberRn is
ΩRg
Rn = (121)
νair
and νair is the kinematic viscosity of the air-gap medium.
The second element is fan input power. We base an estimate of this on two hypotheses. The
first of these is that the mass flow of air circulated by the fan can be calculated by the loss in the
motor and an average temperature rise in the cooling air. The second hypothesis is the the pressure
rise of the fan is established by the centrifugal pressure rise associated with the surface speed at
the outside of the rotor. Taking these one at a time: If there is to be a temperature rise ΔT in the
cooling air, then the mass flow volume is:
Pd
ṁ =
Cp ΔT
27
then power is given by:
Pfan = ΔP v̇
For reference, the properties of air are:
pVa
Br = (124)
2RlNa k1 ωs
Then flux density in the stator teeth is:
wt + w1
Bt = B r (125)
wt
where wt is tooth width and w1 is slot top width. Flux in the back-iron of the core is
R
Bc = B r (126)
pdc
where dc is the radial depth of the core.
28
One way of handling this loss is to assume that the core handles flux corresponding to terminal
voltage, add up the losses and then compute an equivalent resistance and reactance:
3|Va |2
rc =
Pcore
3|Va |2
xc =
Qcore
then put this equivalent resistance in parallel with the air-gap reactance element in the equivalent
circuit.
φ′ = φ − ωm t (131)
29
and
ωr R
Ez = Br (135)
p
Of course electric field in the rotor frame is related to rotor surface current by:
Ez = Z sK z (136)
Now these quantities can be related to the stator by noting that air-gap voltage is related to
radial flux density by:
p
Br = V (137)
2lNa k1 Rω ag
The stator-equivalent rotor current is:
π R
I2 = K (138)
3 Na ka z
Then we can find stator referred, rotor equivalent impedance to be:
V ag 3 4 l 2 2 ω Ez
Z2 = = N k (139)
I2 2 π R a a ωr K z
Now, if rotor surface impedance can be expressed as:
Z s = Rs + jωr Ls (140)
then
R2
Z2 = + jX2 (141)
s
where
34 l 2 2
R2 = N k Rs (142)
2πR a 1
34 l 2 2
X2 = N k Xs (143)
2πR a 1
Now, to find the rotor surface impedance, we make use of a nonlinear eddy-current model proposed
by Agarwal. First we define an equivalent penetration depth (similar to a skin depth):
s
2Hm
δ= (144)
ωr σB0
where σ is rotor surface material volume conductivity, B0 , ”saturation flux density” is taken to be
75 % of actual saturation flux density and
3 Na ka
Hm = |K z | = |I 2 | (145)
π R
Then rotor surface resistivity and surface reactance are:
16 1
Rs = (146)
3π σδ
Xs = .5Rs (147)
30
Note that the rotor elements X2 and R2 depend on rotor current I2 , so the problem is nonlinear.
We find, however, that a simple iterative solution can be used. First we make a guess for R2 and find
currents. Then we use those currents to calculate R2 and solve again for current. This procedure
is repeated until convergence, and the problem seems to converge within just a few steps.
Aside from the necessity to iterate to find rotor elements, standard network techniques can be
used to find currents, power input to the motor and power output from the motor, torque, etc.
5.1 Solution
Not all of the equivalent circuit elements are known as we start the solution. To start, we assume
a value for R2 , possibly some fraction of Xm , but the value chosen doesn not seem to matter
much. The rotor reactance X2 is just a fraction of R2 . Then, we proceed to compute an “air-gap”
impedance, just the impedance looking into the parallel combination of magnetizing and rotor
branches:
R2
Zg = jXm ||(jX2 + ) (148)
s
(Note that, for a generator, slip s is negative).
A total impedance is then
Zt = jX1 + R1 + Zg (149)
and terminal current is
Vt
It = (150)
Zt
Rotor current is just:
jXm
I2 = It (151)
jX2 + Rs2
Now it is necessary to iteratively correct rotor impedance. This is done by estimating flux
density at the surface of the rotor using (145), then getting a rotor surface impedance using (146)
and using that and (143 to estimate a new value for R2 . Then we start again with (148). The
process “drops through” this point when the new and old estimates for R2 agree to some criterion.
kp2
Xzp = Xm (152)
Np2 k12
31
kn2
Xzn = Xm (153)
Nn2 k12
where Np and Nn are the positive and negative going harmonic orders: For ‘belt’ harmonics
these orders are 7 and 5. For ‘zigzag’ they are:
Ns + p
Np = (154)
p
Ns − p
Nn = (155)
p
Now, there will be a current on the surface of the rotor at each harmonic order, and following 65,
the equivalent rotor element current is:
π R
I 2p = K (156)
3 Na kp p
π R
I 2n = K (157)
3 Na kn n
These currents flow in response to the magnetic field in the air-gap which in turn produces an
axial electric field. Viewed from the rotor this electric field is:
E p = sp ωRB p (158)
E n = sn ωRB n (159)
where the slip for each of the harmonic orders is:
sp = 1 − Np (1 − s) (160)
sn = 1 + Np (1 − s) (161)
and then the surface currents that flow in the surface of the rotor are:
Ep
Kp = (162)
Zsp
En
Kn = (163)
Zsn
where Zsp and Zsn are the surface impedances at positive and negative harmonic slip frequencies,
respectively. Assuming a linear surface, these are, approximately:
1+j
Zs = (164)
σδ
where σ is material restivity and the skin depth is
s
2
δ= (165)
ωs µσ
32
and ωs is the frequency of the given harmonic from the rotor surface. We can postulate that the
appropriate value of µ to use is the same as that estimated in the nonlinear calculation of the space
fundamental, but this requires empirical confirmation.
The voltage induced in the stator by each of these space harmonic magnetic fluxes is:
2Na kp lRω
Vp = Bp (166)
Np p
2Na kn lRω
Vn = Bn (167)
Nn p
Then the equivalent circuit impedance of the rotor is just:
3 4 Na2 kp2 l 1
R2p = (170)
2 π Np R σδp
3 4 Na2 kn2 l 1
R2n = (171)
2 π Nn R σδn
1
X2p = R2p (172)
2
1
X2n = R2n (173)
2
1. No-load losses in rotor teeth because of stator slot opening modulation of fundamental flux
density,
2. Load losses in the rotor teeth because of stator zigzag mmf, and
3. No-load losses in the solid rotor body (if it exists) due to stator slot opening modulation of
fundamental flux density.
33
Note that these losses have a somewhat different character from the other miscellaneous losses we
compute. They show up as drag on the rotor, so we subtract their power from the mechanical
output of the machine. The first and third of these are, of course, very closely related so we take
them first.
The stator slot openings ‘modulate’ the space fundamental magnetic flux density. We may
estimate a slot opening angle (relative to the slot pitch):
2πwd Ns wd Ns
θD = =
2πr r
Then the amplitude of the magnetic field disturbance is:
2 θD
BH = Br1 sin
π 2
In fact, this flux disturbance is really in the form of two traveling waves, one going forward and one
backward with respect to the stator at a velocity of ω/Ns . Since operating slip is relatively small,
the two variations will have just about the same frequency as viewed from the rotor, so it seems
reasonable to lump them together. The frequency is:
Ns
ωH = ω
p
Now, for laminated rotors this magnetic field modulation will affect the tips of rotor teeth. We
assume (perhaps arbitrarily) that the loss due to this magnetic field modulation can be estimated
from ordinary steel data (as we estimated core loss above) and that only the rotor teeth, not any of
the rotor body, are affected. The method to be used is straightforward and follows almost exactly
what was done for core loss, with modification only of the frequency and field amplitude.
For solid steel rotors the story is only a little different. The magnetic field will produce an axial
electric field:
ω
E z = R BH
p
and that, in turn, will drive a surface current
Ez
Kz =
Zs
√
Now, what is important is the magnitude of the surface current, and since |Z s | = 1 + .52 Rs ≈
1.118Rs , we can simply use rotor resistance. The nonlinear surface penetration depth is:
s
2B0
δ=
ωH σ |K z |
A brief iterative substitution, re-calculating δ and then |K z | quickly yields consistent values for δ
and Rs . Then the full-voltage dissipation is:
|K z |2
Prs = 2πRl
σδ
and an equivalent resistance is:
3|Va |2
Rrs =
Prs
34
Finally, the zigzag order current harmonics in the stator will produce magnetic fields in the
air gap which will drive magnetic losses in the teeth of the rotor. Note that this is a bit different
from the modulation of the space fundamental produced by the stator slot openings (although the
harmonic order will be the same, the spatial orientation will be different and will vary with load
current). The magnetic flux in the air-gap is most easily related to the equivalent circuit voltage
on the nth harmonic:
npvn
Bn =
2lRNa kn ω
This magnetic field variation will be substantial only for the zigzag order harmonics: the belt
harmonics will be essentially shorted out by the rotor cage and those losses calculated within
the equivalent circuit. The frequency seen by the rotor is that of the space harmonics, already
calculated, and the loss can be estimated in the same way as core loss, although as we have pointed
out it appears as a ‘drag’ on the rotor.
35
a Ra
I� X1 X2 I2
∧∧∧
∨∨
∩∩∩∩ ∩∩∩∩ �
⊃ <
⊃ Xm <> R2
⊃ >
< s
⊃
p R2
Te = 3 |I2 |2
ω s
where ω is the electrical frequency and p is the number of pole pairs. It is straightforward to find
I2 using network techniques. As an example, Figure 13 shows a series of torque/speed curves for
an induction machine operated with a wide range of input frequencies, both below and above its
“base” frequency. The parameters of this machine are:
Number of Phases 3
Number of Pole Pairs 3
RMS Terminal Voltage (line-line) 230
Frequency (Hz) 60
Stator Resistance R1 .06 Ω
Rotor Resistance R2 .055 Ω
Stator Leakage X1 .34 Ω
Rotor Leakage X2 .33 Ω
Magnetizing Reactance Xm 10.6 Ω
Strategy for operating the machine is to make terminal voltage magnitude proportional to frequency
for input frequencies less than the “Base Frequency”, in this case 60 Hz, and to hold voltage constant
for frequencies above the “Base Frequency”.
For high frequencies the torque production falls fairly rapidly with frequency (as it turns out,
it is roughly proportional to the inverse of the square of frequency). It also falls with very low
frequency because of the effects of terminal resistance. We will look at this next.
36
Induction Motor Torque
250
200
150
N−m
100
50
0
0 50 100 150 200 250
Speed, RPM
a X1
I� X2 I2
∩∩∩∩ �
∩∩∩∩
��
+ ⊃ <
⊃ Xm <> R2
− V ⊃
�� ⊃ <> s
If the machine is operated at variable frequency ω, but the reactance is established at frequency
ωB , current is:
V′
I= R2
j(X1′ + X2 ) ωωB + s
37
I� X1′ X2 I2
∩∩∩∩ �
a
∩∩∩∩
��
+ <>
V′ <> R2
�� < s
−
3p |V0′ |2 R2
sB
Te =
ωB (X1′ + X2 )2 + ( R2 2
s ) B
This would imply that torque is, if voltage is proportional to frequency, meaning constant applied
flux, dependent only on absolute slip. The torque-speed curve is a constant, dependent only on the
difference between synchronous and actual rotor speed.
This is fine, but eventually, the notion of “volts per Hz” runs out because at some number of
Hz, there are no more volts to be had. This is generally taken to be the “base” speed for the drive.
Above that speed, voltage is held constant, and torque is given by:
3p |V ′ |2 R2
sB
Te =
ωB (X1′ + X2 )2 + ( R2 2
sB )
The peak of this torque has a square-inverse dependence on frequency, as can be seen from Figure 16.
R2 3 ωp |V ′ |2 Rs2
Te = 3|I2 |2 = R2 2
s ((X1′ + X2 )( ωωB ))2 + (R1′ + s )
Now, we are interested in finding the peak value of that, which is given by the value of R2 s which
maximizes power transfer to the virtual resistance. This is given by the matching condition:
R2 ω 2
r
= R1′2 + ((X1′ + X2 )( ))
s ωB
38
Induction Motor Torque
250
200
N−m
150
100
50
0
0 500 1000 1500 2000
Speed, RPM
39
Breakdown Torque
300
250
200
Newton−Meters
150
100
50
0
0 20 40 60 80 100 120
Drive Frequency, Hz
Note the machine is symmetric (there is no saliency), and since we are referred to the stator,
the stator and rotor self-inductances include leakage terms:
LS = M + LSℓ
LR = M + LRℓ
dλdS
vdS = − ωλqS + rS idS
dt
dλqS
vqS = + ωλdS + rS iqS
dt
dλdR
0 = − ωs λqR + rR idR
dt
dλqR
0 = + ωs λdR + rR iqR
dt
40
Breakdown Torque
300
250
Newton−Meters
200
150
100
50
0 20 40 60 80 100 120
Drive Frequency, Hz
Note that both rotor and stator have “speed” voltage terms since they are both rotating with
respect to the rotating coordinate system. The speed of the rotating coordinate system is w with
respect to the stator. With respect to the rotor that speed is , where wm is the rotor mechanical
speed. Note that this analysis does not require that the reference frame coordinate system speed
w be constant.
Torque is given by:
LR M
idS = λdS − λdR
LS LR − M 2 LS LR − M 2
M LS
idR = λdS − λdR
LS LR − M 2 LS LR − M 2
Now, if we make the following definitions (the motivation for this should by now be obvious):
Xd = ω0 LS
Xkd = ω0 LR
Xad = ω0 M
!
M2
Xd′ = ω0 LS −
LR
41
the currents become:
ω0 Xad ω0
idS = λdS − λdR
Xd
′ Xkd Xd′
Xad ω0 Xd ω0
idR = λdS − ′ λdR
Xkd Xd′ Xd Xkd
The q- axis is the same.
Torque may be, with these calculations for current, written as:
3 3 ω0 Xad
Te = p (λdS iqS − λqS idS ) = − p (λdS λqR − λqS λdR )
2 2 Xkd Xd′
Note that the usual problems with ordinary variables hold here: the foregoing expression was
written assuming the variables are expressed as peak quantities. If RMS is used we must replace
3/2 by 3!
With these, the simulation model is quite straightforward. The state equations are:
dλdS
= VdS + ωλqS − RS idS
dt
dλqS
= VqS − ωλdS − RS iqS
dt
dλdR
= ωs λqR − RR idR
dt
dλqR
= −ωs λdR − RS iqR
dt
dΩm 1
= (Te + Tm )
dt J
where the rotor frequency (slip frequency) is:
ωs = ω − pΩm
For simple simulations and constant excitaion frequency, the choice of coordinate systems is
arbitrary, so we can choose something convenient. For example, we might choose to fix the coordi-
nate system to a synchronously rotating frame, so that stator frequency ω = ω0 . In this case, we
could pick the stator voltage to lie on one axis or another. A common choice is Vd = 0 and Vq = V .
42
This means that we can re-write torque as:
3
T e = p (λagd iqS − λagq idS )
2
Next, note that the rotor flux is, similarly, related to air-gap flux:
λdR = Λ0
λqR = 0
43
Now, suppose we can succeed in putting flux on the right axis, so that λqR = 0, then the two
rotor voltage equations are:
dλdR
0 = − ωs λqR + rR IdR
dt
dλqR
0 = + ωs λdR + rR IqR
dt
Now, since the rotor currents are:
λdR M
idR = − idS
LR LR
λqR M
iqR = − iqS
LR LR
The voltage expressions become, accounting for the fact that there is no rotor quadrature axis
flux:
dλdR λdR M
0 = + rR − idS
dt LR LR
M
0 = ωs λdR − rR iqS
LR
Noting that the rotor time constant is
LR
TR =
rR
we find:
dλdR
TR + λdR = M idS
dt
M iqS
ωs =
TR λdR
The first of these two expressions describes the behavior of the direct-axis flux: as one would
think, it has a simple first-order relationship with direct-axis stator current. The second expression,
which describes slip as a function of quadrature axis current and direct axis flux, actually describes
how fast to turn the rotating coordinate system to hold flux on the direct axis.
Now, a real machine application involves phase currents ia , ib and ic , and these must be derived
from the model currents idS and iqs . This is done with, of course, a mathematical operation which
uses a transformation angle θ. And that angle is derived from the rotor mechanical speed and
computed slip: Z
θ= (pωm + ωs ) dt
A generally good strategy to make this sort of system work is to measure the three phase currents
and derive the direct- and quadrature-axis currents from them. A good estimate of direct-axis flux is
made by running direct-axis flux through a first-order filter. The tricky operation involves dividing
quadrature axis current by direct axis flux to get slip, but this is now easily done numerically (as
44
λ dR
D M
ω S
1 + STa
÷D
N÷ T
M
N TR
i a* ia
i * o
d
-1 ib* ib Motor
T Amp o
Load
i q* ic
* o
i
ω
c
m
θ
∫ Σ
are the trigonometric operations required for the rotating coordinate system transformation). An
elmentary block diagram of a (possbly) plausible scheme for this is shown in Figure 19.
In this picture we start with commanded values of direct- and quadrature- axis currents, corre-
sponding to flux and torque, respectively. These are translated by a rotating coordinate transfor-
mation into commanded phase currents. That transformation (simply the inverse Park’s transform)
uses the angle q derived as part of the scheme. In some (cheap) implementations of this scheme
the commanded currents are used rather than the measured currents to establish the flux and slip.
We have shown the commanded currents i∗a , etc. as inputs to an “Amplifier”. This might be
implemented as a PWM current-source, for example, and a tight loop here results in a rather high
performance servo system.
References
[1] P.L. Alger, “Induction Machines”, Gordon and Breach, 1969
[2] A.E. Fitzgerals, C. Kingsley Jr., S.D. Umans, ”Electric Machinery”, Sixth Edition, McGraw
Hill, 2003
45
[3] D. Fink, H. W. Beaty, ”Standard Handbook for Electrical Engineers, Thirteenth Edition,
McGraw-Hill, 1993
46
r1 x1
∧∧∧
∨∨
∩∩∩∩
⊃ ⊃
⊃ jx2s ⊃ jx2c
⊃ ⊃
⊃ <> ⊃ ⊃ ⊃
⊃ jxag < rc ⊃ xc
⊃ <>
⊃
⊃ ⊃ <> r <> r
< s2s < s2c
<> <>
⊃ ⊃
⊃ jx2s5 ⊃ jx2c5
⊃ ⊃
⊃ ⊃ ⊃
⊃ jxa5
⊃
⊃ < r < r
<> s
2s5
<> s
2c5
<> 5 <> 5
⊃ ⊃
⊃ jx2s7 ⊃ jx2c7
⊃ ⊃
⊃ ⊃ ⊃
⊃ jxa7
⊃
⊃ <> r <> r
< s2s7 < s2c7
<> 7 <> 7
⊃ ⊃
⊃ jx2sm ⊃ jx2cm
⊃ ⊃
⊃ ⊃ ⊃
⊃ jxam
⊃
⊃ <> r <> r
<> s2sm <> s2cm
m m
< <
⊃ ⊃
⊃ jx2sp ⊃ jx2cp
⊃ ⊃
⊃ ⊃ ⊃
⊃ jxap
⊃
⊃ <> r2sp <> r2cp
<> s <> s
p p
< <
47
Massachusetts Institute of Technology
Department of Electrical Engineering and Computer Science
6.685 Electric Machinery
1 Introduction
In this document we develop models useful for calculating the dynamic behavior of synchronous
machines. We start with a commonly accepted picture of the synchronous machine, assuming that
the rotor can be fairly represented by three equivalent windings: one being the field and the other
two, the d- and q- axis “damper” windings, representing the effects of rotor body, wedge chain,
amortisseur and other current carrying paths.
While a synchronous machine is assumed here, the results are fairly directly applicable to
induction machines. Also, extension to situations in which the rotor representation must have
more than one extra equivalent winding per axis should be straightforward.
3 Park’s Equations
The first step in the development of a suitable model is to transform the armature winding variables
to a coordinate system in which the rotor is stationary. We identify equivalent armature windings
in the direct and quadrature axes. The direct axis armature winding is the equivalent of one of
the phase windings, but aligned directly with the field. The quadrature winding is situated so
that its axis leads the field winding by 90 electrical degrees. The transformation used to map
the armature currents, fluxes and so forth onto the direct and quadrature axes is the celebrated
Park’s Transformation, named after Robert H. Park, an early investigator into transient behavior
in synchronous machines. The mapping takes the form:
ud ua
u = u = T u = T ub (13)
q dq ph
u0 uc
Where the transformation and its inverse are:
cos(θ − 23π ) cos(θ + 23π )
cos θ
2
T = − sin θ − sin(θ − 3 ) − sin(θ + 23π )
2π
(14)
3 1 1 1
2 2 2
2
cos θ − sin θ 1
T −1 = cos(θ − 2π ) − sin(θ − 2π
) 1 (15)
3 3
2π 2π
cos(θ + 3 ) − sin(θ + 3 ) 1
This transformation maps balanced sets of phase currents into constant currents in the d-q frame.
That is, if rotor angle is θ = ωt + θ0 , and phase currents are:
Ia = I cos ωt
2π
Ib = I cos(ωt − )
3
2π
Ic = I cos(ωt + )
3
Then the transformed set of currents is:
Id = I cos θ0
Iq = −I sin θ0
Now, we apply this transformation to (1) to express fluxes and currents in the armature in the d-q
reference frame. To do this, extract the top line in (1):
The transformed flux is obtained by premultiplying this whole expression by the transformation
matrix. Phase current may be obtained from d-q current by multiplying by the inverse of the
transformation matrix. Thus:
λdq = T Lph T −1 I dq + T M I R (17)
The same process carried out for the lower line of (1) yields:
λR = M T T −1 I dq + LR I R (18)
If the conditions of (5) through (10) are satisfied, the inductance submatrices of (19) wind up being
of particularly simple form. (Please note that a substantial amount of algebra has been left out
here!)
Ld 0 0
Ldq = 0 Lq 0 (20)
0 0 L0
M Lakd 0
LC = 0 0 Lakq (21)
0 0 0
3
Note that (19) through (21) express three separate sets of apparently independent flux/current
relationships. These may be re-cast into the following form:
λd Ld Lakd M Id
3
λkd = L L L Ikd (22)
2 akd kd f kd
3
λf 2M Lf kd Lf If
" # " #" #
λq Lq Lakq Iq
= 3 (23)
λkq L
2 akq Lkq Ikq
λ0 = L0 I0 (24)
Where the component inductances are:
3
Ld = La0 − Lab0 + L2 (25)
2
3
Lq = La0 − Lab0 − L2 (26)
2
L0 = La0 + 2Lab0 (27)
Note that the apparently restrictive assumptions embedded in (5) through (10) have resulted in
the very simple form of (21) through (24). In particular, we have three mutually independent sets
of fluxes and currents. While we may be concerned about the restrictiveness of these expressions,
note that the orthogonality between the d- and q- axes is not unreasonable. In fact, because these
axes are orthogonal in space, it seems reasonable that they should not have mutual flux linkages.
The principal consequence of these assumptions is the de-coupling of the zero-sequence component
of flux from the d- and q- axis components. We are not in a position at this time to determine
the reasonableness of this. However, it should be noted that departures from this form (that is,
coupling between the “direct” and “zero” axes) must be through higher harmonic fields that will
not couple well to the armature, so that any such coupling will be weak.
Next, armature voltage is, ignoring resistance, given by:
d d
λ = T −1 λdq
V ph = (28)
dt ph dt
and that the transformed armature voltage must be:
V dq = T V ph
d
= T (T −1 λdq )
dt
d d
= λdq + (T T −1 )λdq (29)
dt dt
A good deal of manupulation goes into reducing the second term of this, resulting in:
0 − dθ 0
d −1 dθ
dt
T T = 0 0 (30)
dt dt
0 0 0
4
This expresses the speed voltagethat arises from a coordinate transformation. The two voltage/flux
relationships that are affected are:
dλd
Vd = − ωλq (31)
dt
dλq
Vq = + ωλd (32)
dt
where we have used
dθ
ω= (33)
dt
5 Per-Unit Normalization
The next thing for us to do is to investigate the way in which electric machine system are nor-
malized, or put into what is called a per-unit system. The reason for this step is that, when the
voltage, current, power and impedance are referred to normal operating parameters, the behavior
characteristics of all types of machines become quite similar, giving us a better way of relating
how a particular machine works to some reasonable standard. There are also numerical reasons for
normalizing performance parameters to some standard.
The first step in normalization is to establish a set of base quantities. We will be normalizing
voltage, current, flux, power, impedance and torque, so we will need base quantities for each of
these. Note, however, that the base quantities are not independent. In fact, for the armature, we
need only specify three quantities: voltage (VB ), current (IB ) and frequency (ω0 ). Note that we do
not normalize time nor frequency. Having done this for the armature circuits, we can derive each
of the other base quantities:
5
• Base Power
3
PB = VB IB
2
• Base Impedance
VB
ZB =
IB
• Base Flux
VB
λB =
ω0
• Base Torque
p
TB = PB
ω0
Note that, for our purposes, base voltage and current are expressed as peak quantities. Base voltage
is taken on a phase basis (line to neutral for a “wye” connected machine), and base current is
similarly taken on a phase basis, (line current for a “wye” connected machine).
Normalized, or per-unit quantities are derived by dividing the ordinary variable (with units) by
the corresponding base. For example, per-unit flux is:
λ ω0 λ
ψ= = (38)
λB VB
In this derivation, per- unit quantities will usually be designated by lower case letters. Two
notable exceptions are flux, where we use the letter ψ, and torque, where we will still use the upper
case T and risk confusion.
Now, we note that there will be base quantities for voltage, current and frequency for each of
the different coils represented in our model. While it is reasonable to expect that the frequency
base will be the same for all coils in a problem, the voltage and current bases may be different. We
might write (22) as:
ω0 IdB ω0 IkB ω0 I f B
Vdb Ld Vdb Lakd Vdb M
ψd id
ω0 IdB 3 ω0 IkB ω0 I f B
ψkd = Vkb 2 Lakd Vkb Lkd Vkdb Lf kd ikd (39)
ψf if
ω0 IdB 3 ω0 IkB ω0 I f B
Vf b 2 M Vf b Lf kd Vf b Lf
It is important to note that (40) assumes reciprocity in the normalized system. To wit, the following
expressions are implied:
IdB
xd = ω 0 Ld (41)
VdB
6
IkB
xkd = ω0 Lkd (42)
VkB
If B
xf = ω0 Lf (43)
Vf B
IkB
xakd = ω0 Lakd
VdB
3 IdB
= ω0 Lakd (44)
2 VkB
If B
xad = ω0 M
VdB
3 IdB
= ω0 M (45)
2 Vf B
IkB
xf kd = ω0 Lf kd
Vf b
If B
= ω0 Lf kd (46)
Vkb
These in turn imply:
3
VdB IdB = Vf B If B (47)
2
3
VdB IdB = VkB IkB (48)
2
Vf B If B = VkB IkB (49)
These expressions imply the same power base on all of the windings of the machine. This is
so because the armature base quantities Vdb and Idb are stated as peak values, while the rotor base
quantities are stated as DC values. Thus power base for the three- phase armature is 23 times
the product of peak quantities, while the power base for the rotor is simply the product of those
quantities.
The quadrature axis, which may have fewer equivalent elements than the direct axis and which
may have different numerical values, still yields a similar structure. Without going through the
details, we can see that the per-unit flux/current relationship for the q- axis is:
" # " #" #
ψq xq xakq iq
= (50)
ψkq xakq xkq ikq
The voltage equations, including speed voltage terms, (31) and (32), may be augmented to
reflect armature resistance:
dλd
Vd = − ωλq + Ra Id (51)
dt
dλq
Vq = ωλd + + Ra Iq (52)
dt
The per-unit equivalents of these are:
1 dψd ω
vd = − ψq + ra id (53)
ω0 dt ω0
ω 1 dψq
vq = ψd + + ra iq (54)
ω0 ω0 dt
2H dω
= Te + Tm (64)
ω0 dt
where now we use Te and Tm to represent per-unit torques.
8
6 Equal Mutual’s Base
In normalizing the differential equations that make up our model, we have used a number of base
quantities. For example, in deriving (40), the per-unit flux- current relationship for the direct
axis, we used six base quantities: VB , IB , Vf B , If B , VkB and IkB . Imposing reciprocity on (40)
results in two constraints on these six variables, expressed in (47) through (49). Presumably the
two armature base quantities will be fixed by machine rating. That leaves two more “degrees of
freedom” in selection of base quantities. Note that the selection of base quantities will affect the
reactance matrix in (40).
While there are different schools of thought on just how to handle these degrees of freedom, a
commonly used convention is to employ what is called the equal mutuals base system. The two
degrees of freedom are used to set the field and damper base impedances so that all three mutual
inductances of (40) are equal:
xakd = xf kd = xad (65)
The direct- axis flux- current relationship becomes:
ψd xd xad xad id
ψkd = xad xkd xad ikd (66)
ψf xad xad xf if
7 Equivalent Circuit
i�
d ra xal xf l if
rf �
∧∧∧
∨∨
∩∩∩∩ ∩∩∩∩ ∧∧∧
∨∨
+ + ⊃ +
⊃ xkdl
⊃
⊃ ⊃
The flux- current relationship of (66) is represented by the equivalent circuit of Figure 1, if the
“leakage” inductances are defined to be:
9
Many of the interesting features of the electrical dynamics of the synchronous machine may be
discerned from this circuit. While a complete explication of this thing is beyond the scope of this
note, it is possible to make a few observations.
The apparent inductance measured from the terminals of this equivalent circuit (ignoring resis-
tance ra ) will, in the frequency domain, be of the form:
ψd (s) Pn (s)
x(s) = = xd (70)
id (s) Pd (s)
Both the numerator and denominator polynomials in s will be second order. (You may convince
yourself of this by writing an expression for terminal impedance). Since this is a “diffusion” type
circuit, having only resistances and inductances, all poles and zeros must be on the negative real
axis of the “s-plane”. The per-unit inductance is, then:
If this is true, then the reactance is described by the pole-zero diagram shown in Figure 2.
Under this circumstance, the apparent terminal inductance has three distinct values, depending on
frequency. These are the synchronous inductance, the transient inductance, and the subtransient
inductance, given by:
Td′
x′d = xd ′ (74)
Tdo
T ′′
x′′d = xd′ d′′
Tdo
T ′ T ′′
= xd ′d d′′ (75)
Tdo Tdo
If the time constants are spread widely apart, they are given, approximately, by:
′ xf
Tdo = (76)
ω0 rf
′′ xkdl + xf l ||xad
Tdo = (77)
ω0 rkd
10
1 1
′
Tdo
� �
Tdo ”
× ×
1 1
Td ” Td′
log |x(jω)|
�
��
�
��
1 1 1 1 log ω
′
Tdo T ′
d Tdo ” Td ”
Finally, note that the three reactances are found simply from the model:
11
Then the state equations are:
dψd
= ω0 vd + ωψq − ω0 ra id (83)
dt
dψq
= ω0 vq − ωψd − ω0 ra iq (84)
dt
dψkd
= −ω0 rkd ikd (85)
dt
dψkq
= −ω0 rkq ikq (86)
dt
dψf
= ω0 vf − ω0 rf if (87)
dt
dω ω0
= (Te + Tm ) (88)
dt 2H
dδ
= ω − ω0 (89)
dt
and, of course,
Te = ψd iq − ψq id
xad = xd − xal
xad (x′d − xal )
xf l =
xad − x′d + xal
1
xkdl = 1 1 1
x′′
−xal − xad −
d
xf l
xf l + xad
rf =
ω0 Tdo′
12
Taylor series. Assuming a steady state operating point [ψd0 ψkd0 ψf 0 ψq0 ψkq0 ω0 δ0 ], the first-
order (small-signal) variations are described by the following set of equations. First, since the
flux-current relationship is linear:
−1
id1 xd xad xad ψd1
ikd1 = xad xkd xad ψkd1 (90)
if 1 xad xad xf ψf 1
" # " #−1 " #
iq1 xq xaq ψq1
=
(91)
ikq1 xaq xkq ψkq1
Vd = V sin δ Vq = V cos δ
dψq1
dψkd1
dψkq1
dψf 1
= −ω0 rf if 1 (96)
dt
dω1 ω0
= (Te1 + Tm1 ) (97)
dt 2H
dδ1
= ω1 (98)
dt
Te = ψd0 iq1 + ψd1 iq0 − ψq0 id1 − ψq1 id0
ψd = vq = V cos δ (99)
ψq = −vd = −V sin δ (100)
The set of differential equations changes only a little when this approximation is made. Note,
however, that it can be simulated with far fewer “cycles” if the armature time constant is short.
13
Now, if id and iq are determined, it is a bit easier to find the other currents required in the
simulation. Note we can write:
" # " #" # " #
ψkd xkd xad ikd xad
= + id (106)
ψf xad xf if xad
14
The quadrature axis rotor current is simply:
1 xaq
ikq = ψkq − iq (108)
xkq xkq
The torque equation is the same, but since it is usually convenient to assemble the fluxes behind
subtransient reactance, it is possible to use:
Now it is necessary to consider terminal voltage. This is most conveniently cast in matrix
notation. The vector of phase voltages is:
va
v ph = vb (110)
vc
Then, with similar notation for phase flux, terminal voltage is, ignoring armature resistance:
1 dψ ph
v ph =
ω0 dt
1 d n −1 o
= T ψ dq (111)
ω0 dt
Note that we may define the transformed vector of fluxes to be:
1 d n −1 ′′ o
v ph = T x T iph + T −1 e ′′ (115)
ω0 dt
Now it is necessary to make one assumption and one definition. The assumption, which is
only moderately restrictive, is that subtransient saliency may be ignored. That is, we assume
that x′′d = x′′q . The definition separates the “zero sequence” impedance into phase and neutral
components:
15
x0 = x′′d + 3xg (116)
Note that according to this definition the reactance xg accounts for any impedance in the neutral
of the synchronous machine as well as mutual coupling between phases.
Then, the impedance matrix becomes:
x′′ 0 0 0 0 0
′′ d
x = 0 x′′d 0 + 0 0 0 (117)
0 0 x′′d 0 0 3xg
In compact notation, this is:
1 d n −1 ′′ o 1 d n −1 o ′′ 1 de′′
T e = T e + T −1 (121)
ω0 dt ω0 dt ω0 dt
Now, the time derivative of the inverse transform is:
− sin(θ) − cos(θ) 0
1 d −1 ω 2π 2π
T = − sin(θ − ) − cos(θ − ) 0 (122)
ω0 dt ω0
3 3
− sin(θ + 23π ) − cos(θ + 2π
3 ) 0
Now the three phase voltages can be extracted from all of this matrix algebra:
x′′d dia xg d
va = + (ia + ib + ic ) + e′′a (123)
ω0 dt ω0 dt
x′′d dib xg d
vb = + (ia + ib + ic ) + eb′′ (124)
ω0 dt ω0 dt
x′′d dic xg d
vc = + (ia + ib + ic ) + ec′′ (125)
ω0 dt ω0 dt
16
Where the internal voltages are:
ω ′′
e′′a = − (e sin(θ) − ed′′ cos(θ))
ω0 q
1 de′′q 1 de′′
+ cos(θ) + sin(θ) d (126)
ω0 dt ω0 dt
ω ′′ 2π 2π
e′′b = − (eq sin(θ − ) − e′′d cos(θ − ))
ω0 3 3
1 2π de′′q 1 2π de′′d
+ cos(θ − ) + sin(θ − ) (127)
ω0 3 dt ω0 3 dt
ω 2π 2π
e′′c = − (e′′q sin(θ + ) − e′′d cos(θ + ))
ω0 3 3
1 2π de′′q 1 2π de′′d
+ cos(θ + ) + sin(θ + ) (128)
ω0 3 dt ω0 3 dt
This set of expressions describes the equivalent circuit shown in Figure 4.
′′
i� ′′
a xd ��
ea
va ∩∩∩∩ + −
��
ib x′′d e′′b
�� xg
�
vb ∩∩∩∩ + − ∩∩∩∩
��
ic x′′d e′′c
��
�
vc ∩∩∩∩ + −
��
dψkd
= −ω0 rkd ikd (129)
dt
dψkq
= −ω0 rkq ikq (130)
dt
dψf
= −ω0 rf if (131)
dt
dδ
= ω − ω0 (132)
dt
17
dω ω0
= Tm + e′′q iq + ed′′ id (133)
dt 2H
where: " # " #−1 " # " # !
ikd xkd xad ψkd xad
= − id
if xad xf ψf xad
and
1 xaq
ikq = ψkq − iq
xkq xkq
(It is assumed here that the difference between subtransient reactances is small enough to be
neglected.)
The network interface equations are, from the network to the machine:
2π 2π
id = ia cos(θ) + ib cos(θ − ) + ic cos(θ + ) (134)
3 3
2π 2π
iq = −ia sin(θ) − ib sin(θ − ) − ic sin(θ + ) (135)
3 3
and, in the reverse direction, from the machine to the network:
ω ′′
e′′a = − (e sin(θ) − ed′′ cos(θ))
ω0 q
1 de′′q 1 de′′
+ cos(θ) + sin(θ) d (136)
ω0 dt ω0 dt
ω ′′ 2π 2π
e′′b = − (eq sin(θ − ) − e′′d cos(θ − ))
ω0 3 3
1 2π de′′q 1 2π de′′d
+ cos(θ − ) + sin(θ − ) (137)
ω0 3 dt ω0 3 dt
ω 2π 2π
e′′c = − (e′′q sin(θ + ) − e′′d cos(θ + ))
ω0 3 3
1 2π de′′q 1 2π de′′d
+ cos(θ + ) + sin(θ + ) (138)
ω0 3 dt ω0 3 dt
And, of course,
θ = ω0 t + δ (139)
e′′q = ψd′′ (140)
e′′d = −ψq′′ (141)
xad xkdl ψf + xad xf l ψkd
ψd′′ = (142)
xad xkdl + xad xf l + xkdl xf l
xaq
ψq′′ = ψkq (143)
xaq + xkql
18
11 Network Constraints
This model may be embedded in a number of networks. Different configurations will result in
different constraints on currents. Consider, for example, the situation in which all of the terminal
voltages are constrained, but perhaps by unbalanced (not entirely positive sequence) sources. In
that case, the differential equations for the three phase currents would be:
x′′d dia x′′d + 2xg xg
= (va − e′′a ) − (vb − eb′′ ) + (vc − ec′′ ) ′′
′′ (144)
ω0 dt xd + 3xg xd + 3xg
x′′d dib ′′
x + 2xg xg
= (vb − e′′b ) d′′ − (va − ea′′ ) + (vc − ec′′ ) ′′
(145)
ω0 dt xd + 3xg xd + 3xg
x′′d dic ′′
x + 2xg xg
= (vc − e′′c ) ′′d − (vb − eb′′ ) + (va − ea′′ ) ′′
(146)
ω0 dt xd + 3xg xd + 3xg
′′
i�
a ra x′′d ��
ea
va ∧∧∧ ∩∩∩∩ + −
��
∨∨
ib ra x′′d e′′b
�� xg
�
∧∧∧ ∩∩∩∩ + − ∩∩∩∩
��
∨∨
ra x′′d ��
e′′c
∧∧∧ ∩∩∩∩ + −
��
∨∨
In this situation, we have only two currents to worry about, and their differential equations
would be:
dib ω0 ′′
= (e − e′′b − 2ra ib ) (147)
dt 2x′′d c
dia ω0
= (va − e′′a − ra ia ) (148)
dt x′′d + xg
and, of course, ic = −ib .
Note that here we have included the effects of armature resistance, ignored in the previous
section but obviously important if the results are to be believed.
19
13 Permanent Magnet Machines
Permanent Magnet machines are one state variable simpler than their wound-field counterparts.
They may be accurately viewed as having constant field current. Assuming that we can define the
internal (field) flux as:
ψ0 = xad if 0 (149)
Here, the “flux behind subtransient reactance” is, on the direct axis:
xkdl ψ0 + xad ψkd
ψd′′ = (160)
xad + xkdl
and the subtransient reactance is:
x′′d = xal + xad ||xkdl (161)
20
On the quadrature axis,
xad ψkq
ψq′′ = (162)
xad + xkql
and
x′′q = xal + xaq ||xkql (163)
In this case there are only four state equations:
dψkd
= −ω0 rkd ikd (164)
dt
dψkq
= −ω0 rkq ikq (165)
dt
dω ω0 ′′
= eq iq + ed′′ id + Tm (166)
dt 2H
dδ
= ω − ω0 (167)
dt
The interconnections to and from the network are the same as in the case of a wound-field
machine: in the “forward” direction, from network to machine:
2π 2π
id = ia cos(θ) + ib cos(θ − ) + ic cos(θ + ) (168)
3 3
2π 2π
iq = −ia sin(θ) − ib sin(θ − ) − ic sin(θ + ) (169)
3 3
and, in the reverse direction, from the machine to the network:
ω ′′
e′′a = − (e sin(θ) − ed′′ cos(θ))
ω0 q
1 de′′q 1 de′′
+ cos(θ) + sin(θ) d (170)
ω0 dt ω0 dt
ω 2π 2π
e′′b = − (e′′q sin(θ − ) − e′′d cos(θ − ))
ω0 3 3
1 2π de′′q 1 2π de′′d
+ cos(θ − ) + sin(θ − ) (171)
ω0 3 dt ω0 3 dt
ω 2π 2π
e′′c = − (e′′q sin(θ + ) − e′′d cos(θ + ))
ω0 3 3
1 2π de′′q 1 2π de′′d
+ cos(θ + ) + sin(θ + ) (172)
ω0 3 dt ω0 3 dt
21
The state equations are:
dψd
= ω0 vd + ωψq − ω0 ra id (175)
dt
dψq
= ω0 vq − ωψd − ω0 ra iq (176)
dt
dω ω0
= (ψd iq − ψq id + Tm ) (177)
dt 2H
dδ
= ω − ω0 (178)
dt
22
1 Introduction
1. How are they realized?
It has already been shown (see previous classes) that rings, or split-rings, can realize a
negative permeability (µ < 0) over a certain frequency band.
In addition to this, we need to realize a negative permittivity (² < 0).
It has also been shown (see previous classes) that:
• lossless:
ωp2 ne2
²metal = 1 − , where ωp =
ω2 ²0 me
(n: electron density, e: electron charge, me : effective mass of electrons).
• lossy:
ωp2
²metal =1− .
ω(ω + iγ)
no transmission
PSfrag replacements (² < 0) transmission
(² > 0)
ωp ω
1
2 Section 2. Why “left-handed”?
With these characteristics, ² < 0 has been realized already at infrared frequencies (where
metals behave like plasmas).
Problem: how to realize it at GHz frequencies?
Solution: by reducing n, the electron density.
One way of doing this is to confine the electrons in space. This can be achieved by an array
of rods for example, as shown in Fig. 2.
a
r
PSfrag replacements
Note: it is important that the wires are thin, so as to reduce the radiation interaction and
allow penetration into the structure.
Effect of the wires: to reduce n to
πr2
neff = n . (1)
a2
Finally, note also that the rods have to be parallel to the electric field. This, plus the (known
already) fact that rings have to be perpendicular to the magnetic field, gives an idea on how to
realize physically LH metamaterials (see Fig. 3).
2 Why “left-handed”?
At this point, we have a metamaterial which can realize
Let us write Maxwell’s curl equations for plane wave solutions and time harmonic notations:
0.5 mm
5 mm
z
1.3 cm
x
εs
FRONT BACK
(a)
1 mm
x εs
(b)
In standard materials, Eq. (3) implies that the tryad (Ē, H̄, k̄) forms a right-handed system.
However, under Eq. (2), we will have:
(Ē, H̄, k̄) form a left-handed (LH) tryad.
3 Properties of LH media
Some know characteristics are:
Ē
k̄ S̄
PSfrag replacements
H̄
Figure 4: Electric field (Ē), magnetic field (H̄), wave-vector (k̄) and Poynting power
(S̄) in an LH medium.
This last item is very significant, and we shall spend some time discussing it.
√
n= ² r µr , (5)
For those frequencies inside the left-handed band (i.e. in the band where ² < 0 and µ < 0),
we can write:
• Ray diagram:
5
n>0
θr
PSfrag replacements
θi −θr
n<0
PSfrag replacements
kx k̄1 k̄2 kx
S̄
k1z k2z
3.2 Energy
Traditionally, the energy is given by
W = ²E 2 + µH 2 . (9)
∂(²ω) 2 ∂(µω) 2
W = E + H (10)
∂ω ∂ω
and we must have:
∂(²ω)
> 0, (11a)
∂ω
∂(µω)
> 0. (11b)
∂ω
6 3.3 Properties of an LHM slab
When LH materials are studies as bulk materials, two models are commonly used for the
permittivity/permeability:
1. Drude model:
2
ωep
²r = 1 − , (12a)
ω(ω + iγe )
2
ωmp
µr = 1 − , (12b)
ω(ω + iγm )
Real(eps/eps0)
10 Imag(eps/eps0)
0.02
0.015
0
0.01
−10
0.005
−20
0
−0.005
−30
−0.01
−40
wp = 100 GHz −0.015 wp = 100 GHz
wp = 266.5 GHz wp = 266.5 GHz
wp = 500 GHz wp = 500 GHz
wp = 1000 GHz wp = 1000 GHz
−50 −0.02
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
w/w0 w/w0
Figure 5: Permittivity for various values of ωep in the Drude model (f0 = 30 GHz,
ωep = ωp ).
2. Resonant model:
2 − ω2
ωep eo
²r = 1 − , (13a)
ω 2 − ωeo
2 + iγ ω
e
2 − ω2
ωmp mo
µr = 1 − 2 2
, (13b)
ω − ωmo + iγm ω
where ω(em)o are the electric/magnetic resonant frequencies and ω(em)p are the elec-
tric/magnetic plasma frequencies. An illustration of this model is given in Fig. 6.
Real(eps/eps0) Imag(eps/eps0)
100 1
wp = 266.5 GHz
wp = 266.5 GHz wp = 500 GHz
80 wp = 500 GHz 0.8
60 0.6
40 0.4
20 0.2
0 0
−20 −0.2
−40 −0.4
−60 −0.6
−80 −0.8
−100 −1
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
w/w0 w/w0
Figure 6: Permittivity for various values of ωep in the resonant model of Eq. (13a)
(f0 = 30 GHz, ωep = ωp , ω0 = 100GHz).
y z
PSfrag replacements
Let us consider the case for which ²r = µr = −1 (working at the right frequency).
S I2
I1 z
We see that is the source is close enough to the slab (distance< d), the slab will produce
two images, one inside the slab and one outside.
The distance from source to the second image is
S − I2 = 2d .
2. Rigorous calculation:
Let us consider a TE wave impinging on this slab. We write, for a single interface:
We need to match the boundary conditions and, for simplicity, we set the boundary to be
at z = 0. We get:
• Tangential Ē field:
0
eikz z + re−ikz z = teikz z .
1
• Tangential H̄ field (H̄ = iωµ ∇ × Ē):
µ0 kz0 ikz0 z
eikz z − re−ikz z = te .
µ1 k z
Upon solving, we get the reflection/transmission coefficient from free-space to the medium:
2µ1 kz
t= , (15a)
µ1 kz + µ0 kz0
µ1 kz − µ0 kz0
r= . (15b)
µ1 kz + µ0 kz0
9
In a similar way, the reflection/transmission coefficients from the medium to free-space are:
2µ0 kz0
t0 = , (16a)
µ1 kz + µ0 kz0
µ0 kz0 − µ1 kz
r0 = . (16b)
µ1 kz + µ0 kz0
In order to obtain the field inside the slab, we shall compute the transmission coefficient as:
0 0 0 0 2 0
T = teikz d t0 + teikz d r0 eikz d r0 eikz d t0 + tt0 (r0 )2 e5ikz d + · · ·
∞ 0
0
X 2 0 tt0 eikz d
= tt0 eikz d (r0 )n e2inkz d = . (17)
n=0
1 − r 0 2 e2ikz0 d
q q
kz kz = k 2 − kρ2 kz = i kρ2 − k 2
q q
kz0 kz0 = − k 2 − kρ2 kz0 = i kρ2 − k 2
4kz2 e−ikz d
T = = e−ikz d .
4kz2
−4kz2 eikz d
T = = e−ikz d .
−4kz2 e2ikz d
10 Section 4. Does it really work?
T = e−ikz d . (20)
Conclusions:
• Two images can be formed, like shown in the ray diagram of the previous subsection.
1. Prism,
2. Gaussian beam.
LHM
RHM
PSfrag replacements
RHM
LHM
PSfrag replacements
Dispertion relations in Left-Handed Materials
Massachusetts Institute of Technology
6.635 lecture notes
1 Introduction
We know already the following properties of LH media:
1. ²r and µr are frequency dispersive.
From the past lectures, we know that these materials can be realized by a succession of wires
and rods:
• Periodic arrangement of rods: realizes a plasma medium with negative ² r over a certain
frequency band. The model for the permittivity is:
2
ωep
²r = 1 − . (1)
ω 2 + iγe ω
• Periodic arrangement of rings (split-rings) realizes a resonant µ r modeled as
2
F ωmp
µr = 1 − , (2)
ω 2 − ωmo
2 + iγ ω
m
where F is the fractional area of the unit cell occupied by the interior of the split-ring
(F < 1).
In the lossless case (γe = γm = 0), we can rewrite these two relations as:
ω 2 − ωep
2
²r = , (3a)
ω2
ω 2 − ω02 − F ω 2 ω 2 − ωb2
µr = = (1 − F ) , (3b)
ω 2 − ω02 ω 2 − ω02
√
where ωb = ω0 / 1 − F > ω0 . Therefore:
1 (ω 2 − ωep
2 )(ω 2 − ω 2 )
b
k 2 = ω 2 ²µ = (1 − F ) 2 2 . (4)
c ω − ω0
Upon identifying the regions where ²r and µr change signs, we can immediately get the
relation for k:
1
2 Section 2. Argument on n < 0
ω ω0 ωb ωp
²r − − − +
µr + − + +
k2 − + − +
The region ω ∈ [ω0 , ωb ], which also corresponds to ²r < 0 and µr < 0, corresponds to
k2 positive, which means k real. Therefore, there is propagation in this band, but not in the
adjacent ones.
It may still not be clear that k is negative, even if we write
√ √
q
k = ω 2 ²µ = ω 2 ²20 µ20 ²r µr = k0 n . (5)
2 Argument on n < 0
2.1 Complex Poynting theorem
We shall first recall the derivation of the complex Poynting theorem and the signification of the
various terms.
We start from Maxwell’s curl equation
∇ × Ē = iω B̄ , (6a)
∇ × H̄ = − iω D̄ + J¯ . (6b)
Upon multiplying Eq. (6a) by H̄ ? and substracting the complex conjugate of Eq. (6b) mul-
tuplied by Ē we get:
H̄ ? · ∇ × Ē−Ē · ∇ × H̄ ? = ∇ · (Ē × H̄ ? )
= iω B̄ · H̄ ? − iω D̄? · Ē − J¯? · Ē
= iω[B̄ · H̄ ? − Ē · D̄? ] − Ē · J¯? . (7)
On the right-hand side of the equation, the first terms corresponds to the divergence of
Poyting power, which is therefore positive. The second term relates to the complex EM energy,
and is therefore also positive. Consequently, the left-hand side term must also be positive, and
actually corresponds to the power supplied by J¯ to the volume.
We shall use this result hereafter.
3
¯ ,
∇2 Ē(r̄) + k 2 Ē(r̄) = −iωµJ(r̄) (9)
is rewritten with
to yield
∂2
E(x) + k 2 E(x) = −iωµj0 δ(x − x0 ) . (11)
∂x2
The solution to this equation is
1. First derivative:
∂E(x) ∂
= αik |x − x0 | eik|x−x0 | . (13)
∂x ∂x2
2. Second derivative:
∂ 2 E(x) ∂2 ∂
2
=αik 2
|x − x0 | eik|x−x0 | + α(−k 2 )( 2 |x − x0 |)2 eik|x−x0 |
∂x ∂x ∂x
2 ik|x−x0 |
= −αk e + 2iαkδ(x − x0 ) . (14)
Therefore:
∂ 2 E(x)
+ k 2 E(x) =2iαkδ(x − x0 ) = 2iα k0 n δ(x − x0 ) . (15)
∂x2
Comparing Eq. (11) to Eq. (15), we get
ωµj0 j 0 η 0 µr
α=− =− , (16)
2k0 n 2 n
so that finally the solution is:
j0 η0 µr ik|x−x0 |
E(x) = − e . (17)
2 n
1 η0 j02 µr
Z
P =− Ē · J¯? dV = > 0. (18)
2 V 4 n
The source must, on average, do positive work on the field. Yet, in LH regime, we have
µr < 0 so that we must have n < 0 as well.
4 Section 3. Dispersion relations
3 Dispersion relations
At this point, we know that n < 0 and k < 0. The difference between phase and group velocity
can be directly seen on the dispersion relation diagram.
ω
vφ = , (20a)
kz
∂kz −1
µ ¶
vg = . (20b)
∂ω
√
• Free-space: k = ω ²µ where ² = cte and µ = cte.
• Metamaterial:
ωp2
²r =1 − , (21a)
ω 2 + iγe ω
2 − ω2
ωmp mo
µr =1 − 2 2 + iγ ω
, (21b)
ω − ωmo m
ω 2 − ωp2
²r = , (22a)
ω2
ω 2 − ωmp
2
µr = 2 2
, (22b)
ω − ωmo
(22c)
ωp = 20e9 rad/s
ωmp = 20e9 rad/s
ωmo = 5e9 rad/s
γe = γ m = 0
k surface x 10
−6 Dispersion relation
10
x 10 3
4 2.5
ω [rad/s]
2
1.5
k
1
0
2 0.5
2
x 10
−6 0
0 −6
−2 x 10 0
k −2 k 0 1 2 3 4
z x
ω [rad/s] x 10
10
200
100
r
r
µ
ε
−100
−50 −200
0 1 2 3 4 0 1 2 3 4
ω [rad/s] 10
x 10 ω [rad/s] x 10
10
10 k surface (Gamma= 0)
x 10
4
3.5
2.5
ω [rad/s]
1.5
0.5
0
−3 −2 −1 0 1 2 3
kz −6
x 10
6 3.1 Lossless case (γe = γm = 0), ωmp = ωp
ωp = 30e9 rad/s
ωmp = 20e9 rad/s
ωmo = 5e9 rad/s
γe = γ m = 0 rad/s
k surface x 10
−6 Dispersion relation
10
x 10 4
4
3
ω [rad/s]
2
2
k
0 1
2
−6 0 2
x 10 −2 0 −6
−2 x 10 0
k k 0 1 2 3 4
z x
ω [rad/s] x 10
10
200
µr 100
εr
0
0
−100
−50 −200
0 1 2 3 4 0 1 2 3 4
ω [rad/s] x 10
10
ω [rad/s] x 10
10
10 k surface (Gamma= 0)
x 10
4
3.5
2.5
ω [rad/s]
1.5
0.5
0
−4 −3 −2 −1 0 1 2 3 4
k −6
z x 10
7
ωp = 30e9 rad/s
ωmp = 20e9 rad/s
ωmo = 5e9 rad/s
γe = γ m = 10e7 rad/s
k surface x 10
−6 Dispersion relation
10
x 10 4
4
3
ω [rad/s] 2
2
k
0 1
2
−6 0 2
x 10 −2 0 −6
−2 x 10 0
kz k 0 1 2 3 4
x
ω [rad/s] x 10
10
200
100
µr
εr
0
0
−100
−50 −200
0 1 2 3 4 0 1 2 3 4
ω [rad/s] x 10
10
ω [rad/s] x 10
10
3.5
2.5
ω [rad/s]
1.5
0.5
0
−4 −3 −2 −1 0 1 2 3 4
k −6
z x 10
8 3.1 Lossless case (γe = γm = 0), ωmp = ωp
ωp = 30e9 rad/s
ωmp = 20e9 rad/s
ωmo = 5e9 rad/s
γe = γ m = 10e8 rad/s
k surface x 10
−6 Dispersion relation
10
x 10 2.5
4 2
ω [rad/s]
1.5
2
k
1
0
2 0.5
2
x 10
−6 0
0 −6
x 10 0
kz −2 −2 k 0 1 2 3 4
x
ω [rad/s] x 10
10
40
20
µr
εr
0
0
−20
−50 −40
0 1 2 3 4 0 1 2 3 4
ω [rad/s] x 10
10
ω [rad/s] x 10
10
3.5
2.5
ω [rad/s]
1.5
0.5
0
−2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5
k −6
z x 10
9
ωp = 30e9 rad/s
ωmp = 20e9 rad/s
ωmo = 5e9 rad/s
γe = γ m = 10e9 rad/s
k surface x 10
−7 Dispersion relation
10
x 10 3
4 2.5
2
ω [rad/s]
2
1.5
k
1
0
2 0.5
2
x 10
−7 0
0 −7
−2 −2 x 10 0
kz kx 0 1 2 3 4
ω [rad/s] x 10
10
0 15
−2 10
r
εr
µ
−4 5
−6 0
−8 −5
0 1 2 3 4 0 1 2 3 4
ω [rad/s] x 10
10
ω [rad/s] x 10
10
3.5
2.5
ω [rad/s]
1.5
0.5
0
−4 −3 −2 −1 0 1 2 3 4
k −7
z x 10
10 3.1 Lossless case (γe = γm = 0), ωmp = ωp
ωp = 30e9 rad/s
ωmp = 20e9 rad/s
ωmo = 5e9 rad/s
γe = γ m = 10e10 rad/s
k surface x 10
−7 Dispersion relation
10
x 10 5
4 4
ω [rad/s]
3
2
k
2
0
4 1
2 4
−7 0 2
x 10 −2 0 −7
−2 x 10 0
kz −4 −4 kx 0 1 2 3 4
ω [rad/s] x 10
10
0.966 0.966
0.964 0.964
µr
εr
0.962 0.962
0.96 0.96
0 1 2 3 4 0 1 2 3 4
ω [rad/s] x 10
10
ω [rad/s] x 10
10
3.5
2.5
ω [rad/s]
1.5
0.5
0
−5 −4 −3 −2 −1 0 1 2 3 4 5
k −7
z x 10
11
x 10
10
k surface (Gamma= 0) x 10
10 k surface (Gamma= 10000000)
4 4
3.5 3.5
3 3
2.5 2.5
ω [rad/s]
ω [rad/s]
2 2
1.5 1.5
1 1
0.5 0.5
0 0
−5 −5
0 0
5 5
0 0
−6 5 −5 −6 5 −5
x 10 −6
x 10 −6
x 10 x 10
k
kz
kx k
x z
!#"$&%'#(*)+,
x 10
10 k surface (Gamma= 100000000) x 10
10 k surface (Gamma= 1000000000)
4 4
3.5 3.5
3 3
2.5 2.5
ω [rad/s]
ω [rad/s]
2 2
1.5 1.5
1 1
0.5 0.5
0 0
−5 −5
0 0
5 5
0 0
−6 5 −5 −6 5 −5
x 10 −6
x 10 −6
x 10 x 10
kx kz kx kz
x 10
10 k surface (Gamma= 1.000000e+10) x 10
10 k surface (Gamma= 1.000000e+11)
4 4
3.5 3.5
3 3
2.5 2.5
ω [rad/s]
ω [rad/s]
2 2
1.5 1.5
1 1
0.5 0.5
0 0
−5 −5
0 0
5 5
0 0
−6 5 −5 −6 5 −5
x 10 −6
x 10 −6
x 10 x 10
kx kz kx kz
For simplicity, we can study the lossy case for γe = γm amd ωmo = 0 (although we don’t
really simulate the same medium, the fundamental behavior is similar, and simpler to carry out
mathematically).
The model therefore reads:
ω 2 − ωp2 + iγω
²r = µ r = . (23)
ω 2 + iγω
We compute:
√ ω 2 − ωp2 + iγω
² r µr =
ω 2 + iγω
[ω 2 − ωp2 + iγω] [ω 2 − iγω]
=
ω4 + γ 2 ω2
ω (ω − ωp2 ) + γ 2 ω 2 + iγωωp2
2 2
= .
ω4 + γ 2 ω2
(24a)
√ ω 2 [ω 2 − (ωp2 − γ 2 )]
<{ ²r µr } = . (25)
ω4 + γ 2 ω2
Losses have the effect to lower the plasma frequency to
q
ωp0 = ωp2 − γ 2 . (26)
In addition, we also see that if γ is very large, the plasma effect will completey dissapear
(cf. dispersion relation for γ = 10e10 rad/s).
Green’s functions for planarly layered media
Massachusetts Institute of Technology
6.635 lecture notes
0
0 eik0 |r̄−r̄ |
g(r̄, r̄ ) = . (2)
4π|r̄ − r̄ 0 |
From Maxwell’s equations in frequency domain with an eiωt dependency, the wave equation
for electric field Ē(r̄) is:
¯ ,
∇ × ∇ × Ē(r̄) − k02 Ē(r̄) = iω J(r̄)
=0 for source free case . (3)
which solution is
µ ¶
0 1
G(r̄, r̄ ) = I + 2 ∇∇ g(r̄, r̄0 ) , (5)
k0
For the use of Green’s functions in scattering problems, it is useful to express the Green’s
function in the same coordinates as the problem, which can be rectangular, cylindrical, spherical,
etc. Here we shall concentrate on the rectangular representation (Cartesian).
1
2 Section 2. Cartesian coordinates
2 Cartesian coordinates
2.1 Scalar Green’s function
The formulae are derived from Eq. (1) and the Fourier transform of the quantities:
ZZZ +∞
1 0
g(r̄, r̄ ) = 0
3
dk̄ eik̄·(r̄−r̄ ) g(k̄) , (6a)
(2π) −∞
ZZZ +∞
1 0
δ(r̄ − r̄ 0 ) = 3
dk̄ eik̄·(r̄−r̄ ) , (6b)
(2π) −∞
∂2 ∂2 ∂2
Introducing the differential operator (∇2 = ∂x2
+ ∂y 2
+ ∂z 2
) we write:
ZZZ +∞ ZZZ +∞
2 ik̄·(r̄−r̄ 0 ) 0
(∇ + k02 ) dk̄ e g(k̄) = dk̄ (∇2 + k02 )eik̄·(r̄−r̄ ) g(k̄)
−∞ −∞
ZZZ +∞
0
= dk̄ (−kx2 − ky2 − kz2 + k02 )eik̄·(r̄−r̄ ) g(k̄)
−∞
ZZZ +∞
0
=− dk̄ eik̄·(r̄−r̄ ) , (8)
−∞
1
g(k̄) = . (9)
kx2 + ky2 + kz2 − k02
Using Eq. (6a), we therefore need to evaluate the following integral:
+∞
1 1
ZZZ
0
g(r̄, r̄0 ) = dkx dky dkz eik̄·(r̄−r̄ ) . (10)
(2π)3 −∞ kx2 + ky2 + kz2 − k02
Note that Eq. (10) can be integrated along one of the three axis. In remote sensing applica-
tion, the vertical axis is usually taken to be the z axis, (xy) being the transverse plane (planar
components). We therefore choose to evaluate Eq. (10) along kz , and we split:
k̄ = kx x̂ + ky ŷ + kz ẑ = k̄⊥ + kz ẑ , (11a)
r̄ = r̄⊥ + z ẑ , (11b)
r̄0 = r̄⊥
0
+ z 0 ẑ . (11c)
3
We will perform the integral of Eq. (10) in the complex plane, using Cauchy’s theorem and
the Residue theorem. Before doing this, we have to be careful not to have divergent integrals.
Since we integrate in kz , the condition is:
• if z > 0, we have to choose kz00 > 0, which means that for complex plane integration, we
need to deform the contour into the upper plane.
• if z < 0, we have to choose kz00 < 0, which corresponds to a deformation into the lower
plane.
In addition, we see from Eq. (10) that the integrand has a pole at
so that
∞
1 ik̄⊥ ·(r̄⊥ −r̄0 ) ik0z (z−z 0 )
ZZ
0 i
g(r̄, r̄ ) = dk̄⊥ e ⊥ e , for z − z 0 > 0. (15)
(2π)2 −∞ 2k0z
The treatment for z < 0 follows the same reasoning so that we write for all (z − z 0 ):
∞
1 ik̄⊥ ·(r̄⊥ −r̄0 ) ik0z |z−z 0 |
ZZ
0 0 i
∀(z − z ) ∈ R, g(r̄, r̄ ) = dk̄⊥ e ⊥ e . (16)
(2π)2 −∞ 2k0z
• ...
∂2
ZZ ∞ · ¸
i 1 ik̄⊥ ·r̄⊥ 2 ik0z |z|
g(r̄) = dk̄⊥ e 2ik0z δ(z) − k0z e
∂z 2 (2π)2 −∞ 2k0z
ZZ ∞ ZZ ∞
δ(z) ik̄⊥ ·r̄⊥ i
=− dk̄⊥ e − 2 dk̄⊥ k0z eik̄⊥ ·r̄⊥ eik0z |z|
(2π)2 −∞ 8π −∞
ZZ ∞
i ik̄⊥ ·r̄⊥ ik0z |z|
= −δ(r̄) − 2 dk̄⊥ k0z e e . (21)
8π −∞
Again, all the other terms of ∇∇ applied to the integrand give −k̄ k̄ so that the Green’s
function becomes:
ZZ ∞ · ¸
1 k̄ k̄ ik̄·r̄
dk̄⊥ I− 2 e for z > 0 ,
δ(r̄) i
k0z · k0 ¸
G(r̄, r̄0 ) = −ẑ ẑ + 2 ZZ−∞
∞ (22)
k02 8π 1 K̄ K̄
dk̄⊥ I − 2 eiK̄·r̄ for z < 0 ,
−∞ k0z k0
where
k̄ = kx x̂ + ky ŷ + k0z ẑ , (23a)
K̄ = kx x̂ + ky ŷ − k0z ẑ . (23b)
Some notes:
5
1. The Dirac delta function is known as the singularity of the Green’s function and is impor-
tant in calculating the fields in the source region.
2. The different signs ensure that the integral converges for evanescent waves, i.e. when
kx2 + ky2 > k02 .
3. The square bracket in the expression of the Green’s functions can be expressed in terms
of superposition of TE and TM waves, as we shall see.
k̄ × ẑ 1 1
TE: ê(k0z ) = =q [x̂ky − ŷkx ] = (x̂ky − ŷkx ) , (24a)
|k̄ × ẑ| kx2 + ky2 kρ
1 k0 kρ
TM: ĥ(k0z ) = ê(k0z ) × k̄ = − z (x̂ky + ŷkx ) + ẑ . (24b)
k0 ko kρ ko
After translating to the origin, we get for the dyadic Green’s functions:
ZZ ∞ · ¸
1 0
dk̄⊥ ê(k0z )ê(k0z ) + ĥ(k0z )ĥ(k0z ) eik̄·(r̄−r̄ ) for z > z 0 ,
0 δ(r̄) i
k0z ·
G(r̄, r̄ ) = −ẑ ẑ 2 + 2 ZZ−∞
∞ ¸
k0 8π 1 0
d k̄ ê(−k )ê(−k ) + ĥ(−k ) ĥ(−k ) eiK̄·(r̄−r̄ ) for z < z 0 ,
0z 0z 0z 0z
⊥
k
−∞ 0z
(26)
where
(Note that K̄, ê(−k0z ) and ĥ(−k0z ) form another orthonormal set of vectors about K̄).
0 0
∞ dk̄⊥ 1
{[ê(k0z ) eik̄·r̄ ] ê(k0z ) e−ik̄r̄ +[ĥ(k0z ) eik̄·r̄ ] ĥ(k0z ) e−ik̄r̄ } for z > z 0 ,
RR
δ(r̄) i −∞ k 0z
G(r̄,r̄ 0 )=−ẑ ẑ 2 + 8π 2
k0 0 0
1
for z < z 0 ,
RR ∞
−∞ dk̄⊥ k 0z
{[ê(−k0z ) eiK̄·r̄ ] ê(−k0z ) e−iK̄ r̄ +[ĥ(−k0z ) eiK̄·r̄ ] ĥ(−k0z ) e−iK̄ r̄ }
(28)
If we now consider a layered medium problem, with an arbitrary number of layers and a
source in the top region (incident wave), we write:
∞ ½ ¾
1
ZZ
0 i −iK̄·r̄ 0 −iK̄·r̄ 0
G(r̄, r̄ )i0 = 2 dkx dky K̄e ê(−k0z ) e + K̄h ĥ(−k0z ) e (29)
8π −∞ k0z
with
1. For z < z 0 , i = 0:
2. For region `, i = `:
3. For region t, i = t:
where
q
k`z = k`2 − kx2 − ky2 , (33a)
k̄` =x̂kx + ŷky + ẑk`z , (33b)
K̄` =x̂kx + ŷky − ẑk`z , (33c)
and the coefficients A` , B` , C` and D` are determined from the boundary conditions.
The boundary conditions apply to the tangential electric and magnetic fields. Thus, in
terms of Green’s functions, we need to satisfy the continuity of ẑ × G(r̄, r̄ 0 ) and ẑ × ∇ ×
7
G(r̄, r̄0 ). Let us write this at the interface between media (`) and (` + 1), by separating
the TE and TM components:
A0 = R T E , B0 = 1 , C0 = R T M , D0 = 1 , (35a)
TE TM
At = 0 , Bt = T , Ct = 0 , Dt = T . (35b)
Before evaluating these coefficients, we can build a recursive scheme to calculate the ampli-
tudes from region ` to region ` + 1.
For example, it is straightforward to build a propagation matrix for TE modes from Eq. (34a)
and (34b): Ã ! Ã !
A`+1 eikz `+1 d`+1 TE A` eikz ` d`
=V (36)
B`+1 e−ikz `+1 d`+1 B` e−ikz ` d`
A similar procedure of course applied to the TM modes:
à ! à !
C`+1 eikz `+1 d`+1 TM C` eikz ` d`
=V (37)
D`+1 e−ikz `+1 d`+1 D` e−ikz ` d`
In order to end up the recursive method, we have to express the reflection and transmission
coefficient in the first and last regions, respectively. We shall only illustrated this point here, as
it has been developed in previous classes.
Let us consider a plane wave incident from region 0, with its plane of incidence parallel to
∂
the (xy) plane. All fields vectors are independent on y, so that ∂y = 0 in Maxwell’s equations.
Thus, we can decompose the fields into their TE and TM components. We get in region `:
• TE modes:
1 ∂
H`x = − E`y , (38a)
iωµ` ∂z
1 ∂
H`z = E`y , (38b)
iωµ` ∂x
µ 2 ¶
∂ ∂ 2
+ + ω ² ` ` E`y = 0 .
µ (38c)
∂x2 ∂y
8 2.4 Treatment of layered media
• TM modes:
1 ∂
E`x = H`y , (39a)
iω²` ∂z
1 ∂
E`z = − H`y , (39b)
iω²` ∂x
µ 2 ¶
∂ ∂ 2
+ + ω ² ` ` H`y = 0 .
µ (39c)
∂x2 ∂y
By matching the boundary conditions, and upon using the already known notation
µ` kz (`+1)
p`(`+1) = , (41a)
µ`+1 k`z
1 − p`(`+1)
R`(`+1) = . (41b)
1 + p`(`+1)
A` e2ik`z d`
2
[1 − 1/R`(`+1) ] e2i[k(`+1) z +k`z ]d`
= + A
, (42)
B` R`(`+1) 1/R`(`+1) e2ik(`+1) z d` + B`+1
`+1
Introduction
Model Theory is the part of mathematics which shows how to apply logic to
the study of structures in pure mathematics. On the one hand it is the ultimate
abstraction on the other, it has immediate applications to every-day mathematics.
The fundamental tenet of Model Theory is that mathematical truth, like all truth,
is relative. A statement may be true or false, depending on how and where it is
interpreted. This isn't necessarily due to mathematics itself, but is a consequence
of the language that we use to express mathematical ideas.
What at rst seems like a deciency in our language, can actually be shaped into
a powerful tool for understanding mathematics. This book provides an introduction
to Model Theory which can be used as a text for a reading course or a summer
project at the senior undergraduate or graduate level. It is also a primer which will
give someone a self contained overview of the subject, before diving into one of the
more encyclopedic standard graduate texts.
Any reader who is familiar with the cardinality of a set and the algebraic closure
of a eld can proceed without worry. Many readers will have some acquaintance
with elementary logic, but this is not absolutely required, since all necessary con-
cepts from logic are reviewed in Chapter 0. Chapter 1 gives the motivating examples
and we recommend that you read it rst, before diving into the more technical as-
pects of Chapter 0. Chapters 2 and 3 are selections of some of the most important
techniques in Model Theory. The remaining chapters investigate the relationship
between Model Theory and the algebra of the real and complex numbers. Thirty
exercises develop familiarity with the denitions and consolidate understanding of
the main proof techniques.
Throughout the book we present applications which cannot easily be found
elsewhere in such detail. Some are chosen for their value in other areas of mathe-
matics: Ramsey's Theorem, the Tarski-Seidenberg Theorem. Some are chosen for
their immediate appeal to every mathematician: existence of innitesimals for cal-
culus, graph colouring on the plane. And some, like Hilbert's Seventeenth Problem,
are chosen because of how amazing it is that logic can play an important role in
the solution of a problem from high school algebra. In each case, the derivation
is shorter than any which tries to avoid logic. More importantly, the methods of
Model Theory display clearly the structure of the main ideas of the proofs, showing
how theorems of logic combine with theorems from other areas of mathematics to
produce stunning results.
The theorems here are all are more than thirty years old and due in great part
to the cofounders of the subject, Abraham Robinson and Alfred Tarski. However,
we have not attempted to give a history. When we attach a name to a theorem, it
is simply because that is what mathematical logicians popularly call it.
The bibliography contains a number of texts that were helpful in the prepa-
ration of this manuscript. They could serve as avenues of further study and in
addition, they contain many other references and historical notes. The more recent
titles were added to show the reader where the subject is moving today. All are
worth a look.
This book began life as notes for William Weiss's graduate course at the Uni-
versity of Toronto. The notes were revised and expanded by Cherie D'Mello and
2
William Weiss, based upon suggestions from several graduate students. The elec-
tronic version of this book may be downloaded and further modied by anyone for
the purpose of learning, provided this paragraph is included in its entirety and so
long as no part of this book is sold for prot.
Contents
Chapter 0. Models, Truth and Satisfaction 4
Formulas, Sentences, Theories and Axioms 4
Prenex Normal Form 9
Chapter 1. Notation and Examples 11
Chapter 2. Compactness and Elementary Submodels 14
Compactness Theorem 14
Isomorphisms, elementary equivalence and complete theories 15
Elementary Chain Theorem 16
Lowenheim-Skolem Theorems 19
The L os-Vaught Test 21
Every complex one-to-one polynomial map is onto 23
Chapter 3. Diagrams and Embeddings 24
Diagram Lemmas 25
Every planar graph can be four coloured 25
Ramsey's Theorem 26
The Leibniz Principle and innitesimals 26
Robinson Consistency Theorem 27
Craig Interpolation Theorem 31
Chapter 4. Model Completeness 32
Robinson's Theorem on existentially complete theories 32
Lindstrom's Test 35
Hilbert's Nullstellensatz 38
Chapter 5. The Seventeenth Problem 39
Positive denite rational functions are the sums of squares 39
Chapter 6. Submodel Completeness 45
Elimination of quantiers 45
The Tarski-Seidenberg Theorem 49
Chapter 7. Model Completions 50
Almost universal theories 52
Saturated models 54
Blum's Test 55
Bibliography 61
Index 62
3
CHAPTER 0
6. if ' is ( ! ) then
A j= 'x0 : : : xq ] means that A j= x0 : : : xq ] implies A j= x0 : : : xq ],
7. if ' is ( $ ) then
A j= 'x0 : : : xq ] means that A j= x0 : : : xq ] i A j= x0 : : : xq ],
8. if ' is 8vi , then
A j= 'x0 : : : xq ] means for every x 2 A A j= x0 : : : xi;1 x xi+1 : : : xq ],
9. if ' is 9vi , then
A j= 'x0 : : : xq ] means for some x 2 A A j= x0 : : : xi;1 x xi+1 : : : xq ]:
Exercise 3. Each of the formulas of Example 2 is satised in any model A for
any language L by any (long enough) sequence x0 x1 : : : xq of A. This is where
you test your solution to Exercise 2.
We now prove two lemmas which show that the preceeding concepts are well-
dened. In the rst one, we see that the value of a term only depends upon the
values of the variables which actually occur in the term. In this lemma the equal
sign = is used, not as a logical symbol in the formal sense, but in its usual sense to
denote equality of mathematical objects | in this case, the values of terms, which
are elements of the universe of a model.
Lemma 1. Let A be a model for L and let t(v0 : : : vp ) be a term of L. Let
x0 : : : xq and y0 : : : yr be sequences from A such that p q and p r, and let
xi = yi whenever vi actually occurs in t(v0 : : : vp ). Then
tx0 : : : xq ] = ty0 : : : yr ]
.
Proof. We use induction on the complexity of the term t.
1. If t is vi then xi = yi and so we have
tx0 : : : xq ] = xi = yi = ty0 : : : yr ] since p q and p r:
2. If t is the constant symbol c, then
tx0 : : : xq ] = I (c) = ty0 : : : yr ]
where I (c) is the interpretation of c in A.
3. If t is F (t1 : : : tm) where F is an m-placed function symbol, t1 : : : tm are
terms and I (F ) = G, then
tx0 : : : xq ] = G(t1 x0 : : : xq ] : : : tm x0 : : : xq ]) and
ty0 : : : yr ] = G(t1 y0 : : : yr ] : : : tm y0 : : : yr ]).
By the induction hypothesis we have that ti x0 : : : xq ] = ti y0 : : : yr ] for
1 i m since t1 : : : tm have all their variables among fv0 : : : vp g. So
we have tx0 : : : xq ] = ty0 : : : yr ].
In the next lemma the equal sign = is used in both senses | as a formal
logical symbol in the formal language L and also to denote the usual equality
of mathematical objects. This is common practice where the context allows the
reader to distinguish the two usages of the same symbol. The lemma conrms that
satisfaction of a formula depends only upon the values of its free variables.
0. MODELS, TRUTH AND SATISFACTION 8
Lemma 2. Let A be a model for L and ' a formula of L, all of whose free and
bound variables occur among v0 : : : vp . Let x0 : : : xq and y0 : : : yr (q r p) be
two sequences such that xi and yi are equal for all i such that vi occurs free in '.
Then
A j= 'x0 : : : xq ] i A j= 'y0 : : : yr ]
Proof. Let A and L be as above. We prove the lemma by induction on the
complexity of '.
1. If '(v0 : : : vp ) is the formula t1 = t2 , then we use Lemma 1 to get:
A j= (t1 = t2 )x0 : : : xq ] i t1 x0 : : : xq ] = t2 x0 : : : xq ]
i t1 y0 : : : yr ] = t2 y0 : : : yr ]
i A j= (t1 = t2 )y0 : : : yr ]:
2. If '(v0 : : : vp ) is the formula R(t1 : : : tn ) where R is an n-placed relation
symbol with interpretation S , then again by Lemma 1, we get:
A j= R(t1 : : : tn )x0 : : : xq ] i S (t1 x0 : : : xq ] : : : tn x0 : : : xq ])
i S (t1 y0 : : : yr ] : : : tn y0 : : : yr ])
i A j= R(t1 : : : tn )y0 : : : yr ]:
3. If ' is (:), the inductive hypothesis gives that the lemma is true for . So,
A j= 'x0 : : : xq ] i not A j= x0 : : : xq ]
i not A j= y0 : : : yr ]
i A j= 'y0 : : : yr ]:
4. If ' is ( ^ ), then using the inductive hypothesis on and we get
A j= 'x0 : : : xq ] i both A j= x0 : : : xq ] and A j= x0 : : : xq ]
i both A j= y0 : : : yr ] and A j= y0 : : : yr ]
i A j= 'y0 : : : yr ]:
5. If ' is ( _ ) then
A j= 'x0 : : : xq ] i either A j= x0 : : : xq ] or A j= x0 : : : xq ]
i either A j= y0 : : : yr ] or A j= y0 : : : yr ]
i A j= 'y0 : : : yr ]:
6. If ' is ( ! ) then
A j= 'x0 : : : xq ] i A j= x0 : : : xq ] implies A j= x0 : : : xq ]
i A j= y0 : : : yr ] implies A j= y0 : : : yr ]
i A j= 'y0 : : : yr ]:
7. If ' is ( $ ) then
A j= 'x0 : : : xq ] i we have A j= x0 : : : xq ] i A j= x0 : : : xq ]
i we have A j= y0 : : : yr ] i A j= y0 : : : yr ]
i A j= 'y0 : : : yr ]:
8. If ' is 8vi , then
A j= 'x0 : : : xq ] i for every z 2 A A j= x0 : : : xi;1 z xi+1 : : : xq ]
i for every z 2 A A j= y0 : : : yi;1 z yi+1 : : : yr ]
i A j= 'y0 : : : yr ]:
The inductive hypothesis uses the sequences x0 : : : xi;1 z xi+1 : : : xq and
y0 : : : yi;1 z yi+1 : : : yr with the formula .
0. MODELS, TRUTH AND SATISFACTION 9
However, the ; denition of a limit of a function has prenex rank 2 and is much
more di!cult for students to comprehend at rst sight:
89 8x((0 < ^ 0 < jx ; aj < ) ! jF (x) ; Lj < ):
A formula of prenex rank 4 would make any mathematician look twice.
CHAPTER 1
10. (8x)(9y)(x + y = 0)
11. (8x)(8y)(x + y = y + x)
12. (8x)(8y)(8z )(x (y z ) = (x y) z )
13. (8x)(x 1 = x)
14. (8x)(x = 0 _ (9y)(y x = 1)
15. (8x)(8y)(x y = y x)
16. (8x)(8y)(8z )(x (y + z ) = (x y) + (y z ))
17. 0 6= 1
18. (8x)(8y)(8z )(x < y ! x + z < y + z )
19. (8x)(8y)(8z )(x < y ^ 0 < z ! x z < y z )
20. for each n 1 we have the formula
(8x0 )(8x1 ) (8xn )(9y)(xn yn + xn;1 yn;1 + + x1 y + x0 = 0 _ xn = 0)
k
z }| {
where, as usual, yk abbreviates y y y
The latter formulas express that each polynomial of degree n has a
root. The following formulas express the intermediate value property for
polynomials of degree n: if the polynomial changes sign from w to z , then
it is zero at some y between w and z .
21. for each n 1 we have
(8x0 ) : : : (8xn )(8w)(8z )(xn wn + xn;1 wn;1 + + x1 w + x0 )
(xn z n + xn;1 z n;1 + + x1 z + x0 ) < 0
! (9y )(((w < y ^ y < z ) _ (z < y ^ y < w))
^ (xn y n + xn;1 y n;1 + + x1 y + x0 = 0))]
The most fundamental concept is that of a sentence
being true when inter-
preted in a model A. We write this as A j=
, and we extend this concept in the
following denitions.
Definition 11. If " is a set of sentences, A is said to be a model of ", written
A j= ", whenever A j=
for each
2 ". " is said to be satisable i there is some
A such that A j= ".
Definition 12. A theory T is a set of sentences. If T is a theory and
is a
sentence, we write T j=
whenever we have that for all A if A j= T then A j=
.
We say that
is a consequence of T . A theory is said to be closed whenever it
contains all of its consequences.
Definition 13. If A is a model for the language L, the theory of A, denoted
by ThA, is dened to be the set of all sentences of L which are true in A,
f
of L : A j=
g
This is one way that a theory can arise. Another way is through axioms.
Definition 14. " T is said to be a set of axioms for T whenever " j=
for
every
in T in this case we write " j= T .
Remark. We will generally assume our theories are closed and we will often
describe theories by specifying a set of axioms ". The theory will then be all
consequences
of ".
Example 5. We will consider the following theories and their axioms:
1. NOTATION AND EXAMPLES 13
1. The theory of Linear Orderings (LOR) which has as axioms sentences 1-4
from Example 4.
2. The theory of Dense Linear Orders (DLO) which has as axioms all the
axioms of LOR, and sentence 5, 6 and 7 of Example 4.
3. The theory of Fields (FEI) which has as axioms sentences 8-17 from Example
4.
4. The theory of Ordered Fields (ORF) which has as axioms all the axioms of
FEI, LOR and sentences 18 and 19 from Example 4.
5. The theory of Algebraically Closed Fields (ACF) which has as axioms all
the axioms of FEI and all sentences from 20 of Example 4, i.e. innitely
many sentences, one for each n 1.
6. The theory of Real Closed Ordered Fields (RCF) which has as axioms all
the axioms of ORF, and all sentences from 21 of Example 4, i.e. innitely
many sentences, one for each n 1.
Exercise 6. Show that :
1. Q j= DLO
2. R j= RCF using the Intermediate Value theorem
3. C j= ACF using the Fundamental Theorem of Algebra
where Q, R and C are as in Example 3.
Remark. The theory of Real Closed Ordered Fields is sometimes axiomatized
dierently. All the axioms of ORF are retained, but the sentences from 21 of
Example 4, which amount to an Intermediate Value Property, are replaced by the
sentences from 20 for odd n and the sentence
(8x)(0 < x ! (9y)y2 = x)
which states that every positive element has a square root. A signicant amount of
algebra would then be used to verify the Intermediate Value Property from these
axioms.
CHAPTER 2
Remark. The Compactness Theorem is the only one for which we do not give
a complete proof. If the reader has not previously seen the Completeness Theorem,
there are other proofs of the Compactness Theorem which may be more easily
absorbed: set theoretic (using ultraproducts), topological (using compact spaces,
hence the name) or Boolean algebraic. However these topics are too far aeld to
enter into the proofs here. We will use the Compactness Theorem as a starting
point | in fact, all that follows can be seen as its corollaries.
Exercise 7. Suppose T is a theory for the language L and
is a sentence of
L such that T j=
. Prove that there is some nite T 0 T such that T 0 j=
.
Recall that T j=
i T f:
g is not satisable.
Definition 15. If L, and L0 are two languages such that L L0 we say that
L is an expansion of L and L is a reduction of L0
0
Definition 16. Given a model A for the language L, we can expand it to a
model A0 of L0 by giving appropriate interpretations to the symbols in L0 n L. We
say that A0 is an expansion of A to L0 and that A is a reduct of A0 to L. We also
use the notation A0 jL for the reduct of A0 to L.
Theorem 2. If a theory T has arbitrarily large nite models, then it has an
innite model.
Proof. Consider new constant symbols ci for i 2 N , the usual natural num-
bers, and expand from L, the language of T , to L0 = L fci : i 2 N g.
Let
" = T f:ci = cj : i 6= j i j 2 N g:
We rst show that every nite subset of " has a model by interpreting the
nitely many relevant constant symbols as dierent elements in an expansion of
some nite model of T . Then we use compactness to get a model A0 of ".
14
2. COMPACTNESS AND ELEMENTARY SUBMODELS 15
The model that we require is for the language L, so we take A to be the reduct
of A0 to L.
Claim. If the statement is true when ' is , then the statement is true when
' is :.
Proof of Claim. Fix k 2 N and a0 : : : ap in Ak .
A j= (:)a0 : : : ap ] i not A j= a0 : : : ap ]
i not Ak j= a0 : : : ap ]
i Ak j= (:)a0 : : : ap ]:
2. COMPACTNESS AND ELEMENTARY SUBMODELS 18
Claim. If the statement is true when ' is 1 and when ' is 2 then the state-
ment is true when ' is 1 ^ 2 .
Proof of Claim. Fix k 2 N and a0 : : : ap in Ak .
A j= (1 ^ 2 )a0 : : : ap ] i A j= 1 a0 : : : ap ] and A j= 2 a0 : : : ap ]
i Ak j= 1 a0 : : : ap ] and Ak j= 2 a0 : : : ap ]
i Ak j= (1 ^ 2 )a0 : : : ap ]:
Claim. If the statement is true when ' is then the statement is true when '
is 9vi .
Proof of Claim. Fix k 2 N and a0 : : : ap in Ak . Note that
A = fAj : j 2 Ng.
A j= 9vi a0 : : : ap ] i A j= 9vi a0 : : : aq ]
where q is the maximum of i and p (by Lemma 2),
i A j= a0 : : : ai;1 a ai+1 : : : aq ] for some a 2 A
i A j= a0 : : : ai;1 a ai+1 : : : aq ]
for some a 2 Al for some l k
i Al j= a0 : : : ai;1 a ai+1 : : : aq ]
since the statement is true for ,
i Al j= 9vi a0 : : : aq ]
i Ak j= 9vi a0 : : : aq ] since Ak Al
i Ak j= 9vi a0 : : : ap ] (by Lemma 2):
By induction on the complexity of ', we have proven the statement for all
formulas ' which do not contain the connectives _, ! and $ or the quantier 8.
To verify the statement for all ' we use Lemma 3. Let ' be any formula of L. By
Lemma 3 there is a formula which does not use _, !, $ nor 8 such that
j= (8v0 ) : : : (8vp )(' $ ):
Now x k 2 N and a0 : : : ap in Ak . We have
A j= (' $ )a0 : : : ap ] and Ak j= (' $ )a0 : : : ap ]:
A j= 'a0 : : : ap ] i A j= a0 : : : ap ]
i Ak j= a0 : : : ap ]
i Ak j= 'a0 : : : ap ]
which completes the proof of the theorem.
(2) for any formula (v0 : : : vq ) and any i q and any a0 : : : aq from A:
if there is some b 2 B such that
B j= a0 : : : ai;1 b ai+1 : : : aq ]
then we have some a 2 A such that
B j= a0 : : : ai;1 a ai+1 : : : aq ]:
Proof. Only the implication (2) ) (1) requires a lot of proof. We will prove
that for each formula '(v0 : : : vp ) and all a0 : : : ap from A we will have:
A j= 'a0 : : : ap ] i B j= 'a0 : : : ap ]
by induction on the complexity of ' using only the negation symbol :, the connec-
tive ^ and the quantier 9 (recall Lemma 3).
1. The cases of formulas of the form t1 = t2 and R(t1 : : : tn ) come immediately
from the fact that A B.
2. For negation: suppose ' is : and we have it for , then
A j= 'a0 : : : ap ] i not A j= a0 : : : ap ]
i not B j= a0 : : : ap ] i B j= 'a0 : : : ap ]:
3. The ^ case proceeds similarly.
4. For the 9 case we consider ' as 9vi . If A j= 9vi a0 : : : ap ], then the
inductive hypothesis for and the fact that A B ensure that
B j= 9vi a0 : : : ap ]. It remains to show that if B j= 'a0 : : : ap ] then
A j= 'a0 : : : ap ].
Assume B j= 9vi a0 : : : ap ]. By Lemma 2, B j= 9vi a0 : : : aq ]
where q is the maximum of i and p. By the denition of satisfaction, there
is some b 2 B such that
B j= a0 : : : ai;1 b ai+1 : : : aq ]:
By (2), there is some a 2 A such that
B j= a0 : : : ai;1 a ai+1 : : : aq ]:
By the inductive hypothesis on , for that same a 2 A,
A j= a0 : : : ai;1 a ai+1 : : : aq ]:
By the denition of satisfaction,
A j= 9vi a0 : : : aq ]:
Finally, by Lemma 2, A j= a0 : : : ap ].
Recall that jBj is used to represent the cardinality, or size, of the set B. Note
that since any language L contains innitely many variables, jLj is always innite,
but may be countable or uncountable depending on the number of other symbols.
We often denote an arbitrary innite cardinal by the lower case Greek letter .
Theorem 5. (Downward L owenheim-Skolem Theorem)
Let B be a model for L and let be any cardinal such that jLj < jBj. Then B
has an elementary submodel A of cardinality .
Furthermore if X B and jX j , then we can also have X A.
2. COMPACTNESS AND ELEMENTARY SUBMODELS 20
2 ".
Obtain a model C00 for L00 from D00 by simply switching elements of the universe
of D00 with A to ensure that for each a 2 A the interpretation of ca in C00 is a.
Hence the universe of C00 contains A and C00 j= ". Let C be the reduct of C00
to L. The following argument will show that A C.
Let ' be any formula of L and a0 : : : ap any elements from A. Let
be the
sentence of L0 formed by replacing free occurances of vi with cai . We have
A j= 'a0 : : : ap ] i
2 "
i D00 j=
i C00 j=
i C j= 'a0 : : : ap ]:
However, C may have size strictly larger than . In this case we obtain our
nal B by using the previous theorem to get B C with A B. It is now
straightforward to conclude that A B.
2. COMPACTNESS AND ELEMENTARY SUBMODELS 21
A graph with the property that every pair of vertices is connected with an edge
is called complete. At the other extreme, a graph with no edges is called discrete.
A very important theorem in nite combinatorics says that most graphs contain an
3. DIAGRAMS AND EMBEDDINGS 26
We now use Model Theory to go from the innite to the nite. Let
be
the sentence, of the language of graph theory, asserting that there is no complete
subgraph of size n.
(8x1 : : : 8xn ):E (x1 x2 ) _ :E (x1 x3 ) _ _ :E (xn;1 xn )]:
Let be the sentence asserting that there is no discrete subgraph of size n.
(8x1 : : : 8xn )E (x1 x2 ) _ E (x1 x3 ) _ _ E (xn;1 xn )]:
Let T be the set consisting of
, and the axioms of graph theory.
If there is no r as Ramsey's Theorem states, then T has arbitrarily large nite
models. By Theorem 2, T has an innite model, contradicting the claim.
The following theorem of A. Robinson nally solved the centuries old problem
of innitesimals in the foundations of calculus.
Theorem 11. (The Leibniz Principle)
There is an ordered eld R called the hyperreals, containing the reals R and an
innitesimal number such that any statement about the reals which holds in R also
holds in R.
< 0 1 i. We will make the statement of the theorem
Proof. Let R be hR + <
precise by proving that there is some model B, in the same language L as R and
with the universe called R , such that R B and there is b 2 R such that
0 < b < a for each positive a 2 R.
3. DIAGRAMS AND EMBEDDINGS 27
For each real number a, we introduce a new constant symbol ca , and in addition
another new constant symbol d. Let " be the set of sentences in the expanded
language given by:
ThRR f0 < d < ca : a is a positive real g
We can obtain a model C j= " by the compactness theorem. Let C0 be the reduct
of C to L. By the elementary diagram lemma R is elementarily embedded in C0 ,
and so there is a model B for L such that C0
= B and R B.
of L such that T1 j=
and T2 j= :
.
Proof. The direction ) is easy and motivates the whole theorem.
We begin the proof in the ( direction. Our goal is to show that T1 T2 is
satisable. The following claim is a rst step.
Claim. T1 f sentences
of L : T2 j=
g is satisable.
Proof of Claim. Using the compactness theorem and considering conjunc-
tions, it su!ces to show that if T1 j=
1 and T2 j=
2 with
2 a sentence of L, then
f
1
2 g is satisable. But this is true, since otherwise we would have
1 j= :
2 and
hence T1 j= :
2 and so :
2 would be a sentence of L contradicting our hypothesis.
The basic idea of the proof from now on is as follows. In order to construct a
model of T1 T2 we construct models A j= T1 and B j= T2 and an isomorphism
f : AjL ! BjL between the reducts of A and B to the language L, witnessing
= BjL. We then use f to carry over interpretations of symbols in L1 n L
that AjL
from A to B , giving an expansion B of B to the language L1 L2 . Then, since
B jL1
= A and B jL2 = B we get B j= T1 T2 .
The remainder of the proof will be devoted to constructing such an A, B and
f . A and B will be constructed as unions of elementary chains of An's and Bn 's
while f will be the union of fn : An ,! Bn .
3. DIAGRAMS AND EMBEDDINGS 28
The other links in the elementary chain are provided by the following result.
Claim. For each n 0 there are models An+1 j= T1 and Bn+1 j= T2 with an
elementary embedding
fn+1 : An+1 jL ,! Bn+1 jL
such that
An An+1 Bn Bn+1 fn+1 extends fn and Bn range of fn+1 :
A0 A1 An An+1
#f0 #f1 #fn #fn+1
B0 B1 Bn Bn+1
The
S
proof of this
S
claim will be discussed
S
shortly. Assuming the claim, let
A = n2N An , B = n2N Bn and f = n2N fn . The Elementary Chain Theorem
gives that A j= T1 and B j= T2 . The proof of the theorem is concluded by simply
verifying that f : AjL ! BjL is an isomorphism.
The proof of the claim is long and quite technical it would not be inappropriate
to omit it on a rst reading. The proof, of course, must proceed by induction on
3. DIAGRAMS AND EMBEDDINGS 29
n. The case of a general n is no dierent from the case n = 0 which we state and
prove in some detail.
Claim. There are models A1 j= T1 and B1 j= T2 with an elementary embedding
f1 : A1 jL ,! B1 jL such that A0 A1 , B0 B1 , f1 extends f0 and
B0 range of f1.
A0 A1
#f0 #f1
B0 B1
Proof of Claim. Let A+0 be the expansion of A0 to the language L+1 = L1
fca : a 2 A0 g formed by interpreting each ca as a;2 A0 A+
0 is just another notation
for (A0 )A0 . The elementary diagram of A+0 is Th A+0 A+0 . Let B0 be the expansion
of B0 jL to the language
L = L fca : a 2 A0 g fcb : b 2 B0 g
formed by interpreting each ca as f0 (a) 2 B0 and each cb as b 2 B0 .
;
We wish to prove that Th A+0 A+0 ThB0 is satisable. By the compactness
;
theorem it su!ces to prove that Th A+0 A+0 f
g is satisable for each
in ThB0 .
For such a sentence
, let ca0 : : : cam cb0 : : : cbn be all those constant symbols
occuring in
but not in L. Let '(u0 : : : um w0 : : : wm ) be the formula of L
obtained from
by replacing each constant symbol cai by a new variable ui and
each constant symbol cbi by a new variable wi . We have B0 j=
so
B0 jL j= 'f0 (a0 ) : : : f0 (am ) b0 : : : bn ]
So B0 jL j= 9w0 : : : 9wn 'f0 (a0 ) : : : f0 (am )]
Since f0 is an elementary embedding we have :
A0 jL j= 9w0 : : : 9wn 'a0 : : : am ]
Let '^(w0 : : : wn ) be the formula of L+1 obtained by replacing occurances of ui
in '(u0 : : : um w0 : : : wn ) by cai then A+0 j= 9w0 : : : 9wn '^. So, of course,
; +
A0 A+0 j= 9w0 : : : 9wn '^
and this means that there are d0 : : : dn in A+0 = A0 such that
(A+0 )A+0 j= '^d0 : : : dn ]:
; +
We can now expand A0 A+0 to a model D by interpreting each cbi as di to obtain
;
D j=
and so Th A+0 A+0 f
g is satisable.
;
Let E j= Th A+0 A+0 ThB0 . By the elementary diagram lemma A+0 is ele-
mentarily embedded into EjL+1 . So there is a model A+1 for L+1 with A+0 A+1 and
an isomorphism g : A+1 ! EjL+1 . Using g we expand A+1 to a model A01 isomorphic
to E. Let A1 denote A01 jL we have A1 j= ThB0 .
;
We now wish to prove that Th (A1 )A1 Th B+0 B+0 is satisable, where B+0 is
the common expansion of B0 and B0 to the language
L2+ = L2 fca : a 2 A0 g fcb : b 2 B0 g:
3. DIAGRAMS AND EMBEDDINGS 30
Model Completeness
The quantier 8 is sometimes said to be the universal quantier and the quan-
tier 9 to be the existential quantier.
A formula ' is said to be quantier free whenever no quantiers occur in '.
A formula ' is said to be universal whenever it is of the form 8x0 : : : 8xk
where is quantier free.
A formula ' is said to be existential whenever it is of the form 9x0 : : : 9xk
where is quantier free.
A formula ' is said to be universal-existensial whenever it is of the form
8x0 : : : 8xk 9y0 : : : 9yk where is quantier free.
We extend these notions to theories T whenever each axiom
of T has the
property.
Remark. Note that each quantier free formula ' is trivially equialent to the
existential formula 9vi ' where vi does not occur in '.
Exercise 18. Let A and B be models for L with A B. Verify the following
four statements:
(i) A B i BA j= Th(AA ) i AA j= Th(BA ):
(ii) A B i BA j= 4A i BA j=
for each quantier free
of Th(AA ):
(iii) A B i BA j=
for each existential
of Th(AA ):
(iv) A B i AA j=
for each universal
of Th(BA ):
Definition 28. A model A of a theory T is said to be existentially closed if
whenever A B and B j= T , we have AA j=
for each existential
of Th(BA ):
Remark. If A is existentially closed and A0 = A then A0 is also existentially
closed.
Definition 29. A theory T is said to be model complete whenever T 4A
is complete in the language LA for each model A of T .
Theorem 14. ( A. Robinson )
Let T be a theory in the language L. The following are equivalent:
(1) T is model complete,
(2) T is existentially complete, i.e. each model of T is existentially closed.
(3) for each formula '(v0 : : : vp ) of L there is a universal formula (v0 : : : vp )
such that T j= (8v0 : : : 8vp )(' $ )
(4) for all models A and B of T , A B implies A B.
Proof. (1) ) (2):
Let A j= T and B j= T with A B. Clearly AA j= 4A it is also easy to see that
BA j= 4A. Now by (1), T 4A is complete and both AA and BA are models of
this theory so they are elementarily equivalent.
32
4. MODEL COMPLETENESS 33
So let
be any sentence of LA (existential or otherwise). If BA j=
then
AA j=
and (2) follows.
(2) ) (3):
Lemma 4 shows that it su!ces to prove it for formulas ' in prenex normal form.
We do this by induction on the prenex rank of ' which is the number of alternations
of quantiers in '. The rst step is prenex rank 0. Where only universal quantiers
are present the result is trivial. The existential formula case is non-trivial it is the
following claim:
Claim. For each existential formula '(v0 : : : vp ) of L there is a universal
formula (v0 : : : vp ) such that
T j= (8v0 ) : : : (8vp )(' $ )
Proof of Claim. Add new constant symbols c0 : : : cp to L to form
L = L fc0 : : : cp g
and to form a sentence ' of L obtained by replacing each instance of vi in ' with
the corresponding ci ' is an existential sentence. It su!ces to prove that there is
a universal sentence of L such that T j= ' $ .
Let C j= ". By the diagram lemma, there is a model B for L such that AjL B
and B = CjL. Thus both AjL and B are models of T . Furthermore BA = C so
that BA j= ' . We now use (2) to get that (AjL)A j= ' . But (AjL)A is just AA
and so A j= ' . This means T ; j= ' and nishes the proof of the claim.
We will now do the general cases for the proof of the induction on prenex rank.
There are two cases, corresponding to the two methods available for increasing the
number of alternations of quantiers:
(a) the addition of universal quantiers
(b) the addition of existential quantiers.
For the case (a), suppose '(v0 : : : vp ) is 8w0 : : : 8wm (v0 : : : vp w0 : : : wm )
and has prenex rank lower than ' so that we have by the inductive hypothesis
that there is a quantier free formula (v0 : : : vp w0 : : : wm x0 : : : xn ) with new
variables x0 : : : xn such that
T j= (8v0 : : : 8vp 8w0 : : : 8wm )( $ 8x0 : : : 8xn )
Therefore, case (a) is concluded by noticing that this gives us
T j= (8v0 : : : 8vp )(8w0 : : : 8wm $ 8w0 : : : 8wm 8x0 : : : 8xn ):
Exercise 19. Check this step using the denition of satisfaction.
For case (b), suppose '(v0 : : : vp ) is 9w0 : : : 9wn (v0 : : : vp w0 : : : wm ) and
has prenex rank less than '. Here we will use the inductive hypothesis on :
which of course also has prenex rank less than '. We obtain a quantier free formula
(v0 : : : vp w0 : : : wm x0 : : : xn ) with new variables x0 : : : xn such that
T j= (8v0 : : : 8vp 8w0 : : : 8wm )(: $ 8x0 : : : 8xn )
Exercise 21. Suppose A A0 are models for L. Prove that for each sentence
of LA , if 4A0 j=
then 4A j=
.
Exercise 22. Prove that if T has a universal-existential set of axioms, then
the union of a chain of models of T is also a model of T .
Remark. The converse of this last exercise is also true it is called the Chang-
L os- Suszko Theorem.
Theorem 16. The following theories are model complete:
1. dense linear orders without endpoints. (DLO)
2. algebraically closed elds. (ACF)
Proof. (DLO): This theory has a universal existential set of axioms so that it
is closed under unions of chains. It is @0 -categorical (by Exercise 11) so Lindstrom's
test applies.
(ACF): We rst prove that for any xed characteristic p, the theory of alge-
braically closed elds of characteristic p is model complete. The proof is similar to
that for DLO, with @1 -categoricity (Lemma 7 ).
Let A B be algebraically closed elds. They must have the same character-
istic p. Therefore A B.
Corollary 4. Any true statement about the rationals involving only the usual
ordering is also true about the reals.
4. MODEL COMPLETENESS 38
Proof. Let A = hQ < <11 i and B = hR<<22 i where <1 and <2 are the usual
orderings. The precise version of this corollary is: A B. This follows from
Theorem 14 and Theorem 16 and the easy facts that A j= DLO, B j= DLO and
A B. The reader will appreciate the power of these theorems by trying to prove
A B directly, without using them.
Corollary 5. (Hilbert's Nullstellensatz)
Let " be a nite system of polynomial equations and inequations in several
variables with coecients in the eld A. If " has a solution in some eld extending
A then " has a solution in the algebraic closure of A.
Proof. Let
be the existential sentence of the language LA which asserts
the fact that there is a solution of ". Suppose " has a solution in a eld B with
A B. Then BA j=
. So B0A j=
where B0 is the algebraic closure of B. Let A0
be the algebraic closure of A. Since A B, we have A0 B0 .
By Theorem 16, ACF is model complete, so A0 B0 . Hence A0A B0A and
0
AA j=
.
CHAPTER 5
The following lemma shows why we deal with sums of rational functions rather
than sums of polynomials.
Lemma 10. x4 y2 + x2 y4 ; x2 y2 + 1 is positive denite, but not the sum of
squares of polynomials.
Proof. Let the polynomial be p(x y). A little calculus shows that the mini-
26 and conrms that p is positive denite.
mum value of p is 27
Suppose
l
X
p(x y) = (qi (x y))2
i=1
where qi (x y) are polynomials, each of which is the sum of terms of the form axm yn .
First consider powers of x and the largest exponent m which can occur in any of
the qi . Since no term of p contains x6 or higher powers of x, we see that we must
have m 2. Considering powers of y similarly gives that each n 2. So each
qi (x y) is of the form:
ai x2 y2 + bi x2 y + ci xy2 + di x2 + ei y2 + fi xy + gi x + hi y + ki
for some coe!cients ai bi ci di ei fi gi hi and ki . Comparing coe!cients of x4 y4
in p and the sum of the qi2 gives
l
X
0= a2i
i=1
so each ai = 0. Comparing the coe!cients of x4 and y4 gives that each di = 0 = ei .
Now comparing the coe!cients of x2 and y2 gives that each gi = 0 = hi . Now
comparing the coe!cients of x2 y2 gives
l
X
;1 = fi2
i=1
which is impossible.
Lemma 12. Let A = hA + < <A 0 1 i be an ordered eld such that each posi-
tive element of A is the sum of squares of elements of A. Let B be a eld containing
the reduct of A to f+ 0 1g as a subeld and such that zero is not the sum of non-
zero squares in B.
Let b 2 B n A be such that b is not the sum of squares of elements of B. Then
there is an ordering <B extending <A on B such that b <B 0.
Proof. It su!ces to nd a set P B of \positive elements" of B such that
(1) ;b 2 P
(2) 0 2= P
(3) c2 2 P for each c 2 B
(4) P is closed under + and
(5) for any c 2 B n f0g either c 2 P or ;c 2 P .
Once P has been obtained, we dene <B as follows:
c1 <B c2 i c2 ; c1 2 P:
For each a 2 A, if 0 <A a then a is a sum of squares and so by (3) and (4) a 2 P .
Thus <B extends < A .
So that all that remains to do is to construct such a P . The rst approximation
to P is P0 .
8 9
<Xl m
X =
Let P0 = : c2i ; d2j b : l m 2 N ci 2 B dj 2 B not all zero
i=1 j =1
We claim that (1), (2), (3) P and (4) holdPfor P0 . (1) and (3) are obvious. In
order to verify (2), note that if m 2 l 2
j =1 dj b = i=1 ci , then by the previous lemma
about reciprocals of sums of squares, b would be a sum of squares. Now (4) holds
by denition of P0 , noting that c2i (;d2j b) = ;(ci dj )2 b and
(;d2j b)(;d2k b) = (dj dk b)2 .
We now construct larger and larger versions of P0 to take care of requirement
(5). We do this in the following way. Suppose P0 P1 , P1 satises (1), (2), (3)
and (4), and c 2= P1 . We dene P2 to be:
fp(;c) : p is a polynomial with coe!cients in P1 g:
It is easy to see that ;c 2 P2 , P1 P2 and that (1), (3) and (4) hold for P2 .
To show that (2) holds for P2 we suppose that p(;c) = 0 and bring forth a
contradiction. Considering even and odd exponents we obtain:
p(x) = q(x2 ) + xr(x2 )
for some polynomials q and r with coe!cients in P1 . If r(c2 ) = 0 then q(c2 ) = 0.
But q(c2 ) is in P1 , which is a contradiction. On the other hand, if r(c2 ) 6= 0 then
0 = p(;c) = q(c2 ) ; cr(c2 )
which means that
2
2 2 1
c = q(c ) r(c ) r(c2 )
and since each of the factors on the right hand side is in P1 we get a contradiction.
Now we need:
5. THE SEVENTEENTH PROBLEM 42
Lemma 13. Every ordered eld can be embedded as a submodel of a real closed
ordered eld.
Proof. It su!ces to prove that for every ordered eld A there is an ordered
eld B such that A B and for each natural number n 1, B j=
n where
n is
the sentence in the language of eld theory which formally states:
If p is a polynomial of degree at most n and w < y such that
p(w) < 0 < p(y) then there is an x such that w < x < y and
p(x) = 0.
Consider the statement called IH(n):
For any ordered eld E there is an ordered eld F such that E F
and E j=
n .
IH(1) is true since any ordered eld E j=
1 . We will prove below that for each
n, IH(n) implies IH(n + 1).
Given our model A j= ORF, we will then be able to construct a chain of models:
A B1 B2 : : : Bn Bn+1
such that each Bn j= ORF f
n g. Let B be the union of the chain. Since the
theory ORF is preserved under unions of chains (see Exercise 22), B j= ORF.
Furthermore, the nature of the sentences
n allows us to conclude that for each n,
B j=
n and so B j= RCF. All that remains is to prove that for each n, IH(n)
implies IH(n + 1). We rst make a claim:
Claim. If E j= ORF f
n g and p is a polynomial of degree at most n + 1 with
coecients from E and a < d are in E such that p(a) < 0 < p(d) then there is
a model F such that E F, F j= ORF and there is b 2 such that a < b < d and
p(b) = 0.
Let us rst see how this claim helps us to prove that IH(n) implies IH(n+1).
Let E j= ORF we will use the claim to build a model F such that E F and
F j=
n+1 .
We rst construct a chain of models of ORF
E = E0 E1 : : : Em Em+1
such that for each m and each polynomial p of degree at most n +1 with coe!cients
from Em and each pair of a, d of elements of Em such that p(a) < 0 < p(d) there
is a b 2 Em+1 such that a < b < d and p(b) = 0.
Suppose Em has been constructed we construct Em+1 as follows: let "m be
the set of all existential sentences of LEm of the form
(9x)(ca < x ^ x < cd ^ p(x) = 0)
where p is a polynomial of degree at most n + 1 and such that ca , cd and the
coe!cients of the polynomial p are constant symbols from LEm and
(Em )Em j= p(ca ) < 0 ^ 0 < p(cd)
We claim that
ORF 4Em "m
is satisable.
5. THE SEVENTEENTH PROBLEM 43
Using the Compactness Theorem, it su!ces to nd, for each nite subset
f1 : : : k g
of "m , a model C such that Em C and
C j= ORF f1 : : : k g:
By IH(n), obtain a model F1 such that Em F1 and F1 j= ORF f
n g. By the
claim, obtain a model F2 such that F1 F2 and F2 j= ORF f1 g. Again by IH(n),
obtain F3 such that F2 F3 and F3 j= ORF f
n g. Again by the claim, obtain F4
such that F3 F4 and F4 j= ORF f2 g. Continue in this manner, getting models
of ORF
Em F1 : : : F2k
with each F2j j= j . Since each j is existential, we get that F2k is a model of each
j (see Exercise 18).
Let D j= ORF 4Em "m
and then use the Diagram Lemma to get Em+1 such that Em Em+1 , Em+1 j= ORF
and Em+1 j= "m , thus satisfying the required property concerning polynomials from
Em .
Let F be the union of the chain. Since ORF is a universal-existential theory,
F j= ORF (see Exercise 22) and F j=
n+1 by construction. So IH(n + 1) is proved.
We now nish the entire proof by proving the claim.
Proof of Claim. If p(x) = 0 for some x in E such that a < x < d then we
can let F = E. Otherwise, introduce a new element b to E where the place of b in
the ordering is given by:
b = supremumft 2 E : t < d and p(t) < 0g:
Note that continuity-style considerations show that b 6= d.
We now show that since E j=
n the polynomial p cannot be factored in E.
Suppose p(x) = q(x) s(x). The denition of b allows us to nd a1 and d1 such that
a1 < b < d1 and p(a1 ) < 0 < p(d1 ) and such that, other than possibly b, q and s
have no roots in the interval of E between a1 and d1 . Now since p(a1 ) p(d1 ) < 0
either q(a1 ) q(d1 ) < 0 or s(a1 ) s(d1 ) < 0 and q and s each have degree n, forcing
b to be an element of E.
The fact that p is irreducible over E means that we can extend hE + 0 1i
by quotients of polynomials in b of degree n to form a eld hF + 0 1i in the
usual way. We leave the details to the reader.
Note that the construction cannot force q(b) = 0 for any polynomial q(x) with
coe!cients from E of degree n. This is because we could take such a q(x) of
lowest degree and divide p(x) by it to get
p(x) = q(x) s(x) + r(x)
where degree of r < degree of q. This means that r(x) = 0 constantly and so p
could have been factored over E.
Now we must expand hF + 0 1 i to an ordered eld F while preserving the
order of E. We are aided in this by the fact that if q is a polynomial of degree at
most n with coe!cients from E then there are a1 and a2 in E such that a1 < b < a2
and q doesn't change sign between a1 and a2 this comes from the fact that E j=
n .
5. THE SEVENTEENTH PROBLEM 44
Submodel Completeness
Definition 30. A theory T is said to admit elimination of quantiers in L
whenever for each formula '(v0 : : : vp ) of L there is a quantier free formula
(v0 : : : vp ) such that:
T j= (8v0 : : : 8vp )('(v0 : : : vp ) $ (v0 : : : vp ))
Remark. There is a ne point with regard to the above denition. If ' is
actually a sentence of L there are no free variables v0 : : : vp . So T j= ' $
for some quantier free formula with no free variables. But if L has no constant
symbols, there are no quantier free formulas with no free variables. For this
reason we assume that L has at least one constant symbol, or we restrict to those
formulas ' with at least one free variable. This will become relevant in the proof
of Theorem 17 for (2) ) (3).
Exercise 24. If T admits elimination of quantiers in L and L has no constant
symbols, show that for each sentence
of L there is a quantier free formula (v0 )
such that
T j=
$ 8v0 $ 9v0
Definition 31. A theory T is said to be submodel complete whenever T 4A
is complete in LA for each submodel A of a model of T .
Exercise 25. Use Theorem 14 and the following theorem to nd four proofs
that every submodel complete theory is model complete.
Theorem 17. Let T be a theory of a language L. The following are equivalent:
(1) T is submodel complete
(2) If B and C are models of T and A is a submodel of both B and C, then every
existential sentence which holds in BA also holds in CA .
(3) T admits elimination of quantiers
(4) whenever A B, A C, B j= T and C j= T there is a model D such that
both BA and CA are elementarily embedded in DA .
Proof. (1) ) (2)
Let B j= T and C j= T with A B and A C. Then BA j= T 4A and
CA j= T 4A. So (1) and Lemma 6 give BA CA . Thus (2) is in fact proved for
all sentences, not just existential ones.
(2) ) (3)
Lemma 4 shows that it su!ces to prove (3) for formulas in prenex normal form.
We do this by induction on the prenex rank of '. This claim is the rst step.
Claim. For each existential formula '(v0 : : : vp ) of L there is a quantier free
formula '(v0 : : : vp ) such that
T j= (8v0 : : : 8vp )(' $ )
45
6. SUBMODEL COMPLETENESS 46
We now do the general cases for the proof of the induction on prenex rank.
There are two cases, corresponding to the two methods available for increasing the
number of alternations of quantiers:
(a) the addition of universal quantiers
(b) the addition of existential quantiers.
For case (a), suppose '(v0 : : : vp ) is 8w0 : : : 8wm(v0 : : : vp w0 : : : wm ) and
has prenex rank lower than '. Then : also has prenex rank lower than '
and we can use the inductive hypothesis on : to obtain a quantier free formula
1 (v0 : : : vp w0 : : : wm ) such that
T j= (8v0 : : : 8vp )(8w0 : : : 8wm )(: $ 1 )
So T j= (8v0 : : : 8vp )(9w0 : : : 9wm : $ 9w0 : : : 9wm 1 )
6. SUBMODEL COMPLETENESS 47
Suppose
is such a sentence, Th(BB ) j=
. Let fca0 : : : cap g be the set of
constant symbols from LA n L appearing in
.
Let '(v0 : : : vp ) be obtained from
by exchanging each cai for a new variable
ui . Let (v0 : : : vp ) be the quantier free formula from (3):
T j= (8v0 : : : 8vp )(' $ )
Let be the result of substituting cai for each ui in . is also quantier
free.
Since BB j=
, B j= 'a0 : : : ap ]. Since B j= T , B j= a0 : : : ap ] and so
BA j= . Since is quantier free and AA BA we have AA j= since
AA CA we then get that CA j= . Hence C j= a0 : : : ap ] and then since
C j= T we then get that C j= 'a0 : : : ap ]. But then this means that CA j=
and
so CC j=
so
is in Th(CC ) and we are done.
(4) ) (1)
Let B j= T and A B we show that T 4A is complete. Noting that BA j=
T 4A , we see that it su!ces by Lemma 6 to show that BA C0 for each
C0 j= T 4A .
For each such C0 , by the Diagram Lemma, there is a model C for L such that
A C and CA = C0 . Then C j= T so by (4) there is a D into which both BA and
CA are elementarily embedded.
In particular BA DA CA so we are done.
unary relation symbols and R and S are binary relation symbols having axioms
which state that there are innitely many things, that U V W is everything,
that U , V and W are pairwise disjoint, that R is a one-to-one function from U onto
V and that S is a one-to-one function from U V onto W .
Exercise 26. Show that T above is complete and model complete but not
submodel complete.
Hints: For completeness, use the L os-Vaught test and for model completeness
use Lindstrom's test. For submodel completeness use (2) of the theorem with
B j= T and A B where a 2 A = fb 2 B : B j= W (v0 )b]g along with the
sentence
(9v0 )(U (v0 ) ^ S (v0 ca )):
Remark. We will prove that each of the following theories admits elimination
of quantiers:
1. dense linear orders with no end points (DLO)
2. algebraically closed elds (ACF)
3. real closed ordered elds (RCF)
C. H. Langford proved elimination of quantiers for DLO in 1924. The cases of
ACF and RCF were more di!cult and were done by A. Tarski. Thus, by Exercise 25,
we will have model completeness of RCF which was promised at the beginning of
Chapter 5.
Exercise 27. Use part (4) of the previous theorem and the fact that RCF
admits elimination of quantiers to prove that RCF is complete another result
originally due to A. Tarski.
Hint: Show that Q of Example 3 can be isomorphically embedded into any real
closed eld and then use (4) from Theorem 17.
Exercise 28. Let T be the theory DLO in the language L = f< c1 c2 g where
c1 and c2 are constant symbols. Use the fact that DLO admits elimination of
quantiers in its own language f<g to show that T is submodel complete. But,
show also, that T is not complete.
As an application of quantier elimination of ACF we have the following:
Corollary 7. (Tarski)
The truth value of any algebraic statement about the complex numbers can be de-
termined algebraically in a nite number of steps.
Proof. Let C be the complex numbers in the language of eld theory L let
Remark. In fact Tarski's original proof actually gave an explicit method for
nding the quantier free formulas and this led, via the corollary above, to an ef-
fective decision proceedure for determining the truth of elementary algebraic state-
ments about the reals or the complex numbers.
6. SUBMODEL COMPLETENESS 49
Model Completions
Closely related to the notions of model completeness and submodel complete-
ness is the idea of a model completion.
Definition 32. Let T T be two theories in a language L. T is said to be
a model completion of T whenever T 4A is satisable and complete in LA for
each model A of T .
Lemma 14. Let T be a theory in a language L.
(1) If T is a model completion of T , then for each A j= T there is a B j= T
such that A B.
(2) If T is a model completion of T , then T is model complete.
(3) If T is model complete, then it is a model completion of itself.
(4) If T1 and T2 are both model completions of T , then T1 j= T2 and T2 j= T1 .
Proof. (1) Easy. Just use the diagram lemma and the word \satisable" in
the denition of model completion.
(2) Easier.
(3) Easiest.
(4) This needs a proof. Let A j= T2 . It will su!ce to prove that A j= T1 .
Let A0 = A. since A0 j= T and T1 is a model completion of T we obtain, from
(1), a model A1 j= T1 such that A0 A1. Similarly, since A1 j= T and T2 is a
model completion of T we obtain A2 j= T2 such that A1 A2 .
Continuing in this manner we obtain a chain:
A0 A1 A2 : : : An An+1
Let B be the union of the chain, fAn : n 2 N g. Each even A2n j= T2 . By (2)
and (4) of Theorem 14 we get that for each n, A2n A2n+2 and by the Elementary
Chain Theorem A0 B. Similarly A1 B. so A0 A1 and hence A j= T1 .
Remark. Part (4) of the above lemma shows that model completions are es-
sentially unique. That is, if model completions T1 and T2 of T are closed theories
in the sense of Denition 12 then T1 = T2 . Since there is no loss in assuming
that model completions are closed theories, we speak of the model completion of a
theory T .
Lemma 15. If T T are theories for a language L such that for each A j= T
there is a B j= T such that A B, then the following are equivalent:
(1) T is the model completion of T .
(2) For each A j= T , B j= T and C j= T such that A B and A C we have
a model D such that both BA and CA are elementarily embedded into DA .
50
7. MODEL COMPLETIONS 51
and hence
is in ThCA .
(2) ) (3) and (3) ) (4) easily follow from the denitions.
(4) ) (1) We rst show that T is model complete using Theorem 14 we show
that T is existentially complete. Let A j= T we show that A is existentially
closed. Let B j= T such that A B and let
be an existential sentence of LA
with BA j=
our aim is to prove that AA j=
.
We invoke (4) with C = A to get a model D such that A D and an isomorphic
embedding f : BA ! DA . Since
is existential it is of the form 9v0 : : : 9vp ' for
some quantier free formula '(v0 : : : vp ) of LA .
BA j= 9v0 : : : 9vp '
So for some b0 : : : bp in B we have
BA j= 'b0 : : : bp ]:
By Exercises 8 and 18 we have
DA j= 'f (b0 ) : : : f (bp )]
and so
DA j= 9v0 : : : 9vp ':
Now A D implies that AA DA so
AA j= 9v0 : : : 9vp ':
Hence AA j=
and T is model complete.
We now show that T is the model completion of T . Let A j= T by the
hypothesis on T and T we have that T 4A is satisable. We show that T 4A
is complete in LA by showing that for each B j= T and C j= T with A B and
A C we have BA CA .
Letting B and C be as above, we invoke (4) to obtain a model D such that
BA is isomorphically embedded into DA and C D. C D gives that D j= T .
The isomorphic embedding gives us a model E such that B E and DA = EA . So
E j= T . Using model completeness of T and Theorem 14 we can conclude that
B E. We have:
BA EA DA CA
and we are done.
Let's compare the denitions of model completion and submodel complete. Let
T be the model completion of T . Then T will be submodel complete provided
7. MODEL COMPLETIONS 52
The way to show that DLO, ACF and RCF admit elimination of quantiers is
now clear. We will use Theorem 17 and 18. This reduces to showing that DLO,
ACF and RCF are the model completions of LOR, FEI and ORF respectively. To
do this we will use Lemma 15, so we rst need to show that each pair of these
theories satisfy the general hypothesis of Lemma 15: if A j= T then there is a
B j= T such that A B.
For the case T = LOR and T = DLO is easy every linear order can be
enlarged to a dense linear order without endpoints by judiciously placing copies of
the rationals into the linear order.
The case T = FEI and T = ACF is just the well known fact that every eld
has an algebraic closure.
The case T = ORF and T = RCF is just Lemma 13.
So all that remains of the quest to prove elimination of quantiers for DLO,
ACF and RCF is to verify condition (4) of Lemma 15 in each of these cases. We
rephrase this condition slightly as:
For each A j= T , B j= T and C j= T with A B and A C there
is a D such that C D and an isomorphic embedding f : B ,! D
such that f A is the identity on A.
At this point the reader may already be able to verify this condition for one or
more of the pairs T = LOR and T = DLO, T = FEI and T = ACF, or T = ORF
and T = RCF. However the remainder of this chapter is devoted to a uniform
method.
Definition 34. Let L be a language and "(v0 ) a set of formulas of L in the
free variable v0 . A model A for L is said to realize "(v0 ) whenever there is some
a 2 A such that A j= 'a] for each '(v0 ) in "(v0 ).
Definition 35. The set of formulas "(v0 ) in the free variable v0 , is said to be
a type of the model A whenever
(1) every nite subset of "(v0 ) is realized by A
(2) "(v0 ) is maximal with respect to (1).
Remark. Every set of formulas "(v0 ) having property (1) of the denition of
type can be enlarged to also have property (2).
Lemma 16. Suppose A is an innite model for a language L. Let X A and
let "(v0 ) be a type of AX in the language LX . Then there is a B such that A B
and BX realizes "(v0 ).
Proof. Let T = ThAA "(c) where c is a new constant symbol and "(c) =
f'(c) : ' 2 "(v0 )g and of course '(c) is '(v0 ) with c replacing v0 .
By denition of type, for each nite T 0 T , there is an expansion A0 of A such
that A0 j= T 0 . The Elementary Diagram Lemma and the Compactness Theorem
will complete the proof.
7. MODEL COMPLETIONS 54
By the claim and the Compactness Theorem, there is a model C j= " ThAA .
By the Elementary Diagram Lemma, A is elementarily embedded into CjL, the
restriction of C to the language L. Therefore there is a model B for L such that
A B and BA = CjLA . It is now straightforward to check that BA realizes each
type " (v0 ).
The following lemma completes the proofs that each of the theories DLO, ACF
and RCF admit elimination of quantiers.
Lemma 19. Each of the following three pairs of theories T and T satisfy con-
dition (3) of Blum's Test.
(1) T = LOR theory of linear orderings. T = DLO, theory of dense linear
orderings without endpoints.
(2) T = FEI, theory of elds. T = ACF, theory of algebraically closed elds.
(3) T = ORF, theory of ordered elds. T = RCF, theory of real closed elds.
Proof of (1). Let A and B be linear orders, with B = A fbg and A B.
Let C be a jBj+ -saturated dense linear order without endpoints with A C.
We wish to nd an isomorphic embedding f : B ! C which is the identity on
A.
Consider a type of CA containing the following formulas:
ca < v0 for each a 2 A such that a < b
v0 < ca for each a 2 A such that b < a
Since C is a dense linear order without endpoints each nite subset of the type can
be realized in CA .
Saturation now gives some t 2 C realizing this type.
We set f (b) = t and we are nished.
Proof of (2). Let A be a eld and B a simple extension of A witnessed by b
such that B is a submodel of a eld (a commutative ring).
Let C be a jBj+ -saturated algebraically closed eld such that A C.
We wish to nd an isomorphic embedding f : B ! C which is the identity on
A.
There are two cases:
(I) b is algebraic over A,
(II) b is transcedental over A.
Case(I). Let p be a polynomial with coe!cients from A such that p(b) = 0
but b is not the root of any such polynomial of lower degree. Since C is algebraically
closed there is a t 2 C such that p(t) = 0. We extend the identity map f on A
to make f (b) = t. We extend f to the rest of B by letting f (r(b)) = r(t) for any
polynomial r with coe!cients from A. It is straightforward to show that f is still
a well-dened isomorphic embedding.
Case (II). Let us consider a type of CA containing the following set of formu-
las:
f:(p(v0 ) = 0)g
where p is a polynomial with coe!cients in fca : a 2 Ag.
Since C is algebraically closed, it is innite and hence each nite subset is
realized in CA .
Saturation will now give some t 2 C such that t realizes the type.
We set f (b) = t. Since t is transcedental over A, the extension of f to all of B
comes easily from the fact that every element of B n A is the value at b of some
polynomial function with coe!cients from A.
7. MODEL COMPLETIONS 58
Claim. Let D be a real closed ordered eld and q(x) a polynomial over D with
e 2 D and q(e) = 0. If there is an a < e such that q(x) > 0 for all a < x < e then
q0 (e) 0. If there is an a > e such that q(x) > 0 for all e < x < a then q0 (e) 0.
Here q0 is the rst derivative of q.
Proof of Claim. From the previous claim we get
!
q(x) ; q(e) = q0 (e) + (x ; e) Xn q (m) (e)
(x ; e)m ; 2
x;e m=2 m!
for any x 6= e in D. By choosing x close enough to e we can ensure that the entire
right hand side has the same sign as q0 (e). A proof by contradiction now follows
readily.
Claim. Let D be a real closed ordered eld and q(x) be a polynomial over D
with e 2 D and q(e) = 0. If w and z are in D such that w < e < z and q(w)q(z ) > 0
then there is a d in D such that w < d < z and q0 (d) = 0.
7. MODEL COMPLETIONS 59
Proof of Claim. Without loss of generosity q(w) > 0 and q(z ) > 0. Since q
has only nitely many roots, we can pick d1 to be the least x such that w < x e
and q(x) = 0. Since q(x) 6= 0 for all w < x < d1 , the Intermediate Value Property
of Real Closed Ordered Fields shows that q cannot change sign here and so q(x) > 0
for all w < x < d1 . By the previous claim, q0 (d1 ) 0. A similar argument with
z shows that there is a d2 such that e d2 < z and q0 (d2 ) 0. If d1 = e = d2
take d = e. If d1 < d2 the Intermediate Value Property gives a d with the required
properties.
Claim. Let D be a real closed ordered eld with an ordered eld E D. Let
f : E ! C be an isomorphic embedding into a real closed ordered eld. Let q be a
polynomial with coecients in E such that fx 2 D : q0 (x) = 0g E. Let d 2 D n E
be such that q(d) = 0 but d is not a root of a polynomial with coecients from E
which has lower degree. Then f can be extended over the subeld of D generated
by E fdg.
Proof of Claim. Since the nitely many roots of q0 from D actually lie in
E, we can get e1 and e2 in E such that e1 < d < e2 and q0(x) 6= 0 for all x in D
such that e1 < x < e2 . Furthermore for all x in E we have q(x) 6= 0. We can now
apply the previous claim to get that q(w) q(z ) < 0 for all w and z in E such that
e1 < w < d < z < e2 .
We now move to the real closed ordered eld C and the isomorphic embedding
f . For each w and z in E such that e1 < w < d < z < e2 we have f (w) < f (z )
and q(f (w)) q(f (z )) < 0. By the Intermediate Value property of C we get, for
each such w and z , a y 2 C such that f (w) < y < f (z ) and q(y) = 0. Since q has
only nitely many roots there is some t 2 C such that q(t) = 0, f (w) < t for all
e1 < w < d and t < f (z ) for all d < z < e2 .
We now extend f by letting f (d) = t and f (r(d)) = r(t) for any polynomial
r with coe!cients from E. It is straightforward to check that the extension is a
well-dened isomorphic embedding of the simple extension of E by d into C. We
use the fact that ORF is almost universal to extend the isomorphic embedding to
all of the subeld of D generated by E fdg, since we can rephrase the denition
of almost universal as follows:
Whenever C j= T , D j= T , E0 D and f : E0 ,! C is an isomorphic
embedding, then there is a model E00 j= T such that E0 E00 D
and f extends over E00 .
It is now time for the main part of the proof of this case. Using Lemma 13,
let D be a real closed ordered eld with B D. We have a polynomial p with
coe!cients from A such that p(b) = 0. By induction on the degree of p, we can show
that there is a sequence of elements d0 : : : dm = b of elements of D, a sequence of
subelds of D:
A = E0 E1 : : : Em+1
with each dj 2 Ej+1 n Ej and corresponding isomorphic embeddings
fj : Ej ! C
7. MODEL COMPLETIONS 60
coming from the previous claim and having the property that f0 is the identity and
fj+1 extends fj . In this way we extend the identity map f0 : A0 ,! C until we
reach fm+1 : Em+1 ,! C. We then note that since b 2 Em+1 we have B Em+1
and we are nished.
61
Index
A B, 18 elimination of quantiers, 49
A = fAn : n 2 Ng, 19 existentially closed, 35
ThA, 14 expansion
ThAA , 27 language, 17
j=, 14 model, 17
, 18
A jL , 17
0 0 FEI, 14
, 17 almost universal, 57
elds
tx0 : : : xq ], 7
axioms,theory of, 14
hC + 0 1i, 13 formula, 5
hQ << + 0 1i, 13 free variable, 6
hR < + 0 1i, 13
hN + < 0 1i, 18 isomorphic models, 18
isomorphically embedded model, 27
A. Robinson, 35 language, 6
ACF, 14 Leibniz Principle, 29
submodel complete, 57 Lindstrom's Test, 38
algebraically closed elds linear orders
axioms,theory of, 14 axioms,theory of, 14
almost universal, 57, 64 LOR, 14
axioms, 14 almost universal, 57
Blum's Test, 60 L os-Vaught Test, 24
bound variable, 6 Lowenheim-Skolem Theorems
Downward, 23
categorical Upward, 23
-categorical theory, 24
We shall here continue the treatment of multilayered media Green’s functions, starting from
the TE/TM decomposition we have presented in the previous document.
For the sake of illustration, let us consider a one layer medium with a reflection and trans-
mission region, as shown in Fig. 1.
z
¯ 0)
J(r̄
z0
#0: (²0 , µ0 )
PSfrag replacements
0 x
#1: (²1 , µ1 )
−d
#2: (²0 , µ0 )
.
We have shown before that the Green’s functions in various layers are expressed as:
· ¸
1
ZZ
i −iK̄·r̄ 0 −iK̄·r̄ 0
G`0 = 2 dk̄⊥ K̄e (k`z )ê(−k0z ) e + K̄h (k`z )ĥ(−k0z ) e , (1)
8π k0z
where
`=1 : K̄e (k1z ) = A ê(k1z ) eik̄1 ·r̄ + B ê(−k1z ) eiK̄1 ·r̄ , (2c)
K̄h (k1z ) = C ĥ(k1z ) eik̄1 ·r̄ + D ĥ(−k1z ) eiK̄1 ·r̄ , (2d)
1
2 Section 1. Transmission line analogy for multilayered media
By satisfying the boundary conditions at z = 0 and z = −d, we obtain the following system
(for TE waves, TM waves can be solved similarly):
1 + RT E =A + B (3a)
k0z k1
(−1 + RT E ) = z (A − B) , (3b)
µ0 µ1
A e−ik1z d + Beik1z d = T T E eik0z d , (3c)
k1z k0
(Ae−ik1z d − Beik1z d ) = − z T T E eik0z d . (3d)
µ1 µ0
1 − e2ik1z d
RT E = RT E ,
T E RT E e2ik1z d 01
(4a)
1 + R01 10
4ei(k1z −k0z )d
TTE = . (4b)
(1 + pT01E ) (1 + pT10E ) (1 + R01
T E RT E e2ik1z d )
10
1 − e2ik1z d
RT M = RT M ,
T M RT M e2ik1z d 01
(5a)
1 + R01 10
4ei(k1z −k0z )d
TTM = . (5b)
(1 + pT01M ) (1 + pT10M ) (1 + R01T M RT M e2ik1z d )
10
1 − pT10E T E RT E
1 + R01
A= 10
T E RT E e2ik1z d
e2ik1z d , (6a)
2 1 + R01 10
TE
1 + R01
B= T E RT E e2ik1z d
, (6b)
1 + R01 10
µ1 k 0 TM
2R10
C= e2ik1z d , (6c)
µ0 k1 (1 + pT01M ) (1 + R01
T M RT M e2ik1z d )
10
µ1 k 0 2
D= . (6d)
µ0 k1 (1 + pT01M ) (1 + R01
T M RT M e2ik1z d )
10
From 6.630, we know that a transmission line is characterized by its length d, characteristic
(p)
PSfrag replacements
impedance Zc and wavenumber kz , as defined in Fig. 2.
(p) d (p)
I1 I2
(p)
V1
(p) [Zc , kz ] (p)
V2
∂ (p)
V = − jkz Zc(p) I (p) , (8a)
∂z
∂ (p) 1
I = − jkz (p) V (p) , (8b)
∂z Zc
where p refers to the polarization (TE or TM), and
ωµ
ZcT E = , (9a)
kz
kz
ZcT M = . (9b)
ω²
The solution to Eqs. (8) is:
Therefore, the problem of computing the fields in multilayered media in the spectral domain
comes down to determining the voltage/current in equivalent transmission line network. The
analogy is illustrated in Fig. 3.
Figure 3: Transmission line analogy for horizontal electric source and vertical mag-
netic source. Other cases are obtained by duality.
The treatment of the source will not be demonstrated here and we shall just state the
final results (details can be found in the literature). Thus, depending on the source type and
orientation, different generators will have to be placed in the transmission line network. The
various cases are (magnetic sources can be obtained by duality):
1. Write the field components in terms of voltage and current. Locate the source and obser-
vation point.
2. Compute the equivalent transmission line network. Locate the source (type and position)
and observation.
3. Starting from the source replace all the layers above and below the source by equivalent
impedances (see Fig. 4).
To do this, start with the extreme boundary conditions and propagate back to the source
using:
5
Z up
PSfrag replacements
Z down
Figure 4: Equivalent upper and lower impedance.
(p) (p)
ZL + jZc tan(kz d)
Zin (p) = Zc(p) (p) (p)
. (12)
Zc + jZL tan(kz d)
4. Using standard circuit theory, compute V (p) and I (p) at the upper and lower limits.
To come back to the space domain, we need to evaluate the inverse Fourier transform. A
typical integral we have to perform is:
1
ZZ
f (r̄) = dkx dky f˜(kx , ky ) eikx x eiky y . (14)
(2π)2
By symmetry of the problem (x and y axis are equivalent), we can make a change of variables
and integrate one integral analytically. The proper change of variables is the following:
eikx x eiky y = ei[kρ ρ cos kφ cos φ+kρ ρ sin kφ sin φ] = eikρ ρ cos(kφ −φ) , (17)
so that
2π ∞
1
Z Z
f (r̄) = dkφ dkρ kρ f˜(kρ , kφ ) eikρ ρ cos(kφ −φ) . (18)
(2π)2 0 0
6 Section 2. Coming back to the space domain: Sommerfeld integral
By rotational symmetry, f˜(kρ , kφ ) = f˜(kρ ). In addition, we can expand the exponential part
using the following identity:
∞
X
ikρ ρ cos(kφ −φ) iβ cos θ
e =e = (−i)m Jm (β) eimθ , (19)
m=−∞
(where β and θ have just been defined to simplify the notation in the identity and have no
connection to physical parameters).
As we can see, the exponential function is the only term depending on kφ . Performing the
integration, we get:
Z 2π−φ 0 if m 6= 0 ,
eimθ = (20)
−φ 2π if m = 0 .
G̃ = Ã G = S0 [Ã]
cos 2φ
G̃ = −kx2 Ã G= S1 [Ã] − cos2 φ S0 [kρ2 Ã]
ρ
cos 2φ
G̃ = −ky2 Ã G=− S1 [Ã] − sin2 φ S0 [kρ2 Ã]
ρ
sin 2φ 1
G̃ = −kx ky à G= S1 [Ã] − sin 2φ S0 [kρ2 Ã]
ρ 2
7
must be real. Since this needs to happen in at least one layer, it yields the condition:
√
kρ < k0 max( ²` µ` ) , (24)
l
which puts an upper limit for the location of the poles. Although it can also be shown that poles
have to correspond to kρ > k0 , we do not need this constraint here and we can limit ourselves
√
to the interval [0, k0 maxl ( ²` µ` )].
In the lossless situation, the poles lie on the real axis, which renders the integral impossible
to evaluate as is. We therefore need to deform the contour in the complex k ρ -plane:
=(kρ )
b <(kρ )
PSfrag replacements
wn Sn + wn+1 Sn+1
Sn0 = . (27)
wn + wn+1
The break points xi have to be well chosen, and may for example be chosen based on the
asymptotic behavior of f . If we refer back to Sommerfeld integrals, we can take the asymptotic
expansion of the Bessel function:
s
2 π π
Jν (kρ ρ) ' cos(kρ ρ − ν − ) . (30)
πkρ ρ 2 4
where x0 is the first break point greater than α. We will then approximate S by Sn as
Z xn
Sn = f (x)dx . (32)
α
The problem of Sommerfeld’s integrals is that the remainder
Z ∞
rn = S − S n = − f (x)dx (33)
xn
decays slowly, so that we want to accelerate the series from which S n is evaluated (see Eq. (29c)).
For Sommerfeld-type integrals, we write the generic form as:
Z ∞
I= g(kρ ρ) f (kρ ) dkρ , (34)
α
and the partial integral as Z xn
I(n) = g(kρ ρ) f (kρ ) dkρ . (35)
α
The remainder is therefore
Z ∞
I − I(n) = g(kρ ρ) f (kρ ) dkρ . (36)
xn
This integral can be expanded into an infinite series of inverse powers of ρ by integration by
part. For example, with g(kρ ρ) = eikρ ρ (which is a generic form for Sommerfeld integrals), we
can write: · ¸
i i i 00
I − I0 (n) = eian ρ fn + fn0 + ( )2 fn + . . . , (37)
ρ ρ ρ
where fn = f (xn ), fn0 = f 0 (xn ), etc. Note that if the right-hand side term converges (and it
does in our case), the dominant term is O(ρ−1 ). Yet, if we now construct
fn+1 I0 (n) + fn I0 (n + 1)
I1 (n) = , (38)
fn + fn+1
it appears to be a better estimate of I since the error is in O(ρ−2 ).
If, in addition,
f (kρ ) ∼ Ckρα e−βkρ , (39)
we can approximate fn and fn+1 and write
I0 (n) + η0 I0 (n + 1)
I1 (n) = , (40)
1 + η0
where η0 = [n/(n + 1)]α eβπ/ρ . At higher orders, a better approximation is given by:
I1 (n) + η1 I1 (n + 1)
I2 (n) = , (41)
1 + η1
where η1 = [(n − 1/2)/(n + 1/2)]α−2 eβπ/ρ .
In practical applications, the parameters α and β may have to be adjusted for an optimum
convergence.
The technique presented briefly here is known as the “weighted average” method, and more
details can be found in the literature under this keyword.
Integral Equations in Electromagnetics
Massachusetts Institute of Technology
6.635 lecture notes
Most integral equations do not have a closed form solution. However, they can often be
discretized and solved on a digital computer.
Proof of the existence of the solution to an integral equation by discretization was first
presented by Fredholm in 1903.
In general, integral equations can be divided into two families:
1. When the unknown is in the integral only, the integral equation is called of the first kind.
2. When the unknown is both inside and outside the integral, the integral equation is called
of the second kind.
For electromagnetic applications, we can have both scalar and vector integral equations.
#1
J n̂
S
#2
S∞
1
2 Section 1. Scalar integral equations
Upon performing (Eq. (1a)g1 (r̄, r̄0 )−Eq. (2b)Φ1 (r̄, r̄0 )), we get:
[g1 (r̄, r̄0 ) ∇2 Φ1 (r̄) − Φ1 (r̄) ∇2 g1 (r̄, r̄0 )] = J(r̄)g1 (r̄, r̄0 ) + δ(r̄ − r̄ 0 )Φ1 (r̄) . (3)
Upon integrating Eq. (3) over the entire volume and using the identity ∇ · (g∇Φ − Φ∇g) =
g∇2 Φ− Φ∇2 g, we get:
Z Z
0 0
dv ∇ · [g1 (r̄, r̄ )Φ1 (r̄) − Φ1 (r̄)∇g1 (r̄, r̄ )] = dvJ(r̄) g(r̄, r̄ 0 ) + Φ1 (r̄0 ) . (4)
V V
By Gauss theorem, we reduce the left-hand side integral to a surface integral. Also
Z
dvJ(r̄)g1 (r̄, r̄0 ) = −Φinc (r̄0 ) . (5)
V
We therefore obtain
Z
− ds n̂ · [g1 (r̄, r̄0 )∇Φ1 (r̄) − Φ1 (r̄)∇g1 (r̄, r̄0 )] = −Φinc (r̄) + Φ1 (r̄0 ) , r̄0 ∈ V . (6)
S+S∞
By invoking the radiation condition, the integral over S∞ vanishes, leaving (and exchanging
r̄ and r̄0 so that primed coordinates correspond to sources and unprimed ones to observation):
Z
Φ1 (r̄) = Φinc (r̄) − ds0 n̂ · [g1 (r̄, r̄0 )∇0 Φ1 (r̄0 ) − Φ1 (r̄0 )∇0 g1 (r̄, r̄0 )] r̄ ∈ V1 . (7)
S
For r̄ ∈ V2 , the wave equation has no source and therefore the integration of the delta
function yields a zero value. Performing the same steps for this second case, we get the generic
relation:
Z Φ (r̄) r̄ ∈ V
1 1
Φinc (r̄) − ds0 n̂ · [g1 (r̄, r̄0 )∇0 Φ1 (r̄0 ) − Φ1 (r̄0 )∇0 g1 (r̄, r̄0 )] = (8)
S 0 r̄ ∈ V2
This is directly evocative of Huygens’ principle:
• For r̄ ∈ V1 : the total field Φ1 (r̄) is the sum of the incident field plus the field due to the
surface currents on the surface S.
• For r̄ ∈ V2 : the surface source on S produces a field that exactly opposes Φinc , yielding
the extinction theorem.
Applying the same reasoning to region 2, we write (where there is no incident field):
Z 0 r̄ ∈ V1
ds0 n̂ · [g2 (r̄, r̄0 )∇0 Φ2 (r̄0 ) − Φ2 (r̄0 )∇0 g2 (r̄, r̄0 )] = (9)
S Φ2 (r̄) r̄ ∈ V2
3
Note that the sign reversal is due to the definition of the normal vector n̂ which has to point
outward from the surface. Here, since we use the same n̂ as before, we have to take it as being
negative.
Eqs. (8) and (9) have four independent unknowns:
which can be related by the boundary conditions. Here, g(r̄, r̄ 0 ) and n̂ · ∇g(r̄, r̄ 0 ) are the kernel
of the integral equation.
¯ ,
∇ × ∇ × Ē1 (r̄) − ω 2 ²1 µ1 Ē1 (r̄) =iωµ1 J(r̄) (11a)
∇ × ∇ × Ē2 (r̄) − ω 2 ²2 µ2 Ē2 (r̄) =0 , (11b)
For Green’s functions that satisfy the radiation condition, Eq. (14) becomes:
Z ½ ¾
0 0 0 0
Ē1 (r̄ ) = Ēinc (r̄ ) + ds n̂ · iωµ1 G1 (r̄, r̄ ) · n̂ × H̄1 (r̄) − [∇ × G1 (r̄, r̄ )] · n̂ × Ē1 (r̄) . (17)
S
We perform the same steps for the other region to eventually obtain (and again interchanging r̄
and r̄0 ):
Ē1 (r̄) r̄ ∈ V1
Z ½ ¾
Ēinc (r̄) + ds0 iωµ1 G1 (r̄, r̄0 ) · n̂ × H̄1 (r̄0 ) − [∇0 × G1 (r̄, r̄0 )] · n̂ × Ē1 (r̄0 ) =
S 0 r̄ ∈ V2
(18a)
Z ½ ¾ 0 r̄ ∈ V1
0 0 0 0 0 0
− ds iωµ2 G2 (r̄, r̄ ) · n̂ × H̄2 (r̄ ) − [∇ × G2 (r̄, r̄ )] · n̂ × Ē2 (r̄ ) =
S Ē2 (r̄) r̄ ∈ V2
(18b)
this system can be solved for n̂ × Ē1 (r̄) and n̂ × H̄1 (r̄).
Note also that Eqs. (18) can be written in terms of electric and magnetic currents, and
magnetic Green’s functions:
This equation, in which Φ represent the electric field, is referred to as the electric field integral
equation (EFIE).
Note that as r̄ gets closer to the surface, Φ(r̄) → 0 (from the boundary condition) so that
we do not need to distinguish between approaching the surface from one side or the other. In
fact, we can unify the equations and write:
Z
Φinc (r̄) − ds0 g(r̄, r̄0 ) n̂ · ∇Φ(r̄ 0 ) = 0 , r̄ ∈ S , r̄ 0 ∈ S . (25)
S
In addition, g(r̄, r̄ 0 )
has an integrable singularity as r̄ → r̄ 0 . Let us consider the surface
depicted in Fig. 2, with z = f (x):
s µ ¶2
p
2 2
df
ds = dx + dz = dx 1 + , (26)
dx
such that the integral equation becomes:
s µ ¶2
df
Z
Φinc (r̄) = dx0 1 + g(x, f (x), x0 , f (x0 )) (n̂ · ∇Φ(r̄ 0 )) , at z 0 = f (x0 ) , (27)
∆x dx
6 4.2 Neumann boundary conditions: MFIE
In this case, it makes a difference if we approach the surface from the top or from the bottom.
In fact
Z
Φinc (r̄+ ) + ds0 Φ(r̄0 ) n̂ · ∇g(r̄+ , r̄0 ) = Φ(r̄+ ) , (32a)
S
Z
Φinc (r̄− ) + ds0 Φ(r̄0 ) n̂ · ∇g(r̄− , r̄0 ) = 0 . (32b)
S
These two equations seems inconsistent with one another. We shall show that in fact, they
are actually consistent with each-other, due to the singularity of the Green’s functions.
We shall examine what happens when we let r̄ approach the surface. Fig. 3 is an illustration
of the situation at the immediate vicinity of the surface. The integral part of the equation can
be written as:
Z Z Z
0 0 0 0 0 0
ds Φ(r̄ ) n̂ · ∇g(r̄, r̄ ) = P V ds Φ(r̄ ) n̂ · ∇g(r̄, r̄ ) + ds0 Φ(r̄0 ) n̂ · ∇g(r̄, r̄ 0 ) , (33)
S S piece
where P V denotes the principal value and ‘piece’ refers to the integration over the local domain
shown in Fig. 3. For this integral, we use the local coordinates:
r̄
PSfrag replacements
Z0
−a r̄ = (0, Z)
a X0
Figure 3: Zoom on the rough surface.
Eq. (39) is called the magnetic field integral equation (since Φ represents the magnetic field)
and is an integral equation of the second kind.
8 Section 5. Solving the EFIE
Let us subdivide the integration domain into N small elements, each of length ∆ = L/N ,
and centered at xm (m ∈ [1, N ]). Thus, constraining the observation at these discrete locations,
the integral equation becomes
Z L/2
dx0 K(xm , x0 ) U (x0 ) = b(xm ) . (43)
−L/2
Next, if we suppose that U (x) is constant in each interval, we replace the integral by a sum over
all segments, excluding the singular term:
X Z
∆x K(xm , xn ) U (xn ) + U (xm ) dx0 K(xm , x0 ) = b(xm ) . (44)
n=1 m
n6=m
Note that we have to single out the singularity, i.e. the mth interval because K(xm , x0 ) is
singular at x0 = xm . This part is known as the self-patch contribution.
For a 2D problem, we have:
i (1) p
K(xm , x0 ) = H (k (xm − x0 )2 + (f (xm ) − f (x0 ))2 ) . (45)
4 0
i (1) p
K(xm , x0 ) = H0 (k|x0 − xm | 1 + f 0 (xm )2 ) . (46)
4
(1)
For small argument: H0 (α) ≈ 1 + i π2 ln(αγ/2) where γ is the Euler constant (γ ' 1.78).
Therefore: · µ ¶¸
0 i 1 γ 0
p
K(xm , x ) = 1 + i ln k(x − xm ) 1 + f (xm ) .
02 (47)
4 π 2
9
where ln(e) = 1.
We can therefore cast the integral equation into a matrix equation of the form:
N
X
Amn Un = bm , (49)
n=1
where
1 Introduction
In the previous lecture, we wrote the EFIE for an incident TE plane wave on a PEC surface.
The solution was then obtained by some types of “intuitive” arguments, such as dividing the
integration domain into small elements and supposing that the unknown does not vary too much
over each elementary cell.
We shall now see more rigorously what we actually did, and show that it was in fact a simple
version of the Method of Moments.
R. F. Harrington was the first to use the method of moments (MoM) in electromagnetics
and his book remains a fundamental reference (and very easy to read!):
R. F. Harrington, “Field Computation by Moment Method” (is now available from IEEE Press).
f (x) ²0
x
²1
Figure 1: Rough surface S separating two media.
1
2 Section 2. PEC surface with TE incident wave: EFIE
where
s µ ¶2
df
1+ [n̂ · ∇Φ(r̄ 0 )]|z 0 =f (x0 ) = U (x0 ) , (3a)
dx
Φinc (x, f (x)) = b(x) , (3b)
0 0 0
K(x, x ) = g(x, f (x), x , f (x )) , (3c)
1. x ∈ {xi } , i = 1, . . . , N.
which eventually yielded the following system of equations (supposing that the problem related
to the singularity of the Green’s function has been accounted for):
N
X
Amn Un = bm . (5)
n=1
This is a matrix equation with the two indices m and n corresponding to:
m: observation point → unprimed coordinates.
Mathematically:
The steps we had to perform to from Eq. (2) to Eq. (5) are
stating that now the unknowns become {Un } which are the amplitudes of the function.
The integral equation becomes:
Z L/2 X
dx0 K(x, x0 )U (x0 ) = ∆x Un K(x, xn ) = b(x) . (7)
−L/2 n
3
(a) Mesh the structure (i.e. choose the intervals over which Un will be defined).
(b) Expand the unknown U (x) into basis functions.
2. The second steps concerns the observation: dot-multiply both sides of the equation by a
test function (or weighting function).
This is a very simple, yet very widely used version of the method of moments.
L(f ) = g , (9)
where L is a linear operator, g is known, and f is to be determined. We shall now perform the
two essential steps we have highlighted above.
where αn are constant. The set fn is called expansion function, or basis functions.
Note that for an exact solution, the summation should be taken to ∞, but has to be
truncated in practice.
2. It is assumed that a suitable inner product has been defined for the problem. Now, we
define a set of weighting functions, or testing functions, w1 , w2 , . . . , wN in the range of
L, and take the inner product of the previous equation with wm :
X
αn < wm , Lfn >=< wm , g > . (11)
n
4 Section 4. A simple example for electrostatic
where
< w1 , Lf1 > < w1 , Lf2 > . . . α1 < w1 , g >
< w2 , Lf1 > < w2 , Lf2 > . . . ,
[Amn ] = [αn ] = α [gm ] = < w2 , g > . (13)
2 ,
.. .. .. .. ..
. . . . .
If the matrix [Amn ] is not singular, the unknowns αn are simply given by:
and the original function f can be reconstructed using Eq. (10). We can now generalize the
following definitions:
Let us consider a square plate of side 2a lying on the z = 0 plane with its center at the
origin (see Fig. 2. Let σ(x, y) represent the surface density on the plate, assumed to have zero
thickness. The electrostatic potential Φ at any point in space is given by
z
y
2a
PSfrag replacements 2a x
2b
2b
Figure 2: Discretized square plate.
5
Z a Z a
Φ(x, y, z) = dx0 dy 0 σ(x0 , y 0 ) g(r̄, r̄ 0 ) , (16a)
−a −a
1
g(r̄, r̄0 ) = , (16b)
p 4π²R
R = (x − x0 )2 + (y − y 0 )2 + z 2 . (16c)
The boundary condition is Φ = V = constant on the plate. The integral equation for the
problem is therefore
Z a Z a
1 σ(x0 , y 0 )
V = dx0 dy 0 p , (17)
−a −a 4π² (x − x0 )2 + (y − y 0 )2 + z 2
where the unknown to determine is σ(x0 , y 0 ). Let us perform the three steps mentioned before:
1. Mesh the structure: divide the plate into N squares of size 2b (see Fig. 2).
It is obvious to see that this integral is singular at (xm , ym ) ∈ ∆Sm . In this simple case
fortunately, we can perform the integration analytically (this is not always the case), and write:
Z b
0
Z b
1 1 2b √
Ann = dx dy 0 p = ln(1 + 2) , (21a)
−b −b 4π² x02 + y 02 π²
∆Sn b2
Amn ' = p , m 6= n . (21b)
4π²Rmn π² (xm − x0 )2 + (ym − y 0 )2
To rewrite this with the language of linear space:
1. By meshing both plates (i.e. meshing everything, which is of course always possible). This
will yield matrix twice as big as the previous one, to solve for twice as many unknowns.
2. By using the image theory, and saying that the new problem is equivalent to the one of a
unique plate on top of a ground plane at z = −d. In that case, we only have to change
the Green’s function to take the ground plane into account, and we keep the same number
of unknowns as in the initial problem. When possible, this solution is better because
computationally less expensive (analytically more expensive).
Basically, a general trend is to have a Green’s function that represents as much as possible
the environment and to mesh only those parts that are external to the environment. This is in
fact the reason why people are looking for Green’s functions in layered media, periodic media,
etc.
5 Vectorial MoM
We can of course apply the MoM to the vectorial case, like for example the equation:
Z
Ē(r̄) = iωµ ¯ 0) .
ds0 Ge (r̄, r̄0 ) · J(r̄ (23)
S
The general expansion of the current will be:
¯ 0) = αn f¯n (r̄0 ) ,
X
J(r̄ (24)
n
yielding Z
ds0 Ge (r̄, r̄0 ) · f¯n (r̄0 ) .
X
Ē(r̄) = iωµ αn (25)
n Sn
The third step is to dot-multiply the equation with a testing function h̄m and integrate over
the cell surface (i.e. perform the inner product):
Z Z Z
ds h̄m (r̄) · Ge (r̄, r̄0 ) · f¯n (r̄0 ) .
X
0
Ē(r̄) · h̄m (r̄) = iωµ αn ds (26)
Sm n Sn Sm
The double double-integral on the right-hand side of this equation is known as the
“impedance term” since we can cast this system of equation into a matrix representation as:
1. Entire domain basis functions: using these functions to expand the unknowns is analogous
to a Fourier expansion or to a modal expansion. These types of functions yield a good
convergence of the method but are not versatile since the geometry need be regular in order
to have the modes defined. Note that in this case there is no use to mesh the geometry.
2. Sub-domain basis functions: they rely on a proper meshing of the geometry, which can be
rectangular, triangular, etc. The choice of basis functions is here very wide, from Dirac δ
(like for the weighting functions shown in this document), pulses (basis functions shown
in this document), piecewise linear, etc.
Finally, we can mention that point matching, which is easy to grasp and straightforward to
implement, may not yield an optimal convergence. In most of the applications, the Galerkin
technique is better, which consists in choosing the same testing functions as the basis functions.
This applies to both sub-domain and entire domain functions.
Time Domain Method of Moments
Massachusetts Institute of Technology
6.635 lecture notes
1 Introduction
The Method of Moments (MoM) introduced in the previous lecture is widely used for solving
integral equations in the frequency domain. Yet, some attempts have been made recently at the
use of the MoM in the time domain. We shall briefly expose this approach here.
∂ ∂ ¯ t) ,
∇ × Ē(r̄, t) = − B̄(r̄, t) − M̄ (r̄, t) , ∇ × H̄(r̄, t) = D̄(r̄, t) + J(r̄, (1a)
∂t ∂t
∇ · B̄(r̄, t) = m(r̄, t) , ∇ · D̄(r̄, t) = ρ(r̄, t) , (1b)
D̄(r̄, t) = ²Ē(r̄, t) , B̄(r̄, t) = µH̄(r̄, t) , (1c)
¯ t) + ∂ ρ(r̄, t) = 0 ,
∇ · J(r̄,
∂
∇ · M̄ (r̄, t) + m(r̄, t) = 0 . (1d)
∂t ∂t
For the time-domain MoM, it is easier to work with the potentials, and make use of the
well-known “retarded potentials” theory. In view of doing this, we write the definition:
1
H̄(r̄, t) = ∇ × Ā(r̄, t) (2a)
µ0
∂
Ē(r̄, t) = −∇φ(r̄, t) − Ā(r̄, t) . (2b)
∂t
Both the vector potential Ā and the scalar potential φ satisfy the wave equation which, in
time-domain domain, writes:
∂2 ¯ t) ,
∇2 Ā(r̄, t) − ²0 µ0 Ā(r̄, t) = −µ0 J(r̄, (3a)
∂t2
∂2 ρ(r̄, t)
∇2 φ(r̄, t) − ²0 µ0 2 φ(r̄, t) = − . (3b)
∂t ²0
These potentials are linked by the time-domain Lorentz gauge:
∂
∇ · Ā(r̄, t) + ²0 µ0 φ(r̄, t) = 0 . (4)
∂t
1
2 Section 2. Time domain equations
We can defined also a time-domain Green’s function which satisfies the time-domain scalar
equation:
1 ∂2
(∇2 − 2 2 )g(r̄, r̄ 0 , t, t0 ) = −δ(r̄ − r̄ 0 ) δ(t − t0 ) , (5)
c ∂t
which solution is (in free-space):
1 0 |r̄−r̄ 0 | 0
0 0 4π|r̄−r̄ 0 | δ(t − t − c ) t>t,
g(r̄, r̄ , t, t ) = (6)
0 t < t0 .
From this, the solution to the wave equation for Ā and φ can be written as:
Z Z ∞ Z ¯ 0 , t − R/c)
J(r̄
0 0¯ 0 0 0 0
Ā(r̄, t) =µ0 dv dt J(r̄ , t ) g(r̄, r̄ , t, t ) = µ0 dv 0 , (7a)
V −∞ V 4πR
ρ(r̄, t − R/c)
Z
φ(r̄, t) = dv 0 , (7b)
V 4π²0 R
where R = |r̄ − r̄ 0 |. These wave equations are known as the time retarded potentials, and
essentially say that the potential (either Ā or φ) can be calculated at a given point in space r̄
and given time t from all previous times.
From these equations, we can calculate the space-time electromagnetic fields:
1
Z ¯ 0, τ )
J(r̄
H̄(r̄, t) = dv 0 ∇ × , τ = t − R/c , (8a)
4π R
1
Z
ρ(r̄, τ ) µ0
Z ¯ 0, τ )
∂ J(r̄
0
Ē(r̄, t) = − dv ∇ − dv 0 . (8b)
4π²0 V R 4π ∂t R
R̄
∇R = , (10a)
R
1 R̄
∇ =− 3, (10b)
R R
∂ 1 ∂ 1 R̄ ∂
∇ρ(r̄0 , τ ) = ρ(r̄0 , τ )∇τ = − ∇R ρ(r̄0 , τ ) = − ρ(r̄, τ ) . (10c)
∂τ c ∂τ c R ∂τ
We can therefore continue with the electric field as:
· ¸
1 0 1 R̄ ∂ 1 ∂ ¯ 0
Z Z
0 R̄ 0 µ0
Ē(r̄, t) = dv 2
ρ(r̄ , τ ) + 3 ρ(r̄ , τ ) − dv 0 J(r̄ , τ )
4π²0 c R ∂τ R 4π R ∂τ
· ¸
1 1 ∂ 1 1 ∂ ¯ 0
Z Z
R̄ µ0
= dv 0 ρ(r̄0 , τ ) + ρ(r̄0 , τ ) 2 − dv 0 J(r̄ , τ ) . (11)
4π²0 c ∂τ R R 4π R ∂τ
3
We can perform the same type of calculations for the magnetic field using the relation
¯ 0 , τ ) = − 1 R̄ × ∂ J(r̄
∇ × J(r̄ ¯ 0, τ ) . (12)
c R ∂τ
We get:
· ¸
1 1 R̄
Z
∂ ¯ 0 R̄ ¯ 0, τ ) .
H̄(r̄, t) = dv 0 − × J(r̄ , τ ) − × J(r̄ (13)
4π c R2 ∂τ R3
Upon gathering the expressions for the electric and magnetic field, we eventually get:
½· ¸ ¾
1 1 ∂ 1
Z
0 0 0 R̄ µ0 ∂ ¯ 0
Ē(r̄, t) = dv ρ(r̄ , τ ) + ρ(r̄ , τ ) − J(r̄ , τ ) (14a)
4π c ∂τ R ²0 R 2 R ∂τ
· ¸
1 1 ∂ ¯ 0 1 ¯ 0
Z
R̄
H̄(r̄, t) = dv 0 J(r̄ , τ ) + J(r̄ , τ) × 2 . (14b)
4π c ∂τ R R
Upon using the boundary conditions for the electric and magnetic field, we construct the
integral equations in a standard way:
1
Z
i
⇒ n̂ × Ē (r̄, t) + n̂ × ds0 [. . .] (15)
4π
1¯ 1
Z
J(r̄, t) = n̂ × H̄ i (r̄, t) + n̂ × P V ds0 [. . .] (16)
2 4π
For the sake of comparison, we can write the MFIE in the frequency domain and in the time
domain:
Z
¯ i
J(r̄) = 2n̂ × H̄ (r̄) + 2n̂ × P V ¯ 0 ) × ∇0 g(r̄, r̄0 )
ds0 J(r̄ r̄ ∈ S , (17a)
· ¸
¯ t) = 2n̂ × H̄ i (r̄) + 1 n̂ × P V 1 ∂ ¯ 0 1 ¯ 0
Z
R̄
J(r̄, ds0 J(r̄ , τ ) + J(r̄ , τ) × 2 . (17b)
2π c ∂τ R R
Note that in the principal value, we essentially exclude the part for which R = 0. Since
τ = t − R/c and R 6= 0, we always have that τ < t. The time domain equations therefore
state that the current at location r̄ and time t is equal to a known term 2n̂ × H̄ i (r̄, t) plus a
¯ This is the basis for solving the time domain
term (integral) known from the past history of J.
integral equation by iterative methods, the most well-known one being the marching-on-in-time.
4 Section 3. The marching-on-in-time technique
In order to apply the MoM, we discretize the current both in space and in time:
M N
¯ 0 , t0 ) = J¯p (m0 , n0 ) Ps (r̄0 − r̄m0 ) Pt (t0 − tn0 ) ,
X X
J(r̄ (19)
m0 =1 n0 =0
where P denotes the simple pulse function. In addition, we also apply point-matching, which
means that we take the following testing functions:
Wmn (r̄m , tn ) = δ(r̄ − r̄m ) δ(t − n∆t) = δ(r̄ − r̄m ) δ(t − tn ) , (20)
3.2 Example
Let us consider a 1D example governed by the following integral equation:
x0
|x − x0 |
Z
g(x, t) = K(x, x0 ) f (x0 , τ )dx0 , x ∈ [−x0 , x0 ], τ = τ (x, x0 , t) = t − . (21)
−x0 c
N X
X J
f (x0 , τ ) ' ai0 j 0 Pi0 j 0 (x0 , τ ) , (22)
i0 =1 j 0 =1
Note that we use the definitions: xi0 = i0 dx , tj 0 = j 0 dt , and dx = cdt . In order to apply point
matching, we take the following testing functions:
Coming back to the definition of τ , we write (with the test and the expansion):
|xi − xi0 | dx
τ = tj − = jdt − |i − i0 | = (j − |i − i0 |)dt , (26)
c c
such that the coefficient ai0 j 0 becomes ai0 ,j−|i−i0 | . The integral equation becomes:
N
X Z (i0 + 12 )dx
gij = ai0 ,j−|i−i0 | dx0 K(idx , x0 ) . (27)
i0 =1 (i0 − 21 )dx
In this equation, only the first term involves time step j, all the others terms being at j − 1,
j − 2, . . . Therefore, we can solve for aij :
· N ¸
1 X
aij = gij − Zii0 ai0 ,j−|i−i0 | . (30)
Zii 0 i =1
i0 6=i
The value of all aik are known for k < j, so that aij is completely specified in closed form by
those and the present value of gij . This process is known as a 1D march-on-in-time approach.
Time-domain MoM is nowadays in its early stage and, although it has been successfully
applied to various simple situations, still suffers from numerical instabilities. More work is in
progress...
Study of EM waves in Periodic Structures
with addenda: “Study of EM waves in Periodic Structures (mathematical details)”
1 Introduction
We will study here the distribution of electromagnetic fields in dielectric periodic media. The
main difference with the previous topic comes from the word “dielectric”. Obviously, even a 2D
periodic dielectric medium cannot be studied with the Green’s functions presented in a previous
lecture, since the Green’s function was for periodic metallic structures.
• Since the material is constituted of real dielectric, we suppose we work in a small enough
frequency band such that we can ignore the frequency dispersive behavior of ².
2 Wave equations
2.1 Wave equations for H̄
Starting from Maxwell’s equations and using a permittivity ² = ²(r̄), it is straightforward to
show that we can write the following equations:
· ¸ µ ¶2
1 ω
∇× ∇ × H̄(r̄) = H̄(r̄) , (1a)
²(r̄) c
∇ · H̄(r̄) = 0 . (1b)
1
2 2.2 Wave equations for Ē
where ¸ ·
1
Θ=∇× ∇× . (3)
²(r̄)
Upon solving, we get the eigenvectors which correspond to the field patterns of the harmonic
modes, and the eigenvalues which are proportional to the squared frequencies of these modes.
Note that Θ is a linear operator and that it is Hermitian. The demonstration of the last
property is straightforward:
Define a scalar product < F̄ , Ḡ >= dr̄ F̄ ? (r̄) · Ḡ(r̄) and show that (by
R
integration by part for example): < F̄ , ΘḠ >=< ΘF̄ , Ḡ >, which is the
definition of a Hermitian operator. The consequence of this property is of
course that Θ has real eigenvalues.
H̄k (r̄) = eik̄·r̄ ūk (r̄), ūk (r̄ + R̄) = ūk (r̄) . (5)
ω(k̄) 2
· ¸ µ ¶
1
(ik̄ + ∇) × (ik̄ + ∇) × ūk (r̄) = ūk (r̄) , (6)
²(r̄) c
Note that because ūk (r̄ + R̄) = ūk (r̄), the eigenvalue problem can be restricted to a small
zone in space, which would necessarily imply a discrete spectrum of eigenvalues. Therefore, we
expect a set of discrete modes for each k̄.
3
3.1 Direct lattice (some details are given in the “mathematical details” addenda)
A photonic crystal is a periodic structure (that we will take to be dielectric here) in 1D, 2D or
3D.
Any vector r̄ 0 in space can be written as
r̄0 = r̄ + R̄ , (8)
where α1,2,3 ∈ {. . . , −2, −1, 0, 1, 2, 3, . . .} and ā1 , ā2 and ā3 are the lattice vectors.
From the lattice, we can construct the Wigner-Seitz cell as shown in Fig. 1.
Figure 1: Wigner-Seitz cell for an arbitrary position of points: the cell is constructed
by joining the center element to its closest neighbors and drawing perpendicular lines
from to the center of these segments. The polygon thus created is the smallest repeat-
able cell of the periodic lattice, and is defined as the Wigner-Seitz cell.
Note that there exist only one type of lattice for a 1D photonic crystal, five distinct types
for 2D photonic crystals (rectangular, square, hexagonal or triangle, centered rectangular and
oblique), and fourteen for 3D photonic crystals.
3.2 Reciprocal lattice (some details are given in the “mathematical details” addenda)
We will use here the same notation as [Joannopoulos, Meade, and Winn, “Photonic Crystals”]
and write the reciprocal translational vector as Ḡ:
where β1,2,3 ∈ {. . . , −2, −1, 0, 1, 2, 3, . . .} and b̄1 , b̄2 and b̄3 are the lattice vectors in the spectral
domain.
For the sake of illustration, Tab. 1 gives the definition of vectors ā and b̄ for square and
triangular lattices.
4 3.3 Bloch-Floquet theorem
b̄2
ā2
ā1 b̄1
PSfrag replacements
Figure 2: Direct square lattice and corresponding reciprocal lattice with highlighted
Brillouin zone.
b̄2
ā2
ā1
PSfrag replacements b̄1
Figure 3: Direct triangular (or hexagonal) lattice and corresponding reciprocal lattice
with highlighted Brillouin zone.
4 Bragg-like diffraction
The standard Bragg diffraction is illustrated in Fig. 4. Here, we will derive another diffraction
condition, equivalent to Bragg, and shall see that the diffraction is entirely governed by the
reciprocal vector Ḡ.
5
θ θ
θ a
PSfrag replacements
Referring to Fig. 5, we can write the scattering amplitude in terms of the reflection coefficient
Γ at position r̄ times a phase factor.
r̄
O
k̄ 0
k̄
PSfrag replacements
such that XZ
F (k̄, k̄ 0 ) = Γ̃(Ḡ)ei(Ḡ−∆k̄)·r̄ dv , (13)
Ḡ
6 Section 4. Bragg-like diffraction
where ∆k̄ = k̄ 0 − k̄. This amplitude is maximal when Ḡ − ∆k̄ = 2mπ or, when m = 0,
Ḡ = ∆k̄. (14)
This is an important relation which, again, is a condition for maximal diffraction. Upon ex-
panding back in terms of k̄ and k̄ 0 and rising to the square, we write (noting that |k̄| = |k̄ 0 | = k):
k 2 = k 2 + 2k̄ · Ḡ + G2 , (15)
Ḡ Ḡ
k̄ · ( ) = ( )2 . (17)
2 2
This last equation has a nice geometrical interpretation shown in Fig. 6 which shows that
the vectors k̄ that satisfy the maximum diffraction condition are actually those which lie on the
edge of the Brillouin zone.
D
ḠD /2
O ḠC /2 C
PSfrag replacements
Figure 6: Graphical representation of Eq. (17): each vector k̄ (black vector) with its
tip on a dashed line (not all represented) will satisfy the equation. Graphically: all
those k̄ have the same projection on the generating vector Ḡ/2 (red vector).
Therefore:
The edge of the Brillouin zone plus its center (Ḡ = 0) satisfy the maximum
diffraction condition.
This condition can also be rewritten in terms of group velocity: for those k̄ which tip lie on
the edge of the Brillouin zone and k̄ = 0, the component of the group velocity normal to the
7
Bragg diffraction planes tends to zero since the electromagnetic wave tends to be completely
reflected for these k̄:
µ ¶norm
norm
vg (k̄ ∈ BZ tip) = ∇k ω(k̄) (k̄ ∈ BZ tip) → 0 . (18)
For the symmetry points, the diffracted wave is reflected in the direction of the incident wave so
that for these points, the total group velocity is zero. This can be directly seen on the dispersion
curves where, at the symmetry points of the crystal, the tangent to the curve is horizontal
(except possibly for those points corresponding to a zero frequency).
5 Mathematical details
Using all the principles shown before, we can construct the eigenvalue system for H̄ and then
solve for Ē. The detailed mathematical manipulations are given in the annex document “Study
of EM waves in Periodic Structures (mathematical details)”.
Note that to build the system, we need to evaluate the Fourier coefficients of the permittivity
(or the inverse of the permittivity, κ). We shall show how to get these coefficients for the case
of infinite dielectric rods ²a of circular cross-section organized in a square lattice, embedded in
a background of ²b . We therefore place ourselves in a 2D situation where the Ḡ vector will be
written Ḡρ to denote that it does not depend on z (and similarly R̄ will be noted R̄ρ ).
1
X Z
iḠρ ·ρ
²(ρ) = ²̃(Ḡρ )e , ²̃(Ḡρ ) = ²(ρ)e−iḠρ ·ρ , (19)
Ω Ω
Ḡρ
where again the subscript ρ in R̄ρ denotes a dependency on x and y only, Rc is the radius of the
dielectric rods and S denotes the step function. Merging these two equations, we get:
Z · ¸
1 X
²̃(Ḡρ ) = ²b + (²a − ²b ) S(Rc − |ρ − R̄ρ |) e−iḠρ ·ρ dρ ,
Ω Ω
R̄ρ
1 1
Z Z X
−iḠρ ·ρ
= ²b e dρ + (²a − ²b ) S(Rc − |ρ − R̄ρ |)e−iḠρ ·ρ dρ . (21)
Ω Ω Ω Ω
R̄ρ
Let us call the first integral I1 (Ḡρ ) and the second integral I2 (Ḡρ ), and evaluate them separately.
Evaluation of I1 (ρ)
8 Section 5. Mathematical details
Evaluation of I2 (ρ)
For the second integral, we can make the change of variable ρ0 = ρ − R̄ρ . Since ρ spans the
whole domain Ω and R̄ρ is the translational vector, ρ0 spans the whole space. We can therefore
replace the sum of integrals over Ω by a single integral over the whole 2D space. We write then:
If Ḡρ = 0:
²a − ² b πRc2
ZZ
I2 (Ḡρ ) = dρ0 S(Rc − |ρ − R̄ρ |) = (²a − ²b ) = fr (²a − ²b ), (23)
Ω Ω
πRc2
where fr = Ω is the fractional volume.
If Ḡρ 6= 0:
ZZ ∞
1 0
I2 (Ḡρ ) = (²a − ²b ) dρ0 S(Rc − |ρ − R̄ρ |) e−iḠρ ·ρ
Ω −∞
Z Rc Z 2π
²a − ² b 0
= 0 0
dρ ρ dφe−iGρ ρ cos(φ−θ)
Ω 0 0
Z Rc
²a − ² b
= 2π dρ0 ρ0 J0 (ρ0 Gρ ), (24)
Ω 0
where we have used the change of variable (now standard) x0 = ρ0 sin φ, y 0 = ρ0 cos φ,
Gx = Gρ sin θ, Gy = Gρ cos θ, and the well-known identity for the Bessel function. Upon
R
using xJ0 (αx)dx = x/α J1 (αx), we continue with
· 0 ¸ Rc
2π ρ
I2 (Ḡρ ) = (²a − ²b ) J1 (Gρ ρ0 )
Ω Gρ 0
2J1 (Gρ Rc )
= fr (²a − ²b ) . (25)
Gρ Rc
The reconstruction of the permittivity is then straightforward, and examples of two lattice
are given in Fig. 7.
9
6 Dispersion curves
At this point, we have everything to build the eigensystem (34) given in the additional document
“Study of EM waves in periodic structures (mathematical details)”. Solving it gives a set of
eigenvalues that are directly related to the dispersion curve of the material. An example is given
in Fig. 8.
0.7
0.6
Frequency ω a/2π c
0.5
0.4
0.3
0.2
TE
PSfrag replacements TM
0.1
0
Γ X M Γ
r0 = r + R = r + α1 a1 + α2 a2 + α3 a3 , (1)
2. The spectral domain is defined by a basis, (b1 , b2 , b3 ), and similarly, the translational vector
is written as
G = β1 b1 + β2 b2 + β3 b3 , (2)
3. The two basis are linked since the functions (fields, permittivity) are periodic. For example,
if we write the permittivity:
1
ZZZ
iG·r
dr3 ²(r) e−iG·r . (3)
X
Fourier expansion: ²(r) = ²̃(G) e where ²̃(G) =
Vcell
G
²̃(G) eiG·(r+R)
X
Periodicity: ²(r + R) =
G
²̃(G) eiG·r eiG·R = ²(r)
X
= (4)
G
bj · ai = 2πδij . (6)
1
2 Section 1. Introduction: periodic media nomenclature
4. Bloch-Floquet theorem:
Since EM fields are periodic, we can write them as a propagating function times a function
with the same periodicity as the medium:
eiG·r ,
X
ζ (r) = ζ̃ (8)
k G
G
e ei(k +G)·r ,
X
E (r) = (9a)
k G
G
h ei(k +G)·r .
X
H (r) = (9b)
k G
G
From Maxwell’s equation, we can easily obtain the following wave equations in source-free
regions (with ² = ²(r)):
µ ¶2
ω
∇ × ∇ × E(r) = µr ²r (r) E(r) , (10a)
c
· ¸ µ ¶2
1 ω
∇× ∇ × H(r) = µr H(r) , (10b)
²r (r) c
To make these equations more symmetrical, we shall work with 1/²r (r) instead of ²r (r)
directly, so that we define
1
κ̃r (G) eiG·r .
X
κr (r) = = (11)
²r (r)
G
µ ¶2
ω
κr (r)∇ × ∇ × E(r) = µr E(r) , (12a)
c
· ¸ µ ¶2
ω
∇ × κr (r)∇ × H(r) = µr H(r) . (12b)
c
3
· ¸ µ ¶2 X X
0
ω
(k + G ) × (k + G ) × e 0 ei(k +G )·r = ²̃r (G − G ) e 0 ei(k +G)·r . (17)
X 0 0 0
− µr
0
G c 0
G
G G G
00
We can simplify by exp (ik · r) and multiply by exp (−iG · r) to get:
· ¸ µ ¶2 X X
0
i(G −G )·r
00
ω 00
²̃r (G − G ) e 0 ei(G−G )·r . (18)
X 0 0 0
− (k + G ) × (k + G ) × e 0 e = µr
0
G c 0
G
G G G
If we integrate this equation over the entire space, we can pull all the terms out of the
0 00 00
integral, except ei(G −G )·r on the left-hand side and. ei(G−G )·r on the right-hand side. Yet,
we have
1
ZZZ 00
dr3 ei(G−G )·r =
00
3
δ(G − G ) , (19)
V (2π)
00
so that Eq. (18) becomes (upon substituting G by G since these are dummy variables):
· ¸ µ ¶2 X
ω 0
−(k + G) × (k + G) × e = µr ²̃r (G − G ) e 0 , ∀G . (20)
G c 0
G
G
4 2.2 Method 2: expansion of the inverse of the permittivity
· ¸ µ ¶2 X
i(k +G +G )·r
0 00
ω 0
e 0 ei(k +G )·r . (21)
00
XX 0 0
− κ̃r (G )(k + G ) × (k + G ) × e 0 e = µr
00 0
G c 0
G
G G G
0 00
which, upon substituting G = G + G , simplifying by exp (ik · r), multiplying by
00 00
exp (−iG · r), integrating over the whole space, using Eq. (19) and finally substituting G
by G, becomes:
· ¸ µ ¶2
X 0 0 0 ω
− κ̃r (G − G )(k + G ) × (k + G ) × e 0 = µr e , ∀G . (22)
0
G c G
G
0
(k + G ) · h 0 = 0 . (23)
G
We can therefore define three vectors (ê1 , ê2 , ê3 ) such that
0 0
k + G = |k + G | ê3 , (24a)
ê1 · ê3 = ê2 · ê3 = 0 , (24b)
and (ê1 , ê2 , ê3 ) for an orthonormal tryad. In that case, we can decompose
X
h 0 = h1 0 ê1 + h2 0 ê2 = hλ 0 êλ . (25)
G G G G
λ=1,2
We need now to introduce this expression into Eq. (12b). First, we compute
· ¸ 0
(k + G ) × êλ ei(k +G )·r ,
XX 0
∇ × H (r) = i hλ 0 (26)
k G
0 λ
G
so that
· ¸ 0 00
κ̃r (G ) (k + G ) × êλ ei(k +G +G )·r
XXX 00 0
κr (r)∇ × H (r) = i hλ 0
k G
G G λ
00 0
· ¸
κ̃r (G − G ) (k + G ) × êλ ei(k +G)·r .
XXX 0 0
= i hλ 0 (27)
G
0 λ
G G
Taking the next curl, we write:
5
· ¸ · ¸
κ̃r (G−G ) (k+G)×[(k+G )×êλ ] ei(k +G)·r , (28)
XXX 0 0
∇× κr (r)∇×H (r) = − hλ 0
k G
0 λ
G G
· µ ¶2 X X ¸
ω 0
i(k +G)·r
hλ 0 ei(k +G )·r êλ .
XXX 0 0
− hλ 0 κ̃r (G−G ) (k+G)×[(k+G )×êλ ] e = µr
G c G
G G λ
0
G λ
0
(29)
Always by the same token (multiplying by the proper functions and integrating over whole
space), we write:
· ¸ µ ¶2 X
XX 0 0 ω
− hλ 0 κ̃r (G − G ) (k + G) × [(k + G ) × êλ ] = µr hλ00 êλ00 ∀G . (30)
G c G
0 λ 00 λ
G
We can further simplify this expression by dot-multiplying the equation by êλ0 and noting
that (using C · (A × B) = B · (C × A))
· ¸ · ¸ · ¸
0 0
(k + G) × [(k + G ) × êλ ] · êλ0 = − (k + G ) × êλ · (k + G) × êλ0 (31)
X X½· ¸ · ¸¾ µ ¶2
0 0 ω
(k + G ) × êλ · (k + G) × êλ0 κ̃r (G − G ) hλ 0 = µr h λ 0 . (32)
G c G
0 λ
G
0 00 0 00 0
Upon exchanging G and G (transformations: G → G , G → G, G → G ), we obtain
X X½· ¸ · ¸¾ µ ¶2
0 0 ω
(k + G) × êλ · (k + G ) × êλ0 κ̃r (G − G) hλ = µr h λ 0 0 . (33)
G c G
λ
G
which is the relation given in [Joannopoulos et al., 1995, p. 129]. Upon using the same
notation, we rewrite Eq. (33) as:
µ ¶2
ω
Θk
X
h = µr h , (34a)
(λG),(λG)0 (λG) c (λG)0
λG
where
· ¸ · ¸
Θk
0 0
= κ̃r (G − G) (k + G) × êλ · (k + G ) × êλ0 . (34b)
(λG),(λG)0
6 3.1 Matrix form
Remembering that ê3 × ê1 = ê2 and ê3 × ê2 = −ê1 , we can write:
à !
ê2 · ê2 −ê2 · ê1
[ê3 × êλ ] · [ê3 × êλ0 ] = , (36)
−ê1 · ê2 ê1 · ê1
so that we write the operator as:
· ¸ · ¸
Θk
0 0
= κ̃ r (G − G) (k + G) × ê λ · ( k + G ) × ê λ 0
(λG),(λG)0
¯ ¯ ¯ ¯ Ã !
0 ¯ ¯ ¯ 0 ¯ ê 2 · ê 2 −ê 2 · ê 1
= κ̃r (G − G) ¯¯(k + G)¯¯ · ¯¯(k + G )¯¯ , (37)
−ê1 · ê2 ê1 · ê1
1 Introduction
In the previous class, we have introduced various concepts necessary for the study of EM waves
in photonic crystal structures. We shall now use these concepts to explain various results such
as:
In particular, we will show an example of how a periodic structure can exhibit k-surfaces typical
of a negative refraction material (the concept of k-surface for left-handed materials was first
introduced in the class of February 24, 2003).
This class is based on the following two references::
1
2 Section 2. Retrieving the permittivity
² f + ² (1 − f ) if Gρ = 0
a r b r
²̃(Gρ ) = (1)
(²a − ²b )fr 1 ρ Rc )
2J (G
elsewhere.
Gρ Rc
Yet, the information on the basis is also included in Eq. (1), since both Gρ and fr will depend
on it.
It is straightforward to show that:
µ ¶2
Rc
☞ Square lattice: fr = π (where Rc is the radius of the inclusions and a the
a
lattice constant).
2π Rc 2
µ ¶
☞ Triangular lattice: fr = √ .
3 a
An illustration for square and triangular lattices is given in Fig. 1 (note that since the
permittivity is obtained from its Fourier coefficients, a unavoidable Gibbs phenomenon will
occur).
(a) Square lattice (fr = 38.48%). (b) Triangular lattice (fr = 44.43%).
3 Band diagrams
The purpose of all the mathematical developments presented so far (getting the Fourier coef-
ficients of the permittivity, building the eigensystem, etc) is to eventually obtain the eigenval-
ues and eigenvectors of the problem. Eigenvalues correspond to dispersion diagrams whereas
eigenvectors correspond to the actual field distributions. We shall limit ourselves here to the
consideration of eigenvalues only.
ωk 2
µ ¶
Θ H̄(r̄) = H̄(r̄) , (2a)
c
· ¸
1
Θ=∇× ∇× , (2b)
²(r̄)
(with the condition ∇ · H̄(r̄) = 0 to reduce the size of the system), we get a set of eigenvalues
for each incident k̄. The band diagrams are then constructed by sweeping all possible k̄.
Because of the periodicity of the medium the “all possible” k̄ can be reduced to the first
Brillouin zone and, by symmetry, further reduced to the irreducible Brillouin zone. In addition,
as we have seen, it is also enough to span the edge of the Brillouin zone since it corresponds to
the maximum diffraction condition. Again by symmetry, we further reduce the domain to the
edge of the irreducile Brillouin zone.
This zone will of course depend on the lattice, as is shown in Figs 2 and 3 of the March 19
2003 notes. For both cases, the irreducible Brillouin zone is depicted in Fig. 2.
M
M K
X
Γ
Γ
Figure 2: Irreducible Brillouin zones (red region) for (a) a square lattice and (b) a
triangular lattice. Each region is defined between symmetry points of the crystal.
In order to span the edge of the irreducible Brillouin zone, we therefore need to know the
coordinates of the symmetry points of the crystal. It is easy to show that:
4 Section 3. Band diagrams
• Square lattice:
Γ → (kx = 0, ky = 0) , (3a)
π
X → (kx = , ky = 0) , (3b)
a
π π
M → (kx = , ky = ) . (3c)
a a
• Triangular lattice:
Γ → (kx = 0, ky = 0) , (4a)
2π 2π
K → (kx = , ky = √ ) , (4b)
3a 3a
2π
M → (kx = 0, ky = √ ) . (4c)
3a
Having these limit points, we can sweep k̄, solve the eigensystem and obtain the eigenvalues.
An example is given in Fig. 3.
5
0.8
0.7
Frequency ω a/2π c
0.6
0.5
0.4
0.3
0.2
PSfrag replacements
0.1
0
Γ X M Γ
0.8
0.7
Frequency ω a/2π c
0.6
0.5
0.4
0.3
0.2
PSfrag replacements
0.1
0
Γ M K Γ
Figure 3: Band diagram for TE and TM modes as function of the normalized fre-
quency for (a) a square lattice and (b) a triangular lattice. Notice the absolute band
gap for TM modes. Parameters are: ²a = 1, ²b = 13, Rc /a = 0.48. Important note:
TE modes are here defined as transverse to the axis of the crystal (z axis), therefore
with an in-plane electric field!! Note also that these curves have not yet fully reached
convergence.
6 Section 4. k-surfaces
4 k-surfaces
4.1 Well-chosen example
We can generalize the band diagram of the previous section (which, again, spans only the edge
of the Brillouin zone), to the entire zone or even to the entire photonic crystal.
The example we shall consider from now on is taken from [1] and the parameters are sum-
marized in Tab. 1. In addition, we shall work with TE modes (remember that in this notation,
TE corresponds to an in-plane electric field).
Lattice: square
Inclusions: ²a = 1
Background: ²b = 12
Radius : Rc /a = 0.35
Table 1: Parameters for our example [1].
0.4
Frequency ω a/2π c
0.3
0.2
TE
0.1
PSfrag replacements
Γ
Γ
0
0 X M 1
Figure 4: Band diagram for TE modes for the structure defined in Tab. 1. The
horizontal line is at the frequency corresponding to a change of curvature of the k-
surface, and the green line is the light-line shifted to M.
.
7
We can also extend the plot from the edge of the irreducible Brillouin zone to the entire
structure, for various eigenstates. This is represented in Figs. 5 to 7.
1. Light-line: the light-line gives the radius of the k-surface of an EM wave impinging on
the crystal from free-space. In order to have a possible coupling, whole (or part) of the
free-space k-surface has to be included in one of the k-surfaces of the crystal. The light-line
is represented in Fig. 4 by the green line (it is actually the translation of the light-line to
M ). Its intersection with say the curve of the first eigenvalue gives the maximal frequency
for which the condition mentioned above (total inclusion of the free-space k-surface into a
k-surface of the crystal) is satisfied.
Note also that in order to have a negative refraction, the free-space k-surface has to be
included in one of the k-surfaces converging to M and therefore, the actual crystal needs
to be rotated by 45◦ .
2. Power bending: around M , the power is converging to a single point when frequency is
increased. Yet, the direction of the power can be deduced from the gradient of the k-curves
and therefore, directly from their radius of curvature. Hence, if the radius of curvature is
such that the gradient is pointing toward M , the refraction will be negative.
The frequency at which the radius of curvature diverge is given by the horizontal line in
Fig. 4, which can be obtained by a direct inspection of Fig. 5(c).
As it can be seen, the power can bend on the opposite side of the normal. However, the
phase is still propagating forward, which justifies the title of the paper: “All-angle negative
refraction without negative index”.
We can also examine the second eigenvalue and see if a similar phenomenon can appear. The
k-surface has been depicted in Fig. 6(b) and is shown again in Fig. 9 with less curves represented.
It is clear from this figure that again, there is a point at which the radius of curvature of the
k-curves changes, and energy converges to a single point (Γ this time) as frequency increases.
It is therefore again possible to have a negative index of refraction. Notice that in this case
8 4.2 Negative refraction
0.25
0.2
0.15
0.1
0.05
0
4000
2000 4000
2000
0
0
−2000
−2000
−4000 −4000
0.22
160
0.2
140
0.18
120
0.16
100 0.14
0.12
80
0.1
60
0.08
40
0.06
20 0.04
0.02
20 40 60 80 100 120 140 160
(c) 2D k-surface.
0.36
0.34
0.32
0.3
0.28
0.26
4000
2000 4000
2000
0
0
−2000
−2000
−4000 −4000
160 0.34
140
0.33
120
0.32
100
0.31
80
60 0.3
40 0.29
20
0.28
(b) 2D k-surface.
however, because of phase matching, power and phase are in directions that make an angle
greater than π/2. Although it is not necessarily π like in a pure left-handed regime, it is still in
the regime of left-handed behavior. Fig. 10 can be completed to see this phenomenon.
10 4.2 Negative refraction
0.45
0.44
0.43
0.42
0.41
0.4
0.39
0.38
4000
2000 4000
2000
0
0
−2000
−2000
−4000 −4000
160
0.435
140
0.43
120 0.425
100 0.42
80 0.415
60 0.41
0.405
40
0.4
20
0.395
20 40 60 80 100 120 140 160
(b) 2D k-surface.
50
0.3
45
40
0.29
35
30
0.28
25
20
0.27
15
10 0.26
0.25
5 10 15 20 25 30 35 40 45 50 55
1.2
34
32
0.8
30 0.6
0.4
28
0.2
26
0
−0.2
24
−0.4
22
−0.6
22 24 26 28 30 32 34
Figure 10: k-surface for the second eigenvalue for the specific frequency where the
radius of curvature is positive.