Академический Документы
Профессиональный Документы
Культура Документы
x
Hx
Hx
x
1
Hx
x
n
_ _
;
T
x
Hx Hx:
2. IDA-PBC and I&I for underactuated mechanical systems
Both design methodologies are briey introduced, and their
structural similarities are highlighted. We consider lossless mechan-
ical systems in Hamiltonian representation
:
_
q
_
p
_ _
0 I
I 0
_ _
T
q
Hq; p
T
p
Hq; p
_
_
_
_
0
Gq
_ _
u: 1
qAR
n
and pAR
n
denote the vectors of generalized coordinates
and momenta, uAR
m
, mon, the vector of input forces/torques and
GqAR
nm
the input matrix which has full column rank m in
the considered operation range of the system. I AR
nn
is the
corresponding identity matrix. The Hamiltonian or total energy of
the system is assumed to have a simple mechanical structure, i.e. it is
composed of a kinetic energy T : R
n
R
n
-R plus potential energy
V : R
n
-R:
Hq; p
1
2
p
T
M
1
qp
..
Tq;p
Vq; Mq M
T
q40; 2
where Mq AR
nn
denotes the mass matrix.
Remark 1. Physical damping, especially in unactuated coordi-
nates, can produce additional difculties in the design process,
and is therefore excluded in this contribution. In IDA-PBC, it is not
possible to generate a closed-loop system of the above simple
mechanical form if a dissipation condition [6] is violated. By
relaxing the structure of the target system, the problem can be
obviated and stabilizing IDA-PBC controllers can be derived [9],
though in a more complicated procedure. Interestingly, the result-
ing controllers contain structures which also appear when sign-
indenite damping in mechanical systems is considered [18],
a relation which gives rise to future investigations.
2.1. IDA-PBC
The IDA-PBC state feedback control law
u
IDA
u
es
u
di
is composed of two parts corresponding to the design steps energy
shaping and damping injection.
Energy shaping: The goal of energy shaping is to nd a non-
linear state feedback law u
es
q; p; v to transform (1) into another
lossless pH system
d
:
_
q
_
p
_ _
0 J
1
q
J
T
1
q 0
_ _
T
q
H
d
q; p
T
p
H
d
q; p
_
_
_
_
0
Gq
_ _
v 3
with new input v and a closed-loop (virtual) energy
H
d
q; p
1
2
p
T
M
1
d
qp
..
T
d
q;p
V
d
q; M
d
q M
T
d
q40; 4
which possesses again a simple mechanical structure. The virtual
potential energy V
d
q must admit a minimum at the new desired
equilibrium q
n
such that H
d
q; p serves as a storage function for
(3) with the collocated output
y G
T
q
T
p
H
d
q; p: 5
Accounting for the fact that generalized velocities and momenta
also in the closed loop are related by
_
q M
1
qp, it follows that
J
1
q M
1
qM
d
q:
Furthermore, it must be ensured that the dynamics, which is
not affected by the control input, remains unchanged. This is
expressed by the matching PDE
G
?
q
T
q
Hq; p G
?
qJ
T
1
q
T
q
H
d
q; p: 6
The full rank left hand annihilator G
?
qAR
nmn
with the
property G
?
qGq 0 eliminates the input functions from
the matching equation, which results from the comparison of
the second rows of (1) and (3). The energy shaping control nally
follows from solving the matching equation for u, using the
pseudo-inverse G
q G
T
qGq
1
G
T
q:
u u
es
v G
q J
T
1
q
T
q
H
d
q; p
T
q
Hq; p v: 7
Damping injection. With a feedback of the collocated output,
e.g. by a constant mm matrix K
di
40,
v u
di
K
di
G
T
q
T
p
H
d
q; p
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 446
(damping injection), the new equilibrium q
n
; 0 of the passive
system (3) and (5) is stabilized asymptotically under the condition
of zero state detectability [5]. As a result, the interconnection and
damping matrix in (3) becomes
0 J
1
q
J
T
1
q R
2
q
_ _
; R
2
q GqK
di
G
T
qZ0:
2.2. Immersion and invariance
On the other hand, in the I&I approach for pH systems the
desired behavior of the system to be controlled is captured by the
choice of a target dynamical pH system, of lower dimension than
the original system 2so2n,
t
:
_
q
_
p
_
_
_
_
0
q
q
R
t
_ _
T
q
H
t
p
H
t
_
_
_
_
;
p
_ _
8
with the Hamiltonian function H
t
: R
s
R
s
-R
H
t
q
;
p
1
2
T
p
M
1
t
q
p
..
Tt
q
;
p
V
t
q
;
and states
q
;
p
AR
s
. Moreover, M
t
M
T
t
40 is the target inertia
matrix, R
t
R
T
t
Z0 is the target damping matrix, both of dimen-
sion s s, and T
t
, V
t
dene the target kinetic and potential energy
functions, respectively.
t
has an asymptotically stable equili-
brium at
n
q
; 0 if V
t
q
has a strict minimum at
n
q
.
The objective is to nd a controller which guarantees that the
closed-loop system asymptotically behaves like the target system
(
t
) achieving asymptotic model matching, as opposed to the more
restrictive exact matching of IDA-PBC (or correspondingly the
Controlled Lagrangians method). This is formalized by nding a
manifold in state space that can be rendered invariant and
attractive, with internal dynamics a copy of the desired closed-
loop dynamics, and designing a control law that steers the system
state towards the manifold. The following steps summarize the I&I
approach [2]:
Immersion map: Choosing the immersion mapping as
_ _
;
with
i
: R
s
R
s
-R
n
, the matching (or immersion) conditions can
be written, with x q
T
p
T
T
AR
2n
the vector of coordinates and
momenta of the original system:
T
p
Hj
x
1
_
T
q
Hj
x
G
1
c
2
_
:
Both immersion conditions must be satised by all available
design parameters in the target system (8) as well as the immer-
sion mapping and the virtual feedback control law c that
renders the manifold invariant. The rst equation is related to the
equation
T
p
H J
1
T
p
H
d
in the IDA-PBC approach, which denes the
matrix J
1
q. Multiplying the second equation with the left hand
annihilator G
?
and replacing the target dynamics on the
right hand side, a PDE is obtained which has a structure similar to
the IDA-PBC matching PDE (6):
G
?
T
q
Hj
x
G
?
2
_
: 9
Implicit manifold: The manifold which is to be rendered
attractive and invariant is dened as
M fxAR
n
R
n
jx 0g fxAR
n
R
n
jx for some AR
s
R
s
g:
The mapping x describes the distance of the state x from the
target manifold and therefore denes the so-called off-manifold
coordinates x. Together with the on-manifold coordinates ,
a regular coordinate change can be dened, which will be
exploited later (Fig. 1).
Manifold attractivity and trajectory boundedness: The I&I control
law x; has to ensure that the trajectories of the closed-loop
system
_
x
0 I
I 0
_ _
T
q
Hq; p
T
p
Hq; p
_
_
_
_
0
Gq
_ _
x;
remain bounded, while the target manifold is approached asymp-
totically, i.e. the off-manifold coordinates which satisfy
_
x
x
0 I
I 0
_ _
T
q
Hq; p
T
p
Hq; p
_
_
_
_
0
Gq
_ _
x;
_
_
_
_
_
_
;
tend to zero: lim
t-1
t 0.
Immersion vs. matching PDE. At this point, the IDA-PBC match-
ing PDE (6) is reminded and compared to the I&I immersion PDE
(9). Both PDEs can be split into a kinetic and a potential energy
part. Below, the equations are set one above the other in order to
underscore the structural similarity (arguments are omitted for
brevity):
Kinetic energy:
G
?
j
x
T
q
Tj
x
q
T
t
2
R
t
p
T
t
0
G
?
T
q
T M
d
M
1
T
q
T
d
R
2
T
p
T
d
0
Potential energy:
G
?
j
x
T
q
Vj
x
q
V
t
0
G
?
T
q
V M
d
M
1
T
q
V
d
0
This similarity will be exploited in the following section at the
example of the Acrobot in order to derive corresponding control
laws from both perspectives.
3. A comparative study with the Acrobot
The considered system is the so-called Acrobot, a 2 DOF under-
actuated mechanical system with two rigid links, see Fig. 2. The
pivot rotates freely, the intermediate joint is actuated. The control
task is to stabilize the unstable upright stretched-out congura-
tion of the mechanism q
n
1
q
n
2
0.
3.1. Model
The Hamiltonian description of the system dynamics with
generalized coordinates and momenta q q
1
q
2
T
, p p
1
p
2
T
,
Fig. 1. Illustration of asymptotic matching in I&I.
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 447
is given by (1) and (2), with the inertia matrix
Mq
2
c
1
c
2
2c
3
cos q
2
c
2
c
3
cos q
2
c
2
c
3
cos q
2
c
2
_ _
;
which only depends on q
2
, and the potential energy
Vq c
4
cos q
1
c
5
cos q
1
q
2
:
The constants c
1
; ; c
5
depend on masses, lengths and inertia.
Physical dissipation is not considered, as it has been already
discussed above. Only the intermediate joint is actuated, conse-
quently the input vector is G e
2
, and the corresponding left hand
annihilator is G
?
e
T
1
, where e
i
denotes the i-th unit column
vector.
3.2. IDA-PBC
The IDA-PBC matching PDE according to (6) is
q
1
_
1
2
p
T
M
1
q
2
pVq
_
e
T
1
M
d
qM
1
q
2
T
q
_
1
2
p
T
M
1
d
qpV
d
q
_
:
10
The design parameters in M
d
q and V
d
q must be chosen such
that (10) is satised. It turns out that the kinetic part of this PDE is
trivially true for a closed-loop inertia matrix M
d
const:, and only
the potential energy matching PDE
m
T
d;1
M
1
q
2
T
V
d
q
q
1
Vq 11
has to be considered in the sequel. Notice that M
d
const: is one
possible parameterization to solve the IDA-PBC matching problem,
which in the course of the paper is used as a reference for the I&I
approach.
In the following, the inverse original mass matrix will be denoted
as M
1
q
2
Mq
2
, and column/elementwise:
Mq
2
m
1
q
2
m
2
q
2
m
11
q
2
m
12
q
2
m
12
q
2
m
22
q
2
_ _
:
The new mass matrix is written row/elementwise
M
d
m
T
d;1
m
T
d;2
_
_
_
_
m
d;11
m
d;12
m
d;12
m
d;22
_ _
:
The linear matching PDE (11) then has the structure
a
1
q
2
q
1
V
d
q a
2
q
2
q
2
V
d
q
q
1
Vq 12
with the coefcients
a
i
q
2
m
T
d;1
m
i
q
2
; i 1; 2;
which are summarized in the vector aq
2
a
1
q
2
a
2
q
2
T
.
Solution of the matching PDE. With the coordinate change
z tq,
z
1
t
1
q
1
; q
2
q
1
_
q
2
q
n
2
a
1
a
2
d
z
2
t
2
q
1
; q
2
q
2
and its inverse q t
1
z,
q
1
t
1
1
z
1
; z
2
z
1
_
z
2
z
n
2
a
1
a
2
d
q
2
t
1
2
z
1
; z
2
z
2
13
the potential energy matching PDE (12) is transformed into the
ODE
z
2
~
V
d
z
q
1
VJt
1
z
a
2
z
2
: 14
The right hand side can be simply integrated over z
2
and the
solution
~
V
d
z
~
V
p
d
z
~
V
h
d
z
1
T
q
t
1
q 0;
and therefore arbitrary functions of z
1
t
1
q can be added to the
particular solution in order to shape the resulting potential energy.
From previous work it is known that controller parameterizations
can be easily derived for the Acrobot such that prespecied
closed-loop local linear dynamics is achieved, while all IDA-PBC
design parameters meet the deniteness requirements M
d
40,
q
n
arg min V
d
q, and R
2
Z0, see [7].
Energy shaping: With the appropriately tuned potential energy,
the energy shaping control law (7) for the Acrobot is
u
es
m
T
d;2
Mq
2
T
V
d
q
q
2
Hq; p: 15
The closed-loop system under this controller is a lossless pH
system (3) with new positive denite energy H
d
q; p.
Damping injection: To asymptotically stabilize the new equili-
brium q
n
; 0, damping is injected via output feedback k
di
40
u
di
k
di
G
T
T
p
H
d
q; p k
di
m
T
d;2
p: 16
To prove that the equilibrium is indeed asymptotically stable,
it can be shown that q
n
; 0 is the only possible state which
remains in the set where
_
H
d
0 holds (LaSalle).
3.3. Immersion and invariance
In contrast to IDA-PBC, in I&I lower dimensional target dynamics
is specied. For the Acrobot a target pH system with only one
degree of freedom is assumed:
_
q
_
p
_ _
0
q
q
0
_ _
q
H
t
q
;
p
p
H
t
q
;
p
_ _
17
Fig. 2. Sketch of the Acrobot, u denotes the torque applied to the intermediate
joint.
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 448
with
H
t
q
;
p
1
2
2
p
m
t
V
t
q
: 18
Observe that the target dynamics, when H
t
q
;
p
is positive denite
around its equilibrium
n
q
; 0 is only Lyapunov stable, whereas
usually asymptotically stable target dynamics R
t
40 is specied
in I&I.
The target dynamics must be immersed into an invariant
manifold, i.e. a mapping must exist such that x
n
n
and,
in addition, a virtual control law c such that the trajectories of
the target system (on the manifold) are trajectories of the original
system. When the structure of the mapping according to
q
p
_ _
_ _
;
q
1
q
2
_ _
11
12
_ _
;
p
1
p
2
_ _
21
22
_ _
;
is assumed, the immersion condition is
0 I
I 0
_ _
T
q
H
T
p
H
_
_
_
_
x
0
G
_ _
c
2
_ _
0
0
_ _
q
H
t
p
H
t
_ _
:
19
This condition for the 2 DOF Acrobot system represents 3 scalar
equations which must be met by all free design parameters and
one equation which can be solved for the virtual control law
c.
Choice of design parameters: Multiplying the second row of (19)
with the left hand annihilator G
?
e
T
1
, where m
t
const: is
chosen (inspired by M
d
const: in IDA-PBC), yields
q
1
Vj
q
1
21
21
m
1
t
p
q
V
t
_ _
:
Dening
p
1
21
p
and
q
a
2
q
;
the equation becomes
q
V
t
q
1
VJ
1
a
2
:
The equation is structurally similar to (14), with the difference that
~
V
d
has 2 arguments z
1
and z
2
, while V
t
is only a function in
q
.
Solution of the immersion PDE: Dene the generalized coordi-
nate part of the immersion map q
1
in analogy to q t
1
z
in (13), with z
1
restricted to the equilibrium value z
1
z
n
1
:
q
1
11
q
z
n
1
_
q
n
q
a
1
a
2
d
q
2
12
q
q
: 20
Then a solution of this ODE is identical with the (known) solution
of the IDA-PBC potential energy PDE, restricted to z
1
z
n
1
:
V
t
q
~
V
d
z
n
1
;
q
:
The remaining design quantities
22
q
;
p
and m
t
are xed
such that the rst vector-valued row of (19) is true:
M
1
22
q
;
p
_ _
a
1
a
2
0
1 0
_
_
_
_
a
2
q
m
1
t
p
a
2
q
V
t
_ _
:
The two scalar equations can be solved (after multiplication with
M
q
) for the inertia of the target system
m
t
m
T
1
q
M
q
m
d1
m
d;11
;
which is constant, as has been assumed, as well as the remaining
component of the immersion mapping
p
2
22
p
m
d;12
m
d;11
p
:
Intermediate result: target dynamics. Given a solution of the IDA-
PBC energy shaping problem for the Acrobot as described in the
previous section with V
d
q
1
; q
2
or
~
V
d
z
1
; z
2
, the positive denite
closed-loop potential energy function and the constant positive
denite mass matrix M
d
m
d;ij
i;j 1;2
.
The 1 DOF system (17) with
m
t
m
d;11
V
t
q
~
V
d
z
n
1
;
q
q
m
T
d;1
m
2
_
_
z
n
1
_
n
q
m
T
d;1
m
1
m
T
d;1
m
2
p
m
d;12
m
d;11
p
_
_
_
_
21
is the corresponding immersion mapping x . The restriction
of the closed-loop potential energy
~
V
d
from IDA-PBC to the
equilibrium value of the characteristic coordinate z
1
z
n
1
provides
a potential energy function for the I&I target system.
What remains, is to express the invariant manifold implicitly
(dene off-manifold coordinates), choose a control law such that
the off-manifold coordinates tend to zero and the overall system
trajectories remain bounded. In the I&I approach the way how to
derive this stabilizing control law is not explicitly specied. In
contrast, the IDA-PBC procedure directly leads to the energy
shaping and damping injection control laws (15) and (16).
3.4. I&I stabilization and relation to IDA-PBC
In this subsection, the uncontrolled system is expressed in the
on- and off-manifold coordinates, corresponding to the above
dened immersion mapping. An obvious choice for the I&I con-
troller to keep the trajectories bounded turns out to be identically
the IDA-PBC energy shaping control law. Moreover, dissipation
introduced via IDA-PBC damping injection can be allocated in the
off-manifold subsystem.
Implicit manifold and manifold attractivity: A mapping x,
x q
T
p
T
T
which satises the set identity fx g fx 0g
is given by (for q
n
1
0)
x
q
1
_
q
2
q
n
2
a
1
a
2
d
p
2
m
d;12
m
d;11
p
1
_
_
_
p
_ _
: 22
A control law x; x has to be determined such that the
trajectories of the system
_
x f x gxx; x
remain bounded and lim
t-1
t 0, where
_
x
xf xgxx; x:
Then the invariant manifold is attractive and the equilibrium x
n
is
stable.
As the target dynamics (17) was dened only Lyapunov stable,
asymptotic stability of the desired equilibrium cannot be directly
deduced when the manifold is made attractive in the studied
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 449
example. Therefore, having in mind the IDA-PBC control laws
derived above, the solution of the stabilization problem is con-
sidered from another perspective:
System in on/off manifold coordinates: The immersion mapping
(21) and the off-manifold coordinates (22) dene the regular
coordinate change ; q; p with
p
_
_
_
1
q; p
2
q; p
q
2
p
1
_
_
_
_
:
In the following, the original open-loop model of the Acrobot will
be expressed in a pH form in the new coordinates and .
However, the energy gradient on the right hand side will not be
taken from the physical energy Hq; p, but the shaped closed-loop
energy H
d
q; p from the IDA-PBC design according to (4). Eval-
uated in the new coordinates , it can be written as
^
H
d
;
^
V
d
q
;
q
^
T
d
p
;
p
with the expressions for the potential and kinetic energy part
^
V
d
q
;
q
~
V
d
z
1
q
; z
2
q
and
^
T
d
p
;
p
1
2
m
d;11
jM
d
j
2
p
1
2m
d;11
2
p
:
The corresponding mass matrix for this kinetic energy term is
diagonal:
^
M
d
diag
jM
d
j
m
d;11
; m
d;11
_ _
:
Using these conventions, the open-loop system can be written as
_
q
_
p
_
q
_
p
_
_
_
0 j
12
0 0
0 0 0 0
0 j
32
0 j
34
0 0 j
34
0
_
_
_
q
^
H
d
p
^
H
d
q
^
H
d
p
^
H
d
_
_
_
0
f
p
u
0
0
_
_
_
_
with the elements
j
12
q
jM
d
j
jM
q
j
1
m
T
d;1
m
2
;
j
32
q
jM
d
j
jM
q
j
m
11
m
d;11
;
j
34
q
m
T
d;1
m
2
q
;
of the interconnection matrix. The expression
f
p
;
q
2
H
m
d;12
m
d;11
q
1
HJ
1
;
in the second ODE is nothing else than the drift termon the right hand
side of
_
p
_ p
2
m
d;12
=m
d;11
_ p
1
, expressed in transformed coordi-
nates. It ensures the equivalence of the above state differential
equations with the original open-loop system representation.
Trajectory boundedness: A control law which provides Lyapunov
stability of the equilibrium
n
;
n
q
n
; 0 is certainly given by
u
es
f
p
; j
12
q
^
H
d
; j
32
q
^
H
d
; : 23
Thus, the closed-loop system becomes
_
q
_
p
_
q
_
p
_
_
_
0 j
12
q
0 0
j
12
q
0 j
32
q
0
0 j
32
q
0 j
34
0 0 j
34
q
0
_
_
_
q
^
H
d
p
^
H
d
q
^
H
d
p
^
H
d
_
_
_
_
:
Both the - and the subsystem now are coupled via (a) the mixed
terms in
^
V
d
q
;
q
and (b) the element j
32
q
of the interconnection
matrix. Observe that
^
H
d
q
z
n
1
;
p
0 H
t
q
;
p
holds and indeed, with as output vector, the target systems (17)
and (18) represents the corresponding zero dynamics.
The system at this stage is lossless and lacks damping for
asymptotic stability of its equilibrium. A closer look at the control
law (23) and its transformation into the original states q; p
reveals that it exactly matches the IDA-PBC energy shaping state
feedback (15). Thus it seems appropriate to apply in addition the
damping injection feedback (16) derived above to achieve asymp-
totic stability.
Damping injection: The damping injection law
u
di
k
di
p
2
H
d
q; p k
di
2
^
H
d
;
leads to the closed-loop state representation
_
q
_
p
_
q
_
p
_
_
_
0 j
12
q
0 0
j
12
q
k
di
j
32
q
0
0 j
32
q
0 j
34
0 0 j
34
q
0
_
_
_
q
^
H
d
p
^
H
d
q
^
H
d
p
^
H
d
_
_
_
_
;
which, as already has been mentioned in the IDA-PBC part, has the
desired asymptotically stable equilibrium.
4. Linear mechanical systems
The equivalence result derived for the Acrobot system relies on
the solution of the homogeneous matching PDE in IDA-PBC which
denes the characteristic coordinate z
1
t
1
q. The corresponding
(partial) transformation can be used at the same time to dene a
part of the immersion mapping in I&I. For a system with n degrees
of freedom and only one passive joint, the homogeneous version
of the potential energy matching PDE has n1 locally indepen-
dent solutions which dene n1 characteristic coordinates to
shape the resulting closed-loop energy. However, the transforma-
tion can in general not be expressed in a simple analytic way as in
the Acrobot 2 DOF example
1
. Determining expressions for the
characteristic coordinate functions, which identically can be used
in the I&I design process, is a problem itself.
This section briey sketches some ideas on how to approach
the equivalence question for linear mechanical systems. First, from
the perspective of the mentioned class of systems with an
unactuated pivot, then from a more general point of view.
4.1. Kinematic chains with unactuated pivot
In the linear (or linearized) case, considered in this section, the
mass matrices M and M
d
are both constant, and the expressions
for the potential energies V and V
d
are quadratic forms in q.
According to Section 3, the I&I target system is supposed to have 1
DOF.
IDA-PBC matching PDE: The potential energy matching PDE in
IDA-PBC can be written, as a generalization of (12) to the n DOF
case,
q
V
d
q a
q
1
Vq3a
1
q
1
V
d
qa
n
q
n
V
d
q
q
1
Vq;
1
At least, if the approach as sketched above is followed, with M
d
const: and a
simple choice of G
?
.
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 450
with the constants
a
i
m
T
d;1
m
i
; i 1; ; n;
and reminding that m
i
, m
d;i
denote the i-th columns of the
matrices M M
1
and M
d
, respectively. For aAR
n
, there exists
a square matrix T AR
nn
such that the rst n1 elements of the
vector Ta are zero, e.g.
T
t
T
1
t
T
n
_
_
_
_ I
n
0
nn1
a
a
n
_ _
:
The coordinate change z Tq with
z
i
q
i
a
i
a
n
q
n
; i 1; ; n1 z
n
q
n
and its inverse q T
1
z with
q
i
z
i
a
i
a
n
z
n
; i 1; ; n1 q
n
z
n
renders the potential energy matching PDE an ODE
zn
~
V
d
z
q
1
VJT
1
z
a
n
:
The structure coincides with (14), and consequently, the solution
can be written as
~
V
d
z
~
V
p
d
z
~
V
h
d
z
1
; ; z
n1
:
In this case, n1 characteristic coordinates are available for
energy shaping.
I&I immersion mapping: Inspired by the 2 DOF case, the
following immersion mapping is proposed:
q
1
q
n1
q
n
_
_
_
1;1
1;n1
1;n
_
_
z
n
1
a
1
an
q
z
n
n1
a
n 1
an
q
q
_
_
_
_
and
p
2
p
m
d;1
m
d;11
p
;
where m
d;1
is the rst column of M
d
. Moreover, dening
m
T
d;1
m
n
a
n
; m
t
m
d;11
;
it turns out that the unactuated parts of the immersion condition
(19) are satised. Particularly, the immersion PDE
q
V
t
q
1
VJ
1
a
n
has again the same structure as the transformed matching PDE
from IDA-PBC, restricted to the equilibrium values of z
n
1
; ; z
n
n1
,
and therefore the corresponding solution is
V
t
q
~
V
d
z
n
1
; ; z
n
n1
;
q
:
Consequently, the 1 DOF target systems (17) and (18), with the
above choices of , the target mass m
t
and the potential energy
V
t
q
is a feasible target system for the n DOF linearized kinematic
chain with unactuated pivot. The statement can be proven in a
straightforward way by replacing the corresponding terms in the
immersion condition (19).
The discussion on deriving feedback laws to keep the closed-
loop trajectories bounded and the target manifold attractive, in
accordance with the energy shaping and damping injection steps
of IDA-PBC, as carried out in Subsection 3.3, is omitted here.
4.2. General linear mechanical systems
In the preceding subsection the target system was chosen to have
only 1 DOF in order to capture the desired behavior of the unactuated
pivot. In this part, our focus is turned to the case of linear mechanical
systems with an even number of n DOF and a choice of a higher
dimensional target system.
To this end, consider the linear version of the pH system (1)
and express it as a partition of two pH subsystems with the
unactuated state x
1
q
T
1
; p
T
1
T
and the actuated state x
2
q
T
2
; p
T
2
T
,
x
1
; x
2
AR
n
, as
:
_
x
1
_
x
2
_ _
J
1
0
0 J
2
_ _
P
1
x
1
P
2
x
2
_ _
0
G
2
_ _
u; 24
with uAR
n=2
the vector of inputs.
2
The corresponding Hamiltonian
is given as
Hx
1
; x
2
1
2
x
T
1
P
1
x
1
1
2
x
T
2
P
2
x
2
with positive denite diagonal matrices P
1
, P
2
AR
nn
.
Remark 2. Note that the dynamics (1) can always be transformed
into the dynamics (24), that have decoupled the x
1
from the x
2
dynamics, by an appropriate coordinate change, see for example [13].
IDA-PBC: Consider now the coordinate change dened by
x
1
x
2
_ _
x
1
x
2
x
1
_ _
x
1
_ _
with AR
nn
, rank n. The resulting pH dynamics take the form
~
:
_ x
1
_
_ _
J
1
J
1
T
J
1
J
2
J
1
T
_ _
P
1
T
P
2
x
1
T
P
2
P
2
x
1
P
2
_ _
0
G
2
_ _
u;
25
with the transformed Hamiltonian function given by
~
Hx
1
;
1
2
x
T
1
P
1
T
P
2
x
1
1
2
T
P
2
T
P
2
x
1
:
Following the IDA-PBC methodology as presented in Subsection
2.1, the desired (transformed) pH dynamics are chosen similarly to (3)
as
3
d
:
_
x
1
_
_ _
J
d
1
R
d
1
0
0 J
d
2
R
d
2
_
_
_
_
P
d
1
x
1
P
d
2
_
_
_
_
; 26
with the desired energy function
~
H
d
x
1
;
1
2
x
T
1
P
d
1
x
1
1
2
T
P
d
2
;
and positive denite diagonal matrices P
d
1
, P
d
2
AR
nn
.
Matching the transformed dynamics (25) with the desired
dynamics (26) yields
Matching conditions:
J
1
P
1
x
1
J
d
1
R
d
1
P
d
1
x
1
J
2
P
2
J
1
P
1
x
1
J
2
P
2
G
2
u J
d
2
R
d
2
P
d
2
:
The rst of the matching conditions xes the dynamics of the
unactuated subsystem while multiplying the second matching
equation by the left hand annihilator G
?
2
of G
2
yields, after
2
The choice of the dimensions x
1
; x
2
AR
n
, uAR
n=2
, where n is an even number,
is made for simplicity of presentation.
3
In the expressions of the desired closed-loop system presented here, the
dissipation terms resulting from the damping injection are included in order to
obtain an insight on how this would affect the matching conditions and the proof
of asymptotic stability. Accordingly for the choice of the target system in I&I
dened below.
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 451
comparing the terms in x
1
and ,
G
?
2
J
2
P
2
J
1
P
1
0 27
G
?
2
J
2
P
2
J
d
2
R
d
2
P
d
2
0: 28
I&I : On the other hand, following the I&I approach presented
in Subsection 2.2 the linear target system dynamics are chosen
consistently to (8) as
_
J
t
R
t
P
t
with AR
n
, i.e. sn/2, and the target Hamiltonian function
H
t
1
2
T
P
t
;
where P
t
AR
nn
a positive denite matrix. Choosing the target
manifold to be such that the unactuated dynamics are matched to
the target dynamics gives the expression
x
1
x
2
_ _
_ _
_ _
:
The corresponding matching conditions for the I&I approach are in
this case given by
Immersion condition:
J
1
P
1
J
t
R
t
P
t
J
2
P
2
G
2
c J
t
R
t
P
t
:
Similar to the IDA-PBC part, the rst matching condition xes
the target dynamics while multiplying the second matching
equation by the left annihilator G
?
2
, and using the rst matching
equation, yields
G
?
2
J
2
P
2
J
1
P
1
0: 29
Comparing the expressions in (27) and (29) shows that these
equations are identical. The second matching condition of the IDA-
PBC approach given in (28) is absent in the I&I side which is
consistent to the fact that the I&I approach is aiming at asymptotic
matching thus, imposing less constraints as those imposed by the
exact matching of IDA-PBC.
Finally, choosing the I&I control law x; to be the IDA-PBC
controller with J
t
J
1
d
, R
t
R
1
d
will result in the closed-loop system
dened by (26). Note however that any other controller x;
ensuring that lim
t-1
t 0 would result in a closed-loop system
of the form
_
x
1
_
_ _
J
t
R
t
L
0 J
_ _
P
t
x
1
P
_ _
;
with a certain matrix L, since it is known by [14] that any
(asymptotically) stable linear system can always be written in
the pH form with a skew-symmetric J
, a positive semi-denite
(positive denite) R
, of corresponding
dimensions.
5. Discussion and outlook
In the comparative study which formed the main part of this
contribution, a solution of the IDA-PBC energy shaping plus damping
injection problem is assumed to be given for the Acrobot example. In
particular, the coordinate change, which transforms the potential
energy matching PDE into an ODE is used to dene one part of the
immersion mapping in the corresponding I&I problem. The potential
energy of the lower dimensional target system corresponds to the
closed-loop potential energy in IDA-PBC, restricted to the equilibrium
value of the characteristic coordinate. Accordingly, the inertia of the
I&I target system equals the rst element of the IDA-PBC closed-loop
mass matrix.
The on-manifold coordinates can be naturally complemented by
a set of off-manifold coordinates , which together constitute a
regular coordinate change. Writing down the transformed state
differential equations in pH form again taking advantage of the
known solution of the IDA-PBC energy shaping problem provides a
clear structure of the state representation. Based on this formulation
an energy shaping control law can be simply derived such that the
system becomes stable with an equilibrium at the minimum of the
previously shaped energy. The system at this stage has the structure
of two interconnected subsystems, where it has to be mentioned that
both subsystems are not only coupled via the obvious coupling
element j
32
of the interconnection matrix, but already via the mixed
terms in the potential energy. With straightforward computations it
can be checked that the feedback law which provides this convenient
structure is exactly the energy shaping controller from IDA-PBC.
In the nal damping injection step, the passive IDA-PBC output
feedback is applied, which provides damping in the subsystem.
The system gets the structure of two coupled oscillators, one of
them damped. The damping is propagated between both system
parts and nally the system comes to rest at the minimum of its
virtual energy.
As a result of the above study it can be claimed that the structural
similarity (in parts) of the IDA-PBC and the I&I approaches can be
fruitfully exploited. In particular, given a solution of the controller
design problem using the IDA-PBC methodology, this solution has an
equivalent representation in the context of I&I, which has an inter-
esting structure.
At the end of this contribution the attention is directed to an
instrumental question in I&I, namely how to choose the states of
the target system. In [3] the unactuated part of a mechanism is
mentioned as a sensible choice. However, it is also noted that the
choice is not unique and can be driven by the solvability of the
immersion conditions. This is what can be observed with the
solution to the Acrobot problem presented here. While
q
q
2
and
p
p
1
is probably not the most intuitive choice for the target
system states, it allows for a solution of the I&I problem, which is
inspired from and equivalent to the (known) solution of the IDA-
PBC problem.
Another motivation to study the relationship between the two
approaches is based on the recent results in [17] which show that
the I&I approach is able to stabilize a desired equilibrium for the
CartPendulum system having a constant desired inertia matrix
which is known not to be possible using the IDA-PBC approach.
In the last section, some equivalences of IDA-PBC and I&I have
been revealed for linear mechanical systems. The results in Section
4.1 can be a useful local starting point when the corresponding
nonlinear problem for the considered class of systems with
one degree of unacutation is attacked. In Subsection 4.2, direct
equivalences of the IDA-PBC and I&I equations for a more general
class of linear mechanical systems have become obvious.
Future research questions in the presented context are cer-
tainly (a) the solution of the nonlinear controller design problem
for the class of systems with n DOF and unactuated pivot. To this
end, at least an approximate expression of the coordinate change
(13) must be derived, corresponding to Eqs. (17) and (18) in [7],
using computer algebra. Then Eq. (20) can be correspondingly
generalized for the n1 DOF immersion mapping. (b) A further
investigation of systems with different structure, like the Cart-
Pendulum system would be a topic of interest. (c) Recently in [1]
a methodology was proposed to replace the PDEs of IDA-PBC with
algebraic inequalities by constructing an approximate integral
and a dynamic extension as an alternative to solving the PDEs.
Although this idea was rst introduced in the observer design
using the I&I framework, see for example the recent article on
mechanical systems [4] and references therein, it has not been
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 452
exploited in the I&I stabilization setting and is certainly to be
investigated. And nally, (d) for the linear case, it would be
desirable to obtain necessary and/or sufcient conditions for I&I
stabilization similarly to the works [13,14] on linear IDA-PBC.
References
[1] J.A. Acosta, A. Astol, On the PDEs arising in IDA-PBC, in: Proceedings of 48th IEEE
CDC/28th Chinese Control Conference, Shanghai, China, 2009, pp. 21322137.
[2] A. Astol, D. Karagiannis, R. Ortega, Nonlinear and Adaptive Control with
Applications, Springer, 2008.
[3] A. Astol, R. Ortega, Immersion and invariance: a new tool for stabilization
and adaptive control of nonlinear systems, IEEE Transactions on Automatic
Control 48 (4) (2003) 590606.
[4] A. Astol, R. Ortega, A. Venkatraman, A globally exponentially convergent
immersion and invariance speed observer for mechanical systems with non-
holonomic constraints, Automatica 46 (1) (2010) 182189.
[5] C.I. Byrnes, A. Isidori, J.C. Willems, Passivity, feedback equivalence, and the
global stabilization of minimumphase nonlinear systems, IEEE Transactions on
Automatic Control 36 (1991) 12281240.
[6] F. Gmez-Estern, A.J. van der Schaft, Physical damping in IDA-PBC controlled
underactuated mechanical systems, European Journal of Control 10 (2004)
451468.
[7] P. Kotyczka, Local linear dynamics assignment in IDA-PBC for underactuated
mechanical systems, in: Proceedings of 50th IEEE CDC/11th ECC, Orlando,
2011, pp. 65346539.
[8] P. Kotyczka, Local linear dynamics assignment in IDA-PBC, Automatica 49
(2013) 10371044.
[9] P. Kotyczka, S. Delgado-Londoo, On a generalized port-Hamiltonian repre-
sentation for the control of damped underactuated mechanical systems, in:
Proceedings of 4th IFAC Workshop on Lagrangian and Hamiltonian Methods
for Non Linear Control, Bertinoro, Italy, 2012, pp. 149154.
[10] P. Kotyczka, I. Sarras, Equivalence of immersion and invariance and IDA-PBC
for the Acrobot, in: Proceedings of 4th IFAC Workshop on Lagrangian and
Hamiltonian Methods for Non Linear Control, Bertinoro, Italy, 2012, pp. 3641.
[11] X. Liu, R. Ortega, H. Su, J. Chu, Adaptive control of nonlinearly parameterized
nonlinear systems, in: Proceedings of American Control Conference, San Luis,
CA, 2009, pp. 1113.
[12] R. Ortega, E. Garca-Canseco, Interconnection and damping assignment
passivity-based control: a survey, European Journal of Control 10 (2004)
432450.
[13] R. Ortega, Z. Liu, H. Su, Control via interconnection and damping assignment of
linear time-invariant systems: a tutorial, International Journal of Control 85
(2012) 603611.
[14] S. Prajna, A.J. van der Schaft, G. Meinsma, An LMI approach to stabilization of
linear port-controlled Hamiltonian systems, Systems and Control Letters 45
(2002) 371385.
[15] I. Sarras, On the stabilization of nonholonomic mechanical systems via
immersion and invariance, in: Proceedings of 18th IFAC World Congress,
Milan, Italy, 2011, pp. 72277232.
[16] I. Sarras, J.A. Acosta, R. Ortega, A.D. Mahindrakar, Constructive immersion and
invariance stabilization for a class of underactuated mechanical systems, in:
Proceedings of 8th IFAC Symposium on Nonlinear Control Systems, Bologna,
Italy, 2010, pp. 108113.
[17] I. Sarras, J.A. Acosta, R. Ortega, A.D. Mahindrakar, Constructive immersion and
invariance stabilization for a class of underactuated mechanical systems,
Automatica 49 (2012) 14421448.
[18] I. Sarras, R. Ortega, E. Panteley, Asymptotic stabilization of nonlinear systems
via sign-indenite damping injection, in: Proceedings of 51st IEEE Conference
on Decision Control, Maui, USA, 2012, pp. 29642969.
[19] M.W. Spong, The swing up control problem for the acrobot, IEEE Control
Systems 15 (1) (1995) 4955.
[20] A.J. van der Schaft, L2-Gain and Passivity Techniques in Nonlinear Control,
Springer-Verlag, London, 2000.
P. Kotyczka, I. Sarras / European Journal of Control 19 (2013) 445453 453