0 оценок0% нашли этот документ полезным (0 голосов)

2 просмотров8 страницMar 19, 2020

© © All Rights Reserved

PDF, TXT или читайте онлайн в Scribd

© All Rights Reserved

0 оценок0% нашли этот документ полезным (0 голосов)

2 просмотров8 страниц© All Rights Reserved

Вы находитесь на странице: 1из 8

Trailer Vehicle

Claudio Altafini, Alberto Speranzon, and Bo Wahlberg

driving along simple paths for a miniaturized vehicle composed of a truck

and a two-axle trailer. The paths chosen are straight lines and arcs of circles.

When reversing, the truck and trailer under examination can be modeled

as an unstable nonlinear system with state and input saturations. The sim-

plified goal of stabilizing along a trajectory (instead of a point) allows us

to consider a system with controllable linearization. Still, the combination

of instability and saturations makes the task impossible with a single con-

troller. In fact, the system cannot be driven backward from all initial states

because of the jack-knife effects between the parts of the multibody vehicle;

it is sometimes necessary to drive forward to enter into a specific region of

attraction. This leads to the use of hybrid controllers. The scheme has been

implemented and successfully used to reverse the radio-controlled vehicle.

Fig. 1. The radio-controlled truck and trailer.

Index Terms—Backward driving, hybrid automata, jack knife, multi-

body wheeled vehicle, state and input saturation.

constraints. The “reduced” control goal of stabilization along a line

(instead of a point) allows us to consider a system with controllable

I. INTRODUCTION

linearization, so that local asymptotic stability can be achieved via Ja-

This paper describes a feedback control scheme used to stabilize cobian linearization. Still, the combination of instability and saturations

the backward motion of the radio-controlled truck and trailer shown results in so-called jack-knife effects on the two relative angles between

in Fig. 1. The miniaturized vehicle is a 1:16 scale of a real commer- the truck and the dolly and between the dolly and the semitrailer. This

cial vehicle and reproduces in detail the geometry of the full-scale one; makes the task of backward driving impossible to solve with a single

it has four axles, actuated front steering, and an actuated second axle controller. The scheme we use here is based on the observation that the

to govern the longitudinal motion. Like the real one, it presents sat- system of equations is homogeneous in one of the inputs (the longi-

urations on the steering angle and on the two relative angles between tudinal velocity v ). Homogeneity here means that the sign of v alone

the bodies. It is equipped with potentiometers and differential encoders discriminates between forward and backward motion. The former, un-

so that full state feedback is possible. Our control task is to drive the like reversing, is open-loop stable, which implies that we can use it in

system backward along a preassigned straight line, avoiding jack-knife order to get close enough to the equilibrium before switching to back-

effects on the angles. ward motion. The scheme is formalized in a switching controller with

There is a moderate amount of literature on backward steering a logic variable that allows switching between the two different modes

control of wheeled multiple vehicles reporting on experimental results (forward and backward), each of them governed by a linear state feed-

achieved with different control techniques and with different kinds back designed via linear quadratic techniques on the Jacobian lineariza-

of vehicles, mainly specially built laboratory mobile robots (see, for tions. Switching in the logic variable occurs when the integral curve of

example, [15], [18], [20], [23], and [30]). Numerous papers treat the closed-loop system hits suitably defined switching surfaces. There

the backing problem with tools spanning from neural network [25], is a certain freedom in the design of the two switching surfaces; the

fuzzy control [11], [16], [30], learning, genetic algorithms, and expert important condition is that they do not touch each other. The criterion

systems [6], [12], [17], [26]. Only a few works make use of more we follow here is that the reversing mode can be activated only when

theoretical tools stemming from the literature on control of kinematic the system is entered inside the region of attraction of the local stabi-

vehicles (overviewed, for example, in [5], [19]), see [7], [18], and [29]. lizing controller. Since the nonlinear system is subject to saturations,

According to such formalism, our system is a general three-trailer, very little can be said analytically about the region of convergence of a

general because of the kingpin hitching between the second axle controller. However, an ellipsoid playing the role of invariant set for the

and the dolly. The off-axle connection is important here because it closed-loop saturated system can be identified by numerical methods.

indicates that the system is not differentially flat [9] nor feedback The second switching surface is meant to “inform” the controller that

linearizable and so simple motion planning techniques, like those backward motion is going unstable and that a reallignement of the rel-

based on algebraic tools [24], cannot be applied. See also [29] for ative angles is needed (accomplished by moving forward). In the nom-

reverse control of a truck with trailer via feedback linearization in the inal system, this second switching surface is never in use; however, it is

simpler case of no off-axle hitching. sometimes useful in practice in order to reject disturbances and sensor

From a system theory point of view, the control problem is quite errors. Furthermore, since the equilibrium occurs along a trajectory in-

challenging: it is an unstable nonlinear system with state and input stead of at a rest point, the (possibly destabilizing) perturbations af-

fecting the system have to be considered as nonvanishing. In synthesis,

the switching can be seen as an extra feedback loop around the two

Manuscript received January 24, 2001; revised June 8, 2001, and September

5, 2001. This paper was recommended for publication by Associate Editor H. different closed-loop modes. The switching surfaces and the switching

Arai and Editor A. De Luca upon evaluation of the reviewers’ comments. logic are designed in such a way that the desired equilibrium inside

C. Altafini is with the Division of Optimization and Systems Theory, the backward motion regime is given the character of global attractor

Royal Institute of Technology, SE-10044 Stockholm, Sweden (e-mail: from all the initial conditions in a prespecified domain. This switching

altafini@math.kth.se).

A. Speranzon and B. Wahlberg are with the Department of Signals, Sensors scheme is described in Section IV. Once the local controllers for the

and Systems, Royal Institute of Technology, SE-10044 Stockholm, Sweden. different regimes of motion are available (linear feedback design in

Publisher Item Identifier S 1042-296X(01)11165-1. presence of saturation for forward/backward motion along lines/arcs is

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.

916 IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 17, NO. 6, DECEMBER 2001

and jack-knife effects appear on both angles. The other two states do

not present saturations, however, for practical reasons of limited space

when maneuvering, it is convenient to assume the following:

jy3 j y3 = 75 cm j3 j 3 =

rad:

2

Summarizing, the domain of definition of p is

D = (0y3 ; y3 ) 2 (03 ; 3 ) 2 (03 ; 3 ) 2 (02 ; 2 ) :

(8)

Also, the input has a saturation

Fig. 2. The kinematic model of the truck and trailer. jj s = 0:43 rad: (9)

treated in Section III), there is a certain freedom in designing the logic The steering driver tolerates very quick variations, so we do not assume

loop. The choice above corresponds to the most elementary case of hy- any slew rate limitation in the steering signal.

brid automaton (two states, two transition rules) for the logic loop. As

B. Jacobian Linearization Along Trajectories

an example, in Section V, we describe another simple enough scheme

based on a combination of three different finite states. Both logic de- The system (6) is homogeneous in the longitudinal input v . Fixing

signs were implemented and successfully used to reverse the real ve- v as a given nonnull function means having a drift component, which

hicle. A few experimental tests are described in Section VI. gives a nonvanishing term to the differential equations of the system.

The steering angle can be used to give asymptotic stability to the

II. KINEMATIC EQUATIONS AND LINEARIZATION system along a trajectory. The trajectories which admit a constant equi-

librium point in this way are those corresponding to straight lines or

A. Kinematic Model arcs of circles. The first type of equilibrium involves four of the five

Call (x3 ; y3 ) the Cartesian coordinates of the midpoint of the rear- p

states of (1)–(5), for example, the vector , while for the circular trajec-

most axle, 3 its absolute orientation angle, 3 and 2 the relative ori- p

tories only the relative posture = [3 ; 2 ]T has a constant equilib-

entation angles, respectively, between the rearmost trailer body and the rium point in the system (6) (or a different basis, like that corresponding

dolly and between the dolly and the truck body. L3 , L2 , M1 , and L1 to a Frenet frame must be chosen, see [5] and [27] for details).

are the lengths of the different parts of the body as indicated in Fig. 2. 1) Straight Line Linearization: The equilibrium point of is the p

The inputs are the steering angle and the longitudinal velocity at the p

origin e = 0 and it corresponds to a nominal value of the steering

second axle v . The differential equations describing the kinematics are input e = 0. The linearized system is

p_ = v (Ap + B) (10)

L1

where

M

y_3 =v cos 3 cos 2 1 + 1 tan 2 tan sin 3 (2)

L1 0 1 0 0 0

0 0 1 0 0

sin 3 cos M

2 1 + 1 tan tan A= L B=

_3 =v

L3 L1 2 (3) 0 0 0 L1 L

1 0 LML : (11)

0 0 0 0 L1 L +M

_3 =v cos 2 1 tan 2 0 M1 tan LL

L2 L1 2) Linearization Along a Circular Trajectory: Consider the sub-

0 sinL33 1 + M 1 tan tan

2 (4)

system of (6) relative to p

L1

p_ = v A(p) + B(p; ) :

_2 =v tan 0 sin 2 + M1 cos 2 tan :

(12)

(5)

L1 L2 L1 L2 Proposition 1: The equilibrium point of (12) corresponding to a

Call p= [y3 3 3 2 ]T the configuration state obtained by ne- given steering angle e is

glecting the longitudinal component x3 . In a compact way, the state

2 = arctan M1 + arctan L2 (13)

equations are written as r1 r2

p_ = v (A(p) + B(p; )) : (6) 3 = arctan r3

L3

(14)

backward motion. The entire state is measured via two potentiometers where r1 = L1 =tan e , r2 = r12 + M12 0 L22 , and r3 = r22 0 L32

on the relative angles 2 and 3 and a pair of optical encoders on the are the radii of the circular trajectories followed by the midpoints P1 ,

two wheels of the rearmost axle. P2 , and P3 of the axles (see Fig. 3).

1) State and Input Saturations: Both the relative angles 2 and 3 Proof: At the steady state, with nominal steering angle e , all

present hard constraints the axles follow concentric circular trajectories. Look at Fig. 3. All

the calculations are straightforward from trigonometry, starting from

j2 j 2 = 0:6 rad; j3 j 3 = 1:3 rad: a fixed e .

p

(7)

The linearization of (12) around e = [3 2 ]T and e is now

These limitations are due to the front and rear body touching each given by

other and to the dolly touching the wheels. They are particularly im-

portant since for back-up maneuvers the equilibrium point is unstable p_ = v A (p 0 p e) + B ( 0 e ) (15)

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.

IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 17, NO. 6, DECEMBER 2001 917

(a) (b)

where

Fig. 4. (a) The succesful initial conditions and the fitted ellipsoid E^. (b) The

cos 2 cos 3 cos 2

+ f(

pe ) switching surfaces (in ).

A = L3 L2

0 0 cosL 2

2 M1 tan tan

1+

L1 2 e initial conditions containing the origin are positively invariant sets [2]

for the closed-loop linear system. Such ellipsoids are level surfaces of

f(pe ) = sin 2 sin 3 + M1 sin 2

0 cos 2L3sin 3 tan e the quadratic Lyapunov function VB = pT PB p.

L3 L1 L2 2) Qualitative Analysis of the Basin of Attraction: It is in general

and

0M cos 2 sin 2 sin 3 difficult to draw conclusions on the invariance properties of the flow of

1 + tan2 e

1

+

B = L 1 L2 L3 :

a nonlinear system. If in addition one takes into account the state and

1

1+

M1

cos 2 1 + tan2 e

input constraints (7)–(9), then an analytic description becomes almost

impossible [10]. Therefore, in order to obtain estimates of the region of

L1 L2 attraction of the linear controller = 0KB p and of the contractivity

of the resulting integral curves, we rely on the numerical simulation of

III. LOCAL CONTROLLERS FOR BACKWARD AND FORWARD MOTION the closed-loop behavior of the original nonlinear system (6), paired

with the linear controller KB

In this section, we describe the local controllers to be used in the

different regimes of motion: drive forward or backward and linearize p_ = FB (p) = v A(p) + B(p; 0KB p) : (17)

along a straight line or an arc of circle.

In order to obtain a graphical representation of the results, in the fol-

A. Reversing Along a Line for the Jacobian Linearization lowing we neglect the y3 component of the state space, which is by far

Assume v is a given negative constant. the less critical one with the LQ controller in use.

1) The Linear Quadratic Controller: Consider the straight line The cloud of initial conditions that represents the region of attraction

backing case. The linearization (10) is open-loop unstable: the closely resembles an ellipsoid in p ^ = [3 3 2 ]T space. The fitting of

characteristic polynomial of the uncontrolled system is an ellipsoid E^ strictly contained in the set of succesfull initial conditions

can be done by direct investigation, see Fig. 4(a). The principal axes

det (sI 0 vA) = s2 s+ v s+ v : (16) q^ = [q1 q2 q3 ]T of the ellipsoid are related to p^ by an orthogonal

L2 L3 transformation

Since (10) is controllable, the origin of the nonlinear system (6) can p^ = R^E q^ 2 SO(3):

R^ E

be made an asymptotically stable equilibrium by linear state feedback.

We treat it as a linear quadratic optimization problem and in the weight Calling " , " , and " the semiaxes of E^, the ellipsoid is given by the

1 2 3

assignment we use the rule of thumb of trying to have decreasing algebraic equation

closed-loop bandwidths when moving from the inner loop to the outer

E^ = q" q22 + q32 = 1 :

2

one in a nested loopshaping design. In fact, the relative displacement 1

+ (18)

y3 comes after a cascade of two integrators from the relative angles as 2

1 "22 "32

can be seen on the linearization (11). It turns out that such a heuristic

Taking into account also the y3 component of the initial conditions, the

ellipsoid E 2 4 is given by

reasoning is very important in the practical implementation in order

to deal with the saturations.

Calling KB = BPB the gain proposed by the solution of the LQ

E = q" q22 + q32 + q42 = 1

2

1

problem, where PB is the solution of the Lyapunov equation, the 2

+

"22 "32 "42

(19)

closed-loop linear system p_ = v (A 0 BKB )p has two real and two

1

complex conjugated eigenvalues; all three real parts are distinct. This is with "4 "i , i = 1; 2; 3. In D , the difference with respect to Fig. 4(a)

enough to say that the unconstrained linear closed-loop system forms can hardly be appreciated.

a contraction map for positive times, maxt0 ke(A0BK )t k = 1, From Fig. 4(a), we draw the qualitative conclusion that for the

where k 1 k is the operator norm. In other words, the ellipsoids of closed-loop nonlinear system E^ is a positively invariant set.

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.

918 IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 17, NO. 6, DECEMBER 2001

When v > 0, in (16) the two unstable poles move on the open left

half of the complex plane. Considering the subsystem means ne- p^ jy (t) 0 y j

3 3

t

sin e0j (A0BK )j

^ ^ ^

3 d

glecting one of the two poles in the origin. The origin of is asymp- p^ 0

totically stabilizable by linear feedback and this time convergence for i.e.,

(

the nonlinear system in 0=2; =2 2 01.3; 1.3 2 00.6; 0.6) ( ) ( ) (A0BK )j 0 cos

is a less critical problem. The reason for neglecting y3 when moving cos e0j ^ ^ ^

j1y j

3 3

(A^ 0 B^ K^ F )

3

3 min

natural time constant several orders of magnitude higher when com-

pared to the other states.

Assume for example v = cos ^0 cos^ ^ :3 3

^ ^

1. Extracting from (11) the three-dimen- (A 0 B KF )

3 min

sional (3-D) system (A; B ), linearization around the origin of (3)–(5),

^

it is possible to choose a gain KF such that the closed-loop system

p^_ = v(A^ B^ K^ )^p

0 F is asymptotically stable and has three distinct real Since ts is inversely proportional to the smallest eigenvalue of the

modes. The practical rule here for the selection of the eigenvalues is to closed loop, the more the slowest mode (i.e., 3 ) is “speeded up” by

try to have all three closed-loop poles of the same order of magnitude. ^ p^

KF , the sooner enters inside E . Obviously, moving eigenvalues

The unavoidable input saturation will not destroy stability anyway. As- deeper in the left half of the complex plane implies more problems

suming no control on y3 , the variation in y3 due to the forward closed with the input saturation.

loop is hard to compute explicitly, but a worst-case analysis can give The bound (20) can be used to characterize the region of attraction in

an upper bound on it. ^

D for an ellipsoid like E as attractor set for the forward motion case.

Proposition 2: Assume the task of the forward controller Neglecting the input saturation, it basically coincides with D except

^ ^ ^

KF is to steer the system (A; B ) inside the ellipsoid E . The for a cut in the y3 direction.

variation on y3 starting from any admissible initial condition Corollary 1: In D , the region of attraction to an ellipsoid E of the ^

p =[

0 ]

y3 3 3 2 T is bounded by ^

controller KF is given by

3 3 3

2 (0 ; ) 2 (0 ; )

(20)

(A 0 B KF )

3

3 min 3 3 2 2 (23)

= +(

with j1 M1 =L1)tan tan s j and = p"^ .

2 3 max

where

= j1 + (M =L )tan tan s j, then

Proof: From (2), if 1 1 2 1 + ML tan tan s cos 0 cos

1y () =

2 3 3

:

jy_ j jsin (t)j : (A^ 0 B^ K^ F )

3

3 (21)

3 3 min

We need to compute a bound on the value of (t) and to quantify Such a restriction is not really drastic; in numbers, with our choice of

the settling time ts of the 3 mode from 3 to its entering into the

3

^

KF , it amounts to about 30 cm. Furthermore, one can add that y3 1

ellipsoid E . Since for the forward motion stability is not a problem is a worst-case bound and that the choice of y3 75 cm is purely =

even in presence of saturations, deriving a bound on the settling time arbitrary.

of 3 , we consider only the linearized system. The stable closed-loop

system p^_ = (A^ B^ K^ )^p

0 F has three distinct real modes. Its integral C. Reversing Along an Arc of Circle (Alignment Control)

p^ ( ) =

curves t p^

e(A^ 0B^ K^ )t 0 can be bounded as follows: p p^

If instead of or only the backward stabilization of the rela-

p = [ ]

3 2 is required, then a linearization like (15) can

e0j (A0BK )jI t p^ kp^ (t)k

tive angles

^ ^ ^

0

2

2 be used and the desired equilibrium point e can be indifferently the p

kp^ (t)k e0j (A0BK )jI t p^ ^ ^ ^

origin or a pair of constant angles like in (13) and (14). In this case,

2 0 the truck and trailer will be stabilized along a circular trajectory as

2

computed in Proposition 1. We can consider the circular trajectories

with I3 the 3-D identity matrix. Since there are no multiple closed-loop corresponding to, e.g., je j e =( )

4/5 s (see Proposition 1 for

eigenvalues, also the 3 mode alone is bounded by the slowest mode of the corresponding radii), for which the equilibrium point is compatible

the closed loop with the system constraints and a certain margin is left around it before

reaching the steering actuator saturation. The controller 0KB = p

j (t)j e0j (A0BK )jt :

3

^ ^ ^

3 can be computed like in Section III-A by another LQ problem. The re-

gion of attraction of the equilibrium is an ellipse in the plane, for p

In order to compute ts , we need to have a value of 3 t () each value of e in je j e . If a Frenet frame is chosen on the arc

^ : p^ ^ p^ =

which is certainly inside the ellipsoid E f T PE 2 g . of circle [27], then the encoders information can be used to attain local

( ^ ) p^ = min

The circle min PE k k2 2

( ) p^ = i=1; 2; 3 i k k2 p p

stabilization of (and not just of ) along the desired arc.

^

(do not count 4 ) is contained inside E , therefore calling

^

min = min ( ) = ( max

i=1;2;3 i ( )) = ( ^ )

1= i=1; 2; 3 "i

p 1="max , IV. SWITCHING CONTROLLER

= ^ = ^

for the 3 variable alone 3 =min "max does the job. The region of attraction of the backward controller is only a subset

The desired bound on the settling time is then

E^ of the entire domain D. Starting from outside E^, it is necessary to

first drive forward, for example, with a controller like KF until the ^

ln j j ^

system enters inside E and only then switch to backward motion. When

ts = :

A^ 0 B^ K^ F

(22) reversing, the main manifestation of a destabilizing perturbation is a

min jack-knife effect on the relative angles. Just like on a full-scale truck

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.

IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 17, NO. 6, DECEMBER 2001 919

and trailer vehicle, the only way to recover from such a situation is

to move forward and try again. So, in order to guarantee stability of

the backward motion in D and not only inside the ellipsoid E for the^

nominal model, and in order to cope with the perturbations, one single

controller is not enough. The switching variable between the two con-

trollers is the longitudinal velocity v . For example, we assume that

v 2 f01; 1g 1 I . The backward regime is selected by v 01 and

+ = =

the forward one by v =+ 1. Since the longitudinal input v is a con-

trol input, if we assume that v 2 I then v becomes a controlled logic

variable. Moreover, if the selection of the logic value of v is made ac- Fig. 5. The hybrid automaton associated with the control logic.

cording to a partition of the state space, the overall system with multiple

controllers becomes a feedback controlled system. This is feasible in asymptotically coverges to the origin in backward motion from any

our case since we have on-line full state information available. inital condition in D .

Proof: From the analysis of Section III and looking at the

A. Selection of the Two Switching Surfaces switching rules of Fig. 5, the following order relation is the only

Two are the switching surfaces that delimit the partition of the state possible one for the system:

space and their crossing in a prescribed direction by the flow of the C+ ! C ! C0 :

system induces a sign change in v . This, in its turn, causes the inver- v = +1 v = +1 v = 01

sion of the direction of motion and induces the activation of the cor-

In fact, from any p0 2 D , the controller KB steers the system inside

^

responding linear state feedback controller. These switching surfaces

(call them S0+ and S+0 ) have to be chosen such that they give to the S+0 and S+0 is a positively invariant set for the controller KF . In the

=

point p 0 of the backward motion the character of global attractor (in ^

two regions C+ and C , the controller KB stabilizes only p. Once S+0

D ). Since in both regimes the origin is the closed-loop local asymptoti- is fixed, so is D and in D the corresponding excursion on y3 cannot

cally stable equilibrium point, we choose both S0+ and S+0 as closed exit D by Corollary 1.

hypersurfaces in 4 containing the origin in their interior. So, for the nominal system, the switching surface S0+ is never in

1) The Switching Surface From Forward to Backward Motion: use. Due to the unstable equilibrium point, the effect of perturbations

S+0 : From Section III-A, S+0 has to be contained inside E . The is critical in C0 . Since the whole stabilization developed here occurs

simplest choice is to consider S+0 = E for some such that along a trajectory, we cannot expect the perturbations affecting the

( )

1/2 < < 1. The tradeoff is the following: system to be vanishing at the equilibrium point of (17). For example,

• if S+0 is large ( ! 1), the system will be sensitive to distur- the two potentiometers for the measure of the relative angles 2 and 3

bances and more easily destabilized by perturbations (meaning introduce an error of 64 also at steady state. Similarly, all the distur-

more switches can occur); bances affecting the real system can be considered nonvanishing. When

• if S+0 is small ( ! 1/2), the forward regime will be very long, a perturbation is large enough to pull the state out of E the system

which is often unacceptable for practical implementations. diverges. Trying to quantify the amplitude of the destabilizing pertu-

Ellipsoids smaller that E1=2 are also not recommendable for other rea-

bations and, consequently, trying to inferr total stability for a class of

bounded perturbations is very hard in our situation because of the input

sons, like the possibility of being completely “jumped over” in case of

saturation involved. The destabilized system keeps driving backward

until it hits the S0+ surface. After that, it inverts the direction of mo-

relevant sensor error.

tion and tries again to converge inside S+0 with the forward controller.

2) The Switching Surface From Backward to Forward Motion:

S0+ : Such a switching surface has to “tell” the system that backing In this part, stability is not undermined by the pertubations because the

is not going well and the trailers need to be realigned. The choice is

quite flexible; the only constraint is that S+0 , S0+ , and the sides of

system is open-loop stable, but perhaps the convergence rate (and there-

D must not intersect. In particular, the set distance between S+0 and 1

fore the settling time ts and y3 ) can be.

As said above, if the S0+ and S+0 do not touch each other, degen-

S0+ gives the hysteresis between the two regimes. If this distance is erate switching phenomena (normally referred to as Zeno chattering)

positive, problems like chattering will be avoided. One simple choice

for S0+ is for example to use a cube in 4 which is a rescaling of D

do not occur. Furthermore, also the different pole placement philosophy

by a factor less than 1.

^

adopted in the two controllers KF and KB (in one the critical mode,

the 3 mode, is slow, in the other it is instead faster) is meant to avoid

a chattering type of behavior (like keep moving the system back and

B. Control Logic for v

forth between the same points on S0+ and S+0 ) which can happen if

D is divided into three nonintersecting regions: the two closed-loops resemble each other.

• C0 = region inside S+0 where v = 01;

• C = region between S0+ and S+0 where v can be either +1 or V. ANOTHER SWITCHING SCHEME

01; S

At the switching C 000! C0 , if j3 j is large, instead of the controller

• C+ = region outside S0+ (C+ = D \(C[C0 )? ) where v = +1. K^ F one can think of using a different strategy, based on realigning only

Changes on v occur only at crossing with the rules of the finite state the 2 and 3 angles leaving 3 free and then recover 3 if needed by

machine of Fig. 5. reversing along an arc of circle with K B . Neglecting 3 in the forward

motion means reducing considerably its duration, as the 3 mode is

C. Convergence for the Nominal and Perturbed System the slowest of the three. This strategy allows to greatly increase the

For the nominal system, we can assert the following. convergence rate when y3 is large and y3 1 3 > 0. A typical situation

Theorem 1: Under the assumptions of invariance of E , the system is shown in Fig. 6. The modes needed for its implementation are:

^

(6) with the two controllers KF and KB , respectively, for the cases 1) forward control of 2 and 3 ;

=

v 01 and v =+ 1 and with the feedback rule of Fig. 5 for v 2 I 2) reverse along an arc of circle;

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.

920 IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 17, NO. 6, DECEMBER 2001

Fig. 8. Experiment # 1: Sketch of the motion of the vehicle. The dotted line

represents the path followed by the (x ; y ) point.

Assume v = 01, 3 > 0 and y3 > 0 (Fig. 6). Call p ~ the state on the

switching surface S+0 and x ~3 the corresponding coordinate on the ref-

erence line. The arc of circle tangent to both the line through the point

(x3 ; y3 ) of orientation ~3 and to the straight line y3 = 0 is unique,

as it has a radius r~3 = y~3 =(sin ~3 tan(~3 =2)) and center of rotation

x3 0 y~3 (1 + cos ~3 )=sin ~3 ; y~3 =(sin ~3 tan(~3 =2))). Since v = 01,

(~

we take the length of the arc from P3 to the axis y3 = 0, r~3 1 ~3 , as

duration of the reversing along arc of circle mode i.e., as time between

the switch v : +1 ! 01 and the switch from reversing along arc of

circle to reversing along straight line. The S0+ switching surface re-

mains in use while for the switching v : +1 ! 01 we consider only

the 2 , 3 angles: S+0 = 22 ="22 + 32 ="32 = 1 with "2 and "3 of

the same order of magnitude. The forward controller then is a reduced

version of K ^ F with only two nonnull gains. The reduced system (12)

with v = +1, p e = [0 0]T is asymptotically stabilizable and the du-

ration of the forward motion between S0+ and S+0 is normally quite

short compared to that of Section IV. Calling t~ the time at which the

flow of the system hits S+0 , the complete hybrid automaton is depicted

in Fig. 7.

VI. PRACTICAL IMPLEMENTATION AND EXPERIMENTAL RESULTS Fig. 9. Experiment # 1: (a)Relative angles and and (b) steering input .

For the truck and trailer shown in Fig. 1, the controller was imple-

mented using a commercial version of PC/104. It was written in C-lan- and first drives forward in order to realign itself, then reverses along

guage and used at a frequency of about 10 Hz since the velocity of the the reference line. Notice that since the 3 mode is slower than those

system was very low. Figs. 8 and 9 present the result of a simple real of the relative angles, most of the forward motion is needed to get 3

maneuver. The switching scheme used is the two-state automaton de- inside the ellipsoid S+0 . It is instructive to compare [Fig. 9(a)] the ac-

scribed in Section IV. The vehicle starts with saturated relative angles tivity of the feedback input when the open loop system is stable (upper

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.

IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 17, NO. 6, DECEMBER 2001 921

VII. CONCLUSION

paper we have tried to study and “integrate” different aspects: its the-

oretical conceptualization, its solution, and its practical implementa-

tion. From a theoretical viewpoint, stabilization in the presence of si-

multaneous instability and hard constraints is a very challenging and

largely open field of research. From a practical perspective, a difficult

real problem is studied and a solution is proposed which automates the

practical way of solving it, by the experienced drivers.

REFERENCES

[1] B. Anderson and J. B. Moore, Optimal Control. Linear Quadratic

Methods. Englewood Cliffs, NJ: Prentice-Hall, 1989.

[2] F. Blanchini, “Set invariance in control (survey paper),” in Automatica,

1999, vol. 35, pp. 1747–1767.

[3] P. Bolzern, R. M. De Santis, A. Locatelli, and D. Masiocchi, “Path-

tracking for articulated vehicles with off-axle hitching,” IEEE Trans.

Contr. Syst. Technol., vol. 6, pp. 515–523, 1998.

[4] F. Bullo and R. Murray, “Experimental comparison of trajectory trackers

for a car with trailers,” in IFAC World Congress, San Francisco, CA,

1996.

[5] C. C. de Wit, “Trends in mobile robot and vehicle control,” in Control

^ space and the ellipsoid E^. The points

Fig. 10. The trajectory of Fig. 8 in the p

+ represent initial conditions from which the backward controller alone fails to

Problems in Robotics. Lecture Notes in Control and Information Sci-

ences, K. P. Valavanis and B. Siciliano, Eds. Berlin: Springer-Verlag,

converge on experimental trials. 1998.

[6] K. Chellapilla, “Evolving nonlinear controllers for backing up a

truck-and-trailer using evolutionary programming,” in Evolutionary

Programming VII 7th Int. Conf., 1998, pp. 417–426.

[7] A. W. Divelbiss and J. Wen, “Nonholonomic path planning with in-

equality constraints,” in Proc. 1994 IEEE Int. Conf. on Robotics and

Automation, 1994, pp. 52–57.

[8] J. De Dona, “Input Constrained Linear Control,” Ph.D. , Department

of Electrical and Computer Engineering, The University of Newcastle,

Australia, 2000.

[9] M. Fliess, J. Levine, P. Martin, and P. Rouchon, “Flatness and defect of

nonlinear systems: Introductory theory and examples,” Int. J. Control,

vol. 61, no. 6, pp. 1327–1361, 1995.

[10] E. G. Gilbert and K. T. Tan, “Linear systems with state and control con-

straints: The theory and application of maximal output admissible set,”

IEEE Trans. Automat. Contr., vol. 36, pp. 1008–1020, 1991.

[11] S. K. Halgamuge, T. A. Runkler, and M. Glesner, “A hierarchical hybrid

fuzzy controller for realtime reverse driving support of vehilces with

long trailers,” in Proc. Third IEEE Conf. on Fuzzy Systems, 1994, pp.

1207–1210.

[12] D. F. Hougen, M. Gini, and J. Slagle, “Rapid unsupervised connectionist

learning for backing a robot with two trailers,” in Proc. 1997 IEEE

Int. Conf. on Robotics and Automation, Albuquerque, NM, 1997, pp.

2950–2955.

[13] R. E. Jenkins and B. P. Yuhas, “A simplified neural network solution

through problem decomposition: The case of the truck backer-upper,”

Fig. 11. Experiment # 2: Sketch of the motion of the vehicle. The dotted line IEEE Trans. Neural Networks, vol. 4, pp. 718–720, 1993.

represents (x ; y ). [14] R. E. Kalman, “When is a linear control system optimal?,” Trans. ASME,

pp. 51–60, Mar. 1964.

[15] D. H. Kim and J. H. Oh, “Experiments of backward tracking control for

plot) and when it is unstable (lower plot). The experimental validity of trailer systems,” in Proc. 1999 IEEE Int. Conf. on Robotics and Automa-

the heuristic ellipsoid E was verified by several trials. For the back- tion, Detroit, MI, 1999, pp. 19–22.

ward controller KB alone, some of the unsucessful initial conditions [16] G. S. Kong and B. Kosko, “Adaptive fuzzy systems for backing up a

belonging to the quadrant (3 = 0; 2 0; 3 0) are shown in

truck and trailer,” IEEE Trans. Neural Networks, vol. 3, pp. 211–223,

Fig. 10: they all lay outside E . In the same picture the switched integral

1992.

[17] J. R. Koza, “A genetic approach to finding a controller to back up

curve used in the experiment of Fig. 8 is also reported. In this exper- a tractor-trailer truck,” in Proc. 1992 American Control , 1992, pp.

iment we use S+0 = E4=5 . As expected, after entering inside E , the 2307–11.

[18] F. Lamiraux and J. P. Laumond, “A practical approach to feedback con-

trajectory does not escape anymore. The second experiment (Fig. 11)

trol for a mobile robot with trailer,” in Proc. 1998 IEEE Int. Conf. on

shows part of the switching scheme described in Section V. Starting Robotics and Automation, Leuven, Belgium, 1998, pp. 3291–3296.

from a jack-knife position, the vehicle moves forward, realigns the rel- [19] J. P. Laumond, Robot Motion Planning and Control, Lecture Notes

ative angles and then reverses along an arc of circle until hitting the in Control and Information Sciences, J. P. Laumond, Ed. Berlin:

desired reference line. At this point the third state (reversing along the Springer-Verlag, 1998.

[20] W. Li, T. Tsubouchi, and S. Yuta, “On a manipulative difficulty of a

straight line) takes over (not shown in the experiment). Notice how the mobile robot with multiple trailers for pushing and towing,” in Proc.

forward part of the motion is shorter than in the first experiment as only 1999 IEEE Int. Conf. on Robotics and Automation, Detroit, MI, 1999,

the two relative angles are considered in the realignment. pp. 13–18.

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.

922 IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 17, NO. 6, DECEMBER 2001

[21] D. Liberzon and A. S. Morse, “Basic problems in stability and design of large distances, ignoring the specific issue of how to collect the

switched systems,” IEEE Contr. Syst. Mag., vol. 19, pp. 59–70, 1999. agents back into one unit. Docking the agents, however, is essential

[22] W. H. Ma and H. Peng, “Worst-case maneuvers for the roll-over for recovery, recharging, and sample processing. It also allows robots

and jackknife of articulated vehicles,” in Proc. Amer. Control Conf.,

Philadelphia, PA, 1998, pp. 2263–2267. to physically cooperate or reconfigure so that they may connect

[23] Y. Nakamura, H. Ezaki, Y. Tan, and W. Chung, “Design of steering and achieve a mobility or mission advantage. The use of human

mechanism and control of nonholonomic trailer systems,” in Proc. 2000 teleoperation for final docking maneuvers is an undesirable solution

IEEE Int. Conf. on Robotics and Automation, San Francisco, CA, 2000, because it requires an open (and therefore, potentially detectable and

pp. 247–254.

[24] M. van Nieuwstadt and R. Murray, “Real time trajectory generation for

corruptible) communication channel, and it may lead to cognitive

differentially flat systems,” Int. J. Robust and Nonlinear Control, vol. 8, strain for the operator.

pp. 995–1020, 1998. Docking is especially important for a class of cascaded heteroge-

[25] D. Nguyen and B. Widrow, “The truck backer-upper: An example of neous robots called marsupials, where a “mother” agent transports

self-learning in neural networks,” in Proc. SPIE, vol. 1293, 1990, pp.

one or more “daughter” micro-rover agents to a mission site [1],

596–602.

[26] R. Parra-Loera and D. J. Corelis, “Expert system controller for [2]. Without docking, the mother agent cannot recover the daughter,

backing-up a truck-trailer system in a constrained space,” in Proc. 37th service it, or transport it home or to a new task site.

Midwest Symp. on Circuits and Systems, 1995, pp. 1357–1361. Docking is a challenging problem for several reasons, most notably

[27] C. Samson, “Control of chained systems: Application to path-following the need to accurately extract both relative position and orientation [3].

and time-varying point stabilization of mobile robots,” IEEE Trans. Au-

tomat. Contr., vol. 40, pp. 64–77, 1995. This article reports on a vision-based docking behavior that:

n

[28] O. J. Sørdalen, “Conversion of the kinematics of a car with trailers into 1) offers low-order complexity (is linear in the number of image

chained form,” in Proc. IEEE Int. Conf. on Robotics and Automation, pixels for the average case);

Atlanta, GA, 1993, pp. 382–387.

[29] M. Sampei, T. Tamura, T. Kobayashi, and N. Shibui, “Arbitrary path

2) uses a color segmentation algorithm that is lighting insensitive

tracking control of articulated vehicles using nonlinear control theory,” due to its use of the spherical coordinate transform [4] and adap-

IEEE Trans. Contr. Syst. Technol., vol. 3, pp. 125–131, 1995. tive segmentation [5];

[30] K. Tanaka, T. Taniguchi, and H. O. Wang, “Trajectory control of an ar- 3) requires no explicit communication between docker and dockee;

ticulated vehicle with triple trailers,” in Proc. 1999 IEEE Int. Conf. on 4) is reactive, and therefore compatible with reactive and hybrid

Control Applications, 1999, pp. 1673–1678.

[31] D. Tilbury, R. Murray, and S. Sastry, “Trajectory generation for the deliberative/reactive architectures, plus embedded systems;

N -trailer problem using Goursat normal form,” IEEE Trans. Automat. 5) has been shown to be faster than teleoperation with a similar

Contr., vol. 40, pp. 802–819, 1995. success rate.

The pattern of actions comprising how a single agent can dock was

originally explored by Arkin and Murphy [3] in 1990, producing a

Low-Order-Complexity Vision-Based Docking reactive, potential-fields style of behavior for docking. Much of the

subsequent docking research [6]–[10] either duplicates these results

Brian W. Minten, Robin R. Murphy, Jeff Hyams, and Mark Micire or produces more computationally complex methods. Spofford et

al. [2] also deal with docking, but are primarily concerned with

Abstract—This paper reports on a reactive docking behavior which uses agents’ vision abilities, rather than integration with motion control.

a vision algorithm that grows linearly with the number of image pixels. Our current work is an extension of our earlier contributions to

The docking robot imprints (initializes) on a two-colored docking fiducial [3]. The original approach used combinations of potential fields

upon departing from the dock, then uses region statistics to adapt the color encapsulated in a motor schema to dynamically control docking

segmentation in changing lighting conditions. The docking behavior was

without explicit path planning. The robot’s motion is computed

implemented on a marsupial team of robots, where a daughter micro-rover

had to reenter the mother robot from an approach zone with a 2-m radius instantaneously as a function of moving to the correct orientation

and 140 angular width with a tolerance of 5 and 2 cm. Testing during (tangential to the dock) and relative depth (attraction). This motor

outdoor conditions (noon, dusk) and challenging indoor scenarios (flashing schema had low computational complexity, making it suitable for a

lights) showed that using adaptation and imprinting was more robust than daughter, as long as a low-order-complexity vision algorithm could

using imprinting alone.

be found. In the 1990 work, the pose of the robot relative to

Index Terms—Artificial intelligence, control, docking, multi-agents, vi- the dock was extracted from a black and white image of a large

sion-based robotics. square landmark using the inverse perspective transform.

The work presented in this paper differs from that described above

I. INTRODUCTION primarily in the visual component. The vision system in this work is

color-based, and it uses the change in the number of pixels between

Much of the initial focus with multi-agent teams has been on two colored regions to guide the robot.

strategies for the deployment and cascaded control of members over

Manuscript received July 24, 2000; revised May 7, 2001. This paper was III. APPROACH

recommended for publication by Associate Editor J. Leonard and Editor S.

Hutchinson upon evaluation of the reviewers’ comments. This work was sup- The general approach to the docking behavior follows [3], which

ported by DARPA through award F04701-99-C-0317. divided the area around a dock into four subareas as shown in Fig. 1.

The authors are with the Department of Computer Science and En- It is assumed that the daughter will move to the mother using a

gineering, University of South Florida, Tampa, FL 33620 USA (e-mail:

minten@csee.usf.edu; murphy@csee.usf.edu; hyams@csee.usf.edu; mmi- GPS location or other a priori location. The subarea over which the

cire@csee.usf.edu). daughter is moving toward the mother but cannot perceive her is

Publisher Item Identifier S 1042-296X(01)10914-6. called the Ballistic Region, because the daughter is operating under a

Authorized licensed use limited to: Institut Teknologi Bandung. Downloaded on February 19,2020 at 10:00:20 UTC from IEEE Xplore. Restrictions apply.