Time Optimal Return of A Dynamic Object

∗
Time Optimal Return of a Dynamic Object

arXiv:0811.3016v1 [math.OC] 18 Nov 2008
†
Aleksandr Koshelev
Abstract—We solve the problem concern- of motion. The unknown parameters and the op-
ing a time optimal return of a particle with timal time are numerically determined by Newton’s
a prescribed velocity to the origin by apply- method from the maximum-principle boundary con-
ing a magnitude-bounded force. The equa- ditions an arbitrary magnitude and direction of the
tions of controlled motion are derived and initial velocity. Control regimes with coinciding ini-
explicitly integrated, and the optimal open- tial and final positions (or velocities) can be used to
loop control and optimal time are analyzed construct quasi-optimal global motion.
depending on the parameters of the problem.
The qualitative behavior of the solution is es- II. Formulation of the problem
tablished, and the solution is compared with
other regimes of motion. The results are of The control system under consideration, the ter-
interest for control theory and its applications minal conditions, and the functional are governed by
to controlled flight mechanics. the equations
Key words: Time optimal control, feedback. ẋ = v, x(0) = x0 , x(tf ) = 0 (1)

AMS Subject Classification: 49K15; 70B05.
v̇ = u, v(0) = v0 , v(tf ) = vf = (−1, 0)
I. Introduction |u| ≤ 1, tf → min x, v ∈ R2
u
In the case of general initial and final conditions The more general case of arbitrary values specified
(synthesis), the solution to a multidimensional prob- for the constant mass m, the control force u con-
lem faces considerable computational difficulties. Al- strained by |u| ≤ u0 , the terminal velocity vf , the
though the equations of controlled motion can be an- coinciding initial x0 and final xf positions, and the
alytically integrated in explicit form, the maximum- initial time t0 can be reduced to problem (1) by us-
principle boundary value problem can be solved nu- ing suitable substitutions. System (1) is rotationally
merically only for modified special formulations in symmetric. In the general case where the vectors x
which the final velocity is ignored [1–3]. A numerical– and v have a dimension n ≥ 2, the optimal control
analytical study with fixed magnitudes of velocity regime is equivalent to two-dimensional motion ([4,5].
was performed for equal values of the initial and final The control problem has a solution in the class of
velocities [4]. piecewise continuous functions. Necessary optimal-
A theoretically and practically interesting case is ity conditions of the maximum principle type [1] are
that where the initial and final positions coincide applicable to this problem.
while the corresponding velocities differ. It leads to The solution of time optimal control problem (1) is
planar (two-dimensional) motion for nonparallel ve- reduced to solving a two-point boundary value prob-
locity vectors. This plane of controlled optimal mo- lem for a control function of the form
tion is determined by the indicated vectors. In the
special case of parallel vectors, the problem degener- u∗ = q|q|−1 , p(t) = p0 = const, (2)
ates into the one-dimensional case, for which an ana-
lytical solution can easily be constructed. In the case q(t) = −p0 t + q 0 , q 0 = const,
of two-dimensional motion, the maximum principle
can be used to determine the structure of an opti- where p and q are the variables conjugate to x and v
mal control and to completely integrate the equations (momenta).
Substituting the control function u∗ (t) into the
∗ This paper based on my PhD thesis which I successively equations of motion and performing single and re-
defend in Institute for problems in mechanics named after
A.Yu. Ishlinskii Russian Academy of sciences.
peated integrations, we obtain representations for the
† Volga branch of Moscow automobile-road institute. Email: velocity v(t) and the position vector x(t), respectively
al.koshelev@gmail.com [4, 5].
As a result, we find explicit representations of the By reducing (5) to three scalar combinations and
desired phase variables x(t) and v(t) in terms of alge- taking into account (6), we obtain a system of three
braic and logarithmic functions. They have the form transcendental equations for tf , µ, and σ: (7)
of ”linear” expressions in terms of vectors ξ and η:
Z t tf = (fx2 (µ, σ)))−1/2 (7)
∗
v(t) = v0 + u (τ )dτ = v0 + Vξ (t)ξ + Vη (t)η (3) v02 = f02 (µ, σ)(fx2 (µ, σ))−1
0
x(t) = x0 + v0 t + Xξ (t)ξ + Xη (t)η fx2 (µ, σ)|v0 |c = −f12 (µ, σ), c = cos(v0 , vf )
Q The functions fx , f0 , and f1 depend only on µ ≥ 0

u∗ = , Q = −ξt + η and |σ| ≤ 1. They are obtained by taking the scalar
R
product and have the form
R = |Q| = (ρ2 t2 − 2σρt + 1)1/2
Here, ξ and η are the two-dimensional vectors p0 fx2 (µ, σ) = a2ζ µ2 + 2aζ aη µσ + a2η (8)
and q0 normalized by |q0 | > 0 and η is the unit vector.
Thus, ρ = |ξ|, ρσ = (ξ, η), and the 2 × 2 matrix f12 (µ, σ) = µ2 aζ (aζ + bζ ) + µσ(aζ (aη + bη )
(Vξ,η , Xξ,η ) is nonsingular in the generic case. The +aη (aη + bη )) + aη (aη + bη )
scalar functions Vξ,η and Xξ,η depend on time t and
the parameters ρ and σ in a rather intricate manner: f02 (µ, σ) = (aζ + bζ )2 µ2 + 2µσ(aζ + bζ )
×(aη+ bη ) + (aη + bη )2
Vξ = −ρ−2 (σV + R)|t0 , Vη = ρ−1 V |t0 (4) The second and third equations in (7), combined
with (8), give the governing system of equations for
1
Xξ = [(−ρτ + 3σ)R + (−2ρστ + 3σ 2 − 1)V ]t0 µ and σ. The unknown tf is then determined from
2ρ3 the first expression. Note that a similar system was
t solved by a rather complicated separation of variables
+ 2 (1 + σV (0))
ρ in [5]. That approach is also applicable to the case
t under consideration, but Newton’s method was found
Xη = ρ−2 [−R + (ρτ − σ)V ]t0 − V (0) to be preferable.
ρ
The quantities to be determined are the unknown IV. Optimal trajectories
vectors ξ and η and the optimal time tf .
We present the main results on the construction
III. Solution of the problem of optimal trajectories and examine their basic prop-
erties (Fig. 1, 2). It was established by contradic-
The boundary value problem is reduced to a sys-
tion that the optimal trajectory lies in the angle be-
tem of four algebraic equations for ξ, η, and tf ob-
tween the initial velocity and the positive abscissa
tained from (3) and (4) taking into account boundary
axis. Moreover, the properties of u∗ (t) imply that
conditions (1). Instead of ξ, we introduce the vector
the angle between the position vector x(t) and the
ζ = ξtf , where |ζ| = µ = ρtf . This makes it pos-
positive abscissa axis varies monotonically with time
sible to separate the unknown tf in the equations.
(see below).
The system of two vector equations for ζ, η and tf is
After the substitution v0′ = v0 |v0 |−2 , the optimal
transformed into
time becomes t′f = tf |v0 |−1 . The parameters are
vf = tf (aζ ζ + aη η) (5) transformed as follows: µ′ = µ, σ ′ = σ, ξ ′ = ξ|v0 |,
and η ′ = η. When |v0 | > 1, the solution to prob-
v0 − vf = tf (bζ ζ + bη η) lem (1) can be reduced to the case 0 < |v0′ | ≤ 1. If
|v0′ | < |v0 |, then control u′ (t) at the initial point cor-
The scalar coefficients aξ,η and bξ,η in (5) are func-
responds to the control u(t) at the terminal point.
tions of only µ and σ:
When |v0 | = 1, the velocities and time do not vary:
a = (µ2 − 2µσ + 1)1/2 − 1, (6) v0′ = v0 and t′f = tf .
The various segments of the optimal trajectories
b = ln((µ − σ + (µ2 − 2µσ + 1)1/2 )/(1 − σ)) can be interpreted using the solution to the one-
dimensional problem. The found features of the opti-
aζ = −(1/2µ−3)[(µ + 3σ)a + µ + (3σ 2 − 1)b]
mal control and the optimal time correspond the case
aη = bζ = µ−2 (a + σb), bη = −b/µ of parallel v0 and vf (x, v, and u are scalar functions).
x2
4
0.75
3
x2 0.5
3
4
2
2
2.5 0.25
5
1 1
-5 -2.5 0 2.5 x1 -0.5 -0.25 0 0.25 x1
Figure 1: Family of optimal trajectories for |v0 | = 3 Figure 2: Family of optimal trajectories for α = 0.75π
and various initial velocity directions. and various initial velocity magnitudes.
N 1 2 3 4 5 N 1 2 3 4
α 0.1π 0.25π 0.5π 0.75π 0.9π |v0 | 0.3 0.7 1 1.3
V. One-dimentional case solutions to the maximum principle problem: opti-

mal (tf = 0, u(t) = 0) and nonoptimal (tf = 4,
The control function can be rewritten in the form u(τ ) = sign(−2τ + 1)). In the case√of zero initial ve-
−p0 τ + q0 locity, the optimal time is t0f = 1+ 2. The switching
u= (9) 2
| − p0 τ + q0 | curve has the form x = ∓ v2 for v0 ≥ −1 and v0 < −1,
respectively. We find the set of initial velocities cor-
Here, τ = t/tf is normalized time. Switching occurs responding to a terminal time tf . For η = ±1, we
at τ ∗ = 1/λ, where λ > 1. In view of (9), the phase have the expressions
variables have the form
λ± = 2tf (tf ∓ 2)−1/2
p
u(τ ) = ηsign(−λτ + 1) q
±
vf − v0 v0 = −1 ± t f ∓ 2t2f ∓ 4tf
η= sign(2 − λ), ζ = λη
|vf − v0 |
Let us examine the functions v0± (tf ) when tf ≥ 2,
Terminal conditions (1) for x and v yield the rela- since λ± < 1 for tf < 2. The minimum time tf = 2
tions is reached at v0± = 1. We have v0− (tf ) ≥ 1 for tf ≥ 2,
v0+ (tf ) ≥ 0 for 2 ≤ tf ≤ t0f , and v0+ (tf ) < 0 for
t2f
x(τ ) = v0 τ tf +η 2 (|λτ −1|(1−λτ )+2λτ −1) (10) tf > t0f . For the terminal time tf = t0f , we have
2λ
√ √
q
tf v + 0
(t ) = 0, v ∗
= v − 0
(t ) = − 2−2+ 10 + 8 2
v(τ ) = v0 + η , τ ∈ [0, 1] 0 f 0 f
λ(1 − |λτ − 1|)
Thus, one value of the optimal time corresponds to
one velocity when v0 < 0 or v0 > v ∗ and to two veloc-
λ(1 + |v0 |2 + 2|v0 |c)1/2 λ(v0 + 1)
tf = , η= ities in one direction when 0 ≤ v0 ≤ v ∗ (see Section
|2 − λ| tf (λ − 2) VI). The found properties of one-dimensional motion
are used to interpret the solution to the problem in
λ2 (1 − v0 ) − 4λ + 2v0 + 2 = 0
p the general (two-dimensional) case.
2 ± 2 + 2v 2
λ = λ± = 0
, v0 6= 1 VI. Numerical simulation
1 − v0
According to (10), when v0 = 1, the control u is Figures 1 and 2 show the families of optimal tra-
a constant vector; when v0 = −1, we obtain two jectories. The optimal trajectories lie in the angle
φ/π ψ/π
1
1
2
3
4
2
1.5 3 0.5
4 6
1 0
0 0.5 α/π 0 0.5 α/π
Figure 3: Initial control directions φ as functions of Figure 4: Final control directions ψ as functions of α
α for various initial velocity magnitudes for various initial velocity magnitudes
N 1 2 3 4 5 N 1 2 3 4 5 6
|v0 | 0.5 0.7 0.9 1 3 |v0 | 0.3 0.7 0.9 1 1.3 3
between the initial velocity direction and the positive the required terminal velocity. When |v0 | ∈ [0, 1],
abscissa axis. The angle characterizing the direction we have ψ → π as α → 0 or α → π. When |v0 | ∈
of x(t) is a monotone function of time. (1, +∞), ψ → 0 as α → 0 and ψ → π as α → π.
The optimal control is analyzed for various initial Thus, for high initial velocities (v0 > 1), the behav-
velocities. The resulting family makes it possible to ior of the control at the initial stage is similar to the
find the optimal control and to determine its features. case of low velocities (v0 < 1) at the final stage of mo-
Note the properties of the open-loop control u∗ (t) at tion. The constraints on the direction of the control
the initial and terminal times of motion (Fig. 3, 4). vector imply that the velocity component along the
We analyze these values as functions of the initial ordinate axis decreases at the initial time (Fig. 3).
velocity direction, i.e., the angle α for fixed velocity Moreover, there are no optimal controls under which
magnitudes. The desired angles φ (i.e., the directions the object accelerates at the initial stage of motion
of u) at the initial time are in the triangle bounded and decelerates in the neighborhood of the terminal
by the lines φ = 2π, φ = α + π, and α = 0 (Fig. 3). point.
The object accelerates (i.e., the velocity magnitude Now, we analyze the optimal time as a function of
increases) when φ ≥ α + 1.5π and slows down when the initial velocity direction α (Fig. 5,6). When α =
α + π ≤ φ < α + 1.5π. Thus, deceleration occurs 0 or π, the optimal time corresponds to the solution of
when |v0 | ∈ [1; +∞) and α ∈ [0, π] and when α ∈ the one-dimensional problem in Section V. Moreover,
[π/2, π] and |v0 | ≥ 0 at the initial time. When α ∈ when |v0 | = 1, tf (α) → 4 as α → π, i.e., the optimal
[0, π/2) and |v0 | ∈ [0, 1), the motion switches from time tends to a nonoptimal solution. In the interval
acceleration to deceleration as α increases. According α ∈ [0, π], the function tf (α) increases monotonically.
to the solution of the one-dimensional problem, when When α ∈ [0, π/2] and v0 ∈ [0, v∗], there is a domain
|v0 | ∈ [1, +∞), we have φ → π as α → 0 and φ → 2π where tf ≤ ff0 (Fig. 5). In this domain, the same tf
as α → π; when |v0 | ∈ [0, 1), we have φ → 2π as can be reached at two different velocity magnitudes.
α → 0 or α → π (see Section V). Moreover, any two curves intersect when 0 ≤ |v0 | ≤
A similar diagram can be constructed for tf (Fig. 1.
4). The range of α is bounded by the lines ψ = A transition occurs at the intersection point: as
π, ψ = α, and α = 0. The transition between the α increases, the curve with a higher velocity corre-
deceleration and acceleration regimes for obtaining sponds to a longer time; and the curve with a lower
the required velocity vf occurs on the line ψ = π/2. velocity, to a shorter time. The properties of the
When α ∈ [π/2, π] and |v0 | > 0 and when |v0 | ∈ control-switching curve (see Fig. 3) and the envelope
[0, 1] and α ∈ [0, π], the object accelerates toward of the family of curves tf (α) (Fig. 5,6) imply that,
tf tf
7
1
6
5
4 4
2
2.25
3 3
3
2 4
1
5
2 2 6
0 0.5 α/π 0 0.25 α/π
Figure 5: Optimal time as a function of the initial Figure 6: Optimal time as a function of the initial ve-
velocity direction α ∈ [0, π] for various initial velocity locity direction α ∈ [0, π/2] for various initial velocity
magnitudes magnitudes
N 1 2 3 4 5 6 7 N 1 2 3 4 5 6
|v0 | 0 0.1 0.5 1 1.1 1.2 1.3 |v0 | 0 0.1 0.3 0.5 0.7 1
at the intersection point, deceleration is an optimal celeration (the initial velocity magnitude decreases),
control strategy for an object with an higher velocity then acceleration, the reversion of the control’s sign,
at the initial stage and acceleration is optimal for the and deceleration. When α = 0, φ = π, and |v0 | = 1,
control of an object with an initially lower velocity. the control is a constant vector implementing decel-
The envelope is the control-switching curve at the eration and acceleration. When α = 0, we have φ = 0
initial point (Fig. 3), i.e., for fixed α ∈ [0, π/2]. The for all the velocities |v0 | ≥ 0 with |v0 | =
6 1. Decelera-
minimum optimal time is reached when v0 is orthog- tion is followed by acceleration, then the sign of the
onal to η. For fixed values α ∈ [π/2, π], the minimum control reverses, and again deceleration is followed by
time is reached at v0 = 0. acceleration.
Thus, the velocity magnitude attains a minimum
Conclusions when α = 0, φ = 0, and |v0 | = 1 and a maximum and
two minima when α = π, φ = 0, and |v0 | ≥ 0 with
These results make it possible to draw an anal- |v0 | =
6 1. A maximum and a minimum occur in the
ogy with the case of one-dimensional motion. The remaining cases.
return of the object from the initial position to the In the two-dimensional case when |v0 | = const and
same one with turning the velocity can be divided α ∈ (0, π), the control leads to switching the acceler-
into two stages: motion from the initial position to ation and deceleration regimes corresponding to the
an intermediate point x1 with at the velocity v1 and boundary values of α. This switch occurs at the ini-
motion from this point to the final position. In the tial stage of the motion if |v0 | < 1 and at the final
case of one-dimensional motion, acceleration and de- stage if |v0 | > 1 (Fig. 3,4).
celeration are possible at each stage. These regimes
determine the properties of the trajectories and the References
control in the two-dimensional problem.
These features are represented as acceleration and [1] L. S. Pontryagin, V. G. Boltyanskii, R. V.
deceleration regimes. In the one-dimensional case, Gamkrelidze, and E. F. Mishchenko, The Math-
the velocity magnitude can have three extrema and ematical Theory of Optimal Processes (Nauka,
up to four basic control regimes are possible in the Moscow, 1969; Gordon Breach, New York,
course of this motion. When α = 0, φ = 0, and 1986).
0 < |v0 | < 1, we have acceleration, the reversion of
the control’s sign, deceleration, and again accelera- [2] G. Leitmann An Introduction to Optimal Con-
tion. When α = 0, φ = π, and |v0 | > 1, we have de- trol, McGraw-Hill, NY, 1966
[3] A. Bryson; Ho, Yu-Chi, Applied Optimal Control
: Optimization, Estimation and Control, John
Wiley & Sons, 1975. Revised Printing, 481 pp.
[4] L. D. Akulenko and A. P. Koshelev, Time-
Optimal Steering of a Dynamic Object to a Given
Position under the Equality of the Initial and Fi-
nal Velocities, Journal of Computer and System
Sciences International, p. 921 - 928, Vol. 42, No.
6, 2003
[5] L. D. Akulenko, A. P. Koshelev, Time-optimal

steering of a point mass to a specified position
with the required velocity, Journal of applied
mathematics and mechanics, Pages 200-207, Vol-
ume 71, Issue 2, 2007

Time Optimal Return of A Dynamic Object

Загружено:

Сведения о документе

Исходное описание:

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Time Optimal Return of A Dynamic Object

Загружено:

Авторское право:

Доступные форматы

∗

Time Optimal Return of a Dynamic Object

Key words: Time optimal control, feedback. ẋ = v, x(0) = x0 , x(tf ) = 0 (1)

Q The functions fx , f0 , and f1 depend only on µ ≥ 0

-5 -2.5 0 2.5 x1 -0.5 -0.25 0 0.25 x1

V. One-dimentional case solutions to the maximum principle problem: opti-

0 0.5 α/π 0 0.25 α/π

[5] L. D. Akulenko, A. P. Koshelev, Time-optimal

Вам также может понравиться