Академический Документы
Профессиональный Документы
Культура Документы
Garth P. McCORMICK
The George Washington University, Washington, D.C., U.S.A.
1. Introduction
* This work was supported by Contract AFORS-73-2504, U.S. Air Force, Office of Scientific
Research.
147
148 G.P. McCormick / Computability of global solutions
minimize f (x ) ,
(A)
subject to x~R_CE n.
Iteration k (k >>- 1)
minimize f(x) ,
(Ak, l)
subject to x E R k,lC E n
Convergence condition
If f ( x k'K ) <. v k'K + e, then the algorithm terminates and x k,K is con-
sidered the global solution p o i n t for (A).
If the convergence condition is n o t met, then iteration k + 1 is per-
formed where for l = 1 .... , k, 1 4= K, programs (A k+m,l) are identical
with (Ak'/), program (A k+ 1,K) has the constraint set R k+l'K , and pro-
gram (A k+l'k+l) has the constraint set R k+l'k+l , where these two sets
are a partition of R k, K such that
Comments
sequence of values v k'K(k) would be v*. Care has to be taken that this
be the case since the algorithm would never terminate without this pro-
perty.
The details on the implementation of step 1 and another m e t h o d for
computing the vk'l's will be considered in Part II.
In this section are presented the details for the generation of convex
underestimating programming problems. Basic to this is the notion of
a convex underestimating function. Suppose a function has the form
T [ t ( x ) ] + U[u(x)] • V [ v ( x ) l (2)
the function, and the concave envelope is the lowest concave function
which everywhere overestimates the function. The convex envelope of
T(.) in the interval [a t, b t] is denoted by e r ( . ) and the concave enve-
lope by E r ( . ). The envelopes are functions of the interval as well as the
function T(. ) but for simplicity of notation this dependence will not be
made explicit. Similar notation is used for the other functions.
Compute a point at which each function achieves its minimum on its
interval domain, and a point at which each function achieves its maxi-
mum. (Note that the ability to do this is implied by the availability of
the convex and concave envelopes.) That is, let Zrmtn, Zrmax, A r, B r be
the values where
where mid(z1, z2, z3) is the function which selects the middle value of
three scalars.
The proof that the function in (4) is convex and that it underesti-
mates T[t(x)] in the region of interest is left to the reader. It is impor-
tant to notice two other properties of this function. It is a "tight" un-
derestimating function in that if the interval [a t, bt] were shrunk, the
underestimating function would be higher; and if a t = bt, equality
would hold in (4). Furthermore, if T(. ) is continuously differentiable
on [at, bt] and if ct(x) and Ct(x ) are also continuously differentiable,
then so is the convex underestimating function for T[t(x)] in (4).
The concave overestimating function for T[ t(x)] is
i ~ ~.,~ + + e V
~.~ ~ ,., ~v/
~S
o ]1 II II ~" ~'~ J II
'S
0
"~ ~ " V/ -V
II II II II II ~II II
g o.~
rj t.i ~ bl
~4
o
f~
0
~J e,i
0
II I
¢. ~, eq
G.P. McCormick / Computability of global solutions "153
m a x [ B v U + B u V - B u B V, A v U + A u V - A g A V] .
( A v e u [ m i d { c u ( X ) , Cu(X),ZUmin}] (5)
+ A g E v [mid(co(x), Co (x),ZmVax }] - A u A v } .
minimize fix),
x~E n
minimize
x ~ En+l Xn+l ,
where Fi(x), a(x), and fi(x) are convex and continuously differentiable.
Obviously the problem
minimize Xn+ 1 ,
x ~ E n+1
minimize X N (x ) ,
x~ En
where Xi(x) = x i for i = 1, ..., n, and the remainder of the Xi(x)'s are de-
fined recursively as follows:
GivenXP(x) f o r p = 1, ..., i - l , then f o r i = n + l , ...,N,
i-1 i-1
xi(x)=~ Ti[XP(x)] +~ V i [XP(x)] , U i [Xq(x)] (6)
p=l P p=l q=l q,P P,q '
( X l x 2 ) X 2 t x 2 - 1 e-XlX2t
0 ['(X2) dt .
. . . [ap,
the interval . . . bp ]. Let Zm~n,p/Zm'ax,p
Tz Tl denote a minimizer/maximizer
156 G.P. McCormick / Computability of global solutions
Ai = T i (zT,.i ), Bi i -T,i
T,p p- mln,p T,p = T ; ( z m a x , p ) "
Similar notation holds for the factors of the product terms. There V re-
places T above in one case, and U and q replace T and p in the other.
Let ci(x)/Ci(x) denote the convex/concave functions it is desired to
compute which underestimate/overestimate X i ( x ) . Now
ci(x) = x i = C i ( x ) f o r i = 1, .,., n .
Assume that c p (x), C p (x) are known for p = 1, ..., i - 1. Then for any x
satisfying ap ~< X P ( x ) <<. bp f o r p = 1, . . . , i - l ,
X i ( x ) >>.c i ( x )
i-1
= ~ eT, [ m i d { c P ( x ) , C P ( x ) , zT"imm,p }]
p=l
i-1 p
+ p~lq=~
~ ~ max{B~,p " e iv,q [ m i d ( c q ( x ) , c q (X)'Zmm,q}l
u,i
+B! • e i [ m i d { c P ( x ) , C P ( x ) Z V'i }]
u,q V,p - -' m m , p
C q ( x ) , zU,. i }] + A i . E i [mid{cP(x),
mm,q u,q V,p
CP(x), z V,i }] - A i . A i v , p}
max,p u, q "
(Here, without loss of generality, it has been assumed that Aiu.q < 0, all
other bounds are nonnegative.) The upper bounding concave function
is determined in an analogous way.
Several comments are in order. In general, every convex/concave
function need not be computed. It is not necessary, for example, to
compute C N ( x ) . If a constraint never fails to be satisfied, its convex
(concave) under (over) estimating function need not be computed un-
less needed for an Xi(x) which contains that constraint in its definition.
G.P. McCormick / Computability of global solutions 157
For most problems, the ai's and bi's are given or known from natural
conditions on the problem. If some of them are not given and are need-
ed (for some i > n), they can be c o m p u t e d recursively from the bounds
assumed available on the first n Xt"s (which are, of course, the original
problem variables).
The underestimating convex programming problem is then:
minimize cN ( x ) ,
subject to el(x) <~ b i for i = 1 , . . . , N - 1 ,
Ci(x) >i a i for i = 1 , . . . , N - 1 .
Clearly, any point feasible to problem (A) is feasible to this one, and the
objective function value is no greater for the convex underestimating
problem.
(ex l - x ~ ) ( x l x 2) <~ 0
X 3 = ( X 1 e X1 ) ( X 2 ) , X 4 = X 3 + ( - X 1 )(X2) 3 . (9)
3
/
/
II
II
iii I I
I I i/
1/111/i/ i /
,' / / / I i .
I I ii I I
.".~"...>~, .~ ,'//i
~ I I ."
II. /I
/ , W / / / / . / ' ",
.~/ / / , I / / i i I I I l t
I I i I I/ l ~ " 1 I i l l / i l l I / I I / / I
/11 I i I I I / / / .I / I I
I I I I I l I I I I/ I I /
i I I i I.II I / II t !
l l i III I I I I / I I i
I I / / I,I / / I I, / / I /
' I' i / /' I / ~1 '
Fig. 1. Points feasible to inequality, with those in convex hull but not feasible.
and
- 2 ~ < x 2 ~< 3 .
(x 1 ex l ) x 2 I> - 2 x l e xl - 0 . 3 6 7 9 x 2 - 0 . 7 3 5 8
and
(Xl e x l ) x 2 ~> 3 x l e x~ + 6 0 . 2 5 7 x 2 - . 1 8 0 . 7 7 1 .
- 2 X l exl / > - 3 0 . 3 1 2 2 x I - 2 9 . 5 7 6 5 .
(-xl)(x32) >~ 8x 1 - 3 x 3 - 24
( - x l ) ( x 3 ) ~> - 2 7 x 1 + x 3 - 2 7
, 3
--3x 2 when -2 ~< x 2 ~< - 1 . 5 ,
~(X2) =
-20.25x 2 -20.25 when - 1 . 5 < X 2 ~< 3 .
I X 4 >~- 3 0 . 3 1 2 2 x 1 - O . 3 6 7 9 x 2 - O . 7 3 5 8 - 2 9 . 5 7 6 5 + 8 x 1+6(x2)-24
= - 2 2 . 3 1 2 2 x 1 - 0.3679x 2 - 54.3122 + 8(x 2) ,
II X 4 >~ - 3 0 . 3 1 2 2 x 1 - O . 3 6 7 9 x 2 - O . 7 3 5 8 - 2 9 . 5 7 6 5 - 2 7 x l +~(x2) - 2 7
- 57.3122x1_O.3679x2-57.3122+~(x2),
160 G.P. McCormick / Computability o f global solutions
1 2
L i
-2 -1 \ x
,%
%
\
\
\
\ ~ ( - 3 X 32
-30-
-50"
7
/
//
'2
1 2
-1-
Fig. 3. Comparison of convex hull of example inequality with overestimating convex region.
Boundary of convex hull.
. . . . Boundary o f overestimating convex region.
for an interval. It is a line for an adjacent one, may revert to the func-
tion itself for the next, and so on. The determination of h o w many dif-
ferent representations are required depends upon the size of the interval
and also on the function. A computer program which does this for the
sin function has been coded. The trigonometric functions are the most
complicated ones.
The next step is to determine the exact point at which the convex
envelope changes representation from a line to follow the curve of the
function (or vice versa). Presumably bounds on the interval in which
this occurs are given by the first step. This is simply the problem of find-
ing a point where the slope of the curve equals the slope of the line con-
necting the point with a given one. If T ( z ) is the function o f a single
variable, and a is a given point at the start of the interval, then the prob-
lem is to find a point t ~ [a, b] such that
t k +1 = t k + T - - S ~ ) tk - a (11)
x2
3.0'
/r
////
/ /
// //
//: //
c=4.0 /// /// v z~_c=1.5
/
/Ii /P~I/I/
//
/~]11/11
\ //
/
/
-1.5 4.0
and
X 5 (x) = sin[X 3 (x)] + ( X 4 (x)) 2 - 3X 1 (x)/2 + 5X 2 (x)/2 + I .
sin (x3)
1.o,+
I
/
~l/I
/
/
- .79X31"
- 2.58 k89
. . . . . . . i~
I ~--, J- I
Fig. 5. Convex envelope of sin(X3) for -4.5 ~<X3 ~<7.
G.P. McCormick / Computability of global solutions 165
where
to show that if the rules for splitting the regions to create new program-
ming problems are done properly, the lower bound on the global solu-
tion value would tend to the global solution value in the limit.
First, notation needs to be established. The factorable programming
representation of Problem (A k'l) of Section 2, for l = 1, ..., k is written
(pk, l) minimize c N, k, l (x ) ,
subjec t to ,t <. c 'k'l(x) , ci'k't(x) <<. bk.
fori = 1,..,,N- 1,
where the defining equations for ci'k'l(x) are as in (7), except that where
necessary the superscripts k, l must be appended there.
Let yk, l denote a solution point for (pk, l) and let v k,i denote the solu-
tion value. Clearly, since each pk, l is a convex underestimating problem
for A k,l, then v k,l is lower b o u n d on the solution value of ,A k,l, vk,,t.
This is consistent with the general description of the global solution
m e t h o d described in Section 2. It remains to show h o w the partition, or
splitting (S) is done when the problems are written in canonical factor-
able form.
Consider the following differences:
Tip[Xp (yk'K )] --ei'k'KT,p [mid (cP'k'K (yk'K )' cP'k'K (yk'K )' zT"i'k'Kmm,p }]
i=n+l,...,N; p = 1 , . . . , i - 1; (12)
- m a x (Biizg.'p ei'k'Kv,q [mid (c q'k'K (yk,K), cq, k,K (yk, K ), zU, i,k,Kqmin, }]
+ oi, k,K • ei'k'K[mid{cP'k'K(y k'K~,,v'(7pk'K(ty'k'K'), zV'ikm~n',p'K }]
U,q V,p
- B i,k,K. B i,k,K (13)
U,q V,p '
Ai, k, K. ei, k,K [ m i d {c q,k,K (yk, K ), cq, k,K (yk, K ), zU, i,k,K }]
V,p U,q mln,q
G.P. McCormick / Computability of global solutions 167
E£k,K [mid { cP, k,K (yk, K ), cP, k,X (yk, K ), zT, i,k,g 1] _ T;[ X p (yk, K )]
1, p max,p ~-
K~' , Ei'k'Xv,p [mid {c p' k,K (yk, K ), Cp, k,K (yk, K ), z mV'i'k'Kax,p }l
min {B~:k.
q = 1,...,p.
It has been assumed, without loss of generality, that in all cases of the
A s and B , s, A t!k K
, ' is less than zero.
Consider also u,qthe constraints for Problem i k , K
Because the constraints (16) and (17) are not necessarily satisfied at
yk, K, the differences i n (12) through (15) may not all be nonnegative.
In fact, the T's, U's and V's may not be defined outside their ranges, in
which case the differences may not exist. These will not be used. Con-
sider only those differences in (12) and (13) where the associated i does
not have the property that (17) is satisfied at yk, X. Consider only those
differences in (14) and (1 5) where the associated i does not have the
property that (16) is satisfied at yk, K. ifyX, K is not a global solution to
the factorable programming problem (A), then at least one of the re-
168 G.P. McCormick / Computability of global solutions
maining differences must be positive, and the associated XP(y k'K) (or
associated X q ( y k,K) when differences of the form (13) and (15) are
considered) must be within its/their bounds, i.e.,
must hold.
The set of x's feasible to the underestimating problem is larger than
the feasible region of the original problem. The problem functions may
not be defined outside their original ranges. In this case, it may not be
possible to determine which constraint is not "satisfied" for purposes
of applying the splitting rule. To ensure this, an additional assumption
will be made, that the problem functions are continuous in the region
feasible to the underestimating problem. Formally, define, for p -- 1, ...,
N-l,
ap =inf{XP(x)lCq(X)<~bq, Cq(X)>~aq f o r q = 1 , . . . , p - I } ,
Theorem 1. I f the T i (" )'s are continuous on [ap, bp ] for i = 1, ..., N;p =
1, ..., i - 1, and if th~ U's and V's are continuous on their corresponding
intervals, and if y k'x is not a global solution to the factorable program-
ming problem, then at least one o f the differences (12) through (15) is
positive and the corresponding values o f the XP (yk'K ), (or X P ( y k'K )
and Xq (yk'K ), depending upon the case) lie within their required in-
tervals, i.e., satisfy (18)/(18) and (19).
Proof - Obviously, a]k'K <<.X](y k'x ) ~ bk'x for , ] = 1, ..., n, since the con-
vex and concave underestimating/overestimating functions are just the
problem variables themselves. Let i be any index greater t h a n or equal
to n + 1 and less than or equal to N for which the differences (12)
through (15) are zero, and for which the X p's and X q's involved satisfy
G.P. McCormick / Computability of global solutions 169
Since c i'k'K (yk, X ) <~ bki,K, and C i'k'K (yk, K ) >~ ak, K, it follows t h a t
The splitting rule is then: Find the difference in (12) through (15)
which is maximal with the following restrictions: do not compute (12)
and (13) for any i where Xi(y k ' r ) satisfies (17); do not compute (14)
and (15) for any i where Xi(y k'g) satisfies (16): do not compute any
differences in (12) and (14) where the X p (yk, K) does not satisfy (18);
do not compute any differences in (13) and (15) where either X p (yk, l¢:)
does not satisfy (18) or X q (yk, K) does not satisfy (19).
Let I and P denote the superscript and subscript associated with the
maximum difference if it is of the form (12). (The case when the maxi-
m u m difference is of the form (14) is analogous and will not be con-
sidered.) For iteration k + 1 problems (A k+l'l) for./= 1, ..., k, l :~ K are
identical with problems (Ak't). Problem (A k+l'K ) is the same as problem
(A k,K ) except that the constraint Xe(x)>~ akp"K is replaced by
X P ( x ) ~ [hP'k'K + X P ( y k ' K ) ] / ' 2 (20)
'-'~T,I
where
hP, k,K = mid{cP, k,K (yk, K ), cP, k,K (yk, K ~ zT, I,k,K }
TI J' min,P '
and problem (A k+l,k+l ) is the same as problem (Ak'g), except that the
constraint XP (x) <~ b k,g is replaced by
of the two functions in the max{., • }is higher, the difference portion
associated with the index P can be evaluated at any one of four values:
. V , I k,K , or Zm~x,~0
cP'k'K(yk'K), cP'k'K(yk'K), ~mYn',P V I,k.K . A similar state-
ment holds for quantities associated with the index Q. Denote the value
at which the difference is evaluated in the first case by hf/k,f K and in
the second by h Q,k,K
u, I " If
(22)
X P ( yk' K ) - h iP/ ,k[K >~X Q ( y k , K ) _ h Q , k,K
"°U,I '
Lemma 1. Consider all the differences in (12) and (13) for which the
index i involved does not correspond to a constraint o f the form (17)
which is satisfied an infinite number o f times at y k,K . In addition, con-
sider all those differences in ( 1 4 ) a n d (1 5 ) f o r which the index i involved
does not correspond to a constraint o f the form (16) which is satisfied
an infinite number o f times at yk, K. Consider also those terms in (12)
and (13) for which the index i = N. From these terms, consider only
those for which the X P ( y k,K ) and x q ( y k'K ) involved (if necessary)
satisfy
a k, K ~ X p (yk, K ) ~ b k, K
and
ak, K <. X q (y k,K ) <~ bk, K
q q
Proof. Assume the contrary, i.e., that the limiting maximum difference
is not zero. Assume without loss of generality that the term which ini-
tiates the splitting rule an infinite number of times is of the form (12).
Let I and P be indices associated with the term which initiates the split-
ting rule an infinite number of times and which tends to the m a x i m u m
difference. One limiting interval can be assumed (again, without loss of
generality) to have the form
Since both XP(y) and ~er,I are in the interval, it follows that they are
equal. Then, since the convex envelope agrees with the function at its
endpoints,
= 0
T,P ,
172 G.P. McCormick / Computability of global solutions
~T,i,k,K } is
Proof. In this case the mid(cP'k'K(yk'K), cP'k,K(yk'X), ~mm,p
either c p,k,K (yk, X) o r Z_T,i,k,K
min, p . In either case,
Up(~)=-bp = X p ( y ) .
Proof. The proof follows the proof of Lemma 2 and will not be given.
All the possibilities have not been covered in Lemmas 2 and 3. In
Lemma 2 the conclusion follows if (26) is substituted for (23), and (27)
for (24). In Lemma 3 the conclusion holds if (23) is substituted for
(26), and (24) for (27). The lemmas required for the differences in (13)
and (15) also follow from the same line of reasoning used in Lemma 2
and will not be stated explicitly.
Proof. Let 0 be the set of indices for which (30) and (31) hold. Let 01
be the set of indices for which both (16) and (17) are satisfied an infi-
nite number of times. Let 02 be the set of indices not in 01, for which
X i ( y k'K ) ~ b k.'K
l
(32)
tends to zero.
174 G.P. McCormick / Computability of global solutions
X N ( y ) ~ 1)* .
O* /> ~N (~) .
X N ( y ) = O* .
G.P. McCormick / Computability of global solutions 175
6. Summary
Acknowledgment
References
[1] E.M.L. Beale and J.A. Tomlin, "Special facilities in a general mathematical programming
system for nonconvex problems using ordered sets of variables", in: J. Laurence, ed., Pro-
ceedings of the fifth international conference on operational research (Tavistock Publica-
tions, London, 1970) pp. 447-454.
[2] J.E. Falk and R.M. Soland, "An algorithm for separable nonconvex programming prob-
blems", Management Science 15(9) (1969) 550-569.
[3] G.P. McCormick, "Converting general nonlinear programming problems to separable non-
linear programming problems", Technical Paper Serial T-267, Program in Logistics, The
George Washington University, Washington, D.C. (1972).
[4] G.P. McCormick, "Attempts to calculate global solutions of problems that may have local
minima", in: F.A. Lootsma, ed., Numerical methods for nonlinear optimization (Academic
Press, New York, 1972) pp. 209-221.
[5] R.M. Soland, "An algorithm for separable nonconvex programming problems II: noncon-
vex constraints", Management Science 17(11) (1971) 759-773.