Ad 0767826
Ad 0767826
Ad 0767826
vy^K*-"
7
<7fe7P-b
ENGINEERING DESIGN
HANDBOOK
to*.:; . ' ?=' vsmm
REDSTONE ^Ci.\;:i;/iCir:-r.:::{;f-.TiON Cf.NT
5 0510 00078496 4
MECHANICAL SYSTEMS
L
nn f.*" >-*"".- <?"" '' * w
."^.S!
y&i *.<> k~i^-- ' -i-v. '.;;A,
15 July 1973
AMC PAMPHLET
NO. 706-192
TABLE OF CONTENTS
Paragraph Page
LIST OF ILLUSTRATIONS ix
LIST OF TABLES xiii
PREFACE XV
Paragraph Page
Paragraph Page
31 Introduction 31
3_2 Properties of Linear Programs 32
3_3 The Simplex Algorithm 38
33.1 Determination of a Basic Feasible Point 38
3-3.2 Solution of LP 3-9
3 3.3 The Degenerate Case 310
3_4 Minimum Weight Truss Design 3 12
35 An Application of Linear Programming
to Analysis 3 14
References 3 17
111
AMCP 706-192
Paragraph rage
45 Steepest Descent Solution of the Finite
Dimensional Optimal Design Problem 427
45.1 An Approximation of the Problem OD 428
45.2 Solution of the Approximate Problem 430
45.3 Steepest Descent Algorithm 433
45.4 Use of the Computational Algorithm 434
References 435
51 Introduction 51
5 1.1 Lightweight vs Structural Performance
Trade-offs 5-2
5 1.2 Weapon Development Problems Associated
With Lightweight Requirements 52
5 1.2.1 Aircraft Armament 52
5-1.2.2 Gun Barrel Design 5-3
5-1.2.3 Towed Artillery 5-3
5-1.2.4 Other Weapon Problems 5-4
5 1.3 Plan for Technique Development 54
52 Elements of the Elastic Structural
Design Problem 54
5-2.1 The Optimality Criterion 5-5
52.2 Stress and Displacement Due to
Static Loading 55
52.3 Natural Frequency and Buckling 56
5-2.4 Method of Solution 5-7
53 Steepest Descent Programming for
Optimal Structural Design 57
53.1 Linearized Cost and Constraint Functions 58
53.2 Steepest Descent Algorithm for
Optimal Structural Design 5 10
53.3 Computational Considerations 510
54 Optimization of Special Purpose Structures 5 11
5-4.1 A Minimum Weight Column 5-11
54.2 A Minimum Weight Vibrating Beam 514
54.3 A Minimum Weight Portal Frame With a
Natural Frequency Constraint 516
54.4 A Minimum Weight Frame With Multiple
Failure Criteria 5 18
AMCP70G-192
Paragraph Page
Pa e
Paragraph 8
7- -1 Introduction 71
7- -1.1 The Class of Problems Considered 71
7- -1.2 Historical Development 71
7- -1.3 Methods Employed 72
7- -2 A Minimum Weight Column 72
7- -3 A Minimum Weight Structure With
Angular Deflection Requirements 75
7- -3.1 Statement of the Problem 75
7- -3.2 Tower With One Design Variable 77
7- -3.2.1 Method 1. Tower With Base Rigidly
Fastened to the Earth 78
7-3.2.2 Method 2. Tower With Base Pinned to
Earth and With Top Supported by
Guy Lines 712
73.3 Tower With Two Design Variables 713
7-3.3.1 Method 1. Tower With Base Rigidly
Fastened to the Earth 714
7-3.3.2 Method 2. Tower With Base Pinned to
Earth and With Top Supported by
Guy Lines 7-14
73.4 Discussion of Results 715
74 Minimum Weight Design of Beams With
Inequality Constraints on Stress and
Deflection 7-16
AMCP 706-192
Paragraph Page
3 1 Introduction 81
32 A Steepest Descent Method for the Basic
Optimal DesignProblem 82
3-2.1 The Problem Considered 8-2
32.2 Effects of Small Changes in Design
Variables and Parameters 82
32.3 A Steepest Descent Approach 85
33 A Steepest Descent Method for a General
Optimal Design Problem 815
3-3.1 The Problem Considered 8-15
33.2 The Effect of Small Changes in Design
Variables and Parameters 816
3-3.3 A Steepest Descent Computational
Algorithm 8-23
3-4 Steepest Descent Programming for a Class of Systems
Described by Partial Differential Equations 825
3-4. 1 The Class of Problems Considered 825
3-4. 2 Effect of Small Changes in Design
Variables and Parameters 827
84.3 A Steepest Descent Computational
Algorithm 8-29
Vii
AMCP 706-192
Paragraph Page
91 Introduction 91
9-2 Steepest Descent Method for Optimal
Structural Design 92
9-3 A Minimum Weight Column 9-8
9_4 A Minimum Weight Vibrating Beam 99
95 A Minimum Weight Vibrating Frame 910
96 A Minimum Weight Frame With Multiple
Failure Criteria 914
9-7 A Minimum Weight Vibrating Plate 9-16
References 918
INDEX i_ i
Mil
AMCP 706-192
LIST OF ILLUSTRATIONS
IX
AMCP 70G-192
LIST OF TABLES
xiv
AMCP 706-192
PREFACE
This text treats a broad class of optimal design problems through use of a
consistent set of computational techniques ideally suited for computer
application to mechanical design problems. No attempt has been made to be
exhaustive in the treatment of optimization techniques or the full range of
mechanical applications. Rather, the class of problems treated is concisely
formulated (in Chapters 4 and following) in terms of design and state
variables that occur in mechanical design. A steepest-descent approach
which has served as a workhorse, reliable technique in fields such as
aerodynamic system design, control theory, and nonlinear programming is
developed here for mechanical system design.
The Handbook was written by Dr. Edward J. Haug, Jr. of the U. S. Army
Weapons Command. It is based on lecture materials used by him in a
two-semester graduate sequence on "Optimization of Structural Systems",
taught at the University of Iowa since 1968. Examples treated in the text are
derived primarily from Dr. Haug's research, Dr. Jasbir Arora's University of
Iowa dissertation, and the work of Messrs. Tom Streeter and Stephen Newell
of the U.S. Army Weapons Command.
The Engineering Design Handbooks fall into two basic categories, those
approved for release and sale, and those classified for security reasons. The
Army Materiel Command policy is to release these Engineering Design
Handbooks to other DOD activities and their contractors and other
Government agencies in accordance with current Army Regulation 70-3 1,
dated 9 September 1966. It will be noted that the majority of these
Handbooks can be obtained from the National Technical Information
Service (NTIS). Procedures for acquiring these Handbooks follow:
Commander
Letterkenny Army Depot
ATTN: AMXLE-ATD
Chambers burg, PA 17201
XVI
AMCP 708-192
CHAPTER 1
1-1 SYNTHESIS VS ANALYSIS IN ENGI- sign changes are made and the structure is re-
NEERINGDESIGN analyzed. This process continues until the
designer is satisfied with his design. This has
Engineering is defined (Ref. 1) as "the art been the principal use of the computer in the
or science of making practical application of design process.
the knowledge of pure sciences such as
physics, chemistry, biology, etc.". Although In general, then, before the designer can
broad, this definition implies that the job of assure himself that he has the best system, he
engineering is to synthesize, or put together, must be capable of analyzing all candidates.
useful systems by applying knowledge and In the past half century, outstanding advances
methods derived from the "pure" sciences. in engineering analysis have been made. The
The meaning of "practical" in the given digital computer has allowed the engineer to
definition should be interpreted as best, or quantitatively analyze the behavior of systems
optimal; i.e., the job of engineering design is that were examined only qualitatively in the
to develop the best possible system for the past. The mechanical sciences, particularly,
given application, consistent with the re- have benefited from this boom in analysis
sources allocated to the development phase. capability. Structural analysis, stress analysis,
The purpose of this handbook is to present a analysis of mechanisms, and heat transfer
class of methods that allow for efficient use analysis, just to name a few, have made
of the computer in the design process. spectacular advances in the past twenty years.
Since the computer can be viewed simply Until the early 1960's, and even to the
as a device to handle large quantities of data present day to a lesser extent, the attention of
and perform simple algebraic operations and engineering research has been focused pri-
logic rapidly, it is important to look first into marily on developing analysis capability. Dur-
the role of calculation in design. The usual ing this period of emphasis on analysis,
approach to design is to conceive of a inadequate attention was paid to developing a
candidate system and then test it to see if it synthesis, or design, capability that is able to
works. Great strides have been made with efficiently use the newly developed analysis
digital computers in the past two decades to methods. In some of the mechanical sciences,
allow for numerical analysis as a test of the this problem is particularly acute. In struc-
idea, or concept, rather than previous cut-and- tural mechanics, for example, it is possible to
try techniques. For example, in structural analyze a structure under a given loading to
design one chooses the configuration and obtain accurate values for stress, displace-
member sizes, and then tests the structure by ment, and even natural frequency. It is not
analyzing its response to given loads. If the clear, however, how a structure should be laid
structure does not behave as desired, then de- out and proportioned to efficiently utilize
1-1
AMCP 706-192
I
"XT
ES
Examine
" " * *L Complete
yp.fv
leeasn ana ^
Function
Analysis
_ Conceptual
Design * Optimal
Design
Description
Inputs Output
Objective
1-2
AMCP 708-192
The purpose of this text is not to give a to describe analytically. Conceptual design, as
detailed treatment of system engineering, but its name implies, is the identification of the
rather to present aspects of the theory of various concepts or basic system configura-
computer aided design, with emphasis on tions that might meet the system objectives.
optimal design. The simplified model of a It is desirable in this step to leave the
system engineering process shows that opti- concepts as general as possible so as not to
mal design is a part of system engineering, eliminate candidate systems that might be
but, indeed, by no means the dominant part. very effective. For example, if the function to
The purpose of this paragraph is to discuss the be performed is to propel a vehicle over the
interface of optimal design with the remaining surface of the earth, conceptual designs might
essential elements of system engineering. include wheels, tracks, legs, air cushion, etc.
System engineering begins with the identifi- It is important at this time to identify
cation of a need by a potential user of the ranges of values of parameters describing the
system to be developed. It is often the case system so that, for any parameter in this
that the user knows that he needs a system to range of values, the system will perform the
do a job, but he may have difficulty in stating functions identified in the previous step, i.e.,
his needs and objectives quantitatively. It the set of parameters describing admissible
then becomes the joint responsibility of the systems is identified. It is at this time that the
system engineer and user to quantify system experienced designer can be extremely valu-
objectives so that a meaningful set of objec- able in reflecting state-of-the-art capabilities
tives may be established for the development of technologies involved in the system devel-
to follow. opment.
Once the needs and objectives for a system The optimal design step has as its objective
are identified, it is necessary to define func- the choice of the undetermined parameters
tions that must be performed by the system identified in the previous step. These param-
and any subsystems that are required. This eters must be in the ranges defined by
process is called function analysis, and its technological limitations and system func-
purpose is to pick out functions or operations tions. The criterion for choosing system
that must be performed in order to accom- parameters is maximization of system worth
plish the mission required of the system being or value. It should be emphasized that a
developed. These functions then become mathematically precise optimum may be im-
lower level objectives for the development of possible to attain and must therefore serve
subsystems. Identification of functions tends only as a goal. Methods for choosing system
to be qualitative in nature. However, once a parameters should, however, have the prop-
function or operation is identified, it must be erty that if an optimum does exist, then given
described in quantitative terms, if at all enough patience and computer time, that
possible. For example, if a function must optimum should be approached as a limit.
occur quickly, the time allowed should be
specified. What appears to be the final step in the
system engineering model of Fig. 1-1, Descrip-
The next step shown in Fig. 1-1 is one of tion, is, in reality, probably just an inter-
the most difficult functions in system engi- mediate step. Unless the system design pro-
neering and certainly the most difficult step cedure has been unusually effective, the sys-
1-3
AMCP 706-192
tern decided upon will probably not satisfy it applies to the mechanical sciences. There
the user. More likely, it will probably not are peculiarities of mechanical design, as
satisfy the system engineering team. Having opposed to classical control system design,
the results of one pass through the system which require specialized treatment. Further,
engineering process, the user can probably the mathematics involved in mechanical sys-
remember some constraints which he forgot tem design is quite different from the math-
to specify and which the optimum system ematics of control theory. These distinctions
violates. The designer probably also will see are highlighted throughout the text.
concepts that he did not see before. Much as
the user, he too will remember technological In the chapters that follow, optimal control
constraints which he forgot to specify and theory is interpreted as treating feedback
which the optimum system violates. Finally, controllers; i.e., an optimal control system has
the sponsoring activity will undoubtedly de- active elements that sense errors in output,
cide that it will be all right to decrease the due to fluctuations in inputs, and adjust
measure of system value a small amount if it system controls so as to maximize some
will save some money. measure of system performance. Optimal de-
sign, on the other hand, is taken as the
The next step in the procedure is for each problem of choosing system elements or
member of the team to take a deep breath, parameters describing these elements, which
sigh and go back to work, armed with his are fixed for the life of the elements, so that
hard earned new knowledge. It is for this the system is optimum in some sense. In
purpose that all the feedback paths in the control literature this is called open loop
model of Fig. 1-1 are shown. This iterative control. The principal difference in the two
procedure is then continued until the sponsor- problems is that the variables chosen in the
ing activity decides that the system developed optimal design problem are fixed for the life
is what it really needs. This will probably be of the system, whereas variables in a feedback
another human decision, rather than a pro- control device are to be adjusted according to
grammed mathematical one. inputs as the system operates. Mathematical-
ly, the difference in the two results is that the
The remaining chapters will be devoted to control law describes how the system vari-
the problem of computer aided and optimal ables should be adjusted as a function of the
design. If the design methods presented later state of the system, whereas an optimum
are to be of maximum value to the reader, he design is simply a set of parameters describing
must have a clear picture of how these system elements and will not be changed
methods fit into the larger problem of system during the life of the system. This distinction
engineering. For further literature on the is not uniform in the control literature but is
basic ideas involved in system engineering, see used here to identify the class of problems
Refs. 3, 4, and 5. treated.
1-4
AMCP 706-192
of this kind. For example, in designing a techniques available for design optimization.
structure one must be concerned with stresses Since the subject of this handbook is com-
due to applied loads. These stresses are puter aids to design, the practical distinction
interpreted as the state of the structural is made here. For a unifying mathematical
system. They are determined by a boundary- treatment, the reader is referred to Ref. 7.
value problem that cannot be interpreted as a
sequential process (initial-value problem). In Finally, it is important to realize that
some design problems it is possible to define engineering design optimization and engineer-
auxiliary variables so that the governing equa- ing analysis are fundamentally different in
tions form an initial-value problem with addi- nature. In analysis, one is generally assured
tional constraints. This procedure, however, that a solution exists and numerical methods
generally complicates the problem unneces- are generally stable. In optimal design, on the
sarily. For this reason the problems in other hand, existence of even a nominal
succeeding chapters are formulated as bound- design satisfying objectives is not assured,
ary as opposed to initial-value problems. much less existence of an optimal design.
Moreover, even when an optimum exists,
In order to illustrate the use of the meth- numerical methods for its solution are often
ods presented, applications are made pri- quite sensitive to initial estimates and require
marily in optimal structural design. Applica- much computational art for iterative con-
tions are chosen to illustrate the use of the vergence. These properties will be observed
methods on problems having a number of over and over in this handbook when example
design variables which might be found in problems are treated.
engineering applications. Further, since many
of the methods are relatively new, it is It is important that the reader take a
anticipated that improvements in computa- mathematical outlook when doing computer
tional efficiency may be realized in specific aided design and optimization. A purely
problems if advantage is taken of special intuitive approach can lead to erroneous
features of the class of problems treated. results that may not be apparent until some-
one happens onto a nominal design which is
It is appropriate to highlight a significant vastly superior to a "supposed" optimum
computational distinction between two design.
classes of design problem. The reader may
note that Chapters 2 through 5 of this text 1-4 MATHEMATICAL PRELIMINARIES
deal with problems in which system design
and performance are specified by a finite The level of mathematical background re-
number of parameters (real numbers). Chap- quired for an understanding of the methods
ters 6 through 9, on the other hand, deal with of optimal design presented in the following
systems that are described by functions on chapters is a course in advanced calculus and
some given space or time domain. Mathemati- the ability to use matrix notation. Since
cally, these problems are called finite and engineers often require results of rather deep
infinite dimensional, respectively. Optimiza- mathematical analyses to solve real-world
tion theory for these two classes of problems problems, several results have been accepted
can be put in the same form, but there are with references given to proofs. The purpose
very real differences in the computational of this paragraph is to present notation and
1-5
AMCP 706-192
some basic mathematical ideas used through- Two vectors are called orthogonal if their
out the text. inner product is zero.
Unless otherwise noted, all vector variables Just as the idea of collecting n real numbers
will be column vectors. A vector of the form into a vector in R", it is helpful to define a
(Eq. 1-1) may be interpreted as a point in vector functiong(x) for x&R" as
n-dimensional real space, R" . The spaced" is
simply the collection of all n-vectors of real
numbers. For example, the real line is A1 and g(x)- (1-4)
the plane is R2.
g(x)<0 (1-6)
and is called a norm on". There are many
norms defined on" but Eq. 1-2 will be
where inequality is taken componentwise, i.e.,
sufficient for the purposes of this text. Along
Eq. 1-6 is defined to mean the same thing as
with the idea of norm on Rn goes the concept
Eq. 1-5.
of dot product or inner product. The inner
product of two elementsx andy oiRn is One of the most useful notations in the
(1-3) following chapters is the idea of the derivative
x, y > =x'y
i -1 of a vector function with respect to its vector
1-6
AMCP 706-192
dfjx) (1-9)
V/(x)s In many places in the following chapters,
dx
Taylor's Theorem will be used to obtain an
approximate expression for a function at a
The gradient is one of the few standard point sufficiently near a point where the
symbols which denotes a row vector rather function is known. The most common ap-
than a column vector. Likewise, for a real proximation is the one obtained by deleting
valued function the matrix of second deriva- second and higher order terms. For example,
tives may be defined as the matrix if \\x ~y\\ is small,
d2f(x) d2f{x)
= 72f(x) = (1-10)
f(y) -fix) =* (y-x) (1-12)
dx2' OX; OXj
dx
An important theorem in the analysis of where by Eq. 1-11 the error in Eq. 1-12 is
functions appearing in optimal design prob- at most a constant times \\y x\\2 if f(x) has
lems is Taylor's Theorem. bounded second order derivatives. The left
side of Eq. 1-12 is generally denoted by
Taylor's Theorem: Let the real valued 8f(x), where y x is denoted Sx. In this
function fix) have k + 1 continuous deriva- notation,
tives in Rn . Then for xeR", there is a point
= ax + (1 u)y with 0 < a < 1, such that
;
/<*) dx Sx.
' (1-13)
" ofix)
/O0 =/(*)+ 2 -^-t-ty-x,) (1-11) Eq. 1-13 may be thought of as a total
; = 1 dx- ' '
differential. Even for vector functions g(x),
1 n n d2f(x) Eq. 1-13 holds for each component so if
+
2 2 . ' : ' (yt -x;.)0,.-x.)
2 /'=1 1=1 3jC; 9X;
8g(x)=[d8l(x),.. ., 6gm(x)]T, then
1-7
AMCP 706-192
dg(x,z) dgt{x,z)
dx dx-
m X n
and
dgjx.z)
dz dz,
_I m X p
Figure 1-2. Structural Requirement
Most of the common notation used in later
chapters now has been defined. Special nota-
tion and results required locally for some problem is to design a structure that supports
development will be defined and used there. the device under consideration and which is as
light as possible for purposes of transporta-
1-5 ILLUSTRATIVE MILITARY COM- tion and erection on the battlefield, or per-
PUTER AIDED DESIGN PROBLEMS haps mounting on a helicopter. A basic design
requirement for this structure is that the
In this paragraph two illustrative military device mounted on the top shall not have an
optimal design problems are formulated, and angular deflection of more than 6 radians, in
computer aided design techniques are out- order to hit the receiver or target. The loading
lined for their solution. The treatment here is that is to be considered is a wind load of up
only for the purpose of introducing concepts. to a given velocity, which would cause angular
These examples are treated in more depth in deflection of the top of the tower.
Chapters 7 and 8.
The needs and objectives in this design
1-5.1 OPTIMAL DESIGN OF STRUCTURES problem are well established, so no additional
inputs need be considered at the present time.
The optimization technique described in Further, the requirement that the tower
this paragraph was initially developed for support the device with only a given allowable
application to minimum weight structural angular deflection is the only basic function
design problems. For this reason, and to give required of the tower; thus the function
an engineering feel for application of the analysis block of Fig. 1-1 is also complete.
technique, the method will be presented along The next stage, and one that is quite difficult
with examples from the field of optimal to describe analytically, is that of arriving at
AMCP 708-192
conceptual towers which might perform the Figs. 1-3(A) and (B), involve rigidly fixing the
given mission. tower at its base to the fundamental support-
ing structure. In both towers, variable spacing
Four different conceptual designs are as a function of height is allowed between
shown in Fig. 1-3. The first two concepts, vertical members of the structure. In addition,
*-Z6
1-9
AMCP 706-192
In addition to the design variables, a main ture, it is required that the design variables be
part of the design problem is the behavior of bounded uniformly away from zero. This is
the structure under wind load, since one of given formally by the inequality
the major constraints on behavior of the
structure is that the angular deflection of the b >b
i io> 0,i=\,. ,m. (1-19)
top of the tower not exceed an angle 6. For
this reason, the angular deflection of each of The fundamental constraint in the present
the joints must be determined, along with problem is that the angular deflection at the
lateral deflection due to lateral wind loading. top of the tower not exceed the angle 0. This
This is a relatively routine analysis problem is expressed analytically by the inequality
when one uses the techniques of finite ele-
ment structural analysis. Not shown in Fig. u,u e. (i-20)
1-3, but required in the construction, are
cross members which maintain spacing of the The final step in formulation of an optimal
main vertical members. In order to state the design problem is to identify the cost func-
optimal design problem mathematically, first tion to be minimized. In the present case, the
define vectors of design variables b( and state cost function is structural weight J and is
variables z, given by an expression of the form
b=[b,,b2,. ..bm]T
(1-16) / = 7.2 cibi (1-21)
Z = [Z,, Zi, -z
n]
Using finite element structural analysis tech- where y is material density and c( are weight-
niques, define the stiffness matrix as ing factors representing lengths of structural
elements and weight requirements for lateral
A(b)=laij(b)]nxn (1-17) stiffners.
where the dependence of stiffness on the We now have an optimal structural design
design variables is explicitly shown. Using this problem that is well formulated from a
matrix, the structural response is given by the mathematical point of view. The objective is
following matrix equation to find the design variables bt through b
that satisfy constraint Eqs. 1-19 and 1-20, and
A(b)z=q (1-18) which minimize the structural weight as given
by Eq. 1-21. The technique used to solve this
where q is the wind loading matrix. problem, and in fact a large class of optimal
system design problems, is based on a very
Now that the relationship between the simple idea of engineering design. The idea of
design variables and the structural response is the technique is to allow small variations in
specified by Eq. 1-18, the next step in some nominal design, and analyze the effect
formulating an optimal design problem is the of these variations on the equations of the
identification of constraints. In order to problem and the cost function associated with
prevent dimensions or structural areas from the problem. As a result of allowing only
going to zero, resulting in an unstable struc- small design changes, certain approximations
1-10
AMCP 708-192
may be made that allow the best change in 66,. >*,-*,- (1-24)
design variables to be determined in order to
decrease the cost function of the problem as Or, if the angular deflection constraint is
much as possible, while still not violating violated, for example,
constraints of the design problem. For
example, one might choose as an initial zi > e (1-25)
estimate of the optimal design a uniform
tower as shown in Fig. 1-4. The estimated then, to correct the constraint error it is
design variable in this case is denoted by b^\ required that
1-11
AMCP 706-192
1-12
AMCP 707-192
1-5.2 APPLICATION OF THE STEEPEST (A) One Design (B) Two Design
DESCENT METHOD IN INTERAC- Variable Variable
TIVE COMPUTER AIDED DESIGN Figure 1-7. Tower With Base Simply Supported
and Top Supported With Guy Lines
Very often in design problems, it is not
practical to specify a unique cost function to Consider, for example, the problem treated
be minimized, hence the formal optimization in par. 1-5.1. The initial estimate of the
problem described in par. 1-5.1 does not optimum tow er was taken as a uniform tower.
apply directly. The fact that the vector B in The components of the vector 6b can be
Eq. 1-28 is a direction of steepest descent, projected on a cathode ray tube, along with a
however, is extremely valuable information to picture of the structure as shown in Fig. 1-8.
a designer. The application of this informa- The algebraic sign of the components of 8b,
tion to a structural design problem, using corresponding to each of the design variables,
interactive graphics, is a technique which is an indication of the effect a change in that
shows considerable promise in design. design variable will have on the cost function
TABLE 1-1
WEIGHTS OF TOWERS
Number of
Design
Variables
1-13
AMCP 706-192
861
1-14
AMCP 706-192
personnel in making design changes that will tires. A photograph of the first prototype of
have desirable effects on overall aircraft struc- this weapon is shown in Fig. 1-10.
tural properties, for example. This is extreme-
ly important in large-scale structural design The recoil mechanism for this weapon was
due to the difficulty in determining the effect designed according to traditional recoil mech-
of changes in an individual design parameter anism design goals. Namely, the objective in
on several different structural properties. the design was for a constant retarding force
Computation of these data and interactive which is transmitted by the recoil mechanism
aspects of the technique are discussed in to the undercarriage, as shown in Fig. 1-11. A
Chapter 5. recoil mechanism was designed which de-
livered approximately this recoil force R(t) as
This design technique is feasible from a a function of time.
computational point of view in that very little
additional computer time is required to When the weapon was built and fired, a
generate sensitivity information from stress nearly constant recoil occurred, as desired;
and vibration analyses that are required. While but, at high angles of fire, the weapon
most structural optimization work has been exhibited unacceptable dynamic response.
done in the batch mode, it is shown in During firing, the tires of the weapon com-
Chapter 5 that utilization of the steepest- pressed and after firing and the subsequent
descent technique with interactive graphics is release of the recoil forces, the weapon
a much more practical way to design struc- rebounded off the ground approximately 6 in.
tures, particularly in cases where several This unacceptable behavior required a re-
measures of structural performance are im- design cycle for the recoil mechanism with a
portant. design goal of minimizing the dynamic re-
sponse, or hop, of the weapon after firing.
Development and display of sensitivity
information in design is a form of information It was determined that the peak recoil
transfer to design personnel. This technique force could be allowed to reach 22,000 lb
depends on the availability of interactive without damaging the support structure. The
graphics software and hardware, which are optimization problem is then to determine
currently being developed. the recoil force R(f) as a function time such
that
1-15
AMCP 706-192
problem, the dynamic response h (t) is deter- The same philosophy of small design
mined by the second order differential equa- changes about some nominal estimate, as in
tions of motion of the artillery piece. the structural design problem of par. 1-5.1,
was employed in this case. A sensitivity
R(t)
22,000
20,000 -
Figure 1-11. Traditional Recoil Design Goal Figure 1-12. Recoil Distribution in Time
1-16
AMCP 706-192
REFERENCES
1. The American College Dictionary, Ran- Systems, John Wiley & Sons, New York,
dom House, New York, 964. 1962.
2. C. Y. Sheu and W. Prager, "Recent Devel- 5. M. K. Starr, Product Design and Decision
opments in Optimal Structural Design", Theory, Prentice-Hall, Englewood Cliffs,
Appl. Mech. Rev., Vol. 21, No. 10, Octo- New Jersey, 1963.
ber 1968, pp. 985-992.
6. C. Goffman, Calculus of Several Variables,
3. H. H. Goode and R. E. Machol, System Harper and Row, New York, 1965.
Engineering, McGraw-Hill, New York,
1957. 7. D. G. Luenberger, Optimization By Vector
Space Methods, John Wiley & Sons, New
4. W. Gosling, The Design of Engineering York, 1969.
1-17
AMCP706192
CHAPTER 2
When one speaks of optimal design, he Example 2-1 is included here as an aid to
2-1
AMCP 706-192
2-2
AMCP 706-192
The function fix) has a strict absolute In case x is in R", results analogous to
maximum at x = 1, absolute minima (not those just obtained are given in Theorem 2-1.
strict) at x = 0 and 2, relative maxima at x = 1
and 3, and relative minima atx = 0 and 2. Theorem 2-1: Necessary Condition: Let
fix) be defined on a subset D ofR" and have
In Definitions 2-1 and 2-2 no continuity or a continuous derivative in a neighborhood of
differentiability requirements were placed on a point x which is in the interior of D. If fix)
2-3
AMCP 706-192
2-4
AMCP 706-192
This theorem does not hold, in general, if This theorem is of obvious value to the
any of the hypotheses are deleted. For ex- designer. It assures him that if his design
ample, consider the function/(x) = x onD = problem satisfies the hypotheses of Theorem
(x\ 0< x < 1). D is not closed and/O) does 2-3 and if he has found a relative minimum
not have an absolute minimum in D. If then he is through; he has also found the
Z)={x|0<x< 1} then D is closed and fix) absolute minimum.
has an absolute minimum at x = 0.
Computational methods for finding ex-
Note: The hypotheses of Theorem 2-2 may be trema based on the theorems of this para-
weakened by demanding that fix) be only graph generally involve the solution of non-
lower semi-continuous rather than continu- linear algebraic equations. In particular, Eq.
ous. For proof, see Ref. 1, page 58. 2-3, which is in general nonlinear, can be
solved by a numerical method to locate all
Theorem 2-3 depends on the concept of admissible interior extrema. Methods for solv-
convexity. ing such equations are given in Ref. 3,
Chapter 2. It generally has been found,
Definition 2-4: A subset D of R" is called a however, that direct methods for finding
convex set if whenever x andj are inD, then extrema are superior to the solution of Eq.
the straight line segment x + 6iy x), 0< 8 2-3. For this reason no computational
2-5
AMCP 708-192
methods based on the indirect method will be The object here is to treat more general
presented here. functions, but it is possible to make a
quadratic approximation to / [xM + ots]
It will be the purpose of the remainder of which will hold near the minimum point.
this chapter to present methods that the Then, the minimum point of the approximat-
designer may use to locate interior relative ing function, which may be easily found, is an
minima. Relative minima on the boundary of approximation of the true minimum point.
the admissible region will be treated in Chap-
ters 3 and 4. The quadratic approximation of / [xM +
asj is constructed by passing a quadratic
2-3 ONE-DIMENSIONAL MINIMIZATION curve in a through three computed values of
the function. The distance between the three
In the direct minimization methods to trial points will be 6 > 0, where 6 is initially
follow, a multidimensional minimization chosen to be a small fraction of the expected
problem will be reduced to a sequence of one- range of a. It is known, however, that if the
dimensional minimization problems; i.e., the starting point of the process is quite far from
problem of determining a scalar a so that a the minimum point then the minimum point
given function^(a) will be a minimum. of the approximating function may not be
near the true minimum point. To prevent
In the problem of minimizing f(x) for x in making large, inaccurate steps in this case, a
R", all the methods of solution presented in maximum allowable step size A is chosen
this chapter are based on successive improve- before the process begins. A reasonable choice
ments in certain directions; i.e., at a point x(,) of A is 50% of the expected range of a.
one finds a direction, s, in which f(x) de-
creases. The object is now to move along the The following algorithm implements the
vector x(,) + as, by adjusting a, a > 0, until procedure described:
f(x) is as small as possible. The resulting point
is then called x('+1 \ and the entire process is Step 1. Define a0 = Oand/ = 1.
repeated. It is clear that the intermediate
problem of determining a so as to minimize Step 2. Compute
/(x(,) +as) is one-dimensional. This paragraph
will be devoted to presentation of methods h =/[x('>+(/-' -6)s
for solving the one-dimensional problem.
/o=/[*(0+/-1 s]
2-3.1 QUADRATIC INTERPOLATION
h =/[JC(0+(a/-1 +6)s].
(,)
If the function /Qc + as) of the scalar
variable a x^ and the unit vector s are Step 3. A quadratic polynominal in a
fixed were quadratic in a,then the value of a1'1 = z is fitted through ( 6, /,),
a which minimizes the function could be (0, /o), (6, fi). Its minimum is
found by setting
2
-~2(/1-2/0+/2ylt/l lu
2-6
AMCP 708-192
straight line with minimum at ? = a = (k 1)5. If the initial step 6 was too
A , depending on which of/j and small, many steps will have to be made before
f2 is smaller. the minimum point is located.
2-7
AMCP 706-192
The Fibonacci search method has the prop- Step 3. Compare/ [JC(0 +aas] and/ [xu)
erty of being best in a certain sense among all + ubs] and go to Step 4, 5, or 6.
search techniques which isolate a in an
interval. A measure of the effectiveness of any Step 4. If/ [x(,) + <y] < / [x(,) +abs] ,
then a2 < a*'-1 < <xb. By the choice
such technique is the ratio of the length of
the largest interval in which a may lie after n of a and ab , the new points s =
steps to the length of the original interval a,' and a'u = a,boahave a'. = a .
which contained a. It is shown in Ref. 4, page Compute now / [JC(,) + a'as]
253, that if/ [x(,) + as] has a unique relative where a'a = ae + 0.382 (a'u - a't).
minimum as a function of a,then Fibonacci Go to Step 7.
search minimizes the number of interval
Step 5. If/ [JC<'> +aes] > f [x + abs],
partitions.
then aa s a(,) < a,u . Similar to the
procedure in Step 4, put a'g = aa
The Fibonacci search technique may now
and a' = au so that a.'a = otb.
be given in the form of a computational
algorithm: Compute/[.x^'' + a.'bs] where ot'b =
0^+0.618(0^-^).
Step 1. First an upper bound must be Go to Step 7. -
found for a.,au. It is clear that 0 is
a lower bound, ae. For a chosen Step 6. If/ [x(,) +aas] = / [x(0 + <V]>
small step size 5 in a,let j be the put ae = aa and aj, = 6 -
smallest integer such that Return to Step 2.
a0 =;S 5(1.618)*.
fc = o The simplest and probably the best known
of the direct methods of minimization is the
Step 2. Compute/ [x(,) +
<xbs], where Method of Steepest Descent (or Gradient).
This method is based on the fact that if the
aa =ac +0.382(au -as) cost surface is smooth, then its tangent plane
is a good approximation to the surface near
a6 =ac +0.618(au - a8). the point of tangency. The philosophy of the
2-8
AMCP 706-192
Method of Steepest Descent is apparent in its as a better estimate of the minimum point
title. One wishes to change x(,) by an incre- and the process is continued until 7/ [x(y)] =
ment dx in such a way that /(x), x = x(,) + 0 oxdx is sufficiently small. This method may
dx, is reduced as much as possible for a given be given in compact form as the steepest
length of increment. The direction of the descent algorithm:
increment dx is called the direction of steep-
est descent. Step 1. Make the best engineering estimate
x'0) of the minimum point.
The direction of steepest descent is given
by Theorem 2-4. Step 2. Compute V/[x(0] and define a
normalized gradient s =
Theorem 2-4: Let f(x) be differentiable in II ?/[*"> Ill vW'l-Find^
R". The direction of steepest descent at a a(') which minimizes / [x(i) + as ]
point x is (where i is the number of iterations
completed). If V/ [x(0J = 0, ter-
dx = -aV/r(x) (2-4)
minate the process and x(z) is a
relative minimum point.
where a > 0 is a scalar factor.
The proof of Theorem 2-4 illustrates clear- Step 3. Put x(i+1) = x(0 - a.0)s. If 11 (0
ly that the direction of steepest ascent is and IIV / [x(,+ 1 *] || are less than
predetermined limits, terminate
dx = aVfT (x) (2-5) the process and let x(,+ 1) be the
approximation to the minimum
for a > 0. The reader should note carefully point. Otherwise return to Step 2.
that Eqs. 2-4 and 2-5 give only the direction
in the design parameter spaced" which yields It is interesting to note that successive,
the maximum rate of change of f{x). Since directions of steepest descent are orthogonal
the factor a is not determined explicitly, the to one-another in this algorithmi.e.,
size of step is not specified. V/[x (' + 1)] V/r[x(,) =0]. To see this, recall
that a(,) is chosen so that /[jc(i) as] is a
In order to start the steepest descent minimum in a. The necessary condition of
iterative technique, the designer makes the Theorem 2-1 then requires
best estimate of the design parameter avail- df 1 9/
able, x(0). The gradient V/[x(0>] is then 0= 2 ix (1 + 1)1
da 7/U (0, / = 1 ox-
computed at x(0) and a new point x(1) is
-0)1 =
1
determined by
dx, V/ " [x('> ]
7
2-9
AMCP 708-192
2-10
AMCP 706-192
In order to utilize Taylor's formula in- An easy way of improving the estimate of
cluding second degree terms, the following the minimum point is to change the length of
2-11
AMCP 708-192
the step Ax without altering its direction. The H(x) may not exist unless H{x) is
scalar a ~ 1 will be determined by methods of strictly positive definite.
par. 2-3 so as to minimize / [x(0) +a&x].
2. In nonconvex problems an iteration
This procedure may now be put down in does not necessarily decrease /[x(,)]
the form of a computational algorithm called when the current iterate x(r) is not near
Generalized Newton Method: the minimum point.
Step 1. Make an engineering estimate x(0) 3. For many engineering problems, H(x)
of the minimum point of f(x). will be extremely messy if not im-
possible to compute efficiently.
Step 2. Compute
2-12
AMCP 706-192
second-order derivatives at each iteration (x is vectors S' are linearly independent. To see
in/?"). In most engineering design problems that this is true, form the linear combination
this is an extremely tedious, if not impossible,
task. Further, the Generalized Newton Meth-
2 a,S' = Q,
od is not a learning process. j'=0 '
The methods presented in this paragraph where the a. are scalars. Multiplying this sum
require the computation of only first deriva- T
on the left by S' A yields
tives. However, by making use of information
obtained from previous derivatives, con-
vergence is speeded as the minimum is ap- "z a.S'7AS' = a,SiTAS' = 0
i=o ' '
proached. In fact, as one of the methods
progresses, it develops an approximation to and since S' AS' = 0, a. = 0. Since ;' was
the matrix of second derivatives. In this arbitrary, a;. = 0,/' = 0, 1,.... n 1, and this is
respect the methods here have the desirable just the definition of linear independence.
features of both the Method of Steepest
Descent and the Generalized Newton Method. Consider now the problem of minimizing
the convex function
All Methods of Conjugate Directions are
based on the philosophy "If a method works f(x)=BTx+-xTAx (2-9)
well in minimizing all positive definite qua-
dratic forms, then it ought to work pretty where x is in./?", B is an n x 1 matrix and^l is
well on any smooth cost function." To be a symmetric, positive definite, n x n matrix.
more specific, Conjugate Gradient Methods The central idea of all methods based on
are guaranteed to minimize any positive conjugate directions is contained in Theorem
definite quadratic form in n iterations (the 2-6.
design parameter is in R"). Although this
ideal behavior will not carry over to general
Theorem 2-6: Let S, .... S*1-1 be nonzero
cost functions, since a convex cost function
vectors in R" which are conjugate with
often looks very much like a positive definite
respect to the positive definite matrix A.
quadratic form, similar behavior could be
Choose scalars X = A('\ i = 0, ..., n \,
expected. Experience has shown that this is
successively which minimize
the case.
Since A is positive definite, the conjugate The two methods that follow are simply
2-13
AMCP 708-192
and
gk=VfT[x(k)] =AxM+B,
vl TAS
10 or from the proof of Theorem 2-6,
STAS
k-\
AssumingS1 ,..., 5* areA-conjugate, put gk =gi+1 +A S X<e)S8 (2-14)
2 = /+ 1
2-14
AMCP 706-192
yk+\T k + lT fe'+i
, ,+ 1 - >)
Thus, the g\ i = 0, 1, ..., n 1 are linearly g^i'AS' 8^' g
_ 7/[x(i+1)]y/7U"+1)]
Now, V/[x(]V/rtx('>] '
2-15
AMCP70S-192
Step 4. Terminate the process if little insight into use of the method. For a
IIV/[*(''+1)]llandlU<!'+1) _x('>|| direct proof of convergence, etc., the reader is
are sufficiently small. Otherwise, referred to Ref. 7.
return to Step 2.
The computational algorithm is:
When this algorithm is applied to problems
in which f(x) is not of the form of Eq. 2-9, Step 1. Make an engineering estimate x(0)
convergence will not occur in n steps. of the minimum point and choose
Fletcher and Reeves recommend that after n a symmetric positive definite
steps the algorithm should be "restarted", i.e., matrix H(0).
(0)
x( + i) should be treated as x in the
algorithm. In a sense, the first few iterations Step 2. For i = 0,..., compute
of the algorithm build up information about
the curvature of the cost surface. After n S=-^>V/r[#].
iterations, this information is discarded and a
new estimate of curvature is built up during Step 3. Compute a = oS'^ which minimizes
(/ (i)
the next n iterations. This method then does /[JC > +aS ].
not accumulate information about curvature
of the cost surface over the entire iterative Step 4. Compute
process.
aU) =aV)S(i)
2-6.2 THE METHOD OF FLETCHER AND
POWELL x+i)=x(/)+0(fl
The directions S^'\ generated by the al- Step 5. If || V/[x('+1>] || and ||x(,+ 1>
gorithm that follows, are conjugate if f(x) is Ai)v are sufficiently small, termi-
of the form of Eq. 2-9. This proof is given in nate the process. Otherwise return
Refs. 7 and 9. In Ref. 6 it is shown that the to Step 2.
method of Fletcher and Powell fits naturally
into a large class of conjugate direction Fletcher and Powell (Ref. 9) prove that this
methods. The derivation is tedious and lends algorithm has the following properties:
2-16
AMCP 706-192
2-17
AMCP 706-192
wise return to Step 2 with the /2O1, x2, x3, x4) = (xy + 10x2)2
same set of s'\ j = 1,,.., n.
f(yn +sis)
+ KK*! -x4)4
Put
and
xO+l) =yn +s
/3(x1(^x3)=x^ +2x; +:
(2-23)
Terminate the process if IIx 0+1)
x (0| is sufficiently small. Other-
wise return to Step 2 with the new
set of vectors ^ ..., xm~l, sm + l, The reader should verify that each of these
..., s", s functions has a strict absolute minimum
point. These points are (1,1), (0,0,0,0), and
For a discussion of use of the (0,0,0), respectively. Each iterative method
algorithm, see Ref. 11. will be started at points ( 1,1), (1,1,1,1),
and (1,1,1) for Eqs. 2-21, 2-22, and 2-23,
respectively. These functions will all be mini-
2-7 COMPARISON OF THE VARIOUS mized by each of the methods of pars. 2-4
METHODS through 2-6. The stopping criterion will be
that each component of the independent
During the development of the methods variable must be within 10" 2 of the known
presented in this chapter, theoretical advan- minimum point.
tages and disadvantages have been pointed
out. As a concrete test of these methods,
three functions will be minimized. Two of the Results will be presented in tabular form so
that a comparison of the behavior of each of
functions to be treated are terribly behaved
the methods may be made. For the sake of
and pose a meaningful test to any general
uniformity, each table will include the itera-
minimization technique. These functions re-
tion number i, the iterate x(,) = [x^, ....
semble a very deep valley at whose bottom (<)i T , and the value of the cost function.
the curvature in two orthogonal directions is
radically different. The third function is
quadratic and poses no serious obstacle to any 2-7.1 METHOD OF STEEPEST DESCENT
reasonable method. More specifically, these
functions are
2-7.1.1 COST FUNCTION: f^x) = 100(x2
/,(*,, x2)=100(x2 -x2)2 -x2)2 +(1 -x,)2.
(2-21)
+ (1 -x,)2 Exact solution: (1,1), /i(l,l) =0
2-18
AMCP 706-192
TABLE 2-2
It should be noted that the Steepest De-
STEEPEST DESCENT METHOD - ITERATIVE scent Method decreased the cost function
DATA FOR COST FUNCTION f2 (x) rapidly on the first iteration but in the first
flx(i)] 01 to (i) XA (I)
two problems failed to converge to the
*2
minimum point. That is typical behavior for
0 122.0 1.0 1.0 1.0 1.0 this method, particularly in problems for
1 16.43 0.9055 0.055 1.0 1.0 which the cost function has a long sharp
2 16.31 0.9019 0.023 0.9958 0.9581
5 16.03 0.8925 -0.0498 0.746
valley. It should be clear that blind use of the
0.969
6 15.06 0.886 -0.0756 0.923 0.463 Method of Steepest Descent can yield poor
9 12.25 0.641 -0.063 0.699 -0.156 results.
10 3.00 -1.048 0.0746 -0.1608 -0.9197
11 2.006 -1.039 0.1522 -0.258 -0.815
12 1.380 1.043 0.078 -0.298 -0.752 2-7.2 GENERALIZED NEWTON METHOD
13 1.188 1.033 0.1127 -0.3276 -0.7067
14 1.047 1.021 0.090 -0.362 -0.634 2-7.2.1 COST FUNCTION: /, (x) = 100(x2
15 1.041 -1.015 0.0949 -0.368 -0.619
16 1.040 -1.012 0.0960 -0.370 -0.611
-x\)2 +(1 -x,)2
38 1.039 -1.008 0.0967 -0.373 -0.603
74 1.039 -1.008 0.0968 -0.373 -0.6019 Exact solution: (1,1), ft (1,1) = 0
2-19
AMCP 708-192
W
/ Mx'"] *.'" x
x
7 2-7.2.3 COST FUNCTION: f3(x) = x] +2x
0 404.0 -1.0 -1.0 + 2x\ +2x!X2 +2*2*3
1 3.981 -0.9950 0.9869
2 3.403 -0.7919 0.5832 Exact solution: (0,0,0), /3(0) = 0.
3 2.588 -0.5248 0.2241
4 1.549 -0.1832 -0.5105
5 0.953 0.0887 -0.0271 TABLE 2-6
6 0.473 0.3642 0.1063
7 0.203 0.5955 0.3347 GENERALIZED NEWTON METHOD -
8 0.0531 0.8020 0.6315 ITERATIVE DATA FOR COST
9 0.0042 0.9536 0.9049 FUNCTION f3ix)
10 0.0002 0.9900 0.9810
11 2x 10 "6 1.0003 1.0007 f3[xu)] (/) (/I (/I
2-20
AMCP 706-192
8
0 000556
0001091
0 0297
0 03519 0
0 0029
003519
0 068/
0 02338
000?7
0 02503
~x2)2+(l -*,)2
0 000517 0 035322 0 03498 0 0231 12 0 0231 19
0 0005 17 0035318 0 003530 0 073108 0 0231 19
1 K 10-7 0 035305 0 003528 0 023011 0 0731 15 Exact solution": (1,1), /, (1,1) = 0.
2-21
AMCP 706-192
x1'! (/')
2-22
AMCP 708-192
h [xW] v 1 li)
i x4 li)
x
CONJUGATE DIRECTIONS WITHOUT 2 *3 dt
DERIVATIVES METHOD-ITERATIVE DATA
FOR COST FUNCTIONS, M 0.4421 0.0469 0.0127 0.2799 0.4284
2 0.1415 0.0469 0.0127 0.2799 0.2423
li) li) 0.1418 0.0510 0.0127 0.2799 0.2423
fx\x Oh x2
0.1210 0.0510 -0.0015 0.2799 0.2423
0.0498 0.0510 -0.0015 0.1875 0.2423
0 404.0 -1.0 -1.0
0.0246 0.0510 -0.0015 0.1875 0.1749
100.1 0.0049 -1.0000
3 0.0082 0.0536 -0.0104 0.1291 0.1324
0.9902 0.0049 0.0000
4 0.0020 0.638 -0.0181 0.0794 0.0882
1 0.9902 0.0261 0.0211
5 0.0018 0.1322 -0.0147 0.0892 0.0940
0.9485 0.0261 0.0007
6 0.0010 0.0828 -0.0109 0.0580 0.0603
0.9402 0.0429 0.0174
7 0.0005 0.0412 -0.0057 0.0377 0.0322
2 0.7922 0.1287 -0.0016
8 0.0000 0.0078 -0.007 0.0058 0.0050
0.7022 0.1815 0.0509
0.6172 0.2147 0.0436
3 0.3958 0.4058 0.1440
4 0.2895 0.4785 0.2422 2-7.5.3 COST FUNCTION: &) = x] +\
5 0.2591 0.5308 0.3015
+ 2x2, +ill +2fi|
6 0.0770 0.7258 0.5225
7 0.0282 0.8564 0.7246
8 0.0125 0.8942 0.8033
9 0.01 19 0.91 16 0.8373
10 0.0116 0.9039 0.8218 Exact solution: (0,0,0), f3 (0) = 0.
11 0.0125 0.9469 0.9065
12 0.0042 1.0363 1.0792
13 0.0002 0.9886 0.9781
14 0.0002 1.0032 1.0079 TABLE 2-15
W 0) 0) (i)
f3[x ]
Exact solution: (0,0,0,0), /2(0) = 0.
0 9.0 1.0 1.0 1.0
5.000 -1.0000 1.0000 1.0000
TABLE 2-14 3.000 -1.0000 0.0000 1.0000
1 1.000 -1.0000 0.0000 0.0000
CONJUGATE DIRECTIONS WITHOUT DERIVATIVES 0.000 0.0000 0.0000 0.0000
METHOD-ITERATIVEDATA FORCOST
FUNCTION f2 W
li) (i)
f2[x<"] x, (i) x4 li) The Conjugate Directions Without Deriva-
0 122.0 1.0 1.0 1.0 1.0 tives Method is not as efficient as some of the
109.1 0.2051 1.0000 1.0000 1.0000 methods that require computation of deriva-
18.45 0.2051 0.1140 1.0000 1.0000 tives. However, there are many problems in
7.667 0.2051 0.1140 0.4819 1.0000
which computation of derivatives is either
1 2.371 0.2051 0.1140 0.4819 0.4284
2.157 0.0469 0.1140 0.4819 0.4284 impossible or very difficult. In these prob-
1.075 0.0469 0.0127 0.4819 0.4284 lems, this method appears to be effective.
2-23
AMCP 706-192
2-24
AMCP 706-192
close to their values before the change in the minimization algorithm. In this way, rapid
design variables. By using an iterative tech- convergence to the new state of the system is
nique such as conjugate directions, the pre- attained. This approach has been applied with
vious state may be used as an estimate to start good success (Ref. 15).
REFERENCES
2-25
AMCP 706-192
CHAPTER 3
LINEAR PROGRAMMING
b. Buckling load must be greater than The class of problem considered in this
or equal to applied loads. chapter is very restricted. Only linear func-
tions are to be minimized subject to con-
c. Deflection of the structure must not straints which are linear in the design vari-
exceed specified limits. ables. In matrix notation this is, minimize
3-1
AMCP 706-192
3-2
AMCP 706-192
problem is the problem of determining that x describe an admissible object or process, i.e.,
mR" which minimizes one which performs the required service but is
not necessarily optimal. In LP the constraint
B1 x (3-5) set is a polyhedron and, according to Def. 2-4,
this constraint set is convex. Further, accord-
and which satisfies ing to the same definition, the cost function
> LP f{x) for LP is convex. If the constraint set is
Ax> C (3-6) bounded and nonempty, it is necessarily also
closed and all the hypotheses of Theorems 2-2
x> 0 (3-7) and 2-3 are satisfied. One then concludes that
fix) has a strict absolute minimum in the
where C = 0 is an m x 1 matrix, A is an m x n constraint and that is has no other relative
matrix, B is an n x 1 matrix and the minima.
symbolism < () as applied to matrices means
that the relation less than or equal to (greater Further, if fix) had a minimum in the
than or equal to) holds for corresponding interior of the constraint set, the necessary
components of the matrices. condition of Theorem 2-1 implies
- 0, i = i],...,
df
It should be pointed out that Eqs. 3-5 -= c, -n
dx, '
through 3-7 do not explicitly cover all linear
optimization problems. For example, it may which contradicts Def. 3-1 of LP. Therefore,
be required to maximize a linear objective fix) cannot have a minimum point in the
function. Further, equality constraints may interior of the constraint set but must take on
be imposed and negative values of the x. may its minimum at the boundary. Weyl has
be allowed. However, all these variations on shown, in fact, that the solution must lie on
the linear programming problem may be put one of the vertices of the polyhedral con-
into the form of the problem previously straint set (Ref. 2).
considered. An objective function may be
maximized by minimizing its negative, equal- In spite of this elementary theory, it is
ity constraints are nothing more than a pair of possible that a linear programming problem
inequality constraints (i.e., y = 0 if and only if may not have a solution. This may happen for
y s 0 and y < 0), and a negative x. may two reasons. First, the constraint set may be
always be written as the difference between empty; and second, the constraint set may be
two new non-negative variables. There is unbounded and the cost function may be
therefore, no loss of generality in considering decreased without restriction. In order to
only the problem expressed by Eqs. 3-5 facilitate discussion of these difficulties,
through 3-7. Definition 3-3 is made.
Definition 3-2: The constraint set for the Definition 3-3: If the constraint set of LP
linear programming problem of Def. 3-1 is the is nonempty (empty), the problem is called
set of points mR" which satisfy Eqs. 3-6 and feasible (infeasible). If the constraint set is
3-7. unbounded and the cost function is not
The constraint set associated with a prob- bounded below, then the problem is called
lem is just the set of design variables which unbounded.
3-3
AMCP 706-192
The concept of the dual problem that will Theorem 3-2: Let LP and LPD both be
be used in constructing solutions of LP's will feasible. Then both have solutions x and yt
now be discussed. The dual problem will also respectively, andBTx = CTy.
play a major role in obtaining results for more
general optimization problems. The proof of Theorem 3-2 is involved and
does not yield a method of constructing
Definition 3-4: The linear programming solutions. It may be found in Ref. 3, page 44,
problem of maximizing or Ref. 4, page 118.
3-4
AMCP 706-192
(1/2,1) 2.
df = Vf(\M2)dx = + y/T-
v ^1=^Z> o.
2 2
*-xl
which decreases f. Move in this direction to The solution of LP' is the same as the solution
the first vertex (1, 1/2). of LP.
3-5
AMCP 706-91
The information contained in Eqs. 3-12 using a method which is based largely on
and 3-13 is contained in the following matrix Theorem 3-3.
equation (called the simplex tableau):
Theorem 3-3: If in Eq. 3-15 b'. 0, i = 1,
an al2 ... aln _ct xt ux ..., n, and_c^ > 0, j = 1, ..., m, then the
solution of LP is
-c2
rt = 0, i = 1,..., n
(3-14)
S =
i -^J= 1,..., m
a
ml m2 w = 6.
bx b2 bn ! 0
It is clear from this theorem that any
method of choosing the variables sf and r,
Eq. 3-14 may be viewed asm + 1 equations
which wiU terminate with non-negative entries
involving the variables xt ,..., xn, ult... ,um,
in the last row and column, except perhaps
w. At present Eq. 3-14 may be interpreted as
for 6, will serve as a method of solving LP.
determining ux , ..., um , and w explicitly in
Before developing such a method, several
terms of xl,..., xn . It might be desirable to
definitions will be helpful.
determine some other combination of m + 1
of the variables in terms of the remaining n. Definition 3-5: In Eq. 3-15, the variables
Except in singular cases, this is possible. s.,j = 1 m, are called basic variables, while
the variables r., i = 1,..., n are called nonbasic
Assume now that m + 1 of the variables .?i, variables.
..., sm , and w have been determined explicitly
in terms of the remaining variables^ ,..., rn . Definition 3-6: The set of variables Sj,...,
Eq. 3-14 will then take the form s r r
m i >> will be called abasicpoint. If c.
[- < 0,;' = 1,..., m, in Eq. 3-15, then the basic
11 12
point will be called a basic feasible point.
21 22 2H - C2 r2 2
A certain geometric interpretation may
now be given for the nonbasic variables. In
(3-15) LP' it is clear that the boundary of the
constraint set of LP is obtained by setting
mi a'mi u
mn <-m various combinations of the variables xt, / = 1,
..., n and ;-, / = 1,..., m, equal to zero. In the
b\ b'2 b'n I spaced" of the design variable x, a vertex of
the polyhedral constraint set is obtained by
where primes denote coefficients obtained having n equality constraints among the xv i =
when the original set of equations is solved 1, ..., n, enforced. By the discussion, this
forsj,..., sm ,and w. occurs when r. = 0, i = 1,..., n. An edge of this
polyhedron is a line inR" obtained by setting
The solution of LP will be constructed r. = 0 for n 1 indices i. From Def 3-6 and
3-6
AMCP 706-192
V1
;+i
(3-16) r l a
a
H p (X
-> p P
(3-18)
7 V "J3
a!. r"'
fl J _p P
v
3-7
AMCP 708-192
-Cl ?i S\
3 6 1 ?2 = 2
- 5 3 2_ _1_ _w_
(3-19)
a c
m 1 ~~ m rn Sm
interchange the role of rl and 2.
8 1 W
3-8
AMCP 706-192
With this restriction in mind, consider the The process described may be given quite
values of c'. after the pivot step with i = i0. simply as the iterativeAlgorithm LP-A:
In order to insure c't > 0, i > k, it is Step 2. Choose any positive element ak,
required that in the kth row of the matrix of Eq.
3-19.
.0.+'*"' > 0,i>k,i*L (3-22)
Step 3. Choose z'0 asinEq. 3-25
<o/o
^ = min [X I _fL > 0 \ (3-25) Since by Eq. 3-19, w = b^rx +... + bnr +
Vo i>k Wo ' aHo 6, it is clear that if b. < 0 for some; =/
/o
then w may be reduced by increasing r. from
then c\ > 0, i > k and c'k > ck. If ck zero. If a pivot step is performed which makes
is still negative, the process may be repeated. r./o a basic variable then w will be decreased,
Otherwise choose the next entry above ck The choice of the basic variable s. which is
'0
which is negative and repeat the process. to be made nonbasic must be made in such a
way that the point obtained after the pivot
If all the cjy i > k are nonzero, only a finite step is still a basic feasible point, i.e., so that
number of basic points are possible since the c. > 0, i = 1. ..., m. However, this is
process is monotone (nonrepeating). If there precisely the restriction which led to the
exists a point with c. > 0, i=\, ..., m, this choice of i0 in par. 3-3.1. Therefore, the same
process must find it. The degenerate case in procedure for choosing ('0 may be employed
which some c, = 0, / > k is discussed later. here.
3-9
AMCP 708-192
Since w'= w ci0bj0/ajo/0, the pivot step Viewed geometrically, the difficulty occurs
determined here guarantees w' < w provided because the path which successive basic points
all c. > 0, i = 1, ..., m. In this case, follow on the polygonal constraint boundary
therefore, only a finite number of pivot steps may form a closed loop. To prevent this
may be made, and the process must terminate behavior with only a small error in the final
at the solution of the linear programming solution an entry, c., which is zero, is
problem. Termination occurs when*' > 0,/ = replaced by an arbitrarily small parameter E >
1,..., n. Theorem 3-3 shows that this is the 0. The problem is not degenerate any longer
solution of the linear programming problem. and cycling cannot occur. Therefore, the
The degenerate case where some c( = 0 will be altered problem will proceed toward the
discussed par. 3-3.3. solution.
This process is given explicitly mAlgorithm Example 3-4: Use the simplex algorithm to
LP-B: solve the LP
3-10
AMCP 706-192
For the first pivot step in algorithm LP-A, pivot step leads to
k = 2. Choose j0 = 1 since a21 = 3 is the
largest element in the second row. i0 = 2 is - 1/6 2/3 -11/6 11/6" 1 ^3
the only choice available in Eq. 3-25 and
1/3 - 1/3 2/3 1/3 2 ^2
C2
- 4> 0.
1
a22 _ 17/6 -7/3 37/6 29/6_ *l W
=
The pivot is a22 1- This pivot step inter-
.1- =
changes u2 and x2' The result is
Put/0=2,
11 4 -6 11 *1 1
Cy C2
= -11/4, = 1,
3 1-24 "2 - x2 fl
12 a
12
c C
i 2 _4_ .1-
Ml a2l
Since this is a basic feasible point and the
so z0 = 1. The pivot is then at t = 11. The
first three elements in the third row are
result of a pivot step is to interchange Xj and
positive, then the solution is immediate. The
M, . This results in the basic feasible point
nonbasic variables are zero,
Z. l/ll 4/11 -6/11 1 "l Xi
i = Xj =x2 =0
3/11 - 1/11 -4/11 1 2 = x2 and the basic variables take on the value
Choose /'0 = 3.
c, c2
x2 = 0
= 11/6, = 11/4,
13 Q23 x3=5/2.
=
so J'Q = 1. The pivot is a22 6/11 and a The minimum value of f(x') attained is 5/2.
3-11
AMCP 706-192
3-4 MINIMUM WEIGHT TRUSS DESIGN linearly independent equations in Eq. 3-26.
ff a is the maximum allowable stress (both
As will become apparent in subsequent
chapters, most optimal design problems are tensile and compressive) for the material from
nonlinear. Even the problems considered in which the truss is constructed, then stress
this paragraph appear at first glance to be constraints are
nonlinear. However, it is shown that the
problem can actually be solved as a linear I Sf I < oAf (3-27)
program. This will not be the case in general.
The class of problems and their solutions that Further, if p is the weight density of the
are discussed in this paragraph are taken from structural material, the total weight W of the
an outstanding paper by Dorn, Gomory, and truss which is to be minimized is
Greenberg (Ref. 5). Similar results have been
W-. p
reported more recently (Ref. 6). V/ (3-28)
The problem treated here is minimum where fi. is the length of the jth member.
weight design of plane trusses with constraints
on stress. The initial restrictions on the truss The problem of minimizing W of Eq. 3-28
include only the location of joints in the subject to the constraints of Eqs. 3-26 and
truss. The loads to be supported by the truss 3-27 is not the complete truss design problem.
are applied at joints. A member with non- In addition to the equilibrium conditions of
negative cross-sectional area is allowed to Eq. 3-26, a set of compatibility conditions
connect each pair of joints. If there are between displacements of the joints must be
joints, there may be M(M 0/2 members in satisfied. These compatibility conditions will
the truss. In general, then, statically indeter- be nonlinear in the variables S. and A,. In its
minate trusses are allowed. complete formuiation, then, the truss design
problem is not a linear programming problem.
Let A., j 1, .... n, denote the cross- It will be shown, however, that if the com-
sectional area of jth member and S. the load patibility conditions are ignored and the
in that member due to the external loads problem described by Eqs. 3-26, 3-27, and
applied to the truss; S. > 0 denotes tension. If 3-28 is solved, its solution satisfies the com-
m = 2;u,then equilibrium of the joints of the patibility conditions and is, therefore, the
truss is specified by the equations solution of the truss design problem.
x> 0. (3-32)
W= 2 |S|,
a t =i /' /
This linear programming problem may now
subject to Eq. 3-26. In order to treat this be solved by the simplex method. Before the
problem as a linear programming problem, solution of the linear programming problem
define can be taken as the solution of the truss
design problem, however, it must be shown
Sj, ifSf> 0 that it satisfies the compatibility conditions.
s; = { It is clear that if the truss specified by the
0, if Sf < 0 linear programming problem is statically
determinate, it satisfies the compatibility con-
0, if 5) > 0 ditions trivially (i.e., there are no compatibil-
s ity conditions). For the analysis here, stat-
i=<
-Sjt if 5) < 0 ically determinate is taken to mean that the
member forces 5 are uniquely determined by
Now, the given loads and the equilibrium conditions
of Eq. 3-26.
srs;sr
As pointed out in Ref. 5, page 32, there
and will be m* possibility nonzero components of
x (basic variables) in the solution, correspond-
\Si\=S*+Sr ing to linearly independent columns of the
matrix A ; i.e., only m* of the Sy will possibly
Denote be nonzero. According to Eq. 3-27, then, only
m* of the areas may be nonzero. Further,
since the rank of A is m*, the member forces
* =(s? ,...s; ,s; ,...,s- > are uniquely determined. The resulting truss
CT =(Ft Fm), is, therefore, statically determinate and hence
is the solution of the original truss design
problem.
A -(aift -av)m x2
3-13
AMCP 708-192
of solving engineering design problems. Sever- subinterval, the member sizes are different. A
al examples are solved in considerable detail plot of W vs a and the forms of optimal
in Ref. 5; the results of one of these problems trusses are shown in Fig. 3-4.
will be discussed here.
n 9- 12- 15-
Figure 3-4. Optimum Bridge Trusses
11- 14-
The discussion here only touches on the
i_* 10- highlights of the very complete treatment of
H+-:
I t r the truss design problem in Ref. 5. The
interested reader is encouraged to study this
outstanding article in detail.
Figure 3-3. Admissible Joints for Bridge Truss
Before leaving the truss design problem, a
point of interest in the present results and in
In the solution presented in Ref. 5, it is the results obtained in future chapters may be
assumed that the truss is symmetric about the noted. In Fig. 3-4 it is clear that at two values
line of joints 7-8-9. This assumption reduces of a the form of the optimal truss changes
the number of variables to 57. Further, due to form drastically. Still, even though the
the assumed symmetry, there are only 14 topology of the structure is not continuous in
independent equilibrium conditions. There- a, the weight apparently is a continuous
fore, there will be only 14 members which function of a. The same sort of behavior
can be nonzero in the optimum truss. In the occurs in a beam design problem with con-
solution presented in Ref. 5 the problem is straints on deflection which is discussed in
made nondimensional by defining a = h/Z and par. 7-4. These problems might lead one to
= H/V, where h and 2 are the vertical and suspect that there is some basic mathematical
horizontal spacing, respectively, and H and V structure of the optimal structural design
are applied loads shown in Fig. 3-3. problem that has not been uncovered.
The solution presented in Ref. 5, page 45, 3-5 AN APPLICATION OF LINEAR PRO-
for a fixed value of ( = 1) shows that there GRAMMING TO ANALYSIS
are three subintervals of values of a on each
of which the truss has a constant geometrical . A major application of linear programming
form. For different values of a within a given in engineering design is, oddly enough, in
3-14
AMCP 706-192
3-15
AMCP 706-192
Further, it is just a restatement of the best Figure 3-5. Boundary Condition for
approximation criterion of Eq. 3-44. Example 3-4
3-16
AMCP 706-192
The procedure is now to form Similar inequalities in the c*, cT, and 7 will be
obtained at all other boundary points chosen.
*= \x\ ^c^c,{2x\ -x\) Under the requirements c* > 0, c~> 0, and
y > 0, the problem of minimizing 7 is then
+ c3(&x* -2Ax\x\ + 3*) solved.
+
-(285/16)(C 3-C3)Q7 |Z(JC) -z(x) 1 < 0.0053, x in ft
REFERENCES
3-17
AMCP 708-192
CHAPTER 4
4-1 INTRODUCTION TO THE THEORY OF Definition 4-1: The first nonlinear pro-
NONLINEAR PROGRAMMING (NLP) gramming problem NLP, is: find xeR" to
4-1
AMCP 706-192
and
for NLP, and
gi + m+p(x)=- ft.(x), i= 1,...,p.
D' = { x^R" | g(x) < 0 and h(x) = 0 }
Now, NLP' is equivalent to the NLP:
for NLP'.
minimize f(x)
For convenience, Theorem 2-2, which was
stated previously in Chapter 2, is given here
subject to
(Theorem 4-1) as it applies to nonlinear
programming problems.
00 * 0,
Theorem 4-1: If f(x) is continuous on D
gi (*)
(D'Jand this set is closed and bounded ini?" ,
where g(x) =
then NLP (NLP') has a solution which is an
absolute minimum of/Cx) mD (D') . g
m+lp(X>
4-2
AMCP 706-192
h(x) = 0.
fM = (Xi -3)2 +(x2 -3)2
4-3
AMCP 708-192
the constraint set. The question arises, "Is this 4-1.3 LOCAL THEORY
relative minimum an absolute minimum?" In
general problems it is difficult to answer this Without convexity it is difficult to say
question. There is a class of problems, how- much about global properties of the solution
ever, in which this question is easily answered. of NLP or NPL'. Considerable theory is
This class is described by Definition 4-4. available, however, which characterizes local
minima. The approach in the local theory is
Definition 4-4: If D (D') is a convex set to suppose that/(x) has a relative minimum at
and fix) is convex onfl (D') then NLP (NLP') a point inD orD' and then find conditions on
is called a convex programming problem. fix), gix), and h(x) which must hold at this
point. In this way, many points inZ) and D'
Theorem A-l, Appendix A, guarantees that may be eliminated as candidates for a relative
if gj(x)> i= 1, , rn, are convex functions, extrema and perhaps relative extrema can
then the set D is convex. Since the equalities even be located using these conditions. Such
(Eq. 4-5) in NLP' define a surface inR" , it is conditions, therefore, are called "necessary".
clear that D' is the intersection of that In some problems it will be possible to obtain
surface with the set { xe^lg^x) =s 0, i = 1, a set of conditions that, if satisfied at a point,
..., m) . The surface is convex if and only if it guarantee that this point yields a relative
is a plane, or equivalently, if and only if each extremum. Conditions of this kind, of course,
hXx) is linear in x. Since by Theorem A-6, are called "sufficient".
Appendix A, the intersection of two convex
sets is convex, D' is convex if g((x), i = 1,..., As often happens in engineering, the engi-
m, are convex and hXx),j = 1,..., p are linear. neer needs a powerful result developed in
The class of problems NLP' which are convex mathematics to solve his problem. Proof of
is, therefore, quite restricted. this result, however, may be very complex
and, in fact, contribute very little to the
As will be clear from what follows, con- engineer's insight into his problems. This
vexity is a very desirable property. However, appears to be the case in many phases of
in the real world, many optimization prob- optimization theory, in particular, in the
lems are nonconvex. In spite of this fact, the study of necessary a.id sufficient conditions
study of convex problems is justified. Many in nonlinear programming. In the remainder
results which hold only in convex problems of this paragraph results will be borrowed
have led to constructive methods which are from mathematical developments.
effective for finding local extrema in noncon-
vex problems. Some of these methods would
Before meaningful results may be given for
probably never have been developed if only
NLP and NLP', the following conditions will
general nonconvex problems had been
be required of the constraint functions g(x)
treated.
and h(x).
One of the powerful results which follows
due to convexity is given in Theorem 4-3. Definition 4-5: (First-order constraint
qualification): Let x be a point in the
Theorem 4-3: A relative minimum in a constraint setD' (orZ> if there are no equality
convex programming problem is an absolute constraints) and let the functions gix) and
minimum. h(x) be differentiable at x. Then the first-
4-5
AMCP 706-192
order constraint qualification holds at x if In this result, and in fact, in the remainder
for any nonzero yeR" suchthat Vgj(x)y o 0 of this paragraph, the problem NLP' is de-
for each i with g.(x) = 0 and Vh(x)y = 0, scribed. It is clear, however, that putting/ = 0
then y is tangent to a differentiable arc in NLP' yields NLP. One of the principal
passing from x into the constraint set. results of nonlinear programming may now be
stated. For proof the reader is referred to Ref.
Geometrically, this definition says that if 1, page 20.
the vector y is a direction which, to first
order, appears to point from x into the Theorem 4-5: (Kuhn-Tucker Necessity
constraint set, then there is a curve with y as Theorem): Let the functions f(x), g(x), and
tangent which actually passes from x into h{x) be differentiable and let the constraint
the constraint set. The conditions W,(*0)j Q functions satisfy the first-order constraint
0 forg,U) = 0 and Vh(x) = 0 are just first qualifications at a point x mD' of NLP'. In
order perturbations of g{(x) and h(x) which order that x be a relative minimum for NLP'
indicate that a small move in the y-direction it is necessary that there exist multipliers
ought to do the right thing to gt(x) and veRm and we '' suchthat
h(x). This is illustrated in Fig. 4-2.
Vj T> 0, i = 1,..., m (4-6)
and
V L(x,v,w) =0 (4-8)
where
4-6
AMCP 708-192
minimum for L(x,v,w), so VL(x,v,w) = 0 and if for every nonzero yeR" such that
must hold. This argument is not valid. For a Vgj(x)y = 0 for v( > 0, Vgt(x)y < 0 ioig^x)
rigorous proof of Theorem 4-5 the reader is = 0 and v. = 0, and Vh(x)y = 0, it is true that
referred to Ref. 1.
y' V L{x,v,w)y > 0 (4-11)
Theorem 4-6 states additional conditions
which are required to hold if the functions then x yields an isolated relative minimum for
appearing in NLP' have two derivatives. NLP'.
4-7
AMCP 706-192
are, however, appropriate at this point. The from the theorems stated in the preceding
analytic necessary and sufficient conditions of paragraph.
par. 4-1 could be used to construct solutions
of NLP by solving systems of nonlinear 4-2.1 FINITE DIMENSIONAL OPTIMAL
equations. This is particularly true of the DESIGN PROBLEMS
results of Theorem 4-8. If one reads the
current literature, however, he is led to the The class of problems to be treated in this
distinct conclusion that iterative methods paragraph is, in a sense, a special case of the
based on successive improvements are too nonlinear programming problem NLP'. How-
effective to bypass in favor of methods that ever, by developing a theory for the new class
require solution of complicated, nonlinear, of problems which takes advantage of its
algebraic equations. special features, a more efficient solution
algorithm may be obtained.
Even if the results of par. 4-1 are never
used by the designer to construct solutions of The general optimal design problem must
nonlinear programming problems, they are have several of the features of NLP'. Namely,
still very powerful tools. Verification of the it is required to have a cost (return) function
hypotheses of one of the theorems may mean which is to be minimized (maximized) and a
the difference between going onto the com- set of constraints that describe the perfor-
puter with the comforting knowledge that a mance demanded of the object being de-
unique solution exists as opposed to the signed. It is in the representation of con-
frustrating experience of having computer straints that the optimal design problem
print-out which may be meaningless. differs from NLP'.
automobile shock absorber so that peak ac- optimal design problem (OD) is a problem of
celeration in the passenger compartment due determining beRk to
to road conditions is as small as possible.
minimize f{z,b) (4-12)
The coefficient of damping is the design
variable since it describes the object being subject to
designed, and its magnitude is to be fixed by OD
the designer. Acceleration on the other hand h(z,b) = 0 (4-13)
is a state variable since it describes the
behavior of the object being designed. <j>{z,b) < 0 (4-14)
Further, this state variable may be determined
by Newton's laws of motion. Note that the where
designer has no direct control over the state
ht(z,b)
variable. He may effect it only indirectly by
adjusting the design variable. This is typical of h(z,b)
state and design variables.
hn (z,b)
4-9
AMCP 706-192
from Eq. 4-13 as a function of b, is differ- linearly independent at z,b. Then there exist
entiable with respect to b. This fact will be multipliers AeJ?" and peRm, with > 0 such
needed later when constructive methods are that for
developed.
H=f(z,b) + \Th(z,b) +jiT<t>{z,b) (4-21)
4-2.2 LOCAL THEORY
dH
(z,b) = 0 (4-22)
Since it is very seldom that the state db
equations (Eq. 4-13) are linear in both z and
b, convexity of the constraint set and hence (z,6) = 0 (4-23)
the problem will be rare. For this reason, no oz
global results based on convexity will be
discussed. In case Eq. 4-13 is linear, however, and
global results may be obtained by applying
the Theorems 4-3 and 4-8. M/0/(z,6) = O, /=!,..., (4-24)
then the problem OD may be put into the In exactly the same way the second-order
form NLP'. According to Theorem 4-4, the necessary and sufficient conditions of
first-order constraint qualification will be Theorems 4-6 and 4-7, respectively, may
satisfied for OD (with xeR" + * as indepen- be stated for the problem OD. No essential
simplification of the statements of those
dent variable) if the row vectors
theorems occurs, however, so the theorems
are not restated here.
<iht bhi
i = 1, .... n (4-19)
8z~' db Theorem 4-9, just as Theorem 4-5, is
difficult to use in constructing solutions of
30, 30, OD. Considerable difficulty arises because one
-r-{z,b\ ~{z,b)
Oz db does not know which of the inequalities in
OD is an equality. For problems with a small
for; with 4>-(z,b) = 0 (4-20) number of inequality constraints this may not
be a difficult obstacle, particularly if the
are linearly independent. Theorem 4-5 may designer has a good intuitive idea of which
now be applied to the problem OD. constraints will be equalities. If, on the other
Theorem 4-9: (First-order Necessary Con- hand, there are a large number of inequality
ditions): Let all the functions appearing in constraints, then the number of combinations
OD be differentiable at a point z, b which of constraints which may be equalities is
satisfies Eqs. 4-13, 4-14, and 4-15. Further, large. It is, therefore, difficult to determine
let the vectors, (Eqs. 4-19, 4-20, and 4-21) be just which combinations will be equalities. An
4-10
AMCP 706-192
analytic solution is extremely difficult in this of Fiacco and McCormick (Ref. 1). Theoret-
case. ical results guaranteeing convergence are
presented here to indicate the level of the
Rather than attempt to use the necessary known theory of SUMT, rather than as a
conditions to construct candidate solutions, a complete treatment of the subject.
more direct approach will be followed. The
remainder of this chapter will be devoted to 4-3.1 INTERIOR METHOD
direct methods of solving NLP, NLP', and
OD. The interior SUMT is based on the idea of
using the constraint functions to erect a
4-3 SEQUENTIALLY UNCONSTRAINED
barrier at the boundary of the constraint set
MINIMIZATION TECHNIQUES (SUMT)
D of NLP by adding a penalty function to
fix) which approaches infinity as the bound-
A favorite method of solving difficult ary of D is approached from the interior.
problems, particularly among mathematicians,
Once the solution of the augmented problem
is to reduce a difficult problem to a sequence is obtained, the penalty function is altered so
of easy problems. Each of the easy problems as to effect fix) less in the interior of D.This
is solved and if the method is any good, the behavior is illustrated in Fig. 4-3.
sequence of solutions of easy problems will
converge to the solution of the difficult
f(x) + Penalty Function UK '
problem. As the title might imply, SUMT
follows just this pattern. It should be clear
I
that a central part of this method must be
results which guarantee convergence, at least / I
in cases where solutions are known to exist.
The method presented here essentially re-
duces NLP and NLP' to a sequence of Penalty ""^^^^
Function (1.) /
auxiliary problems which may be solved by \ /
the methods of Chapter 2. The cost function
of NLP or NLP' is augmented by a function ,ru
called a penalty function. The penalty func- (A)
tion is formed from the constraint functions
in such a way that as a parameter approaches
zero (or perhaps infinity) the unconstrained
minimum of the augmented cost function
converges to the solution of NLP or NLP'.
Two basically different ways of doing this are
presented here. Each has its computational
and theoretical advantages and disadvantages
that will be described later.
Due to the large body of theory concerning
SUMT, results will be presented in this para-
graph without proof. The reader is referred
for proofs and an extended discussion of
SUMT to the complete and well-written text Figure 4-3. Penalty Functions
4-11
AMCP 706-192
As illustrated in Fig. 4-3, when the penalty 1. I(x] is continuous and non-negative on
function is decreased on the interior of D, the the interior of the constraint set D and if
minimum of the second augmented cost | .^^ is any sequence of points in R"
functionx(2) is closer to the solutions than converging to x where g(x) = 0 for some/,
the minimum of the first augmented cost then^ /(**) = +
function J'. The idea, of course, is that the
sequence of points x(,) generated in this way 2. S(r) is continuous and if r{ > r2 > 0,
converges to x. then S(r1) > S(r2) 0 and if r. is a sequence
of numbers converging to zero, then j^ S(r.)
It should be clear why this approach is = 0.
discussed only for NLP and not NLP'. The
constraint set of NLP' can have no interior Probably the most common penalty func-
due to the equality constraints. It is possible tions I(x} and S(r) are
that NLP has no interior and in this case the
interior SUMT is not applicable. In what 1
I(x} = - S - (4-26)
follows, it is assumed that the constraint set D
of NLP has an interior.
and
The sequence of points xw which is to
converge to the minimum point is generated S(r)=r. (4-27)
by minimizing
4-12
AMCP 706-192
where S(r) and I(x) satisfy prop- where x is not an isolated point of
erties No. 1 and No. 2. Choose r D, (4-32)
> 0 and x^0) in the interior of the
constraint setD. { r. } be a strictly decreasing
sequence which converges to zero. (4-33)
Step 2. Beginning atx(0) minimize U(x,rQ)
without regard to constraints to Then for xW> sufficiently near x
obtain x^\ Any of the methods andr. sufficiently small,
of Chapter 2 may be employed for
this purpose. (4-34)
Further,
Step 3. For i = 0,1,2,..., choose r - j > 0
such that r{ < r.. Beginning at .1^ S(rt) I[xu)] = 0 (4-35)
x^!) minimize U(x,r. * t) without
regard to constraints to obtain
x^l+ l\ where i is the iteration in-
dex. (4-36)
4-13
AMCP 708-192
The functions f(x), g1(x), and g2(x) are The penalty function employed for the
exterior method will have the form
convex and by Theorem A-5, Appendix
A, so are 1/gjfx^and l/g2(x). Since r > 0,
P(t) E(x) (4-39)
U(x,r) is convex and thus has a unique
minimum. To find it, put w'here P{t)andE(x) are required to satisfy the
conditions:
dU
= 0= 1
bx. (x,)2 1. E(x) = 0 if x is in the constraint set, and
E(x) > 0 if A; is outside the constraint set.
dU
0=1- 2 2. P(t) is continuous and if t2 > tx > 0,
dx 2 (x,)
v
^2 then E(t2) > (/j) > 0. Further, if tj - +
then ._f F(r.) = +
so
,1/2
Probably the most common choice for P(l)
and E(x) is
,1/2
Pit) = t (4-40)
4-14
AMCP 706-192
Definition 4-8: The exterior point sequen- There be a relative minimum point
tially unconstrained minimization algorithm is x in that admissible domain!) such
given by the following: that f{x) < fix) for all x = x in
some neighborhood of x, where x is
Step 1. Make an engineering estimate x(0) not an isolated point of D. (4-45)
of the solution of NLP or NLP'.
4-15
AMCP 706-192
and
lim
ru, /.] =/(S) (4-50)
2 + 3/
[f[xM]} is monotone decreasing (4-5ij
As
{ E [xw ]) is monotone decreasing. (4-52)
subject to
Subtracting,
gt(x) <, 0,i= 1, ..., m (4-54)
4x2 2x\ =0,orx1 =2x2.
4-16
AMCP 706-192
where the set of all points which satisfy the m denoted x(,).
inequalities Eq. 4-54, has an interior. As
might be expected, the constraints, Eq. 4-54, Step 4. As r. - 0 and tf -+, if \\xO) -
will be dealt with using an interior point x(,'-1)|| and|/[*<'>] -/[^'-^Jlare
penalty function and the constraints, Eq. sufficiently small, terminate the
4-55, will be dealt with using an exterior process and take x(,) as the solu-
point penalty function. tion of NLP'. Otherwise return to
Step 3.
The penalty function used here will be
As might be expected from a study of the
two methods which were combined to form
the mixed method, a convergence result is
where S(r)I(x), P(t), and E(x) satisfy condi- given by Theorem 4-14.
tions No. 1 and No. 2 preceding Eq. 4-26, and
No. 1 and No. 2 of Eq. 4-39. It is understood Theorem 4-14: In the mixed point al-
that I(x) is a function of only the constraint gorithm let:
functions in Eq. 4-54 and E(x) is a function
of only those in Eq. 4-55. A general minimiz- gi(x), ..., gm(x) be continuous on
ing algorithm for NLP is now given in the nonempty interior of their con-
Definition 4-9. straint set and Rx),hl(x),..., h Ax)
be continuous for allx. (4-58)
Definition 4-9: The mixed interior-exterior
sequentially unconstrained minimization al- 5(r), I(x), P(t), and E(x) satisfy
gorithm is given by the following: conditions No. 1 and No. 2 preced-
ing Eq. 4-26, and No. 1 and No. 2
Step 1. Make an engineering estimate x(0) of Eq. 4-39. (4-59)
of the solution of NLP'.
There exist a relative minimum
Step 2. Choose rx > 0 and tx > 0 and point x in the admissible domain D'
obtain an unconstrained minimum of Eqs. 4-54 and 4-55 combined,
of such that f(x) < f (x) for all x = x
V(x,r1,t1)=f(x)+S(.r1)I(x) in some neighborhood of x, where
x is not an isolated point of D'. (4-60)
+ P(ti)E(x), (4-56)
The sequence ( ri J be strictly de-
(1) creasing to 0 and { t(} be strictly
denoted x .
increasing to+. (4-61)
Step 3. Continue with i = 2, ... by choosing
r. < rr_2 and t{ > ti_1 and starting
from x('_1* finding an uncon-
Then for x^ sufficiently close to x, r.
strained minimum point of
sufficiently small, and ti sufficiently large,
V(x,ri,ti)=f(x)+S(.rj)I(x)
lim
+ P(tt) E(x) (4-57)
S(r.)I[x^] =0 (4-62)
4-17
AMCP 706-192
lim
P(t.)E[xw] =0 (4-63)
2A: 2
x2=&-
and
Since g(xlfx2) < 0 is satisfied at all times, x2
lim (
V[je '\ rt, tt] =/(*). (4-64) > 1. Taking the limit as r -* 0, then, x2 -* 1.
4-18
AMCP 708-192
move from N into K; i.e., so that no con- derivative information to make successive
straint which was previously satisfied is vio- improvements in an estimated solution of
lated, but constraints which were previously NLP. A study of the problem NLP' will be
violated are satisfied. This may be accom- better included in the next paragraph.
plished by minimizing the unconstrained cost
function Geometrically, the method presented here
will first investigate the direction of most
rapid decrease in the cost function/(x). As
U(y,r.)= gt(y) + S(r.) 2 /[g 00)
I i<=N ' ' ieK ' seen in par. 2-4, this is yfT{x). This
direction is then projected onto the tangent
with respect to y, where S(r) and I(x) satisfy hyperplane to the boundary of the constraint
the four constraints No. 1 and No. 2 set at x. A small move in the resulting
preceding Eq. 4-26, and No. 1 and No. 2 of direction will then decrease f{x) and will not
Eq. 4-39 and r. is a strictly decreasing cause excessive violation of constraints. This
sequence. The result is denoted y^: process is repeated as long as f(x) may be
decreased.
As soon as JJ(;) is such that gt [j>(/) ] < 0 for
some i previously mN, that constraint func- Instead of basing the derivation of the
tion is switched to K. In this way, constraint method on a geometric argument, the work
functions fromiV may get to K but those mK will all be done analytically. The reason for
may never fall back to N. Once all the this is twofold. First, geometric ideas in
constraints in N are switched to K, the higher dimensions are not always as clear as
process is stopped and the resulting y^ is in those in two and three dimensions. Second,
the interior of the constraint set of Eq. 4-68. the analytical method used here will be
If the minimum of U(y,r) is found as r. -* 0 employed in deriving algorithms in con-
and there are still constraints in N, then the tinuous problems where geometric concepts
constraint set defined by Eq. 4-68 has no are much more difficult.
interior. In this case, NLP is infeasible (has no
solution) or certain of the constraints of Eq. Extensive use will be made of matrix
4-68 will have to be treated by exterior point calculus notation in this paragraph.
methods.
4-19
AMCP 706-192
will denote a change in x and a 6 in front of a i.e., the collection of indices of constraint
quantity which depends onx will denote the functions which are equalities at the point x.
first order change in that quantity due to the
change bx in x. For example, for the scalar The object is now to find a direction of
function/(JC), change bx, bxT bx = 1,suchthat bx = kbx for
sufficiently small k > 0 will decrease f(x)
without violating any constraints. The prob-
/to bx.
dx lem is then to find bx such that
k > 0.
[*<>]6f< 0, (4-72)
ox
(0)
In the remaining case, the point x is on
the constraint boundary sog;[x(0)] = 0 for bxTbx = 1 (4-73)
some i. For convenience define the set
It is assumed that at points where several
AM = { /1 gtM = 0 } , (4-70) gAx) = 0, the gradients are linearly indepen-
4-20
AMCP 706-192
_1
%- + X^ f; + 2X.W = 0, (4-74) 6x :
2X o
3L+9dg^ - = 0
bx bx ax ax
4-21
AMCP 706-192
-1
Step 3. Compute X in Eq. 4-75. For all X.
< 0, delete i from A [x) ] to form
H= P[u 11 J _ r 11 [1,-1]
L-! J
at*].
1/2 1/2
Step 4. Compute P in Eq. 4-77 and 8x in x [1,- 1] =
Eq. 4-78. ff P = 0, then this is the 1/2 1/2
solution of NLP.
Finally,
4-22
AMCP 706-192
There are many techniques presented in the if df/dx[xa + l)]8x <0. The process in the
literature for determining the size of step to algorithm for the direction of steepest descent
be taken in the constrained direction of is repeated from Step 2, and a new step size k
steepest descent. Three of these techniques is computed as above. If, on the other hand,
are presented here. The first technique applies dfldx[xU+1)]5x > 0, then a relative mini-
to a specialized class of problems in which the mum has been bypassed. To locate this
constraint functions are linear. The second relative minimum, do a oneKmensional
and third methods apply to the general search in the direction 8x starting at x^ to
nonlinear problem. obtain x(J+1).
4-4.2.1 ROSEN'S METHOD FOR LINEAR This process may be summarized inRosen's
CONSTRAINTS A Igorithm :
Step 1. Compute
ff the constraint functions are linear, then
once the direction of steepest descent 8x is mm
found, it may be followed without leaving the T bi-Gfx^'
k = IG( 8X =0
constraint boundary until a constraint,-(*) = Gjbx
0, for i not in A [x^ ]. This algorithm,
therefore, can lead to rather long step sizes. Step 2. Compute
G,.r[x(/) + k8x] - b. = 0
put
7\f
mm [x< +k8x]8x > 0,
dx
j Gfbx > 0 G:TSx
I i4A[x<-')] then find A: so as to minimize
4-23
AMCP 706-192
4-24
AMCP 706-192
< 0, delete i from/4 [x^] to form vided f [ - ( * )kbx ] is decreasing, and con-
A[xV>]. straints do not exceed the given error toler-
ances. When either fails to hold, the resulting
Step 4. Compute P in Eq. 4-82 and Sx in point is called x^+1'.
Eq. 4-83.
If the process is stopped because a con-
Step 5. Compute xu+1) = x(i) + Sx. If any straint is violated in excess of its given
constraints are violated excessively, tolerance, the method of par. 4-3 is used to
use the method of par. 4-3 to get obtain a new point in the constraint set and
from x*'+1' back into the con- the process is repeated until the minimum
straint set. point is located.
Step 6. ff |/[x(/+1)] -/[*<] | andll^'+1) This method should be most effective when
JC(/)|| are sufficiently small, ter- constraint functions are easily evaluated but
minate. Otherwise, return to Step derivatives are costly in computer time. The
2 (possibly with altered 2 and W). basic idea of the method is to prevent an
excessive number of calculations of the con-
strained direction of steepest descent.
4-4.2.3 STEEPEST DESCENT WITH CON-
STRAINT TOLERANCES
4-4.3 A STEEPEST DESCENT METHOD
In par. 4-4.2.2 it was noted that a step may WITH CONSTRAINT ERROR COM-
be made so large as to violate a constraint in PENSATION
excess of an admissible error. The method of
choosing step size presented here will prevent
In previous subparagraphs, steepest descent
this difficulty.
methods were given which at boundary points
generated steps parallel to a constraint bound-
Let reasonable tolerances e(. be assigned to
ary in a direction which decreased the cost
constraint functions gt(x). The object here is
to move in the constrained direction of function as rapidly as possible. Due to non-
steepest descent until some constraint func- linearity of the constraint functions, and the
tion g{(x) violates the tolerance g(.(x) > ep or finite step size, however, some constraints will
until a minimum of invariably be violated. It is the object in this
paragraph to present a new method motivated
f[xW+k8x] by the article (Ref. 6) which automatically
corrects for violation in constraints.
is reached.
Let A[xiJ>] = { i\ gt[x(i>] > 0 } be the
A uniform step size in k may be chosen and indices of constraint functions which are zero
steps taken, checking or are violated. As in the preceding develop-
ment of this paragraph first-order Taylor
gt[xW +kSx] approximations will be used to approximate
functions appearing in NLP. The linearized
for each ieA [x(/) ] and each step in k. The version of NLP at an approximation to the
multiplier k is increased monotonically pro- solution, x^\ is
4-25
AMCP 706-192
20 dx \ dx dx dx dx
df dgT
(4-88)
ox ax
dg- I dg- df Ag
ax Wax -m-
dx
and
(4-90)
Sx-Agf) =0, ieA.
\ax This expression for 8x could now be substi-
tuted into Eq. 4-87 to find 0. To be more
This set of equations is nonlinear in A and general, however, put 1/(20) = 7 > 0 and define
4-26
AMCP 706-192
df*_
Step 1. Make an engineering estimate of
dx \dx dx I dx. dx
the solution of NLP'.
JCU+1) = XW + 8x.
< 0
9x
A method of choosing 7 still has not been Step 7. If l/[x0+1)] -/[*<] I and II Sx ||
given. This parameter is interpreted as a are sufficiently small, terminate
step-size and may be determined by one- the process. Otherwise return to
dimensional search or any other scheme Step 2.
chosen by the designer. In different applica-
tions, different methods have proved effec- 4-5 STEEPEST DESCENT SOLUTION OF
tive. No single scheme has been found that THE FINITE DIMENSIONAL OPTIMAL
seems best. The choice of 7 at this time DESIGN PROBLEM
constitutes an art as much as a science.
In this paragraph a steepest-descent method
The use of this method may now be of solution of the problem OD is developed.
4-27
AMCP 708-192
In many ways, the method of this paragraph used to solve Eq. 4-95 for z- The object now
is similar to the method of par. 4-4. Here, is to determine a change in M0), denoted bb,
however, a distinction is made between design such that
and state variables, and the two types of
variables are treated quite differently. ^D = M> + 8b (4-97)
The problem to be solved here isjust as in will be an "improved" design. The meaning of
par. 4-2: Choose beRk and zeR"to minimize "improved" will be made clear as the analysis
progresses. If the new design variable fc(1)
(4-94) were substituted into Eq. 4-95, this equation
could be solved for the corresponding new
subject to state variable z(1). Since the matrix
9fc/3z[z(\M* ] is nonsingular, the implicit
h(z, b) = 0 OD (4-95) function theorem, Ref. 4, page 181, guaran-
tees that if II 6b || is small, then z(1) -z(0)
and will be small. The change inz is denoted bz so
that
<t>(z, b) < 0 (4-96)
Z(D = z(0) + fe- (4-98)
T
where h(z,b) = [hl(z,b), ..., hn(z,b)] , and
<t>(z,b) = [<t>xiz,b), ..., <t>m(z,b)]T. The state 4-5.1 AN APPROXIMATION OF THE
equations, Eq. 4-95, are put into vector form PROBLEM OD
here in order to take advantage of the The basic idea in the approach to OD
compact matrix calculus notation. presented here is to construct an approxima-
tion of OD which can be solved to obtain an
The steepest descent algorithm for OD is improvement 6b in M0'. The approximate
developed here by first approximating the problem is obtained by making linear approxi-
nonlinear elements of OD by linear expres- mations to nonlinear functions in OD. Linear
sions in the various variables. The difference approximations to the changes in f(z,b),
between the method presented here and that h(z,b), and <pz,b) due to the small changes
of par. 4-4 lies in the treatment of the state bb in () and bz in z^0' are, by Taylor's
variable. In a sense, the state variable is a Formula, Ref. 7, page 56:
nuisance since it does not really describe the
system being designed. The algorithm pre- 5/[z(0)j(0)]^[z()j(0)]fc
sented here is obtained by first eliminating
the state variable from the linearized problem (4-99)
and then solving an explicit problem for an
optimum improvement in the design variable. bb
4-28
AMCP 706-192
4-29
AMCP 708-192
for small and W a positive definite matrix. dence on 8z is to solve Eq. 4-111 for 8z as a
The matrix W will be used in particular function of 8b. This, however, requires the
problems to assign weights to the various inversion of the matrix dh/dz. The preceding
components of 8b. This is often necessary approach of applying necessary conditions
when the components of b represent different was scuttled for just this reason, so another
physical quantities that may be of different method of eliminating 8z must be found.
orders of magnitude. Usually W is diagonal. Note that if the terms (df/dz) 8z and
(30/dz) 8z could be found in terms of 8b,
To summarize the approximate problem, then dependence on 8z would be eliminated.
8b and 8z are to be chosen to minimize This is the approach that will be taken here
and also in a later chapter on infinite dimen-
sional problems.
8z + 6b (4-1 10)
dz db
Define the column matrix XJ as the solu-
subject to the constraints tion of
dh , dh
8z+6b=0, (4-111)
dz db (4-1 14)
dz dz
8bTW8b< *. (4-113)
Note that X* is a matrix whose columns are
solutions of
4-5.2 SOLUTION OF THE APPROXIMATE
PROBLEM dhT ^ = 3^
(4-1 16)
dz dz
Necessary conditions of Theorem 4-9 could
now be applied directly to the approximate for j&A. Note that Eqs. 4-114 and 4-115
problem, Eqs. 4-1 lOthrough 4-113. If this require the repeated solution of equations
course of action is followed, however, an with the same matrix on the left and different
explicit inverse of dh/dz must be computed. right-hand sides. There are efficient computa-
Since the dimension n of this matrix is often tion codes which can construct all the solu-
quite high, this operation would be very tions simultaneously.
costly. Instead of applying necessary condi-
tions immediately, Eq. 4-111 will be used to To see how these newly defined matrices
eliminate the dependence of the remaining are helpful, compute the transpose of both
functions of the problem on 8z. Necessary sides of Eqs. 4-114 and 4-115 and multiply
conditions may then be easily applied for the through on the right by 8z to obtain
determination of 8b.
(4-1 17)
The obvious method of eliminating depen- dz dz
4-30
AMCP 706-192
and
X^if^ is not empty
where
would uniquely determine 8b, and there
would be no optimization problem.
\J (4-122)
db db
The constraints, Eqs. 4-120 and 4-121, will
and be treated differently, so different multiplier
4-31
AMCP 706-192
notation in Theorem 4-9 will be used for 4-121 and the problem is again solved with
each. First, define the reduced number of constraints.
In any method of solution of the approxi-
jT + T T T
H = S. db il ^ 6b+v8b W8b. mate problem, no information is gained if v =
0. Therefore, in the following v > 0 will be
assumed.
Theorem 4-9 requires that
Solving Eq. 4-124 for 8b,
M
=0=ZJ +jiT$ + 2v8bTW (4-124)
dSb
Sb = -W-'i^+^il). (4-127)
2v
where r- > 0 and v > 0
It is now assumed (to be checked later) that
AJ.(fi*'6ft-A0i)=o,i&4 (4-125) Eq. 4-120 is an equality. Substituting for 6b
from Eq. 4-127 into the equality Eq. 4-120,
and
1 r
v{8bT W8b -42) = 0. (4-126) * IV"1 aJ + Z$ji) = A<.
2v
4-33
AMCP 708-192
Step 3. Form the vector of constraint the designer should be familiar with the
functions $ in Eq. 4-105 and solve method of obtaining the given algorithm. In
Eqs. 4-114 and 4-115 for XJ and this way, problems with peculiar features
often can be treated by altering the general
algorithm slightly.
Step 4. Compute 27 and * in Eqs. 4-122
and 4-123.
There are two steps in the algorithm of par.
4-5.3 which are not complete. They are Steps
Step 5. ChooseA0 in Eq. 4-107.
8 and 10. In Step 8, a parameter 7 is to be
chosen, but no analytical method of choosing
Step 6. ComputeM^ inEq. 4-108.
it is given. This is the classical difficulty with
steepest-descent methods. They give a direc-
Step 7. Compute 8b' and 8b2 in Eqs.
tion but, unfortunately, they do not allow
4-131 andEq. 4-132.
analytical determination of a step size (l/(2c)
in this case).
Step 8. Choose 7 > 0 and evaluate A in Eq.
4-129. If any components of are
negative, take the corresponding A simple technique for choosing 7 which
elements out of 4> and return to has worked well in a number of problems is
Step 3. given here as a candidate scheme. Since it is
the 8b1 component of 6b which tends to
Step 9. Compute reduce f, the step size determination will be
based on 8b1. The basic idea is to choose 7 in
order to obtain a certain percentage reduction
bV+D=bU) sb1 +5b2.
2v in f. Let A/ (a negative quantity) be the
desired reduction in / for a single iteration
Step 10. If |/[x(/+1)] - /[*<>>] | and (perhaps a 5% to 10% reduction). Since for
II Sb1 || are sufficiently small, ter- A<?~0,
minate. Otherwise, return to Step
2.
8f=-2jT 8b1 (4-134)
2v
4-5.4 USE OF THE COMPUTATIONAL AL-
7 is chosen as
GORITHM
REFERENCES
1. A.V. Fiacco and G.P. McCormick, Non- Solution of Problems of Equilibrium and
linear Programming: sequential Uncon- Vibrations", Bull. Amer. Math. Soc, 49,
strained Minimization Techniques, John 1943, pp. 1-23.
Wey & Sons, New York, 1968.
6. 1.0. Melts, "Nonlinear Programming
2. J. Abadie, Ed., Nonlinear Programming, Methods for Optimizing Dynamical Sys-
John Wiley & Sons, New York, 1967. tems in Function Space",Automation and
Remote Control, No. 1, January 1968,pp.
3. D.J. Wilde and C.S. Beightler, Foundations 68-73.
of Optimization, Prentice-Hall, Englewood
Cliffs, New Jersey, 1967. 7. C. Goffman, Calculus of Several Variables,
Harper and Row, New York, 1965.
4. W. Rudin, Principles of Mathematical
Analysis, McGraw-Hill, New York, 1953. 8. G. Zoutendijk, Methods of Feasible Direc-
tions, Elsevier Publishing Company,
5. R. Courant, "Variational Methods for the Amsterdam, 1960.
4-35
AMCP 706-192
CHAPTER 5
problem must be considered simultaneously result, the minimum weight design problem is
with all other aspects of system design. It is often stated with explicit constraints on
not practical to expect, therefore, that one structural deflection, natural frequency, buck-
will find lightweight structural design special- ling load, and strength. A central part of the
ists operating independently of designers con- design problem, then, is representation of
cerned with other aspects of the weapon weapon system performance requirements
development. A technology is needed which that have an impact on structural design. It is
will allow trade-offs concerning weapon- often required that in doing structural design,
weight to be integrated into the overall dynamic weapon performance must be ana-
weapon design process. lyzed to assure that the proper constraints are
included in the structural design problem.
The objective in this treatment has been to
formulate the minimum weight structural
5-1.2 WEAPON DEVELOPMENT PROB-
design problem with constraints realistically
LEMS ASSOCIATED WITH LIGHT-
reflecting the performance requirements of
WEIGHT REQUIREMENTS
the weapon system. A detailed formulation
and solution of this structural design problem
is presented in this Chapter as well as Chap- To further explore some of the trade-offs
between lightweight and weapon system per-
ters 7 and 9.
formance, several typical problems en-
5-1.1 LIGHTWEIGHT VS STRUCTURAL countered in weapon development will be
PERFORMANCE TRADE-OFFS discussed in this paragraph. The discussion
here is presented to highlight some typical
Normally, achieving a lightweight structure problems, not necessarily to identify all light-
requires a reduction in the amount of material weight structural design problems faced in
used. The consequence is an increase in weapon development.
structural flexibility that causes increased
deflections, decreased natural frequencies,
5-1.2.1 AIRCRAFT ARMAMENT
and decreased buckling loads. Consequently,
failure modes that were not previously criti-
Some of the most critical lightweight struc-
cal, may now become limiting factors in
tural development problems in weaponry
design. For example, in gun supporting struc-
today are in the field of aircraft armament.
tures, increased deflection often reduces This is due to the very high priority placed by
effectiveness of the weapon system by in- the Army on improved air mobility and the
creasing dispersion. There are many ways in need for minimum weight weapon systems to
which such changes in structural performance be carried by helicopters. The combination of
can have an impact on overall system lightweight structural requirements and the
behavior. extreme environment under which the struc-
ture must perform in helicopter application,
The only effective approach to minimum generates a very difficult class of minimum
weight structural design is to formulate the weight structural design problems. The weap-
structural design problem to include con- on developer's interest in structural design for
straints on performance which are dictated by aircraft armament lies primarily in the area of
functional use of the weapon system. As a weapon and weapon support structures.
5-2
AMCP 706-192
The key structural requirement in this gun barrel development, particularly for
application is accurate aiming of an automatic infantry automatic weapon application. With
weapon during firing. Dynamic response of a great deal of emphasis being placed on
the weapon support structure due to inputs lightweight infantry weapons, the barrel is a
from the weapon and from the airframe, natural component in which to seek weight
which vibrates due to aerodynamic inputs, reduction. This is particularly true for rapid
must be considered in the design problem. fire weapons in which heavy barrels have
The most difficult feature of the minimum traditionally been used to alleviate tempera-
weight structural design problem for aircraft ture problems. For a particular barrel con-
weapon applications is the variety of per- figuration, decreased mass tends to cause
formance and failure constraints which must elevated temperatures and stresses. To com-
be treated in the design process. Constraints plicate matters, material strengths are highly
must generally be placed on stresses arising in temperature dependent, making stress con-
the structure, angular deflection of the struc- straints difficult to handle. Another potential
ture at the gun mount, and natural frequency problem, as one tends toward optimality in
of the supporting structure. These constraints barrel design, is the possibility that material
generally appear in the form of inequalities. yield properties will become critically depen-
For example, stress is required to be less than dent upon strain rates and require their
or equal to the allowable stress for the explicit inclusion in the design process.
material. This kind of constraint is very
realistic, from an engineering point of view, Another problem, which can arise in re-
but makes the solution of the optimal design duced weight design, is barrel deflection with
problem rather difficult. resulting reduction in weapon accuracy.
Deflection constraints must, therefore, be
In addition to altering the geometry and considered.
distribution of material in the structure to
obtain desirable performance, it is also possi- The objective of the barrel design problem
ble to induce damping into the structure and is to choose barrel dimensions and structural
to use active feedback control devices to material to minimize barrel weight in the
reduce response. These two methods of re- presence of constraints on dollar cost, tem-
ducing dynamic response will require addi- perature, stress, and perhaps strain rate. The
tional weight on board the helicopter. There optimal design problem must then include
is a trade-off between design of the structure equations of state of stress and temperature as
and design of other means of obtaining a function of time, both depending on the
improved weapon system performance. These barrel design features.
trade-offs, then, require that we treat the
aircraft weapon design problem as a system
problem, explicitly accounting for the inter- 5-1.2.3 TOWED ARTILLERY
action between structoral behavior, damping,
and active feedback control. The principal objective in towed artillery
design is to provide support for a large-caliber
5-1.2.2 GUN BARREL DESIGN tube that will, upon firing, transmit momen-
tum to the earth without doing damage to the
A second area in which lightweight struc- support structure and without undue dynamic
tural design is of critical importance is that of response. The fundamentals of the design
5-3
AMCP 706-192
problem then lie in the field of mechanics some of the more complex lightweight struc-
and, in particular, are highly dependent upon tural design problems faced in weapon devel-
the weight distribution within the artillery opment. They are simplifications of the real
piece. problems but are difficult enough to illustrate
the need for research in development of
In traditional artillery design, the support design methods. In view of the current em-
structure is flexible but has been quite heavy phasis within the Army on air mobility and
and stiff in the past so that the flexibility of lightweight systems, new design methods are
the structure was a higher order effect. Also, required which are capable of solving these
heavier carriages reduced the severity of the and many more lightweight design problems.
dynamic response problem due to their higher
mass. Recent developments, such as the 5-1.3 PLAN FOR TECHNIQUE DEVELOP-
Ml02, 105 mm Howitzer, have resulted in a MENT
weapon that weighs approximately 3200 lb,
as compared to the older Ml01 which The remainder of this chapter will be
weighed 4500 lb. As a result of the reduced devoted to formulation and application of a
weight, problems have arisen in providing a method of structural optimization. As noted
firm support for the artillery piece on soil. at the beginning of par. 5-1, an in-depth
More recent design efforts, including the treatment of lightweight structural design
XM164, 105 mm Howitzer, and XM198, 155 provides insight into application of the gen-
mm Howitzer, have resulted in weapons eral methods of Chapter 4.
which are considerably iighter than their
predecessors. As a result of the reduced For a comprehensive review of structural
:tructurai weight of the weapon, dynamic Optimization through 1967, the reader is
.response in both of these weapons became referred to Refs. i ana 2. Several of the major
critical and had to be treated as a key design classes of optimal structural design problems
constraint in development of the recoil mech- are outlined in Ref. 2. Some of the key papers
anism. For a discussion of a particular prob- which have appeared in the literature since
iem, :he reader Is referred to the artillery i 967 are listed in Refs. 3 through i 8.
design example of par. 8-5.
Although these are primarily mechanical 5-2 ELEMENTS OF THE ELASTC STRUC-
system design problems, they have arisen due TURAL DESIGN PROBLEM
to the lightweight design criterion. For this
reason, when one considers lightweight struc- A class of optimal structural design prob-
tural design he must be willing to fit his lems in which the structure must remain
structural design problem into a larger system elastic is treated in this paragraph. The objec-
design program and clearly understand the tive of this paragraph is to show how the
interfaces arising between structural and other optimization methods of Chapter 4 can be
system performance characteristics. used to solve realistic optimal design prob-
lems. No attempt is made here to present a
5-1.2.4 OTHER WEAPON PROBLEMS complete theory of optimal structural design
that is capable of solving all problems.
The example problems cited in par. 5-1.2
are meant to illustrate the essential features of The reader should note that, even for the
5-d
AMCP 708-192
class of problems considered here, it is not ment, buckling loads, and natural frequency.
possible to blindly apply the techniques of The collection of all variables required to
Chapter 4. A certain amount of knowledge of describe this response due to applied load will
structural analysis is required before a reason- be denoted by the state variable vector z. The
able statement of the design problem and a manner in which z is related to the design
method of solution can be obtained. Even variables and applied loads will be discussed in
more important, the structural designer needs some detail later in this paragraph.
to have a thorough knowledge of the opti-
mization methods of Chapter 4 and their The cost of the structure must now be
development. As will be seen, in some cases it described as a real valued function of the
is required that parts of the design problem be design and behavior variables. In keeping with
interpreted in light of the derivation of the the preceding notation this function will be
optimization method. In this way the method denoted as
may be adapted for solution of a particular
class of problems. / = /(*, b, r> (5-1)
5-5
AMCP 708-192
chapter to structures which obey Hook's law, These constraints can generally be written in
i.e., stress and displacement are determined the form
by linear equations. It is clear, however, that
the design variables play a large part in the 0(z, b, r) < 0 (5-4)
response of the structure to loads. The depen-
dence on the design variables enters these where Mz.b.S) = [*x(*.*,), .,4>,{zM)\T
linear equations through the coefficients. The The inequality constraints, Eq. 5-4, are
equations for z will be denoted required to be satisfied for each of the states
z' due to different applied loads P'.
A(b)z=P (5-2)
It is clear that the Eqs. 5-2 and constraints,
where P is a matrix of loads and Eqs. 5-4, fit into the formulation of the finite
dimensional optimal design problem of par.
A(b)=[alf(b)]nx (5-3) 4-5. Treatment of the restrictions imposed by
Eq. 5-4, however, must be delayed until
is a matrix whose elements depend on the similar restrictions due to other behavior
design variables. constraints are accounted for. The entire
problem will be treated in par. 5-3.
In this formulation of the problem, z and P
may be generalized state and load variables.
Eq. 5-2 may be obtained through direct 5-2.3 NATURAL FREQUENCY AND
application of equilibrium and compatibility BUCKLING
conditions or through application of varia-
tional criterion for equilibrium. In today's As pointed out in par. 5-1, the desire to
structural analysis technology, Eqs. 5-2 are obtain lightweight structures has led to
very likely to be obtained by finite element resonance problems and, likewise, buckling
methods (Refs. 19, 20). If the structural problems. It is necessary, then, that a mean-
analysis problem is properly formulated, the ingful optimal design methodology be capable
matrix A(b) is nonsingular and z may be of enforcing constraints on eigenvalues
obtained by solving Eq. 5-2. It is assumed that associated with the system response. The sort
the elements of the matrix A(b) are differ- of constraint considered here is
entiable with respect to b.
f> fo (5-5)
In most real-world structural design prob-
lems the structure is required to carry a whole where is buckling load or natural frequency
family of loads that occur at different times and f0 is a lower bound on that eigenvalue.
in the life of the structure. The treatment More general restrictions than those of Eq.
here will be limited to a finite number of 5-5 are included in the general constraint, Eq.
loads, denoted P', i = 1,..., s. Associated 5-4.
with each load is a state z' determined by Eq.
5-2. Much as in Eq. 5-2, the equations of
vibration or buckling may be written in the
Constraints on behavior of the structure form
due to each of the applied loads P may
include bounds on stresses and displacements. K(b)y = tM(b)y (5-6)
5-6
AMCP 706-192
where y = fri ,.. .,yn ]T is an eigenvector of this quotient is the smallest eigenvalue. A
which plays the role of a state variable, direct method of minimizing the Rayleigh
quotient is discussed in par. 2-8.
w) = [k..mnxn (5-7)
5-7
AMCP 706-192
dh d
Since the cost and constraint functions [A(b)z]
Tb ' Yb
depend on z, b, and f, the first order
perturbation in these functions due to small
and
changes 8z, 8b, and Sf in z, b, and f is
dh
Sf=5LSz + KSb+v6i (510) 7z=A
oz ob ob
From Eqs. 4-117 and 4-1 18, and the It is assumed that the eigenvalues and
perturbed state equation we obtain, just as eigenvectors of
Eq. 4-1 19,
K(b)y = W(b)y (5-16)
9/ ,T 9
5z = -X/ --[A(b)z]8b (5-12)
dz db depend continuously onb and further, that to
first order, the following perturbation equa-
and tion is accurate
30
6z = -X* [A(b)z]8b (5-13) K(b)8y+ -i [K(b)y]8b = 8$M(b)y
z ob ob
(5-17)
J
where \ and X* are determined by + [M(b)y]8b + W(b)8y
ob
A\J =~ (5-14)
dz where y and f satisfy Eq. 5-16.
5-8
AMCP 706-192
necessary to solve the adjoint eigenvalue tion that Eq. 5-17 holds is highly questionable
problem from an operator theoretic point of view.
Under reasonable assumptions on the finite
K1 (b)y = W1 Wy (5-18) dimensional eigenvalue problem treated here
however, Eq. 5-19 is shown to hold (Ref. 23);
that has the same eigenvalue f asEq. 5-16 but i.e., even though the justification given here is
a different eigenvector y. Rearranging and not mathematically rigorous, the result, Eq.
premultiplying by y T this is 5-19, holds for a large class of problems.
Defining
yT[K(b)-^M(b))8y+yT [K{b)y]8b
ob
-yTt^r [M(b)y]8b
ob
"=- >^''^
= yTS$M(b)y + | ~/lyTM(b)y] \
lid )r_
Since the first term is a scalar, x | [K(b)y] y
\ \ ob )
Assuming y T
M{b)y = 0 which will generally \x^l/[y^M(b)y]
ob I
be the case. or
, 0, if $ is empty.
T
6? = \ y [K(b)y]
\ 6b Eqs. 5-10 and 5-11 become
(5-19)
and
Derivation of the perturbation formula, Eq.
5-19, has been strictly formal. The assump- 50 = *r5. (5-23)
5-9
AMCP 706-192
The linearized problem is now to minimize -M~* (*T W1 & + 2vA$). If any
bJ, Eq. 5-22, subject to constraints component of is negative, re-
move the corresponding row from
T $ and return to Step 3.
$ bb < A$
Step 8. Compute
where A0 is the desired correction in con-
straint error and
8b1 = W1 (/ - 9?M[\ S?' W1 nJ
T 2
bb Wbb< %
and
where W is positive definite and % is small.
This is precisely the same problem in par. 4-5, bb2 = W-l$M-];A A<
Eqs. 4-119 through 4-121, so the theoretical
results and steepest-descent algorithm of that and form
paragraph apply with proper interpretation.
bb^-^-bb1 +bb2.
5-3.2 STEEPEST DESCENT ALGORITHM 2v
FOR OPTIMAL STRUCTURAL DE-
SIGN Step 9. Compute
5-10
AMCP 706-192
oscillation occurred. By monitoring the dot number of constraints with positive compo-
product, 8b^ * 6_1), oscillations were nents of M was always less than or equal to the
sensed when negative values of the dot prod- number of design variables of the problem.
uct occurred. Thus, step size, l/(2i>), was This procedure of adjusting the constraint set
divided by two when negative values of the has worked very well and has minimized the
dot product occurred on two successive itera- possibility of divergence of the algorithm.
tions. Finally, the most effective method of
adjusting step size was to monitor successive The method presented is relatively auto-
reductions in cost function after feasibility matic in the sense that, for the computer
had occurred. Once insignificant reductions program developed, the input data given is the
occurred, the step size was reduced to obtain only pertinent design information required
finer convergence. for solution of the problem. All the necessary
matrices and their derivatives are automati-
The Power method used to compute the cally generated in the computer. Any person
smallest eigenvalue performs quite well. At with a reasonable knowledge of FORTRAN
every iteration, the starting value for the language should be able to handle the pro-
eigenvector is taken from the previous itera- gramming without any difficulty. The method
tion which manifested a very rapid rate of is developed to meet simultaneously displace-
convergence. An accuracy of 0.1% in each ment, strength, and frequency requirements
component of the eigenvector was used to on the structure. The technique, therefore,
compute the new eigenvector. The stiffness can be made user oriented.
matrix for the structure was inverted by the
Gauss-Jordan elimination procedure.
5-4 OPTIMIZATION OF SPECIAL
PURPOSE STRUCTURES
Another comment that is appropriate here
concerns the sign check on the Lagrange
Several special purpose structural optimiza-
multiplier vector p, called for in Step 7 of the
tion problems are solved in this paragraph on
computational algorithm (par. 5-3.2). The
an ad-hoc basis to illustrate the method of
algebraic sign of each component of the
par. 5-3. Subsequent paragraphs will treat
Langrange multiple vector p was checked at
large scale problems in a more unified man-
each iteration. If some of the components
ner.
were negative, then the matrix $ and the
vector A$ were adjusted accordingly. This
procedure is particularly useful whenever 5-4.1 A MINIMUM WEIGHT COLUMN
there were redundant constraint violations. In
some cases, the number of constraints vio- A column is to be constructed by making
lated is more than the number of design its cross section piecewise uniform as shown
variables of the problem, yielding a singular in Fig. 5-1. The objective of the design
matrix coefficient of p. In such cases numeri- problem is to choose the element areas so that
cal noise yielded a solution such that some of the column will support a vertical load P0
the components of the vector p were always without buckling or yielding under compres-
negative, indicating that the corresponding sive load. For the purpose of the present
constraints would be strictly satisfied in the problem the geometric shape of each column
next iteration. In numerical examples, the element is fixed and symmetric about two
5-11
AMCP 706-192
,+i^Po-P<0. (5-26)
12 ~6L{ - 12 -6Lt
Second, in order to insure that the column -6L. 4i(2 6Lt 2L)
Eabf
material does not yield under the applied load (5-29)
'(*)=-
P0, it is necessary that - 12 6Lt 12 2Lt
5-12
AMCP 708-192
T
3 1 3 1
fifi -dli-= [o,...,o,-p0/b;,o,...,oi,
5L. ~2" 20 db
1 i i= 1,..., k (5-33)
20 15 20~ 60
D\b)=2 (5-30) since 0. does not depend on P, i = 1,..., k and
3 1 3 1
20 si; 20 { 9
[K(b)y]\ y
db
i 1
20 60 "20 15
p (yTDy),
\7b[Dib)y]\Tyi
Summing the total potential energies of all
the elements from Eq. 5-28 and defining a if^+1 > 0
new variable
[0],if^+1 < 0. (5-34)
y= [yi.yi, -,^atl7-
The computations required in Eq. 5-34 are
= [u\, u\,u\,u\, ..., uk, uk]T messy but they can be programmed for
automatic computation.
the total potential energy PE of the column
may be written All expressions required for direct applica-
tion of the steepest descent algorithm of par.
PE=-yT K(b)y - P-yT D(b)y 5-3 are now available. Numerical results and
profiles of optimum columns are shown in
Tables 5-1 and 5-2, and Fig. 5-3. Numerical
where K{b) and .0(6) are made up of elements data for the example problems are E = 3.0 x
of K'(b) and D'(b) and are symmetric. Apply- 107 psi, a = 0.079577, amax = 20,000 psi,
ing the theorem of minimum total potential and L = 10.0 in. Computation in each case
energy given in Appendix B, the governing required approximately 0.1 sec per iteration
equations of buckling are
Eq. 5-31 is now in the form of Eq. 5-6, with Volume of Volume of
proper interpretation of notation. Optimal Uniform* Material
P, lb Column, in? Column, in? Savings, %
In order to implement the computational
500 0.806 0.923 12.7
algorithm of par. 5-3, the following vectors 1000 1.143 1.300 12.1
are required: 1500 1.411 1.600 11.8
2000 1.640 1.840 10.9
4000 2.412 2.600 7.2
,T dJ
= = t7Ll,7L2
3 '""7i*] (5 32)
" 'Lightest uniform column which will support load P.
5-13
AMCP 706-192
TABLE 5-2
CROSS-SECTIONAL AREAS OF OPTIMUM COLUMNS
Element
No. i P = 500 lb P= 1000 1b P= 1500 lb P= 2000 1b P = 4000 lb
ab. (5-35)
i I'I 1111 ri
5-14
AMCP 706-192
where p is material density and L(. is the -6/^ 2/A2 6/,i, 4/,4
length of the ith section.
(5-40)
By neglecting compression of the beam, For harmonic motion of the structure, y(t) =
deformation of a typical element is shown in y sin ut, where y isjust a constant vector, / is
Fig. 5-5. By Appendix B, the kinetic energy time, and w is natural frequency. Substituting
into Eq. 5-41 and defining f = co2, the
eigenvalue equation is
77 ').
K(b)y = W(b)y (5-42)
M\b) -
As a numerical example, the beam problem
54 -13L. 756 227,..
420 was solved with the data E = 3 x 107 psi, L =
10 in., a = 1.0, and p = 0.00208 lb-sec2 /in3.
137,, -3L) 227,, 4L*
The computational algorithm required about
(5-39) 0.6 sec per iteration on an IBM 360-65 system
and approximately 15 iterations to converge.
Results for a range of natural frequencies are
Likewise, the potential energy of the ith given in Table 5-3 and the profile of an
element is u'TK'(b)u'/2, where, from Eq. B-4 optimum beam is shown in Fig. 5-6.
5-15
AMCP 708-192
TABLE 5-3
COMPARISON OF OPTIMUM BEAMS
Volume of Optimum
Frequency Uniform Volume, Material
rad/sec Beam*, in? in? Savings, %
A portal frame as shown in Fig. 5-7 is to be where L. is the length of the ith member and
proportioned so that it weighs as little as the element deformation variables are shown
possible and has its fundamental frequency at in Fig. 5-8. The potential energy PE of the ith
least as large as a specified frequency coQ. element is
Each member of the planar frame is formed
from several uniform sections whose areas are
to be determined as design variables. As in the t
preceding problems, the cross-sectional geom-
etry is taken as fixed and all dimensions of
cross sections varied proportionally. The
second moment of the cross-sectional area y ')
about a centroidal axis is/; = ubf where b. is
the cross-sectional area of the ith element.
5-16
AMCP 706-192
where u denotes time derivative of and where p is density of the structural material.
Taking into account the lateral rigid body where f 0 is the lowest allowable eigenvalue of
motion of Member 2, the total kinetic energy
Eq. 5-47, r0 = w2 .
of the structure is
The steepest-descent algorithm may now be
KE = X-ui Af (&,.)'' + - MI (5-46) applied directly. Data for the specific prob-
1 2 2
lems solved are given in Table 5-4. The results
for an aluminum portal frame are given in
where M is the mass of Member 2 and uA is
Tables 5-5 and 5-6, with a typical profile
the horizontal velocity of points .
shown in Fig. 5-9. The design variable bi
shows the distribution of material for a
Requiring harmonic motion with frequency
minimum weight frame whose frequency of
co, the displacement vector y{t) made up of
vibration must be greater than or equal to a
all displacements is
5-17
AMCP 706-192
*>1
1.577 1.964 2.907 4.020
b. 0.883 1.604 2.484 3.321
b, 0.552 1.416 1.912 2.622
b, 0.374 0.866 1.290 1.725
b, 0.350 0.360 0.671 0.836
be 0.350 0.350 0.350 0.350
of cross section varying proportionally. Thus, where the matrix D is derived from the
I. = abj where b, is the cross-sectionalareaof shortening of Member 2 as in par. 5-4.1 and, as
the ith element. The stiffness matrix K{b^) of in the previous problem, K(b) is a stiffness
a typical element, Fig. 5-11, can be written as matrix.
in par. 5-4.3 with generalized displacements
defined by The cost function to be minimized in this
problem is the structural weight of Members 1
u' = [t/j, U2, M3, u4]T. and 2 which is simply
S~\ / = 7 2 btLt
c
The weight of the frame is to be minimized
h subject to the following constraints:
5-19
AMCP 706-192
*w+/+2 =
X ~amax2 < (5-56)
for bj in Member 2,
TABLE 5-8
VOLUME OF OPTIMUM FRAME
q, lb/in.
10 15 20 25
Optimum
Volume, 504.2 533.4 558.0 656.0
in.3 Figure 5-13. Rectangular Plate
5-20
AMCP 706-192
d2D 32w
+ V2Z)-V2w -(1 -v)
3x2 3y2
= h pco2 w (5-58)
Eh3(x,y)
D(x,y) = (5-59) In the steepest-descent algorithm, the cost
12(1 -v2)
function that is to be minimized is
fcCx,>0 is the thickness of the plate, which is
j = p(\A . 2 h(Xj,y) (5-61)
the design variable, and p is the density of
plate material.
where AA is the area of the grid squares.
When the function w(x,y) is represented in
the form The constraints imposed on the design are
(5-60) and
?o - ? < 0 (5-63)
the eigenvalue problem can be solved approxi-
mately by numerical methods. The problem
where h0 > 0 and f0 > 0 are lower limits on
posed here is solved using a collocation
plate thickness and eigenvalue of Eq. 5-58,
technique, i.e., the differential equation is
respectively.
satisfied at discrete points in the region, Fig.
5-14.
The steepest-descent computational al-
gorithm applies in a direct way. It should be
The number of discrete points is chosen noted that the collocation method for ap-
equal to the number of terms in the truncated proximate solutions of the equations for
series of Eq. 5-60. The drivatives of the natural frequency yields nonsymmetric
function D(x,y) at the grid points are evalu- matrices K and M in Eq. 5-16. Thus in this
ated by the use of finite differences. For a formulation of the plate optimization prob-
given set of design variables, i.e., h(x,y), the lem, the adjoint eigenvalue problem, Eq. 5-18,
lowest eigenvalue, J = pco2, and the associated must be solved along with the original eigen-
eigenvector {Amn I , which plays the role of value problem. If finite element methods for
y, are determined. plate analysis had been used, symmetric
5-21
AMCP 708-192
matrices would have been obtained. In this 5-5 GENERAL TREATMENT OF TRUSS
example, as well as in the preceding, a DESIGN*
minimum effort was expended to make com-
putations efficient. The emphasis has been The theory presented in pars. 5-2 and 5-3
placed on getting results. A subsequent effort will now be applied to the case of general
will be devoted to making algorithms more plane and space trusses. These types of
efficient. structures are encountered quite frequently in
practical situations. Most common among
The minimum weight plate problem was these are buildings, transmission towers,
solved by the algorithm of par. 5-3 with bridges, cooling towers, aircraft structures,
E = 3.0xl07 psi, p = 7.43 xlCT4 lb-sec2/in.4 , and lightweight military structures. In all these
v = 0.30, and co0 = 1375 rad/sec. The uni- cases, it is desirable that the structure simul-
form plate with t = pw2 = 1400 was taken as taneously should meet strength, deflection,
the initial estimate to the optimization prob- and frequency requirements and be of mini-
lem. The dimensions of the plate are 10.0 in. mum weight. In this chapter, all these con-
by 10.0 in. and the value of h0 =0.1 in. The straints will be considered.
material is assumed to have a constant density
and so minimum weight is equivalent to 5-5.1 SPECIAL PROBLEM FORMULATION
minimum volume. The volume of the uniform
plate is 11.44 in.3 and the volume of the In the problems to be considered here,
optimal plate is 10.8 in.3 which is a 5.6% geometry of the truss is assumed to be
material savings. Fig. 5-15 shows 25 colloca- specified and the loads are applied only at the
tion points. The numbers in the network are joints. The objective function for the problem
is taken as the total weight or the volume of
a the truss, and the design variable for each
0 2 member is taken as its cross-sectional area.
0.124 0.116 0.100 0.100 0.100 The objective function of Eq. 5-1 in this case
0.116 0.103 0.100 0.100 0.100 is a linear function of m design variables and
0.100 0.100 0.100 0.104 0.111
may be written as
0.100 0.100 0.104 0.121 0.128
h 0.100 0.100 0.111 0.128 0.136
2
<' /= 21 PiLtbt (5-64)
Kt
K(b) (5-68)
K
2\ v4
where m is the total number of elements in
Figure 5-16. Description of a Truss the truss and K. is the stiffness matrix for the
ith element of the truss. The stiffness matrix
for the ith element may be written as
1 I
K.. (5-69)
L
' i 1 1
/= ['K(jb)]z
= K(b)z (5-70)
Figure 5-17. A Truss Element
where
5-23
AMCP 708-192
element of the russ and is given by Assembling the matrix C* of Eq. 5-21 is
more tedious. It requires a formation of
2 1 constraint set $ as in Eq. 4-105 which will be
PMt
M,=- (5-74) discussed in detail now. The constraint set $
1 2
may be divided into five subsets, namely,
frequency, stress, buckling and displacement
Any nonstructural mass that is attached to constraints, and lower and upper bounds on
the trass may also be added to the mass the design variables. Explanation of these
matrix of Eq. 5-72 and it may be written as subsets follows one by one and, for each
subset, matrices 9#/9?, 9#/9<?, and 9$/9> are
M(b)=TM(b)+M0 (5-75) computed.
5-24
AMCP 706-192
5-4, Eq. 5-78 may be written as inequality (Eq. 5-82) holds in such cases. For
tension members, a. and a? are negative;
$,&, z, f) = ro - f < 0 (5-79) therefore, Eq. 5-82 is written as a? s a.. The
expressions that follow are written for the
where s is a number assigned to this con- case of compression members. For tension
straint. If this constraint is violated, then members similar expressions can be readily
A<?s = _(ro _f), fy/db =(0, ,0),3#,/3? written. Inequality Eq. 5-82 may be written
= - l,andd#s/3z =(0, , 0). as
Since the matrix K{b) is available, Eq. 5-70 where s is an index assigned to this constraint.
can be solved for the unknown displacement In all subsequent constraint subsets, subscript
vector z. Element forces can then be com- s on $ will have the same meaning. If Eq. 5-83
puted from Eqs. 5-65 and 5-66. Substitution is violated, then A$s = - (a. - af), 9</3? = 0,
of Eq. 5-65 into Eq. 5-66 yields
3tf>s 3#. /1 *F, 1
F= Sz (5-80) = 0 for i=j, " =1 - F. ,
dbf db; \bt bbi b2. '''
5-25
AMCP 706-192
cross-sectional geometry of the ith member. where za is the maximum allowable /th
This is a convenient way of expressing the component of displacement. If a particular
moment of inertia in terms of the cross-sec- component of displacement is positive, then
tional area of a member, because the constant Eq. 5-89 is written as z- z? < 0;and if it is
a. can be specified by the designer quite negative, then it is written aszi?- z- < 0. The
readily. Therefore, Eq. 5-84 may now be expressions that follow are written for the
written as case of positive displacement, and similar
expressions can also be written for the case of
negative displacement. In terms of the nota-
(5-86) tion of Eq. 5-4 the constraint for the positive
displacement may be written as
where 0. =n2Eiaj/Lf. Eq. 5-86 may be writ-
4>s(b,z, f) = zj-z < 0 (5-90)
ten in terms of the critical buckling stress a*
as
If thejth displacement component exceeds an
Lp=0,.b,.. (5-87) allowable limit, then
Now the buckling constraint for the ith A0s=-rz/.-zp) - =0,^ =0, ,0),
compression member may be written as
w)--o;-
and
i,{b.z,f)=ot -of Q 0. (5-88)
*r=(0'
If this buckling constraint is violated, then
All the displacement components are checked
A0s=_(a. -a),30,/3f =0,
and any other violation is treated in a similar
way.
s = 0 for i # /, 5
dh. db,. b, db< ui '' '
5-5.1.5 BOUNDS ON DESIGN VARIABLES
5-26
AMCP 708-192
(1) Lower Bound o n Design Variables: Problem par. 4-3.1, the present formulation
handles it without any difficulty.
This constraint is written as bLt < bt or in
terms of notation of Eq. 5^ After all the constraints have been con-
sidered, the matrices d(j>/db, 90/3?, and
4>s(b,z,f) =-bLt-b < 0 (5-92) 90/9z are available and X* can be solved
from Eq. 5-15. This still does not allow
Violation of this constraint yields, the matrix * of Eq. 5-21 to be assem-
bled. The following matrices must also be
A0S=-O>'' -K), 3*f = 0, computed
9 9 r -
it may be noticed here that the cross-sectional 2 l K
area of any member of the truss may be
assigned a predetermined value by putting the
same upper and lower bound on it. This
situation may be encountered in practice due (=i db
(tfTk,ff)2
to various reasons, and as shown in Example
(5-97)
5-27
AMCP 708-192
It should be noted here that the summation constraints. This procedure of taking into
sign in Eq. 5-97 represents the summation of consideration all the loading conditions has
m matrices of dimension (n x m). The worked out quite satisfactorily in the example
quantity inside the differentiation sign is an problems.
n-dimensional vector whose components may
be dependent upon the design variable vector 5-5.3 EXAMPLE PROBLEMS
b. Therefore, the quantity inside the summa-
tion sign is an n x m matrix for each index i Several trusses are designed by applying the
However, in the present case, since Kt is a procedure presented in this paragraph. A
function of only bf, the computation of Eq. computer program, based on the algorithm
5-97 is greatly simplified. Consideration of stated previously, was written in FORTRAN
each i in Eq. 5-97 generates a (n x m) matrix IV. The computations were performed on the
whose only nonzero elements are in its ith University of Iowa IBM 360-65 computer.
column. Computation of Eq. 5-97 is per- The stiffness matrix or the structure was
formed quite readily and automatically in the inverted by the Gauss-Jordan elimination
computer. procedure, and the power method was used to
find the smallest eigenvalue.
All the information being available, the
matrix 2* of Eq. 5-21 may now be assembled Results for three typical trusses are pre-
and the algorithm of par. 5-3.2 may be used sented here. All these structures were de-
to solve actual problems. signed with stress, displacement, buckling,
and frequency constraints. Examples 4-1 and
5-5.2 MULTIPLE LOADING CONDITIONS 4-2, par. 4-1.1, are compared with results in
Ref. 25. These were designed with and with-
Most structures are designed to withstand a out frequency and buckling constraints in
multiple loading environment. This is quite order to compare the results with Ref. 25.
reasonable, because only a certain combina- Example 4-3, par. 4-3.1, is treated in Ref. 26,
tion of loads may act on the structure at a and it was also designed with only stress
particular time. This situation is handled in constraints in one case to compare results
the par. 5-5.1 formulation by expanding the with Ref. 26. All sample problems had lower
state variable vector z to include all states. limits on areas of the elements and Example
The element force vector / is also expanded 4-3 had upper limits. The program is general
accordingly. Formulation of displacement, enough to handle different lower and upper
stress, and buckling constraints must also take bounds on stresses in an element, elements of
into consideration all states of the system. different materials, and a different buckling
This is handled in the manner that follows. parameter a. for each element. The examples
While formulating a particular displacement follow:
constraint, the value of displacement for each
loading case is checked and each violation is 1. Example 5-1. Five-node Four-bar Truss
entered into the reduced constraint vector 0.
After this, the procedure of calculating the Fig. 5-18 shows the geometry and the
matrices d<j>/db, 90/3f, and 9<A/3z is the same dimensions of the truss. Input and output
as explained earlier. An exact same procedure information is given in Table 5-9. In order to
is followed in treating stress and buckling compare the results with those of Ref. 25, the
5-28
AMCP 706-192
Figure 5-18. Four-bar Truss (Example 5-1) 2. Example 5-2. Transmission Tower
trass was first designed for stress constraints, Fig. 5-20 shows the geometry and the
Fig. 5-19(A), and second for stress and dimensions of the tower. This example also is
displacement constraints, Fig. 5-19(B). It treated in Ref. 25. In this problem, the
may be noted that the results presented here cross-sectional area of each member of the
are at least as good as those presented in Ref.
25.
The trass was also designed by including 140 /Cv > Starting Point 1
buckling and frequency constraints along with
other constraints. Two different starting
points were used in optimizing this trass. fini 1 h h li
Starting Point 1 was infeasible and Starting Iteration Number
Point 2 was feasible. The final design weight (B) With AII Constraints
beginning at Starting Point 1 was 113.48 lb
and at Starting Point 2 was 113.77 lb. The Figure 5-19. Iteration vs Weight Curves for
slight difference in the two weights was due Example 5-1, Four-bar Truss
5-29
AMCP 706-192
TABLE 5-9
Design Information: For each member, Young's Modulus of Elasticity Ejt the specific weight p/r lower limit on
area of cross sectionbL. ,and the constant a(. are 104 kips/in.2, 0.10lb/in.3, 0.10in.2,and 1, respectively. There is
no upper limit on member size. The resonant frequency for the truss is 284.6 Hz. For Output 1, the stress limits on
each member are 25.0kips/in.2 and the displacement limitsat node five are, 0.0, 0.3in., and 0.4in. inthex,-,
x2-, andx3-directions, respectively. For Output 2,the stress limits for each member are 15.0kips/in.2, and the
displacement limitsat node five are 0.15 in. in all three directions. There are three loading conditionsfor the truss;
they are: in positivexj-, x2-, andx3- directions, 5, 0, 0; 0, 5, 0; and 0,0, 7.5 kip, respectively, applied at node five.
Weight, Weight,
10.19 9.09 34.86 14.28
lb lb
Weight, Weight,
69.72 113.48 257.68 113.77
lb lb
5-30
AMCP 706-192
trass is treated as an unknown design variable, cycles. The values of final design variables
and the results obtained are given in Table compare quite well with those in Ref. 25. At
5-10. The tower was designed first with only the final design point all constraints were
stress constraints. The final design weight in satisfied within 0.006%.
this case was 91.13 lb with a computation
time of 38 sec for 12 iterations. The final The tower was also designed with stress and
design weight reported in Ref. 25 was 91.14 displacement constraints and, finally, with all
lb with a computation time of 9 sec for 5 the constraints included. The design weight in
5-31
AMCP 706-192
TABLE 5-10
Design Information: For each member of the structure, the modulus of elasticity ), the specific weight pi, the
constanta/, and the stress limits are 104 kips/in.2, 0.10lb/in.3, l.Oand 40.0kips/in.2, respectively. The lower
limit on the area of cross section of each member is O.lOin.2 for the case with stress constraints only and 0.01 in.2
for other cases. There is no upper limit on the member sizes. The resonant frequency for the structure is 173.92 Hz,
and the displacement limits are 0.35 in. on all nodes and in all directions. There are six loading conditions, and they
are as follows (all loads are in kips):
output:
5-32
AMCP 706-192
Output: (Cont'd.)
Weight,
132.37 91.13 772.24 546.18 669.80 590.32
lb
the first case was 546.18 lb with a computa- optimized for a single loading condition. The
tion time of 47 sec for 17 iterations, and the design information and the results are shown
maximum constraint violation was 0.00011%. in Table 5-11. In order to compare the results
The comparable design weight reported in with Ref. 26, the truss was first optimized
Ref. 25 was 555.11 lb with a computation with stress constraints only. The final design
time of 24 sec for 7 cycles. This shows that, weight was 2,993.37 lb with a computation
when displacement constraints are also in-
cluded, the results obtained with the new
780
gradient projection method are slightly better
than those of Ref. 25. For a design with all
the constraints included, the final weight was /W:*h All Constraints
590.32 lb with a computation time of 129 sec Optimum Weight = 590.32 ,b
for 36 iterations, and the maximum violation
of constraint was 0.028%. Fig. 5-21 shows With Stress and Displacement
variation of the cost function with respect to I Constraints
520
3. Example 5-3. 4 7-Bar Plane Truss 4 8 12
Iteration Number
The schematic diagram of the structure
with dimensions is shown in Fig. 5-22. This Figure 5-21. Iteration vs Weight Curves
for Example 5-2, Trans-
example is also treated in Ref. 26 where it is mission Tower
5-33
AMCP 706-192
6.72K 18 10 3 17 @) 13 14 6.72K
2 ft 6 in.
2 ft 6 in.
time of 115 sec for 17 iterations. At this 9th iteration for which the design weight was
point the stress in member 18 was violated by 2,998.88 lb; maximum constraint violation
0.24% and all other violations were less than was 0.10% for stress in 7th member and all
0.035%. Another feasible design occurred at other violations were less than 0.016%. The
5-34
AMCP 706-192
TABLE 5-11
Design Information: For each member of the structure, the modulus of elasticity Er the specific weight p:, and
the constanta. are 3.Ox 104 kips/in.2, 0.284lb/in.3, and 1.0,respectively. The resonant frequency for the structure
is 16.0 Hz and the displacement limits are 1 in. on all nodesand in all directions. There is one loading condition for
the truss which is shownon Fig. 5-22 .Allowable stress in tension for all members is 21.28kips/in.2
output:
5-35
AMCP 708-192
output: (Cont'd.)
final weight reported in Ref. 26 was 3,328.5 that for practical purposes, convergence oc-
lb which is considerably higher than the one curred in approximately 6 iterations.
reported herein. This may be attributed to the
fact that in Ref. 26 the members are divided
into eight groups so that there are only eight 4000
of elastic, working stress analysis. The AISC where At is the area of cross section, Zf is the
Steel Construction Manual (Ref. 27) is used section modulus, r. is the least radius of
for the properties of these sections. The gyration of the ith element of rigid frame, and
constraints considered are stress, buckling, a, ct, and df are constants. These constants
displacement, natural frequency, and restric- can be found by plotting curves of area of
tions on design variables. cross section, section modulus, and the least
radius of gyration versus the moment of
5-6.1 PROBLEM FORMULATION inertia of various economical beam and
column sections. These curves have been
In the problems considered here, the drawn by Nakamura (Ref. 28) for wide flange
geometry of the frame is assumed to be sections of AISC Steel Construction Manual
specified, i.e., lengths of the members or the (Ref. 27), and the same values are used in this
joint coordinates are not treated as design handbook. This approach of obtaining con-
variables. Multiple loading conditions for the tinuous relationship for area of cross section,
structure are treated by the procedure ex- section modulus, least radius of gyration, and
plained in par. 5-4.2. The moment of inertia the moment of inertia has also been used by
for each element is treated as the design other researchers in their work (Refs. 29, 30,
variable; therefore, ft is a vector whose ith 31).
component b{ is the moment of inertia of the
ith element. In calculating weight or volume The objective function, Eq. 5-1, for this
of the structure, element direct stresses, ele- problem is again taken as the total weight of
ment bending stresses, area of cross section, the frame which may be written as
and the section modulus of each element
must be known. Also, in order to calculate
the allowable compressive stress for an ele- /= 2 pAJi^V P^pA,1'1 (5-101)
/=; ' ' ' i=i ' ' ' '
ment, its least radius of gyration r. must be
known. These quantities are required as con- The displacement method of structural analy-
tinuous functions, rather than discrete num- sis is used, and nodal displacements of the
bers, in the present formulation. Since the frame are considered as basic state variables.
moment of inertia of each element is its only Therefore, the jth component of the state
design variable, the quantities area of cross variable represents the jth displacement com-
section, section modulus, and the least radius ponent of the frame. Fig. 5-24 shows a simple
of gyration must be expressed in terms of the scheme for designating joints, members, and
moment of inertia of the element. These displacement components of a frame in the
relationships of the ith element are written as structure coordinate system. Fig. 5-25 shows
follows: a frame element in the member coordinate
system with the sign convention to be used on
A, =a1b1l/2 (5-98)
element forces and deformations. It may be
5-37
AMCP 706-192
z2 z5 Z
8 noted that FJ and FJ are direct forces on the
L
element, F[ and Fs' are shearing forces, and
a F'z and F'6 are the moments at the end of an
element. The structural analysis equations
developed in par. 5-4 and in Appendix B are
1 3 5
also used here. The element forces are com-
<
j
F = Sz (5-102)
Figure 5-24. Description of a Frame
where
"2
S = K(b) (5-103)
'F'Z2
U'
.w kips/in. Dimensions of the matrices K(b) and are
I I I f\ 4 t I \ t I I t I F'4' F'4 adjusted for the case of frames and F is a
M r
vector which consists of element forces for all
the elements of the frame. The stiffness and
>*3 ~/Ai mass matrices for the frame element are
different from the truss element. They are
Figure 5-25. A Frame Element given by the following matrices:
1/2 1/2
fii 0 0 a
fii 0 0
6/. 4 0 -6/Lj 2
K.= (5-104)
1/2 -1/2
aibi 0 0 aibi 0 0
0 6/Lj 2 0 - 61L. 4
5-38
AMCP 706-192
140 0 0 70 0 0
5-39
AMCP 706-192
F, =0.60F (5-109)
Members subjected to both axial com-
pression and bending stresses shall be pro-
2. Bending:
portioned to satisfy the following require-
ments:
Fb = 0.66 Fy (5-110)
(1) When//F, < 0.15,
where F is material yield stress, F, is
allowablec tensile
1 stress, and Fb is allowable
f f
bending stress. Js. +J < 1.0 (5-116)
a b
3. Compression:
(2) When/a/Fa > 0.15,
On the gross cross-sectional area of axially
loaded compression members, when kL/r, the cmfb
largest effective slenderness ratio of any un- < 1.0 (5-117)
braced segment, is less than C (\ -^-\F.
F'
5-40
AMCP 706-192
5-41
AMCP 706-192
be derived for various types of loadings and given by Eq. 5-112 includes an allowance for
members, but the values recommended in both of these factors and is adjusted to
Eqs. 5-120, 5-121, and 5-122 are conservative account for their varying influence. For short
and are used in the present work. For a columns, Eq. 5-112 approaches the basic
detailed development and discussion of these safety factor in tension (1.67); and, at e = 1,
equations the reader is referred to Ref. 32. it becomes 15%higher (1.92), a value which is
then used in the case when kL/r exceeds Cc.
The allowable compressive stress formula, Eq. 5-112 is an approximation of a quarter
Eq. 5-115, is derived based on the basic sine wave between the two limits, the curve
theory of column buckling. It is obtained by used in the specification as best representing
dividing the Euler buckling stress by a factor the influence of the two factors. For a
of safety of 1.92. Therefore, F = ir2 7/1.92 detailed discussion of the these factors, the
x (kL/r)2 ] and, taking E = 3.0 x 104 Ksi, Fa reader is again referred to Ref. 32.
= 1.49 x 10s KkL/r)2. Eq. 5-115 is applicable
when the largest slenderness ratio kL/r is The effective length factor k for each
greater than or equal to Cc. Experiments have member of the frame is found from the
shown that when kL/r < Cc , the values of the differential equation
failure stress predicted by the Euler critical
stress formula are seldom attained (Ref. 32).
dly Py n
This is due to the presence of residual stresses T + =0, (5-123)
and other imperfections in fabrication of the dx2 El
members. Therefore, when kL/r < C , the where P is the buckling load, and / is the
values of the allowable stress F are found second moment of the cross-sectional area.
from Eq. 5-111 which is derived based on the
parabolic approximation of the curve-critical The solution of this equation is given by
stress Fa versus the slenderness ratio kL/r in
the range kL/r < Cc. This approximation is
chosen based on the experimental results P P
y{x)=Dx sin/ x + D2 cos /- x
obtained at Lehigh University (Ref. 32). The yj El TJ El
value of the constant C, is found by assuming (5-129)
that the Euler critical stress formula holds
until the critical stress isF12. Therefore, In rigid frames, two cases must be discussed:
(1) frames without sidesway, and (2) frames
a2E
F 2
OTCc = (kL/r)l = y/27r2E/F with sidesway. The transcedental equation
y' ~ {kL/r)\ that comes from Eq. 5-124, while satisfying
where L and r must be expressed in the same the boundary conditions for a member of the
units. frame without sidesway, is given by Ref. 32
1 + GB + G.GJn/k)2 -1 Or/*)
The factor of safety is used to account for (G.A
2
small imperfections of form and loading, and
variations of support and restraint conditions
x sin (ir/k) ~(GA +G)(n/k)2 + 2
from those assumed in computation, which
cause the true effective length to be different
from that calculated. The factor of safety x cos (ir/k) + 2 = 0 (5-125)
5-42
AMCP 706-192
where GA and Q are given by the following equations is used in finding the roots of Eqs.
equations: 5-125 and 5-128 in the present work.
[GAG, i-nlk)2 -36] sin(V^) Eq. 5-132 is used in computing the maximum
bending stress required in the interaction
- 6{GA + GB )(ir/k) cos (v/k) = 0, equations. Now, the implementation of the
interaction equations will be considered one
(5-12; by one and the vectors 9<s/9z and 90s/d6 will
be computed in each case. For the sake of
where GA and G, are given by Eqs. 5-126 simplicity, let ./V be the direct force on the
and 5-127, respectively. Thus, for the case of element, M max be the maximum bending
a frame with sidesway, Eq. 5-128 must be moment, kj be the effective length factor, and
solved for k for each member of the frame. L- be the length of the member to which the
The secant method of nonlinear algebraic ith element belongs.
5-43
AMCP 706-192
N Mmax
-1.0< 0. (5-133)
dM
max max bb}
*,= ZF rr) (5-139)
AiFa i bb 4b. bb
Mmax
m/IV =F'3OTF'6
(5-136) b^
x(l-ej)[l-_j
bb
and
3
Vf e.+-
bM_ bFk bFl
(5-137) k. (F.S.) I ' 8 (F.S.)
bz bz bz
M
(1 ^ i '
If Mmax occurs at a point other than the 1 --e2 ] ' (5-142)
ends, then Eq. 5-132 gives its value, and 2 ' I bb
5-44
AMCP 706-192
('-} (5-150;
N M
n max (3) Interaction Eq. 5-118:
0 = + - 1.0 0 (5-145)
AtFa
Next, consider Eq. 5-118, which may be
where written as follows for the maximum stress in
the ith element
*i = (5-146)
(' -#'F"
N . Mmax
r 7 + -1.0 0..(5-151)
0.6FyAi ZtFb
If this constraint is violated, then one must
compute A$s, 30S/3J, bifrjbz, and bfijdb. In
this case 3<_/3f = 0 and Therefore, 30s/3f = 0 and
,1 / " CM \ N
Ai=J
S +ggf-i.0\ (5-152;
\0.6FyAf ZtFb ]
Differentiating Eq. 5-145 with respect to z Differentiating Eq. 5-151 with respect to z
andb, one obtains and b, one obtains
5-45
AMCP 706-192
It may be noted in the previous equations 54 and 5-5 that follow also are treated in
that Ref. 28 and were first designed for only stress
constraints in order to compare results with
db! (ith) those of Ref. 28.
-(0,.....0,1,0, ,0) (5-155)
ab
1. Example 5-4. Simple Portal Frame
The value of the vectors such as dN/dz,
Fig. 5-26 shows the dimension of the
dNIdb, dF'2/dz, Fib can be found directly
frame. The moment of inertia for each ele-
from the Eq. 5-102. The vectors dCm/dz and
ment of the frame is treated as an unknown,
dCm/db are zero for cases prescribed by Eqs.
and the results obtained are shown in Table
5-120 and 5-122. For the case prescribed by
5-12. The frame was first designed with only
Eq. 5-121, they are computed using the chain
stress constraints. The final weight in this case
rule of differentiation. It remains to find
value of the vector dk./db. This vector can be was 3050.5 lb with a computation time of
computed by differentiating Eq. 5-125 or Eq. 3.74 sec for 13 cycles. At the final design
5-128 with respect to the design variable point, the maximum constraint violation was
vector b. However, due to the fact that both 0.012% for stress in element 2. Optimal
GA and GB arefunctionsof b, this computa- weight reported in Ref. 28 was 3206 lb,
tion is quite tedious and time consuming on which is higher by approximately 5%.
the computer. Another approach that may be
followed for computing akjdb is to use the The frame was also designed by including
method of finite differences; but this ap- all the constraints. The resonant frequency
proach is equally time consuming on the limit for the structure was 25.0 Hz and the
computer. Moreover, it has been observed in final weight obtained in this case was 3803.0
lb with a computation time of 14.60 sec for
the numerical computation that the value of
31 iterations. At the final design point, the
kt does not change appreciably from one
maximum constraint violation was 0.0073%
design cycle to another. Therefore, without
significant loss of accuracy, the value of k. in
a particular design cycle is treated as a
constant. However, at the start of each design
cycle, k values for all the members of the -240 in.-
5-46
AMCP 706-192
TABLE 5-12
Design Information: For each element of the frame, the modulus of elasticity, the specific weight, and the yield
stress are 3 x 10* kips/in.2, 0.2836lb/in.3, and 36.0kips/in.2, respectively. The constants a,-, c- and dj are 0.58,
0.58,and 0.67,respectively. The lower limit on the moment of inertia of each element is 1.0 in.4 and there is no
upper limit. The resonant frequency is 25.0 Hz and the displacement limits are 0.5 in. at nodes2, 3, and 4 in both
xv and x2 -directions. There are three loading conditions for the frame; first is uniformly distributed load of 0.5
kip/in. on elements 2 and 3, second is a load of 45.0 kips inx-direction at node 2,and the third is a load of 45.0
kips inx,-direction at node 4.
El. Starting Values, Final Values, El. Starting Values, Final Values,
No. in.4 in.4 No. in.4 in.4
for stress in element 2. Fig. 5-27 shows the in only 5 iterations in the first case and in 7
variation of the objective function as the iterations in the second case. However, in the
iterations progress. It may be noted that, for second case, the cost function continued to
practical purposes, convergence was obtained reduce for a few cycles beyond the 7th
iteration without correcting the constraints.
This was due to the fact that the step size for
4400 1 i r- the problem was too large.
With All Constraints
Optimum Weight = 3803.0 lb
3900
2. Example 5-5. One-bay Two-story Frame
i With Only Stress Constraints The present example is also treated in Ref.
Optimum Weight = 3050.5 lb 28. Fig. 5-28 shows the dimensions and the
loading conditions for this structure. Input
and output information for this example is
given in Table 5-13. This frame was first
designed for stress constraints only. The final
0 4 8 12 16 18 weight in this case was 8292.0 lb with a
Iteration Number computation time of 21.47 sec for 32 itera-
tions. Maximum constraint violation at the
Figure 5-27. Iteration vs Weight Curves for
Example 5-4, Simple Portal design point in this case was 0.27 percent for
Frame stress in element 3. The comparable final
5-47
AMCP 706-192
weight reported in Ref. 28 was 8810 lb, The frame was also designed while en-
which is again higher by approximately 5.8%. forcing all constraints. The resonant frequen-
cy limit in this case was 15.0 Hz. The final
, 6K/ft weight obtained in this case was 9722.5 lb
/ 45K with a computation time of 48.84 sec for 32
45K uiuuuniirn]
3 0' 4 III iterations. Maximum constraint violation was
0.38 X 10"3 % for displacement of node 3 in
the x{ -direction. Fig. 5-29 shows variation of
180 in.
6K/ft the cost function with respect to iteration
JL
45K
mimnimii 45K number, and it may again be noted that
convergence was obtained in 8 cycles in both
III
the cases.
180 in.
3. Example 5-6. Two-bay Six-story Frame
0)
Figure 5-30 shows the geometry and
240 in. - dimensions of the frame. This frame has 21
joints, 30 members, and 54 degrees of free-
Figure 5-28. One-bay, Two-story Frame dom. The frame was designed for four loading
(Example 5-5) conditions, and the input and output informa-
TABLE 5-13
Design Information: For each element of the frame, the modulus of elasticity, the specific weight, and the yield
stress are 3 x 104 kips/in.2, 0.2836lb/in.3, and 36.0kips/in.2, respectively. The Constantsa., c;, and d,.are 0.58,
0.58,and 0.67,respectively. The lower limit on the moment of inertia of each element is 1.0in.4 and there is no
upper limit. The resonantfrequency for the frame is 15.0Hz and the displacement limits are l.Oin.at nodes2, 3,4,
5, 6, and 7 in both x, -andx2 -directions. There are three loading conditionsfor the structure, and they are as shown
on Fig. 5-28.
El. Starting Values, Final Values, El. Starting Values, Final Values,
No. in.4 in.4 No. in.4 in.4
5-48
AMCP 708-192
2 3
1
lion for the problem is given in Table 5-14.
3
1
4
2
5
t
144 in
The frame was first optimized by imposing
the stress constraint only. The optimum
5 6 \
6 7
weight in this case was 21706.6 lb with a \
8 9 10 144 in.
computation time of 8.32 min for 21 itera- ''
8 9
tions. At the final design point, the maximum 7
11 12
constraint violation was 0.025% for stress in 13 14 15 144 in.
element number 25. Next, the frame was
designed by imposing all the constraints. The
10
16
11
17
12 fI
optimum weight in this case was 24290.1 lb 18 19 20 144*1.
with a computation time of 8.7 min for 20 14 15 \
iterations. At the final design point, the
maximum constraint violation was 0.0072%
13
23
21
24
22
25
I
144 31.
5-49
AMCP 706-192
TABLE 5-14
Design Information: For each element of the frame the modulus of elasticity, the specific weight, and the yield
stress are 3x 104 kips/in.2, 0.2836lb/in.3, and 36.0 kips/in.2, respectively. The constantsa^c., andd.are 0.58,
0.58,and 0.67 respectively. The lower and the upper limits on the moment of inertia of each elementare 394.5in.4
and 6699.Oin.4, respectively. The resonant frequency of the structure is taken as 4.0Hz and the displacement limits
are 2.Oin. at all nodes in bothx!- and x2-directions. There arefour loading conditionsfor the frame: (1) Uniform-
ly distributed load of -4.0 kips/ft on element 1, 7,11, 17,21, and 27,and 1.0kip/ft on elements2, 6,12,16,22,
and 26; (2)Uniformly distributed load of - 4.0kips/ft on elements 2, 6, 16, 22, and 26,and 1.0kip/ft on
elements 1, 7,11, 17, 21,and 27; (3)Uniformly distributed load of - 1.0kip/ft on elements 1, 2,6,7, 11, 12,16,
17, 21, 22, 26,and 27, and loads of 9.Okips each at nodes 1, 4, 7, 10, 13, and 16 in direction of the x-axis; (4)
Uniformly distributed load of - 1.0kip/ft on elements 1, 2,6,7,11, 12, 16,17,21,22, 26,and 27, and loads of
- 9.Okips each at nodes 3,6, 9,12,15, and 18 in direction of xt axis.
El. Starting Values, Final Values, El. Starting Values, Final Values,
No. in.4 in.4 No. in.4 in.4
5-50
AMCP70G-192
5-51
AMCP 708-192
5-52
AMCP 708-192
design parameters will not cause significant designer a clear picture of the manner in
violation in specified performance constraints which he should change his design parameters
such as stress limits and deflection limits. to reduce total weight, subject to stress
constraints. He can now choose the desired
To illustrate these ideas, consider the reduction A/ in weight and take the resulting
simple structural design problem in Fig. 5-34. design change 8b, or if he wishes, he can input
modified design changes through an inter-
active computer terminal.
5-53
AMCP 706-192
5-54
AMPC 708-192
j-jj
AMCP 706-192
5-56
AMCP 706-192
TABLE 5-15
1 2 3 4 5 6 7 8
Max.
Defl, in. 0.00689 0.02390 0.00766 0.02559 0.005 0.005 0.005 0.005
2. Example 5-8. Transmission Tower computation and the Curve 1 of Fig. 5-41
shows the variation of cost function with the
Fig. 5-40 shows the geometry and dimen- number of iterations. The computations of
sions of the transmission tower to be studied. this case were monitored to determine which
This problem has been considered by cross sections went to their lower bounds.
Venkayya and others (Ref. 39). The tower
has 25 members, 10 joints, 18 degrees of One set of members which attained their
freedom, and is designed for 6 loading condi- lower limits of cross-sectional area were num-
tions. The structure is indeterminate, with a bers 10, 11, 12, and 13. It was observed that
degree of indeterminacy of seven. these members carried small forces and could
be removed without causing collapse of the
The tower was designed by first imposing tower, so they were removed from the tower.
only stress constraints, and then by imposing The final values of areas of cross section of
stress, displacement, buckling, and natural the resulting structure are given in Column 2
frequency constraints. Design information is of Table 5-17. Curve 2 of Fig. 5-41 shows the
given in Table 5-16, and the final results variation of cost function with respect to the
obtained are shown in Tables 5-17 and 5-18. design cycle. The final weight in this case was
Table 5-17 shows the results when only stress slightly less than the previous case.
constraints are considered, and Table 5-18
gives the results for the corresponding cases The next member that reached its lower
when all the constraints are considered. For limit was number 1, so it was also removed
results given in Column 1 of Table 5-17, all from the structure. The results of this case are
the members of tower were included in the given in Column 3 of Table 5-17 and Curve 3
5-57
AMCP 706-192
75 in.
5-58
AMCP 708-192
TABLE 5-16
For each member of the structure, the modulus of elasticity E., the specific weight p/V the constant a. (moment
of inertia of /th member,/. = ai>.2), andthe stress limitsare 104 kips/in.2, 0.10 lb/in.3, 1.0, and 40.0 kips/in.2, re-
spectively. The lower limit on the area of cross section of each member is 0.10 in.2 for the case with stress
constraints only and 0.01 in.2 for other cases. There is no upper limit on the member sizes. The resonant frequency
for the structure is 173.92Hz andthe displacement limitsare 0.35 in. on all nodesand in all directions. There are six
loading conditions and they are as follows (all loads are in kips):
of Fig. 5-41. The final weight in this case was 16, and 17 reached their lower bounds but
86.94 lb, which is given slightly less than the removal of all of these members rendered a
previous case. Finally, members 14, 15, 16, structure that was geometrically unstable.
and 17 were at their lower limits of cross- However, members 14 and 16 or 15 and 17
sectional area. Removal of any of these could be removed without causing the col-
members, however, would cause collapse of lapse of the structure. Results with members
the structure. Members 2 and 5 or 3 and 4 15 and 17 removed are given in Column 5 of
could be removed to make the structure Table 5-17 and similar results are obtained by
determinate. The results for a statically omitting members 14 and 16 from the com-
determinate structure, obtained by removing putation. The next set of members that were
members 2 and 5, are shown in Column 4 of at their lower bounds and could be removed
Table 5-17. The final weight in this case was without making the structure unstable were
106.97 lb. It may be noted that this statically numbers 1, 12, and 13. These were also
determinate structure yielded only a local removed from the structure and the results
optimum, Curve 4, Fig. 5-41. obtained in this case are given in Column 6 of
Table 5-17. Two other members could be
Another sequence of removing the mem- removed from the structure to make it
bers that reached their lower limits of area of statically determinate. Results obtained by
cross section was also tried. Members 14, 15, removing members 4 and 5, and then numbers
5-59
AMCP 706-192
TABLE 5-17
El.
No. 1 2 3 4 5 6 7 8
Max.
Defl. in. 2.288 2.305 2.311 3.489 2.486 2.453 3.614 3.615
2 and 5 are given, respectively, in Columns 7 tained by removing members 1,2, J, 15, 16,
and 8 of Table 5-17. Computations were also 19, and 20 was optimized. The cross-sectional
carried out by removing members 2 and 3, areas of various members at the optimum
and members 3 and 4 along with members 1, point were as follows: 3,4(0.100); 6 to
12, 13, 15, and 17. Results obtained in these 9(0.779); 10,11(0.182); 12,13(0.446);
cases were the same as those shown in 14,17(0.302); 18,21(0.775); 22,25(0.537);
Columns 7 and 8 of Table 5-17. For this and 23,24(0.751). The optimum weight in
reason, these results are not reproduced here. this case was 118.1 lb and the maximum
Finally, another determinate structure, ob- deflection at this point was 3.861 in.
5-60
AMCP 706-192
TABLE 5-18
Max.
Defl, in, 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350
All these tower configurations were also mum weight of the tower increased as more
optimized by imposing all constraints; i.e., redundant members were removed from the
stress, displacement, buckling, and natural structure.
frequency. The results of these cases are given
in Table 5-18. Curves 1 to 4 of Fig. 5-42 show
the variation of cost function with respect to 5-7.4 INTERACTIVE COMPUTING CON-
the iteration number for results of Columns 1 CLUSIONS
to 4 of Table 5-18. It can be observed from
the results of Table 5-18 that, for the casein Computing times for this interactive com-
which all constraints were imposed, the opti- puting approach are considerably shorter than
5-6"1
AMCP 706-192
140
870-1
Curve 4
* 100-
2 4 6
Figure 5-41. Iteration vs Weight Curves for Iteration Number
Example 5-8, Transmission
Tower With Stress Constraints Figure 5-42. Iteration vs Weight Curves for
Only Example 5-8, Transmission Tower
With Al I Constraints
had been experienced when the same prob-
lems were solved in the batch mode. Second,
and probably more significant, interactive
computing allows the designer to alter the
right. For the case when only stress con-
structural configuration in a systematic way
straints are imposed, results of Table 5-15
to seek the global optimum design. This is not
indicate that minimum weight designs for
to say that a mathematically precise method
trusses with multiple loading may be statically
of obtaining a global optimum has been
determinate. However, the results of the
found, for no such method is known. It
appears, however, that the technique pre- second example given in Table 5-17 indicate
that all statically determinate trusses may not
sented here makes strong use of the designer's
be lighter than the indeterminate trusses.
knowledge and intuition, and gives him a tool
with which to seek a global optimum in an
organized way. For the case when all constraints are
imposed, results of Tables 5-15 and J-18 show
The results presented for the two examples that statically indeterminate trusses are lighter
solved in par. 5-7 are of interest in their own than the determinate trusses.
REFERENCES
6. R. Luik and R. J. Melosh, "An Allocation 14. W. R. Spillers and J. Farrell, "On the
Procedure for Structural Design", Pro- Analysis of Structural Design", J. Math.
ceedings AIAA/ASME 9th Structures, Anal, and Appl., Vol. 25, 1969, pp.
Structural Dynamics and Materials Con- 285-295.
ference, April 1968.
15. W. J. Woods, "Substructure Optimization
7. W J. Woods and J. H. Sams, III, "Geo- in the Theory of Structural Synthesis",
metric Optimization in the Theory of Proceedings AI A A 7 th Aerospace
Structural Synthesis", Proceedings Sciences Meeting, January 1969.
AIAA/ASME 9th Structures, Structural
Dynamics and Materials Conference, 16. M. J. Turner, "Optimization of Struc-
April 1968. tures to Satisfy Flutter Requirements",
Proceedings AIAA Structural Dynamics
8. R. L. Barnett and P. C. Hermann, High and Aeroelasticity Specialist Conference,
Performance Structures, NASA Report, April 1969.
NASA CR-1038, May 1968.
17. C. P. Rubin, "Dynamic Optimization of
9. M. J. Schrader, An Algorithm for the Complex Structures", Proceedings AIAA
5-63
AMCP 706-192
Structural Dynamics and Aeroelasticity 27. Manual of Steel Construction, Sixth Edi-
Specialist Conference, April 1969. tion, American Institute of Steel Con-
struction, 1967.
18. R. L. Fox and M. P. Kapoor, "Structural
Optimization in the Dynamics Response 28. Y. Nakamura, Optimum Design of
Regime: A Computational Approach", Framed Structures Using Linear Pro-
Proceedings AIAA Structural Dynamics gramming, Master's thesis, Department of
and Aeroelasticity Specialist Conference, Civil Engineering, M.I.T., Cambridge,
April 1969. Massachusetts, 1966.
5-64
AMCP 706-192
5-65
AMCP 706-192
CHAPTER 6
6-1
AMCP 706-192
(6-4)
6-2
4MCP 706-192
J(x). The admissible class of functions here spaces may be described in a similar manner
plays a role similar to the constraint sets of as
Chapters 3, 4, and 5.
C'(a,b) = {x(t), a-i t< b \ x(t) has i
The idea of classes of functions required continuous derivatives ( . (6-6)
here is basic to the mathematical field called
Functional Analysis. Classes of functions in It should be understood here that x(t) may be
this field are called function spaces. Consider, a vector valued function and the differ-
for example, the collection of all continuous entiability requirement in Eq. 6-6 refers to
functions x{t) on0< (< 1. The graphs of each component.
several such functions are shown in Fig. 6-3.
Function spaces may be thought of as sets
of elements, where elements in the function
space are really curves or functions. In this
way the problem of minimizing J{x) may be
viewed as picking the element (curve) in the
appropriate function space that makes J(x) as
small as possible. This approach makes mini-
mization of a functional sound very similar to
the programming problems of Chapter 2. With
this mental analogy one may begin his study
Figure 6-3. Examples of Continuous Functions of the calculus of variations armed with a
powerful intuitive tool.
It is clear that there are infinitely many The basic ideas of function space theory
continuous functions but that not all func- are presented very clearly in Ref. 1, Chapter
tions are contained in this class. For example 2.
6-3
AMCP 706-192
f"
1/2 X(t) - 5
x\\ = x2(f)df (6-11)
J ,u
This simply says that J(x) has a minimum in a Examples 6-1 and 6-2 have features in
sufficiently small neighborhood of 2. It is common that allow for the formulation of an
interesting to look at a neighborhood of a entire class of problems containing these two.
curve in C(t,tl) where norm is defined by For the sake of generality, let the variable x(t)
Eq. 6-10. In this case Eq. 6-12 simply be a vector valued function of the real
demands that x(t) be within 6 of x(t) for all t variable t, i.e.,
in t Q t ss t1. The neighborhood of x in this
case is simply the collection of all continuous x,(0
curves which can be drawn between x(t) + 6 x(t) = (6-13)
andx(t) 6, as shown in Fig. 6-4.
xn(t)_
The present chapter will be devoted almost
exclusively to the theory of the calculus of where x;(f) are real valued functions of t.
variations and optimal process theory. Con-
structive methods for these problems will be The problem considered here may be for-
treated in the chapters to follow. A knowl- mulated as Definition 6-1.
edge of this basic theory is essential for
successful application of the theory of opti- Definition 6-1 (Fundamental Problem of
mal design. It has been the experience of the the Calculus of Variations): Find a function
author that most real-world problems require x(0 inC2^0,/1) which satisfies
6-4
AMCP 706-192
x.(t) = x , for some indices k ia such candidate that must then be the solu-
tion. If there are several candidates, other
x- (tl) = x J, for some indices 1 <s ;' s n methods must be used to choose the solution.
This problem will be discussed later.
(6-14)
Graphically, the method of obtaining con-
and which minimizes ditions on the solution, x{t), of the funda-
mental problem will be to allow small changes
J(x) = \
f Fit, x, x')dt (6-15) inx(0 and examine the behavior of J(x). An
admissible, small perturbation is illustrated in
Fig. 6-5. The equation for this curve is x(t) +
where F is a real valued function of all its
arguments and
' dxx
dt
(6-16)
(6-18)
tion, x(t), inC2(?,f') first will be assumed. A
set of conditions that x(t) must satisfy then Recall that x(t) is a local minimum of J(x)
will be derived. These conditions then may be subject to Eq. 6-14, i.e., any small change in
employed in particular problems to find x{t) increases J{x). For any given function
functions x(t) that are candidate solutions of q(t) mC2{t,t') and satisfying Eq. 6-17, x{t)
the problem. Hopefully, there will be just one + er?(0 is in C2(t,tx) and satisfies Eq. 6-14
6-5
AMCP 706-192
for all E. Therefore, for this r\(t), J(2 + 617) is a be treated independently, i.e., each is required
real valued function of e. Further, for E - 0, to be zero. One of the major results which
J(x + erj) has a relative minimum and it is follows is a direct application of Lemma 6-1.
assumed that F(t,x,x') is twice continuously
differentiable in x and x' so that J(2 + erj) is a Lemma 6-1: If M(t) is a continuous func-
twice continuously differentiable function of tion on t <i t < t1 and if
E. Theorem 2-2 then applies, and it is required
that
n
J(x + eri)
ae k=o
0. (6-19)
f M(t) 77(f) dt = 0 (6-22)
ae ie=o
V+ r}'}dt = G, (6-20)
X X I
4 (
where the arguments in the partial derivatives Figure 6-6. Graphical Proof of Lemma 6-1
of / in Eq. 6-20 are x(t) and x'{t). It is
important to remember that Eq. 6-20 is
required to hold for any rj(t) in C2(t,tl) t* < t1 , is shown where M^t*) = 0. The curve
which satisfies Eq. 6-17. 77.(t) is then constructed so that neither
function is zero in the interval a < t < b.
Integrating the second term in the Their integral over the entire interval is then
integrand of Eq. 6-20 yields nonzero which is a contradiction of Eq. 6-22,
so M{(t*) = 0.
oF d OF
; Vdt Since the two terms in Eq. 6-21 must each
dx dt ox'
be zero,
dF
I! dx " dt \ox')
r}dt = 0 (6-23)
Since the behavior of 77 inside the interval t for all 77(f) in C2(t,tl). In any subinterval of
< t < tl and at its ends are independent, the t < t < tl where x(t) is continuously
integral and boundary terms in Eq. 6-21 may differentiable, the quantity
6-6
AMCP 706-192
[dF/dx d/dt (dF/dx')] is continuous. forallt?(?), r\{tl) satisfying Eq. 6-14, and
Therefore Lemma 6-1 implies
f-[f_0,Jc(/-0),Jc'(f-0)]
dF dF dx
=0 (6-24)
dx dt \dx
ML fff 0,x(f+0),x'(f+0)] (6.28)
- f'
17'" J
dF
dx
dt + C. (6-25)
called the Weierstrass-Erdmann corner condi-
tion.
6-7
AMCP 706-192
Rather, when solutions of Eq. 6-26 are This is simply an algebraic equation be-
determined, Eq. 6-30 is used to eliminate tween t and x. Since there will be no
unsuitable functions, i.e., it may very well constants of integration, it will not generally
disqualify a function which satisfies Eq. 6-26. be possible to pass the resulting curve through
particular points. This means that a solution
The derivation of necessary conditions for to such a problem generally will not exist.
the fundamental problem is only very lightly
covered here. Further, the theory of sufficient Example 6-3: Minimize
conditions is completely neglected. For out-
standing and complete treatments of these
topics see Refs. 2, 3,4, and 5. 1 x2dt
J 0
6-2.2 SPECIAL CASES AND EXAMPLES
for
In many problems the form of the function
x(0) = 0, x(l) = 1.
Ftt,x,x') allows for simplification for the
Euler-Lagrange equation, Eq. 6-26. In any The condition, Eq. 6-33, is
case, Eq. 6-26 may be written, using the chain
rule of differentiation and the notation
2x = 0.
3F d2F
F Fx , and But it is, therefore, impossible to satisfyx(l)
* ~?)x' ' ~dx"F'x' dtdx
= 1 so the problem has no solution.
2
F
F To get an idea of what has gone wrong,
x'x a \ax
note that since x2(t) > 0 for each t,
to obtain
T TF
1x2 (t)dt > 0
Fx-Fx;-x' Fl'x-x" *'*
0
and this is not even a continuous function. Since OO2 > 0, [ 1 + (x')2 ] * 0 and x'(t) is
The class of functions xn (t) are illustrated in required to be continuous so [ 1 + (x')2 ] =
Fig. 6-7. and it, therefore, is required that
x"(t) = 0
or
x{t) =at + b,
Example 6-4: Using the formulation of Eq. 6-26 is, in this case,
Example 6-1, find the shortest curve in the t-x
plane which passes through the points (0,0)
Fxx.(t,x') = 0
and (1,1). dt
6-9
AMCP 706-192
or This reduces to
6-10
AMCP 706-192
A"
de Jf
F(t,x + eSx,x'
T 2
d F
r ~^ {t-x'x')bxT
x
+ e8x')dt T
e=o b2F
n-
+ 28x , (t,x,x')bxf
dxdx
3F
(t,x,x')bx
dx
T 2
d F
+ 8x' (t,x,x')bx' dt
dx2
dF
+ ,(t,x,x)bx dt.
dx'
Define
Note that all this does not require that x(t) d2F
be the solution of the fundamental problem.
dx2
If, however, x(t) = x(t) is the solution of the
fundamental problem, then it is clear from
d2F
Eq. 6-19 that it is necessary that 5=2
dxdx'
8J(x) = 0, (6-41)
and
for all 8x(t) for which x(t) + ebx(t) satisfy the
d2F
end conditions in the fundamental problem. C=
dx'
In a way quite similar to the definition of
With this notation,
the first variation, the second variation may
be defined as
d2
b2 J(x) = J(x + eSx)
de E=0
b2J(x)=\
I"
If 0
(bxTASx + oJBbx'
+ 8x'TC8x')dt
Performing the differentiation, this is
If F(t,x,x') has three derivatives, then by
J(x)--
d2
T" I F(t,x +e8x,x' +i 8x') dt
e=o
Taylor's formula
J(x+ebx)=J(x) + \
de
le=o
Sx1
d2J d*J
e< +
"de2 "d?
bF1 e=o e=e
(t,x +ebx,x' +e5x')
dx
(6-42)
T
dFT
+ 8x' (t,x + ebx.x + ebx') dt
dx' e=o where 0 < e < E. If we computed d3J/de3, it
6-11
AMCP 706-192
would involve a sum of terms each containing which has large derivatives. One might, there-
third degree terms in 8x and 8x'. For 8x and fore, be led to believe that the derivative term
8x' sufficiently small, this term may be in the inequality of Eq. 6-43 is dominant.
neglected to obtain a second-order approxi- This would then require C to be positive
mation when E = 1 so that semi-definite.
J(x + 8x)~J(x) + 8J + 82J. To show that this is the case, assume that
there is a point t", t < t < t1 and a nonzero
It is clear then that 6J and 82J play the vector h such that hTC(t*)h = - 2 < 0. For
role of differentials in the theory of func- any continuous bx'(t) such that dx'ft*) = h,
tionals. there is an interval
This is just / OL
h sin I \,t*-a< t< t*+a
7T
2
8 J( x)> 0 Sx(t) = '
0, elsewhere
or
so that
(8xTA8x+8xTB8x' I \ir{t - r*)l ^
h cos , t* <x< t < t + a
Sx'(t)
+ 8x'TC8x')dt> 0 (6-43)
0, elsewhere
6-12
AMCP 706-192
b2F
Isin0 II cos 0 \d6 - [t,x(t),x'(t)]
dx
J it
6-13
AMCP 706-192
lim
J[x<n>] =J(x) = Z. (6-45) x<n>(t)= 2 a.4>t(t) (6-46)
It is clear, however, that a sequence which where the 4>/t) are chosen so that x^nHt)
satisfies Eq. 6-44 may very well fail to satisfies the end conditions associated with
converge to an admissible function (?/ This the fundamental problem. This expression k
must necessarily be the case if no solution of then substituted into J(x) to obtain
the fundamental problem exists. From an
engineering point of view, one may not be too J[x<n>] =
concerned with existence of a limit of the
sequence [x<nl (t)] .Provided it is possible to
successively reduce /, consistently better re- I'1
sults are being obtained and the process will FL.Z afaU), arfft^dt. (6-47)
be continued until no further meaningful
reduction in / may be achieved. For an The object now is to choose the coefficients
outstanding treatment of convergence of di- ap i = 1, ..., n, so that J[x^"^ ] is as small as
rect methods, see Ref. 2, page 192, and Ref. possible. For this purpose, it should be noted
3, page 127.
that the right side of Eq. 6-47 is simply a
function of parameters. The problem is now
The problem of primary interest to the
to minimize this function without any other
engineer is the construction of a minimizing
constraints. For this purpose, any of the
sequence. There are many ways of generating
methods of Chapter 2 may be used.
such a sequence, only two of which will be
treated here. These methods are known as the
Ritz Method and the Method of Finite Differ- The property
ences.
J[x(" + l>} < J[X("1]
6-2.4.1 THE RITZ METHOD
follows readily from the method of deter-
The Ritz Method is based on the idea of mining the af. It is clear that by choosing
representing functions by using linear com- an+l = 0, x(n+1) (0 = x'"'(t). However, by
binations of known functions; i.e., given 4>/t), allowing a+l to be nonzero, a larger number
i = 1,2, . . . which preferably form a complete of functions are available as candidates for
set, a function is represented by minimum of J[x<-n+1)] than J[x<") ]. The
minimum of /[x(" + 1)] will, therefore, cer-
tainly not be greater than that of J[x<n> ] and
y(t) = 2 a.<t>.(t)
/= l ' ' this is the desired result.
6-14
AMCP 706-192
J(x)-
i: [x'2 _(i + t2)x2 -2x]dt
(6-48)
x<2)(t) =75fe(3969 - 4200r2 + 23lr4).
In particular,
subject to the end conditions x(\) = x(l) =
0. x<2)(0) = 0.93344.
6-15
AMCP 706-192
ment of derivatives and the integral by finite chapters that these side conditions generally
approximations of these continuous opera- include both equality and inequality con-
tions. A grid, t = t0, ti , ..., tn+1 = t' , is straints. An extension to inequality con-
placed on the interval t < t < tl and the straints will be given in par. 6-4.
value of xft) only at the grid points is sought,
i.e., only the parametersx( =x(tj), i = 1,..., n, The problem to be treated here is given in
and perhaps x(t0) or x(tn+l), are sought. Definition 6-2.
J(x,)= 2 t;X.,
J = g0(b,t',x>)
1=0 ' ' t.i+i-h
(6-50)
x Oi+l-ti)) + f0[t,x(t),u(t),b]dt
('t"
The problem is now simply an uncon- subject to the conditions
strained minimization problem in a finite
dimensional space and may be solved by the dx
methods of Chapter 2. f(t,x,u,b), t < t< t" (6-5 1)
lit
6-16
AMCP 706-192
and t < t' < tv, where (t',x') are inter- The allowed discontinuities of f0, f, La,
mediate points, / = 1, ...,77 - 1. and 4>a play an important role in many
real-world problems. This feature allows for
For the problem considered here it will be completely different forms of state equations,
required that the conditions, Eqs. 6-53, shall constraints, etc., for different ranges of state
not determine any component of x(t) ex- and time. It is, therefore, possible to routinely
plicitly. This is equivalent to requiring that account for sudden changes in system be-
the rank of the matrix havior such as reverse in direction of frictional
force, motion of objects in a space where
physical barriers or restraint surfaces exist,
30,
(t, x, u, b) (6-55) logic built into the system which changes
Lbuk configuration as in staging of rockets, etc. It
q X m
should be clear that these features are re-
shall be q for all admissible values of the quired in order to treat many realistic prob-
arguments. In case some constraint function lems.
should depend only on x(t) and t; this
constraint is called a state variable constraint. For a discussion of the effect of these
This kind of constraint will be discussed in a discontinuities on more detailed necessary
later paragraph. conditions and sufficient conditions, see Ref.
8.
The vector variable x(t) is called the state
variable, u(t) is called the design (or control) 6-3.2A MULTIPLIER RULE
variable, and b is called the design (or control)
parameter. Eqs. 6-52 contain the boundary As mentioned in par. 6-3.1, real-world
conditions on the state variable and functions optimal design problems require at least the
which determine the end points of the inter- complexity of the Bolza problem of Def. 6-2.
val, t and /". The independent variable t In fact, the system designer requires all the
may be time or a space-type variable, depend- tools the mathematical theory of optimal
ing on the problem being considered. processes can give him. This requirement
points out one of the obstacles to engineers in
The functions f0,f,La, and <j>a are assumed utilizing the modern theories of mathematics.
to be continuously differentiable at all points This text cannot possibly present the mathe-
except (t'.x1), j = 1,..., TJ 1. At these points matical theory required of the research math-
the functions may have jump discontinuities; ematician who is developing the theory of
i.e., the functions will have limits along any optimization. The approach taken here to
path, but limits along different paths may by-pass this obstacle is to accept a key
have different values. In general, even for theorem of Functional Analysis and then
problems with very regular functions, u(t) proceed to develop the tools required for
may havejump discontinuities. Therefore u(t) solving problems of optimal design. A very
6-17
AMCP 706-192
6-18
AMCP 706-192
J=\0g0(b,t',x') + 2 laga{b,t',x')
I {\0fo(t,x,u, b) + 2 \,.(f)
J=G +
n dt
\T(t)-H]dt
dt
(6-60)
\dxt
, dx
+ 2, \La(t,x,u,b) (O-r -H] dt.
dt
+ r> 3C . . , 9G . 3G
i> * r n *
+ 5?" +8b
9r" 9
f5-J8)
G - Kgo(b,t>,x>) +2 -v (b t1 x1')
LI a? 9JC
n
H,x,u,bXy,n) = \TU)fU,x,u,b)
. T, , dSx 9//
- ~k0fo(t,x,u,b)
9// ,. 9//
"^7 ('.*..*) dt
du ab
6-19
AMCP 706-192
dHT
-Hit7' -0,x, u, b) lot".
J'
Integrating the first terms in each integrand
dh
dt x
Tfor t = t! (6-61)
9G n dG A dG p . 1/"
0 = Ax + ... + Ax" +- 5f dH
dx dx* dt dt = 0 (6-63)
dG , dG
+ ...+T5f* +^-(5ft
I
3r> 36
l_r-\(r) = o
3x
1
t'
T
L dt dx dG
+ \(tr>) = 0 (6-64)
~dx^
dH dH
+ 5+ 8b dt
du db r + \(f> -0J-\(t' +0)=0
dx'
If r f"
dt L+H(t +0)=0
6-20
AMCP 706-192
Even though Eqs. 6-61 through 6-67 are For proof of this theorem see Refs. 8 and
very complicated, it is interesting to note that 13.
they provide just the right number of equa-
tions to solve for all the unknowns. Eqs. 6-51 Another useful result is the following iden-
along with Eq. 6-61 form a system of 2n tity:
first-order differential equations for x{t) and
X(f). Further, the first and last members of dH__dH_
Eq. 6-64 may be considered as 2n equations ,fort*f'. (6-69)
dt at
in boundary conditions onX and x. This is the
proper number of boundary conditions. The
This condition is useful in case H does not
second equation of Eqs. 6-64 provides any
depend explicitly on/. Then H is constant
jump conditions in \{t) at the intermediate
between the points t>', and at these points it
points t', 0 < j < r). Eqs. 6-65 may be
may have discontinuities governed by the
interpreted as determining t>, j = 0, 1,..., 77,
third equation in Eq. 6-65.
and Eq. 6-66 determines t*. Eq. 6-67 simply
states that X is continuous even at jump
To prove this relation holds, compute
discontinuities in u. Finally, Eq. 6-63 deter-
formally
mines the design parameter b.
It should be clear that this argument only dH M m du dH dx dH dh
+ + +
shows that there are the proper number of dt at du dt ax dt ah dt
6-21
AMCP 706-192
y=v
In order to obtain familiarity with the
Bolza problem, several examples will be con-
. F (6-70)
sidered. In order to illustrate the basic ideas u = cos 0
associated with the Bolza problems, these m
examples will be elementary. In real-world . F
problems the engineer should be prepared for v = sin 6 g,
complexity that will probably force him to
use a numerical method of solution. For
examples of the Bolza problem in the field of where
dt
aerodynamics, a field which contributed
greatly to optimal design theory, see Refs.
15,16, and 17. The projectile is fired from a gun at time t
= 0 with x(0) = y(0) = 0 and initial velocity
Example 6-6: Maximum Range Rocket- w(0) = V cos 0O, "(0) = V sin 0O, where V is
assisted Projectile the muzzle velocity of the projectile. The
problem at hand is to choose B0 and d(t) so
A projectile of mass m is acted on by a that at some future time T, the projectile will
fixed force F as shown in Fig. 6-8. The angle hit the earth as far as possible from the launch
of 0(t) is measured from the x-axis, where the point, i.e., y(T) = 0,x(T) = maximum.
J = go(l>,t',x') = -x(T).
Figure 6-8. Particle in Motion Boundary conditions on the state variables are
6-22
gl =x(0) = 0 dG
= 7i = \(0)
g2 = y(0)=Q 3x(0)
F dG
H = \u +X v+\ cose
dy(D
= x5 = \(T)
K
(isind^
(6-78)
(6-73) dG
= 0 = \(T)
where variable named subscripts are used for du(T)
the A's.
dG
0 = \(T)
Theorem 6-5 yields as necessary conditions dv{T)
dH dG
= H(T-0) (6-79)
dT~
dH and
= 0
\ = dy
(6-74) \ cos + Xv sin 0 < X cos 6
dH
X = = -X
du + \ sine (6-80)
dH for aU admissible .
$*-
dG | dH
dt = 0 73 Vsin0
ae
de0 - J o
6-23
AMCP 706-192
X =i2(.T-t) 1 TF
T(VSm8, -gT+smdo)=0.
2 2m
and 2m
_ (T _ t) sin 8 + 2 (T- t) cos 0=0. The one condition which has not been
used is Eq. 6-79. By substitution of Eqs. 6-81
and 6-83 into Eq. 6-73, Eq. 6-79 becomes
For all t = T, this is
TF
V+ m-J cos d0 + i-i I ysin 8,
- sin 8 + 2 cos d = 0.
h
These equations imply TF
+ sin0 ~gT
m
2 =tan8, By use of Eq. 6-82 this becomes
(6-82)
8(0=60
V+ ) cos2 60 + [V+ I sin2 0O
6-24
AMCP 706-192
T=-
3FV
2 m
-t
4m2
4P ( A
\2m2 2 /
taken as
2(
-^ (6-87)
. v2 M h{t)F sine
u = - *
Substituting T from Eq. 6-87 into Eq. 6-85
(6-88)
then gives an easy equation for 6 0.
uv , h(t)F cos 6
v = f
While the results of this problem are not
particularly useful, the solution does illustrate
the use of the various conditions in Theorem m =h(t) 4
6-5 in generating a candidate sclution of the
problem. The reader, however, should not be where
led to believe that all Bolza problems may be
solved in closed form as in this example.
6-25
AMCP 706-192
r(T)=R, u(T) = 0
(6-89)
+ X.
MV h (t)F sin <p
1
v(r> = (M/^)1/2 + \nh(t)q.
w = radial velocity
i*f)-4f
7 = tangential velocity K = -X+x
m = mass of spacecraft
\.=-\A )+ X, (6-90)
fi = gravitation constant ft
MO-F sinift
F = thrust \=\, L W
4 = mass flow rate during thrust
ft(0-F COS0
+ X.
<t> - thrust orientation
h(t)F cos<p
Since m0 m(t) is the amount of fuel o = x
consumed up to time t, the object here is to m
minimize (6-91)
, h(t)F cos (p
J - m0 m(T).
XM(D = -1 (6-92)
6-26
AMCP 706-192
The prospect of solving this set of equa- Definition 6-3 (Problem cf Optimal De-
tions in closed form is dim indeed. A general sign): The optimal design problem is a prob-
procedure can be discussed, however, and the lem of finding u(t), b, x(t), t <s t < tn, which
actual solution can be obtained using numeri- minimize
cal methods discussed in a later paragraph.
J = g0(b,t'\x')
From Eq. 6-91, <p{t) may be determined as
c
order differential equations for the state and
adjoint variables. Eqs. 6-89 and 6-92 form a
ga{b,t>,x>)+ I La [t,x{t),u{t)Mdt
set of eight boundary conditions for these
variables. Eqs. 6-93 show that the adjoint
variable is continuous and Eqs. 6-94 deter- = 0, a= 1, ...,r (6-98)
mine tx and t2. A numerical procedure may
be used to solve this problem. The resulting
adjoint variables may then be substituted into ga(b,t',x')+ I La [t,x(t),u(t),b]dt
Eq. 6-95 to obtain the explicit design (or
control) variable. A problem of this kind is
discussed in Ref. 18. The method used there < 0, a/ = r + 1,..., Y (6-99)
to construct a solution is completely different
(p(,t,x,u,b) = 0,= 1, ...,q'
from the one proposed here.
(6-100)
t a to t*
6-4 PROBLEMS OF OPTIMAL DESIGN
AND CONTROL and
6-27
AMCP 706-192
reduction, define the slack variables va, a = r' required to have rank q. For this purpose, the
+ 1,..., r and w At), = 4' + 1,..., q by design vector must be considered as (uT,
wT)T, where wT = (w -+1, W
J-
ga(b,t',x')+ |
I. I[/,x(0,K(f),*]df
(6-102)
The matrix, Eq. 6-55, becomes
90t 901
+ v2 =0,SL = /+ L...,i 0, ..., 0
3, 3
and
^(U,,i)) + ws2(O = 0, 9^
,0,...,0
(6-103) 3! 3
)3 = </'+!,...,? (6-104)
is of full row rank. This simply says the and the conditions, Eqs. 6-1 lOthrough 6-113,
gradients of all constraint functions (which are unchanged. Further, Theorem 6-6 yields
are equalities) with respect to the design
variable must be linearly independent. Assum- H[t,x(t),U,bMt),yfl,v,w]
ing this is the case, Theorem 6-5 may be
applied. < H[t,x(t),uV),bMt),0,v,w] (6-1 15)
Define
//=X^/_X0/0_ 2 yaLa_ Su K
a~ 1 =1 If w = 0, then by Eq. 6-103 <f> = 0. If w
J=0,<t>< 0 and /* = 0. Therefore, Eq. 6-116
(6-108) is equivalent to
(6-109) (6-117)
d\ _ u=r' + 1,.... r
(6-110)
dt dx
M or
=0 (6-111)
du
2yava(] +(i - t) = 0, a = r'+ 1, ...,/-.
ffl'
=0 (6-112)
Since 1 + ?" _ f =0,
"J,f
dG d_H_ 7ava =0, a =r' + 1,. (6-118)
dt = 0 (6-113)
db db
6-29
AMCP 706-192
dGT
7a ga(b,t',x')
3GT
La[t,x(t),u(.0,b] dt = 0. + X(r*)=0 (6-124)
3x"
9G
db
6-30
r dH
db
dt = 0 (6-123)
//[r,x(f),u(f),,X(/),7,0]
It should be noted that, just as in Theorem plicitly on u(t), then this equation is of the
6-5, the number of conditions here is just form required in the problem treated in par.
equal to the number of unknowns, so one 6-4.1. If not, then differentiating through this
might be led to believe that a solution may be equation with respect to t and using the chain
found. Existence of a solution is, however, a rule of differentiation
very difficult question that is treated in Refs.
10 and 11. d2K *K 2*fl
+ 2
""** bt2 Utax
6-4.2 STATE VARIABLE INEQUALITY
CONSTRAINTS
r
24> df
+/ " f + 3v
(6-133)
In many meaningful design problems con-
straints may involve restrictions only on the where all the arguments are omitted. If the
state variable. This is the case when some <p right side of Eq. 6-133 depends explicitly on
of Eq. 6-101 depends only on t, x, and b. To u{t) then this equation is of the form treated
study this problem, just one such constraint in par. 6-4.1.
needs to be considered, i.e.,
This process continues until
<j>M,x,b)< 0,t < t< r" . (6-132)
Let t~ < t< t+, t~ < t+, be an interval in O-/0 [t,x{t)M (6-134)
which <S> of Eq. 6-132 is an equality. It is dt"
clear that b(j>Jbu = 0, so the matrix, Eq.
6-105, has a zero row in this interval and involves u(t) explicitly in its right side and
hence cannot be of full row rank. Theorem u(t) can be determined as a function of x(t)
6-7 cannot be applied directly, so further
and b, as in par. 6-4.1. The integer i> A 1
analysis is required.
is defined to be the first integer for which this
is true. The constraint, Eq. 6-132, is then
In the interval t < t< r = 0 so it is
called a f^th order state variable inequality
necessary that
constraint.
b<t> b<j> dx
= 0 From the theory of ordinary differential
dt bt dx dt
equations (Ref. 14), Eq. 6-134 throughout
From Eq. 6-97, dx/dt may be replaced by / t~ < t < f+ and
and this relation becomes
At-,x(r),b] =0 (6-135)
b<t>[t,x{t)M
dt bt
d0
- [r,x(r),b] =0, i = \,...,p._.
d<t>M,xU),b] dt1
+ f[t,x(t)Mt),b]
ox (6-136)
If the right side of this equation depends ex- are equivalent to 0 = 0 throughout t~ <
6-31
AMCP 706-192
t < t+. This, of course, requires that <j> Theorem 6-8: If [x{t), u{t), b, t!, x'] is a
have v piecewise continuous derivatives, solution of the optimal design problem with
and / have v&l piecewise continuous state variable inequality constraints, then
derivatives in t~ < t < t+. The point there exist multipliers X0 > 0, \.(/), i = 1, ,
t~ plays the role of a t' in the problem n, ya, a = 1, ..., r, ya > 0, a = r' + I, ..., r,
stated earlier in this paragraph. Hv(t),= l,...,qi,M/J(r)>0,j3 = q'+l,...,4>and
TT i = 1,..., v and associated with a state
variable constraint and 4> = 0 in?" < t< t+,
It will be assumed that when the right side
of Eq. 6-134 is used in place of </> in d\ 3HT
computing the matrix, Eq. 6-105, this matrix r= -- ,foi t=t', r, t+ (6-141)
at ux
has full row rank. In this case Theorem 6-7
may be employed. To utilize this theorem, ML-
define 0, for t t1, r, t+ (6-142)
3u
,.*<o
V/=l d
(6-137)
3G
3b
+
LJ:
3G
3b
3H_
3b
dt = 0 (6-143)
<t>, = 0
3GT
dt -\(t) =0
3x
where v& = 0 if 0 involves u explicitly, 3GT
dx*
G = XQg0+ 2 \ga (6-138) 3GT
et=l
+ \(t> -0)-\(t> +0)=0 <6-144)
~3xi~
V
- *~l \{ dl
% [t-,x(t),b]
-, )\ (6-139)
G=2 2 T- +\(r -0)-\(r +0)=0
1=0
dt1 ax-
where this sum on/3 is extended only over the \{t -0)-\(t+ +0)=0
indices associated with state variable in-
equality constraints, T~ are multipliers, and +H(t+0)=Q
d- -Hit" -0)=0
ff = X'/-X0/0- 2 yaLa
a=l dt*
(6-140)
6-32
AMCP 706-192
H(t* -0)-HU* + 0)=0 (6-146) these subintervals is not known before the
solution is computed. The generality of the
X(t* -0)-X(t* +
0J=0 (6-147) problem makes it difficult to discuss all its
intracies without resorting to special cases and
lt(t)4>V,x,u,b) = 0, 0=1, ., q (6-148) examples.
6-33
AMPC 706-192
a tan e
X2 V cos x3
A2 (0 = f2
G=\0T+7l [Xl(0)-x\
and Eq. 6-156 implies J, =0. The last
+ 72 lx2(0)-x2] +y3[x3(0)-x] equation in Eq. 6-154 is then
6-34
AMCP 706-192
X3 = -S2 ^cosx3. Since x < 7r/2, forf small, either 0(f) =0O
or 0(f) = - 0O. From Fig. 6-11, it is
Using the first equation in Eq. 6-152 to reasonably clear that 0(f) = 0O and Eq. 6-152
replace V cosx3, this is can be integrated to obtain
Therefore, sin x ]
Since the last two terms in H are zero, the x1(t)=x*-R [sin (x*-bt) '
only explicit dependence of H on 0 is through
the term \3{t)a tan 0(f). The inequality, Eq. sinx.
6-131, states that 0(f) must maximize//. It is
clear then that if X3(f) = 0, then x2(f) = x| +R [cos (X? -of) V (6-163)
6-35
AMCP 706-192
and this slope is continuous. This means that These equations yield
segments of the optimal path where 6 = d0,
0, or 60 must be tangent where they intersect. x* = x\-R sinx +\R
2
-{xl2 -R
With this information, the solution of the
problem may be constructed geometrically.
-x2 -R cosx)2 1/2
**=T(*i ~
R sin
*3 +xf + R sinxj) Figure 6-13. Extremal Arcs With Straight
Section
6-36
AMCP 708-192
that they may be optimum. It is clear that for this case, Eq. 6-122 provides no information.
x\ < s there is only one possible solution of It is then required that the inequality, Eq.
the problem. For x\ > s this is not the case as 6-131, must be used to determine the design
shown for x\ = x\. Both the extremals variable. For a complete treatment of this
leading to the path x2 - x2 satisfy the subject, see Ref. 20.
necessary conditions of the theorem. It is,
geometrically, relatively clear that these are The problem treated in this paragraph is
the only two possibilities so the one with the not as complicated as most optimal design
shortest time required to get to x2 = x\ is to problems occuring in the real-world. It does,
be chosen. The test, Eq. 6-151, may eliminate however, illustrate some of the features and
one candidate. It seems clear that when the difficulty encountered in most realistic
extremal with straight line exists, it is best. optimal design problems. This problem should
convince the reader that the solution of
It should be noted that if x\ > s + 2R, it is optimal design problems is not simply a
impossible to intersect x2 = xl with only two matter of plugging numbers into formulas.
circular arcs so the extremal with a straight Even though analytical methods will be
section is required. stressed in subsequent work, the effective
solution of this class of problems requires a
This problem illustrates many of the basic sound understanding of the theory of optimal
ideas and complexities involved in optimal design.
design and optimal control theory. Some of
the features are worth noting because they Example 6-9: A Constrained Brachisto-
will arise later: chrone Problem.
2. Multiple solutions. As seen in the fore- This problem will be treated as an optimal
going problem, more than one candidate design problem. On the assumption that there
solution may be constructed. Condition, Eq. are no discontunities in the velocity vector,
6-13 1, must then be used to choose the best conservation of energy yields
of these candidates.
1
3. Singular arcs. It occasionally happens, as -mv = mgx2
in the vehicle steering problem, that there will
exist a set of values of the state variables and or
multipliers such that the function // does not
V = (2gX2)l/2.
depend explicitly on the design variable. In
6-37
AMCP 706-192
1/2 1/2
0 = (2g*2) sin H - (2gx2) tan a cos ^
0 (6-167)
Xl{0) = x2(0) = 0
Necessary conditions from Theorem 6-8 are
(6-165)
Xi{T)=x\.
X, =0
The object is then to find u(t), xt(t), and
1/2
x2(t) such that a particle starting at rest at X2 = -g(2gx2) [Xj cos
(0,0) reaches x(T) = x\ in minimum time T. (6-169)
The path is required to satisfy the constraint + X2 sin /x(cos u
1/2
Since the constraint of Eq. 6-166 does not (2gx2) [-X, sinw+X2 COSJ;
involve the design variable u explicitly, the
problem contains a state variable inequality + ^(sin u +tan a cosu)] = 0 (6-170)
constraint. Computing <j> and substituting
from Eq. 6-164 yields \2(T)=0 (6-171)
6-38
AMCP 706-192
/IT
2
Wi(, 0< t < t
It might be expected that when 0 = 0, then Fig. 6-15 shows solutions for tan a = 1/2
the curve is a cycloid as in Example 6-5. and several values of h
Whenever <p = 0 it is clear that u = a. This is,
in fact, the case and as presented in Ref. 21 The reader may very well get the impres-
the solution is a cycloid for sion from these examples that analytical
solutions of general optimal design problems
h 2
> -IT - a tana are extremely difficult to obtain. This is
\2 indeed the case. Therefore, either numerical
methods must be used to solve the equations
i.e., the optimum path does not touch the given as necessary conditions in the theorems,
constraint surface. or some direct computational method must
be used to solve the optimal design problem.
For Some numerical methods of solving the neces-
sary conditions are presented in the next
paragraph. Several optimal structural design
1- a ) tan a
problems are solved in Chapter 7 to illustrate
6-39
AMCP 706-192
6-40
AMCP 706-192
Eq. 6-179 is introduced to illustrate the The initial value problems, Eqs. 6-182 and
dependence of the final values of y on|. 6-183, for i = k + 1,..., n may be integrated
from t to tl to obtain the derivatives
It is clear that a solution of the problem required inEq. 6-181.
can be obtained if % can be found so that Eqs.
6-179 are equalities. To simplify notation, Once these derivatives have been deter-
define the column vectors mined, the new estimate |_ in Newton's
Method is given by
Ht1 ;D = iyjU1 ;)] for those; inEq. 6-177
and 1=?0 W1;*0)
[y(t1;H0)-y1
a?
y1 =
lyj ] for the same/.
(6-184)
The conditions which are to determine are
The process is repeated with ' playing the
yV1;Z)=y1. (6-180)
role previously occupied by .
Any numerical method of solving algebraic
equations may be used to solve Eq. 6-189. If a This method of finding the partial deriva-
scheme like Newton's Method or a Gradient tives is direct in nature but requires the
Method is to be used, it must be possible to solution of Eq. 6-182 n k times. Further,
compute both the differential equations, Eq. 6-176 and
6-182, must be programmed.
dy
as (t'\?)
(6-181) A second method of constructing the par-
tial derivatives of Eq. 6-181 (or approxima-
where is an estimate of the solution of Eq. tions of them) is to use a difference quotient,
6-180. These partial derivatives may be ob- i.e., Eq. 6-176 is solved for and + 5 where
tained or approximated in a number of ways. 5 = (0, ..., e, ..., 0)r where i indicates the ith
position and E is small. Therefore
The first method of determining the deriva-
tives in Eq. 6-181 is to observe that y(t) = <W ;6) ^ y1 ;$ + S) - yjt1 ;Q
y(t-) and further, that the dependence on
is very regular (Ref. 14) so that by{.t&)$ (6-185)
exists. Differentiating formally with respect
to inEq. 6-176,
Once these approximate derivatives are
determined, the algorithm, Eq. 6-184, may be
zy\ a/ ay (6-182) used.
dt
6-41
AMCP 706-192
computation is performed with the same set The Generalized Newton Method for
of differential equations. This method is solving Eqs. 6-186 and 6-187 is similar in
illustrated in the problems of pars. 7-2 and philosophy to the Newton method of solving
7-3. algebraic equations. An estimate of the solu-
tion, y<0Ht), is made and the right side of Eq.
A third scheme which makes use of differ- 6-186 is expanded about ;v(0)(f) using Tay-
entiation formulas for definite integrals is lor's formula to obtain
developed in par. 7-4.
^- =^-lt,y^\t)]y^+f[t,/Ht)]
6-5.2 A GENERALIZED NEWTON METH- dt dy
OD
(0)
Ityv'VHy^'V) (6-188)
A second method which is used to solve the dy
necessary conditions for optimization prob-
lems is a Generalized Newton Method of where y^Ht) is required to satisfy
solving boundary-value problems. It has been
pointed out in the foregoing that Bolza >'J1)(r)=>',0,forthoseiinEq. 6-187
problems and optimal design problems may
be reduced to nonlinear boundary-value prob- y)l)(tl )=y}, for thosej in Eq. 6-187
lems. The method employed here was devel-
oped forjust such problems (Refs. 22, 23). (6-189)
6-42
AMCP 706-192
Since the discussion in this paragraph is on The argument used in applying the General-
ways of solving optimization problems, the ized Newton Method to the problem of
Generalized Newton Method will be applied determining x{t), u(t), and \(t) from Eqs.
more directly to this class of problems. For 6-193, 6-194, and 6-195 through 6-197 as
the present, consider only the following prob- follows:
lem:
1. Solve Eq. 6-187 for
minimize J
I' f(t,x,u)dt (6-192) u =u(t,x,\) (6-198)
6-43
AMCP 706-192
It is clear that with the new variable, Eq. The Generalized Newton Method presented
6-200 is equivalent to Eq. 6-199. The neces- here has been discussed by many authors and
sary conditions of Theorem 6-5 may now be generally has received favorable comments.
applied and the Generalized Newton Method For a more detailed discussion and examples,
utilized just as in the preceding case. see Refs. 23, 25 through 28. An outstanding
treatment of the Generalized Newton Method
In case the optimal design problems with also appears in book form (Ref. 29). A very
state variable inequality constraints, a differ- rigorous treatment of existence and con-
ent technique for elimination of inequalities vergence properties of the method is given
has proved effective. For constraints of the which applies to the control problems dis-
form cussed. The reader should note that some
writers follow Bellman in calling the method
4>,(t,x) < 0 (6-201) described here, "Quasilinearization".
6-44
AMCP 706-192
REFERENCES
2- I.M. Ge If and and S.V.Fomin, Calculus of 12. L.S. Pontryagin, V.G. Boltyanskii, R.V.
Variations, Prentice-Hall, Englewood Gamkrelidze, and E.F. Mishchenko, The
Cliffs, New Jersey, 1963. Mathematical Theory of Optimal Pro-
cesses, John Wiley & Sons, New York,
3. N.I. Akhiezer, The Calculus of Variations,
1962.
Blaisdell, New York, 1962.
9. GA. Bhss, Lectures on the Calculus cf 18. D.S. Hauge, "Solution of Multiple-arc
Variations, University of Chicago Press, Problems by the Steepest-Descent Meth-
Chicago, 1961. od", in Recent Advances in Optimization
Techniques, A. Lavi and T.P. Vogl, Eds.,
10. E.B. Lee and L. Markus, Foundations of John Wiley & Sons, New York, 1966.
Optimal Control Theory, John Wiley &
Sons, New York, 1967. 19. H.M. Hung, Time-Optimal Steering of a
Ground Vehicle, presented at the 1968
11. W.W. Schmaedke, "The Existence Theory Spring Meeting of SIAM, Toronto,
of Optimal Control Systems", inAd- Canada.
6-45
AMCP 706-192
20. H.J. Kelly, R.E. Kopp, and H.G. Moyer, Blaisdell, Waltham, Mass., 1968.
"Singular Extremals", in Topics in Opti-
mization, G. Leitmann, Ed., Academic 25. G. Paine, The Application of the Method
Press, New York, 1967. of Quasilinearization to the Computation
of Optimal Control, Report No. 67-49,
21. A.E. Bryson Jr., W.F. Denham, and S.E. Department of Engineering, UCLA,
Dreyfus, "Optimal Programming Prob- August 1967.
lems with Inequality Constraints I: Nec-
26. R.E. Kopp and H.G. Moyer, "Trajectory
essary Conditions for Extremal Solu-
Optimization Techniques", in Advances
tions", AIAA J., Vol. 11, November
in Control Systems, Vol. 4, C.T. Leondes,
1963, pp. 2544-2550.
Ed., Academic Press, New York, 1966.
23. P. Kenneth and R. McGill, "Two-Point 28. B.D. Tapley and J.M. Lewallen, "Com-
Boundary-Value-Problems Techniques", parison of Several Numerical Optimiza-
in Advances in Control Systems, Vol. 3, tion Methods", J. Opt. Theory and Appl. ,
C.T. Leondes, Ed., Academic Press, New- Vol. l,No. 1, 1967, pp. 1-32.
York, 1966.
29. PL. Falb, Some Successive Approxima-
24. H.B. Keller, Numerical Methods for tion Methods in Control and Oscillation
Two-Point Boundary- Value Problems, Theory, McGraw-Hill, New York, 1971.
6-46
AMCP 708-192
CHAPTER 7
7-1
AMCP 706-192
Another important method of design devel- conditions that the solution of the optimal
oped principally by Prager and Drucker (Refs. design problems must satisfy. Once this task is
6,7,8) is limit analysis. In this method of complete, the design problem is reduced to
design, the structure is allowed to reach a the determination of solutions of the neces-
state of collapse due to plastic action of the sary conditions that are candidate solutions of
material. The resulting design is, therefore, the optimal design problem. The term "in-
safe for application of the given loads even direct" seems to describe this process quite
though permanent deformation of the struc- well.
ture results. If the loads must be applied
many times in the life of the structure, it will As discussed in par. 6-5, any method of
generally be required that all material in the solving the nonlinear boundary-value problem
structure must remain in the elastic range at contained within the necessary conditions is
all times. For this reason, methods had to be admissible. In this chapter, two problems will
developed for elastic design. be solved by shooting techniques. The prob-
lems of par. 7-2 are treated by the shooting
In 1960, Joseph B. Keller published an technique of par. 6-5. The problems of par.
article on column design (Ref. 9) which 7-3, however, are treated by a modified
renewed interest in elastic, minimum weight shooting technique.
design. Several papers have subsequently been
published by Keller and his associates in 7-2 A MINIMUM WEIGHT COLUMN
which a class of eigenvalue problems is treated
(Refs. 10,11,12). The methods employed in A lightweight column of length T is to be
these papers are elegant but are not easily designed to support a given load P. The
adapted to realistic engineering problems. material is specified and has yield strength
a
max- The particular support considered is
A new method of optimal design has been shown in Fig. 7-1. In problems considered
developed by J.E. Taylor and W. Prager since
1967 (Refs. 13,14,15). This method is based
on an energy representation of the structural
element under consideration. A particularly
nice feature of the method is the ability to
obtain sufficient conditions for certain classes
of design problems. However, no unified
method of constructing solutions has been
presented.
7-7777-7- / / ff f
7-1.3 METHODS EMPLOYED
(A) Undeflected (B) Deflected
The theorems of Chapter 6 will be em- Figure 7-1. Column Under Consideration
ployed here for the solution of optimal design
problems, Use of the results of Chapter 6 to here, the cross section is assumed to depend
construct solutions of optimal design prob- on only one design variable, u(t), 0 < to T.
lems is called an indirect method of solution. The problem is to determine u(t) that mini-
This is so, because one first obtains a set of mizes the weight or, equivalently, the volume
7-2
AMCP 706-192
A(u) = area of the cross section The system, Eq. 7-5, reduces to
7-3
AMCP 708-192
P/A(u) = a (7-7) dx
x(0) = 0, (T)= 0
at
In the remaining case, n = 0 and condition,
Eq. 6-122, of Theorem 6-7 is where the choice on the right side of Eq. 7-12
must correspond to the selection inEq. 7-11.
2Px2
Eau*
(7-10) 1.
dt2
EaP
or
7-4
AMCP 708-192
Step 3. Make the adjustment in C The results for this design problem are
given in Table 7-1 and Fig. 7-2. For loads
dx _ /<# above 6794 lb, the cross-sectional area is
Cl = C0 (P / (71
dt / dr ' determined by ^4 = P/omiX and the resulting
column is stable. A meaningful optimal design
Step 4. Return to Step 2 with new esti- problem then exists only for P < 6794.
mate C= Ct , and repeat.
7-3 A MINIMUM WEIGHT STRUCTURE
The equations derived here must be WITH ANGULAR DEFLECTION RE-
changed only slightly to solve column prob- QUIREMENTS
lems with other end conditions and other
forms of cross section. 7-3.1 STATEMENT OF THE PROBLEM
For a numerical example of this problem, The problem considered in this paragraph is
let the cross section be circular with variable the design of a portable communication tower
radius. In this case a = l/(47r). For the exam- of height L which will support a line-of-site
ple, let omax = 20,000 psi, = 3x10' psi, and transmission unit, a laser transmitter, for
7= 10 in. example. In order for the transmission beam
TABLE 7-1
Volume of
P, lb Volume, in.3 Uniform Column', in. Saving,'.
7-5
AMCP 706-192
A
r The general configuration of the tower is
shown in Fig. 7-3. Three vertical members
with cross-sectional area A{t) are arranged on
the vertices of an equilateral triangle of
altitude hit). Here t is a coordinate measured
along the length of the tower. In order to
maintain the spacing of the vertical elements,
small cross members are inserted.
A(t)
tional requirement must be placed on the is minimum. The objective in the design
tower. In transportation and election it must problem is to choose/4(f) and h(t) for 0 < t <
be strong enough so that it is not damaged by L, so that V is as small as possible and the
rough treatment. Therefore, it is required that given conditions are met.
the moment of inertia of the cross-sectional
area of the tower be greater than a prede- Lateral deflection of the tower due to the
termined limit I0 everywhere. lateral wind load is determined by elementary
7-6
AMCP 706-192
beam theory. The differential equation for considered in par. 6-4, define
displacement is (Ref. 14)
XI ^
I)=-A(t){h(t)]2. (7-16)
TABLE 7-2
CONSTANTS
In order to prevent damage in handling, it is
L = 360 in. 6 = O.OOOlrad
required that
Q = 8.35lb/in. = 0.25in.
I(t) > I0 > 0, 0 < t < L.
E = 3x 107 lb/in. / = 172.8in?
or in the notation of par. 6-4,
7-7
AMCP 706-192
V=
r
I
J 0
[A+h(t)]dt
par 7-3.1. Eqs. 7-7 7 become
x, X-i = fi
=AL+
r
Jo
h(t)dt.
to minimization of
*,(0) = 0
J=
r
Jo
h{t)dt.
x2(D = e
(7-21)
G = yiXl(0)+y2x2(0) + y3[x2(L)-d].
A', = 0
(7-22)
X2 = Xi
X1(I)=0. (7-24)
where t is measured upward from the bottom
of the tower; the other symbols are defined in The general solution of Eq. 7-22 is
7-8
AMCP 706-192
It
1/3
Xa(0 = -a -fi*- 3gagtf-Q2
h or
24
Condition, Eq. 7-24, implies 1 = 0, so (7-27;
1/2
34
Ai (0 = 0 24
f7-2j;
MO = -$a where by Eq. 6-131, the choice in Eq. 7-27
must be made which makes
For the determination of A<7), two cases
must be considered:
3tag(I-Q2
H=-h
4&4A2
Case 1: 4> = 0. In this case
a maximum.
2
-Ah -I0 =0
Let /* be a point of transition from one
expression in Eq. 7-27 to the other. Since
so Theorem 6-7 requires H to be continuous,
1/2 when the two expressions of Eq. 7-27 evalu-
3^o ated at t* are substituted into//, a common
/! = = Ai
2A value must occur.
Case 2: <t> < 0. In this case Eq. 6-128 is Evaluate H as a function of both h2 and
hi-
M0 = O
h
If X0 = 0, then, 2 = 0 and from Eq. 7-26, A, --
l
Xi = X2 = 0. This, however, violates the
condition stated in the first sentence of
Theorem 6-7. Therefore, X0 = 0 and it is H(h2) = H2 =-h2
permissible to choose X0 = 2.
3Q(-Q2 = -A2
A = = A, 2 /*
2,4
7-9
AMCP 706-192
Since h(t) is defined as the maximum of two Integrating Eq. 7-29 first from 0 to f* (as
continuous functions, h is continuous. It given by Eq. 7-28) and then from t* to L
follows that all points t* of transition from yields
one value of Eq. 7-27 to another can be found
by equating the two expressions of h(t).
3L5/3 / 3Q\1/3 -2/3
(L)= b
The point t is then determined by ** 15" I IS)
1/2
3/p 1/
V0/12(23/4x3+10/0/6)_r3/2
hi(t*) =
2A
1/3
(24)1/4101/2//4
_ 3{aQ(j,_/)i (7-30)
2fe = ,(f*).
Eq. 7-50 is solved numerically for 2. Once
This solution yields two values of f*. The 2 is determined, then Eqs. 7-27 and 7-28
requirement 0 s ;* < L results in a unique completely specify the tower. Results are
value of/* shown in Table 7-3(A) and Fig. 7-5(A).
7-10
TABLE 73(A) TABLE 73(B)
WEIGHTS OF SIMPLY SUPPORTED WEIGHTS OF GUY-LINE SUPPORTED
TOWERS, ONE DESIGN VARIABLE TOWERS, ONE DESIGN VARIABLE
TABLE 73(C)
WEIGHTS OF TOWERS
Number of
Design
Variables 0 12 0 1
h = 63.7 in. h
max = 914in hmax = 80 2 in h = 46 in. hmax = 46 5 in h
max = 36 55 in
A = 7.96 in.2 6.97 in.2 10.03 in.2 A = 3.84 in.2 4.434 in.2 4.95 in.2 >
3
o
o
1
o
O)
(O
M
AMCP 706-192
x x
i ~ i /i
, _ Qt{f - L) _ (7-31)
=/2
2EI
*!(())= 0
jca(0) =e } (7-32)
(A) One Control Variable (B) Two Control Variables
x1(L)=0
Figure 7-5. Tower With Base Rigidly Fastened
to the Earth
The quantity to be minimized is still given
by Eq. 6-17. In the problem considered here,
d\2 _ dH
= -X,
Define xt = x and x2 = x[; this is equiva- dt ~ dx-.
lent to
so
Xj(0 = -b -iit.
Also,
\2(L) = 0
7-12
AMCP 708-192
^Ah2 -/0=0
so
1/2
37o
1 (7-33)
2A
Xo =
2^
7-13
AMCP 706-192
along the tower and play the role of design is permissible to take X0 = 1- The system is
variables. then two equations for h and A with solution
1/4
Eqs. 7-16 and 7-17 remain the same in the 2
h(t) - (3iQ a -t)V
present problem. However, fromEq. 7-14 2E
1/3
) (7-37)
I/2
f0=A(t) + h{t). 3t2Q'< (1-0
A{t)
H[I0 - \ Ah2
"-(- . (7-39)
The equations for Xt and X2 are just as in the
3/0|32
preceding work, so again, A{t)
7-14
AMCP 708-192
Xi =n
hit)
Ait) =
3/oY/3
40
3/o0
I (7-43)
Xj ={,(-/)
(7 40)
-- A,*i3Qftf_Q' 1 " 7-3.4 DISCUSSION OF RESULTS
dA ' 4EA2h2
For both types of tower considered (simply
+ p^ h2 =0 supported and towers with top supported by
guy lines), the variables A and h could be
As before, two cases must be considered: fixed at constant values, large enough that
deflection requirements are met. For a given
Case 1. <j>< 0. This implies M = 0. configuration of the tower, there is one pair
of constant values A and h which yield a
From Eq. 7-40, it is clear that if X0 =0, then tower at least as light as any other combina-
i = 0, and Xi = X2 = 0. This contradicts the tion of constant A and h. For both types of
theorem, so X0 =0 and it is permissible to tower, finding these values is a matter of
take X0 = 1. The system is then a set of two simple algebra. Results for both towers are
equations for A and A which yields given in Table 7-3(C), referred to as towers
1/4
with no design variables. With/4 held constant
3ig ,1/4 ( _ 0l/2
and h{t) treated as a design variable, the
hit)
2E problem can be solved for several different
(7-41) values of A for both types of tower. Sum-
1/4
7-15
AMCP 706-192
over the case where only the spacing h(t) is maximum deflection at a given point could be
allowed to vary, of 6.1% and 6.7% for the among these.
simply supported and guy-line supported
towers, respectively. Similarly, these towers Note that Figs. 7-5 and 7-6 are not scale
represent respective reductions in weight over drawings of the towers, but are representative
the completely uniform (no design variables) of the general shape of the respective towers,
tower of 18.8% and 19%. as viewed on one face. Sample profiles of the
four possible structures are presented in Table
7-3(A), (B), and (C).
All four configurations of the structure
described in the preceding subparagraphs have 7-4 MINIMUM WEIGHT DESIGN OF
been successfully prescribed by a digital com- BEAMS WITH INEQUALITY CON-
puter approach. An IBM 360-65 Computer STRAINTS ON STRESS AND DEFLEC-
was used and the programs employed Runge- TION
Kutta integration with the Newton's Method
described in the text. Convergence depended The problems treated thus far in this
on getting a good starting value for the chapter have only design variable inequality
multiplier . This was made more difficult by constraints. In engineering design one often
the fact that has no physical significance so encounters problems in which it is required
that, consequently, engineering intuition was that the state of the system satisfies inequal-
no help. To find a sufficiently close starting ity constraints. As seen in Chapter 6, state
value for , (i.e., a value for which the error is variable constraints are more tedious to treat
of order two or less) several different values and have features not encountered in prob-
of were investigated, each increasing from
lems without state constraints. A class of
the previous value by a factor of 10, and the
beam design problems including state variable
first value very close to zero. Once a starting
constraints is presented in this paragraph to
value that would allow Newton's Method
illustrate some of the features and difficulties
iterate effectively was found, convergence
that can arise in this difficult class of prob-
occurred in ten or less iterations, taking less
lems. While the problems solved are of limited
than two minutes of computer time. This
practical value, they do illustrate typical
time could be reduced with increased sophisti-
features that can arise in state variable con-
cation of the computer program.
strained problems.
(7-44) V2(t)Q2U,d2(t)f
I2{t)b2[t,di(t)],
7(0 =I{u(t)\ (moment of inertia),
respectively, are the maximum principal
(7-45) stresses that occur in the cross section at t.
The values dt and d2 may be determined by
b(t,d) = b[u(t),d] (width of cross section the methods of ordinary calculus, for each t.
atd),
The problem is to determine u(t) so that
(7-46) the beam, subjected to a given ioading,
contains as little material as possible and still
Q{t,d) = Q[u(t),d] (first moment of area satisfiesthe following conditions:
above d, about the
neutral axis), 1. Principal normal stress is less than or
equal to some allowable normal stress
(7-47)
7-17
AMCP 706-192
In structural design problems, it is fre- for all f in (0,T), where M(t) is bending
quently sufficient to require only that the moment;
maximum flexural stress be less than <Jmax
T
and the maximum direct shear stress be less pW-< Tmax (7-53)
than 7-max. These conditions are considerably
easier to enforce than the conditions on for all t in (0,70 where V(t) is shear;
maximum principal stress.
/[(/)] >/o. (7-54)
If the beam is subjected to several loadings,
then the problem is more tedious but is no where I0 is a constant greater than zero; and
more difficult mathematically. Corresponding for all/in (0,71,
to each loading there is a a deflection curve,
bending stress, and shear stress that must Xi(t) < x(t) < X2(t) (7-55)
satisfy the stated conditions.
It is assumed that the functions appearing
Further, since the beams are made of above have the following properties:
homogeneous material, the weight of a beam
will be minimum if and only if its volume is 1. q has a piecewise continuous derivative
minimum. Therefore, in the following the in (0,70.
quantity to be minimized will be volume.
2. A, I, C, and Q are piecewise twice
The given problem is now stated mathe- continuously differentiable.
matically: A vector function w(f) is sought
which causes the functional 3. X, and X2 have continuous second
derivatives in (0,70.
at all but a finite number of points in (0, T), In case only maximum bending stress and
7-18
AMCP 706-192
maximum direct shear stress are to be In the statically determinate case, the
bounded by amax and rmax , respectively, differential equation, Eq. 7-50, and the
constraints, Eqs. 7-52 and 7-53, are replaced boundary conditions, Eq. 7-5 1, reduce to
by
d2x Mjt)
(7-58)
\M{t)\ C[u(t)] dt2 EI{u)
|o(r)| = (7-56)
I[u(t)]
and
and
g ,[x(0),x'(0),jc(r),*'(O] =0,
W(t)\Q[u(t),d3(t)]
\T(t)\ = (7-57)
I[u(.t)]b[u(t),d3(t)] 5= 1,2. (7-59)
for all t in (0,T) where d3(t) = d3 [t,u{t), The boundary-value problem, Eqs. 7-58
M{t), V(t)\ is the distance from the neutral and 7-59, is equivalent to a boundary-value
axis where the absolute value of the direct problem with a system of first-order equa-
shear stress is largest. The distance d3{t) may tions. The new problem may be written as
be determined by the methods of ordinary
calculus. dxx
(7-60)
~
In the case of beam design with multiple
loading requirements, there is still just one dx2 Mjt)
(7-61)
design variable u{t). However, c rresponding ~~dT EI{u)
to each loading there is an additional state
variable (deflection) that must satisfy condi- and
tions identical in form to Eqs. 7-50 through
7-55. The problem is to determine u(t) so that gs[x1(.0)>x2(0),xl(D,x2(T)] =0,
the functional, Eq. 7-49, is minimum subject
to the condition that Eqs. 7-50 through 7-55 s = 1, 2 (7-62)
are satisfied for each loading.
w here x t is defined to be x. In terms of this
notation, Eq. 7-57 becomes
7-4.2 NECESSARY CONDITIONS FOR THE
BEAM DESIGN PROBLEM jr,(r)< JC,(O< Jr2(0.
The treatment which follows applies only It is now clear that the beam design
to statically determinate beams, i.e., beams problem is contained in the class of problems
loaded in such a way that reactions at all to which par. 6-4 applies. The quantities
supports (and hence also shear and bending appearing in par. 6-4 will now be identified
moment) are determined completely by the with the physical quantities associated with
conditions for equilibrium of the beam. beam so that necessary conditions for the
Changes in formulation of the problem which beam design problem may be stated.
are necessary to consider statically indeter-
minate problems will be indicated below. Conditions, Eqs. 7-52, 7-53, and 7-54,
7-19
AMCP 706-192
/l =*2 (7-63)
(7-66)
M(t) (7-64)
/=
EI(u) *2 (0,04=0
EI(u)
xi =X1{t) (7-68)
mr(*a -x1)=X"2(t)-x'i(t)
or
X'i(t) +
EI(u) X2{t) (7-69)
In terms of the notation of Eq. 6-101, At each t~, the deflection curve becomes
tangent to one of the curves, Eqs. 7-68 or
*'=Vf)-%x* 7-69. Further, there is a neighborhood of t~
in which Eq. 7-55 is a strict inequality to the
* P max left of t~, and Eq. 7-68 or Eq. 7-69 holds to
7-20
AMCP 706-192
Mit)
M4 04,2 ~Ms 05,2 A.Q A2
du. du. Eliu)
G= 2 7sgs + ZyB+a(t-o>a)
s - 1 a
- [Ml 01 +M2 02 +M3 03 (7-73)
au.
M7-70)
G = T0|4>6 04 (t ) + T7]4)6 04 Op
+ M4 04.2 +M5 05,2l =0,/= 1, ..., m
+ T
+ T.S.r 05 0,") l,5,r 05 (0 , M4 04.2 = MS 05,2 =0 (7-75)
Theorem 6-9 and Eq. 7-70 yield Theorem at all but a finite number of points in (0,7);
7-1.
X.(a+0)-Va -0) = 0,
Theorem 7-1: Necessary conditions for the
minimum weight beam problem are: i= 1,2 (7-76)
7-21
AMCP 708-192
7-22
AMCP 706-192
implied by Eqs. 7-73, 7-74, and 7-76. The finite number of unknown constants, or
roles played by Mi and 2 in Theorem 7-1
would simply be combined in a new variable 2. The beam is supported in such a way
M. This result may be stated as Corollary 7-1. that an infinite number of constants are
required to specify the reactions (e.g., abeam
Corollary 7-1: Let there be a scalar design on an elastic foundation).
variable u(t) and assume that any two of the
inequality constraints, Eqs. 7-52, 7-53, and In the first case, the unknown constants
7-54 are equalities at t*. If the first partial appear in the expressions for M and V. By
derivatives of these two constraint functions defining new state variables, x( with i > 3, to
with respect to u are not zero at (*, then be these parameters, the following differential
Theorem 7-1 holds. equations must be satisfied:
7-23
AMCP 706-192
6-8 now applies to statically indeterminate T1 .+ 3 rcan be eliminated and the tr deter-
problems of the second type. mined as functions of the parameters t and
2. Note that 1 and 2 to the left of t~ need
7-4.4 SOLUTION OF THE EQUATIONS OF not equal 1 and 2 to the right. In exactly
THEOREM 7-1 the same way, the t~ are determined by Eqs.
7-79, 7-80, and 7-82. Continuity of H at t*
The Lagrange multipliers, X0, Xu X2, are and t* determines the ft and t* as functions
not uniquely determined by Theorem 7-1. of 1 and2-
However, if the solution, x(t) and u{t), is
normal (Ref. 17, p. 214) for the problem, it is The equations previously enumerated
permissible to put X0 = 1. Theorem 7-1 then determine u-, t* tr, f^, and t* as functions of
determines the remaining \i uniquely. Abnor- t, 1; and 2. The problem would be solved
mal solutions are peculiar in that there may by direct integration of Eqs. 7-71 and 7-72
be no other functions, x(t) and u(t), near and application of g{ = g2 = 0 if |t and 2
them which satisfy the conditions, Eqs. 7-50 were known.
through 7-55. The procedure adopted in this
subparagraph is to assume there is a normal The conditions which determine ^ and 2
solution and then attempt to solve for it. If are Eqs. 7-83, 7-84, 7-85, and 7-86. Eqs. 7-85
this fails, there is either no solution or the and 7-86 after elimination of7i and y2> yield
solution is abnormal, in which case a special two equations in j and 2. If there are w of
analysis is required. In the following, X0 will the points tr and t~ , they subdivide (0,T) into
be taken as 1. w + 1 subintervals. Since these are the only
possible points of discontinuity of Xj and X2 ,
In order to determine u{t), consider any there are just w + 1 pairs, 1 and 2, which
interval in which z 0 Q Z < 4, of the represent 2w + 2 unknown. Thus, there are
inequalities, Eqs. 7-52 through 7-55, are 2w + 2 equations from which the 2w + 2
equalities and the remaining 4 z are strict values of ] and 2 may be determined. If this
inequalities. Eqs. 7-74 and 7-75 show that the is not the case, the problem has no solution or
4 z multipliers corresponding to the 4 z the solution is abnormal.
strict inequalities are zero. Then, Eq. 7-73 is a
system of m equations in the m functions It is assumed now that u{f) and points t'r
uAt) and the z nonzero multipliers. Further, and t~ are known functions of t and 2. A
the z equalities of Eqs. 7-52 through 7-55 numerical method is developed which can be
yield z equations in the ut). Thus, there are used to solve the equations given above for 1
m + z equations which are to determine the m and 2. A numerical solution is required since,
+ z unknowns. The nonzero Xt) are first even for very simple problems, the function
eliminated and the u,{t) are then found as fi(t&\ ,2) is far too complicated to integrate
functions of t and the parameters t and 2. in closed form.
At points tr one part of Eq. 7-55 is an Expressions, Eqs. 7-85 and 7-86 generally
equality, say the jth part (j ~ 1 or 2). In this yield two easy relations between j and 2.
case, only T0i/ + 3|,. and Tij; + 3,r can possibly Eqs. 7-83 and 7-84, however, require succes-
be nonzero. Eqs. 7-77, 7-78, and 7-81 are sive integration of /20,i ,2). To complicate
three equations from which i"0/ + 3r and matters, some limits of integration are the
7-24
AMCP 706-192
points t~ and t~, which are themselves func- First, the equations of Theorem 7-1 are
tions of 1 and 2. Therefore, Eqs. 7-83 and written out in detail and simplified for the
7-84 form a system of nonlinear, finite case of beam with rectangular cross section of
(nondifferential) equations ini and 2- variable depth. In pars. 7-4.5.1, 7-4.5.2, and
7-4.5.3, three specific examples are con-
A Generalized Newton Method (Ref. 18, p. sidered. These examples range from an easy
220) is used to solve this set of equations. A problem in par. 7-4.5.1 to a rather complex
generalization of Leibniz' Rule is used to one in par. 7-4.5.3.
compute the required derivatives of integrals
with variable limits of integration. This rule is The cross section considered here is shown
(Ref. 16, p. 80). in Fig. 7-9. For this particular cross section, if
d_ (sAr.
h(t,r)dt
dT
)gl(T.
g
^T) bf2(t,T)
I
dt
dr
(7-87)
dg2 (T)
+ f2 [g2(T).T]
dr
7-25
AMCP 706-192
Qit)=~h2(t) (7-92)
bh3 (t)
O
12 -Io
and
so
Ibhit)
It is worthwhile to note that in order for
so Eq. 7-97 to hold the product \2it)M(t) must
be positive. That is, X2(0 andA/(/) must have
3|K(/)| the same algebraic sign throughout any inter-
*(') =
26T val in which Eq. 7-97 holds.
7-26
AMCP 706-192
[x2;-owa;-o)]1/4
/(f) = if/= 4,
1 1 (7-99) = [x2(/; + 0W(/; + 0)]"4 . (7-101)
3/4
-c3ix2wr ' if T
l l< rmax
Eq. 7-98 and / > 0 imply M(t;) # 0, so if
x \M(t)\ lh a M(t) is continuous at t;, then Eq. 7-101
' l l< max>
reduces to
k x sgn [M{t)}, and I > I0,
X2(f;_0)=X2(f;+0) (7-102)
32b2T3m ax
Eq. 7-96 shows that points t* and t~ can
Ci =-
9E occur only in intervals where M(t) (hence also
7-27
AMCP 706-192
()
[X2()M()]
\a | -a max < 0
1
3/4
V{Q) I4 64b2 T*
and 0.
M)M() 9E
/ - KO (7-105)
By putting
are all negative at points where M(t) = 0 = l'/4
V(t). The result stated just below Corollary K()
7-1 then shows that points of intersection of LM)M)J
intervals in which one or more of the above
and
inequalities is an equality may be determined 1/4
by the condition C = (64b2 max/
-9E-
W -o; = w; + o) (7-10-/;
Eq. 7-105 becomes
7-28
AMCP 706192
where
-1 1/4
~M{S)~
p= dg i(a)
U2(S)J x /tei(a), a]
Ja
and
1/4
+ rp.oo d*2 (a)
/(Tj,a)dT)
da
(7-109)
Therefore, the only real solution of Eq.
Leibniz' Rule, Eq. 7-87, is used repeatedly to
7-103 at t = Sis
obtain Eq. 7-109.
JH(S)=l-=^-IX2(S). (7-107)
7-4.5.1 A PROBLEM WHICH CAN BE
SOLVED ANALYTICAL LY
In deriving Eqs. 7-106 and 7-107, it was
assumed that Q and S were not equal to any As a first example, the cantilever beam of
<ja. The coa are, therefore, possible choices
Fig. 7-10 is considered. This problem is simple
for Q and S and must be checked in Eq.
enough that a solution can be obtained
7-103.
analytically.
In particular problems, the following two
Boundary conditions for this beam are
identities are used:
*i(0)=*2(0) = 0. (7-110)
+ I 1 f{y\)dr\dv
)
n:
+ I I f(n)dt\dv, (7-108)
Figure 7- 70. Simple Cantilever Beam
7-29
AMCP 706-192
and
and application of Eq. 7-1 lOyields
V(t) = 0.
h(t) =<
([" bo max _
if|a| = <7
Eq. 7-88 is an equality is t = T. Since f2{t) <
0, Eq. 7-114 implies x1(t) < 0. Therefore, if
1/4
36X2(.t)lM Eq. 7-88 is an equality, it must be Xi(T) =
if la I < a_ A. Further, since there are no t", t\, t~, or
Eb2
wa, \i and \2 are continuous and there is
(7-111) only one pair of constants, 1 and %2, to be
determined.
Also, Eq. 7-99 becomes It is assumed first that xx(T) > A. In this
case, Eq. 7-86 implies
2ftc \1/2
^H , if | a | = a Xi(D = f1=0
3E2M ) ' ' max
/2(0=< and
4i2M\1/4 - 3/4
|Xj(OI
9E *i(T) = t2 -7-^=0
dXf
= x-. Ci(r)=
IF -(-5^rJ T
7-30
AMCP 706-192
On physical grounds, it is expected that the The derivative of the right side of Eq.
beam should be stiffest near / = 0 in order to 7-117 with respect to t* is zero at t* = T and
reduce the deflection at T efficiently. Also, positive everywhere else. This means that Eq.
7-117 has at most one solution. Eq. 7-116
since X2 is largest at t = 0, the second part of
Eq. 7-111 would tend to stiffen the beam then determines 1 and the problem is solved.
there. It is assumed, therefore, that there is
just one point t* having I a I < amax on its As a numerical example, the beam of Fig.
7-10, having the following properties, is con-
left and I a on its right. Eq. 7-107 for
sidered :
r* is
2 T= lOin.
M I(a max \|t (T-t*). (7-116)
b= 1 in.
Eq. 7-114 with t = T may be integrated
ff
using Eqs. 7-108and 7-112 and becomes max = 30,000 lb/in.2
7-31
AMCP 706-192
E = 107 lb/in.2 q(t) > 0 for all t in (0,7). The load q(t) of this
form implies V(t) is non-negative at zero and
and decreases monotonically in (0,T). M(t) is zero
at both t = 0 and t = T. Further, on either side
M = 450 in.-lb of the point where M(t) has its (non-negative)
maximum it is monotone.
If A > 1 in., then the beam of minimum
weight is uniform with h = 0.30 in.
For a more meaningful problem, A = 0.5
in. is considered. Eqs. 7-117 and 7-116 yield
fTTTTTTr^
rFT
t* = 7.7 in. and |i = 2.17. The precise shape
of the optimal beam is given by Eq. 7-111. By
putting Eqs. 7-111 and 7-90 in Eq. 7-49 and Figure 7-12. Simply Supported Beam With
Positive Distributed Load
performing the indicated integration, the vol-
ume of the optimal beam is found to be 3.59
in? A plot of the profile of the optimal
beam may be made by direct substitution into
q(t)dt # 0, thenA/(0 and V(t) cannot
Eq. 7-111. This profile is shown in Fig. 7-11. Jo the same point. There is, therefore,
be zeroat
no danger that the optimal beam will have
h(t) = 0 at any point. For this reason, the
constraint, Eq. 7-54, is not imposed here.
0.431 in. |.30 in.
7-32
AMCP 706-192
II 11 V V ii ti
1 __ .1- 1 -
o
-^^
~
t- c
I
1
re
1 1
The boundary conditions for this problem Eq. 7-102 for f3 yields
arc
t3 =
n2
xt(0) = 0 TU + fj
X2(0) = \2OT = 0.
and
The constants ?i and f2, which are intro- It is noted that Eqs. 7-120 through 7-123
duced in Eqs. 7-118 and 7-119, are now to be can be solved easily for f1 and f2, but, in
determined. general, not so easily for the t(. In the
7-33
AMCP 706-192
I
The conditions that are to determine the
unknown t{ are Xi(t3) = A and x2(t3) = 0. + (7"-/s)| /i(ij)di?
However, for computational reasons, a more
convenient set of equivalent conditions is
*i(0) = 0
and
(7-124)
If fi(V,$*)dvdv
*,(D = 0
written explicitly as
where .Ri and i?2 are introduced for nota-
tional purposes.
i?,= A + r, P f (v)dr) + t P
fi I| 2 | f (v)dr)
2 2
It is assumed now that q{t), b, E, T, <7max,
Joo j, /.
t, and Tmix are given. The equations that
determine the t( are different in four distinct
ranges of the deflection requirement.' These
ranges of A are described in the following:
{
+ t3\ f2{r\,U)dn
7-34
AMCP 706-192
value of A, say A = A,, for which either t2 or method of solution will be described for the
ti, first coincides with t\ or ts, respectively. Case 2.
For definiteness, assume t2 = t, when A =
A,. An iterative method called Generalized
Newton Method (Ref. 18) is used to solve for
3. For A slightly less than Aj, points /s t2 and t4. The procedure begins by estimating
and ?4 are determined by Eqs. 7-126 and values i2 and ?4; and then making a correc-
7-127. tion according to the formula
This explanation of the behavior of the ^ is where [ ]"' denotes matrix inverse, and t2
not the result of a mathematical analysis. It is and f4 are improvements on the estimate.
expected on physical grounds and has been
vahd in each case treated. Eqs. 7-102 and 7-107 determine t3 =
f3(f i ,?2), i = f i U2), and f2 = f2(r4). By use
The values of A { and A2 could be obtained of this information, the derivatives in Eq.
analytically. However, their determination 7-128 are computed by the chain rule of
would be of the same order of difficulty as differentiation. For example,
the optimization problem considered in this
paragraph.
dR i
= *i.r,
9r,
A, and A2 can be determined by a trial (7-129)
and error scheme. For example, to determine + (bRt tbRt bt3\ adf,
A,, t2 is put equal to tx and Eq. 7-107 3, dr3 3f, ) dt2
determines fi. Then, t3 is guessed and Eq.
7-102 solved for f2. Numerical integration of where
Eqs. 7-60 and 7-61 indicates the correction
that is to be made in t3. When t3 is located Rl , =t2[f2{t2 -0)-/2(/2+0)]
accurately, the resulting deflecting at 13 is A{.
A similar procedure is used to determine A2 is the partial derivative of R t with respect to
(t4 is put equal to /5). t2 with all variables in R , taken as indepen-
dent,
The solution of Eqs. 7-126 and 7-127 for
the t( differs only in certain details depending
on whether the given value of A is in the dt)
3fi
range described by Cases 2, 3, or 4. The
7-35
AMCP 706-192
40 in.
G?7?dV
b 0.25 in.
15,000 lb/in.2
= t3f2(t3-0).
and
Similar expressions for the remaining deriva-
a
tives in Eq. 7-128 are derived with the aid of max = 30,000 lb/in.2
Eq. 7-109.
For this problem, it was found that
The iterative procedure for determining t2
and f4 is: A0 = 0.774 in.
7-36
o
9
TABLE 74
RESULTS FOR SIMPLY SUPPORTED BEAM WITH q (t) = t
Vol. of
Unif.
A, fi. h. t3. t. ts. 5i U Vol., Beam', Saving, %
in. in. in. in. in. in. in.3 in.3
>0.775 0.053 39.89 14.00 18.15 29.6
0.75 0.053 10.21 19.42 22.54 39.89 2.77 104.7 14.12 18.15 28.6
0.65 0.048 0.048 19.99 28.41 39.89 3.60 144.0 14.75 18.15 23.0
0.50 0.040 0.040 20.32 37.79 39.89 5.27 217.7 16.08 18.58 15.5
0.40 0.034 0.034 20.32 39.90 39.90 7.12 294.1 17.33 20.01 15.4
0.30 0.028 0.028 20.32 39.92 39.92 10.45 431.7 19.07 22.03 15.5
0.20 0.022 0.022 20.32 39.94 39.94 17.91 740.0 31.82 25.21 15.5
0.10 0.014 0.014 20.32 39.96 39.96 45.17 1866.0 27.50 31.78 15.6
1
Volume of lightest beam of constant depth which satisfies all the requirements of the problem.
AMCP 706-192
t7 (7-135)
I - I
f,
f
f t,
f If I-
and
EM _1
Figure 7-15. Subdivision of the Beam T 1 (7-136)
r.2
Points f2 and tn, where Case 3 occurs, are In obtaining the expressions for t3 and f6>
tr, so they are determined by Eq. 7-102. use was made of the fact that t3 < A < t6.
Further, xl(t2) = A and *i(?7) = A. Points The result, \2{t)M{t) > 0, from Eqs. 7-97
fi> ts, t6, and t% are <* and are determined by and 7-98 shows that this is true. To prove
Eq. 7-107. this, assume for definiteness t3 > A. Since
M(t) changes sign at A and X2 (t) is con-
In this problem, t2 and t-, may be points of tinuous there, X2W) = 0. But, since X2(f) is
discontinuity of X4 and \2. The conditions, continuous near A, it is arbitrarily near zero
Eqs. 7-85 and 7-86 on K2 are X2(0) = X2(D = in a neighborhood of A. Eq. 7-130 then shows
0. Therefore, X2 may be written as that hit) is arbitrarily near zero in a neighbor-
hood of A and this violates the condition \o\
< cm ,v. Likewise,' A < t6.
max
, if 0 < f < ?2
X2(0 = < , if f 2 < ^ < 17 Conditions that determine the f;. are
and
Solving Eqs. 7-102 and 7-107 for the t.
yields the following: x1(r7)=-A,A;2a7) = 0.
ti =- (7-132) Xi(0)= 0
Si +!
x2(f7) = 0
1 EM \ J_
ti =-. (7-133)
xt(t7) + A = 0
7-39
AMCP 706-192
and
x,(D = 0
+ (?7-^6)\
i; /2(7)rfT?
Rt =xl(0) = A + t1 \
J no
f2(n)dn
n;
+i l f2(n)dndp
+ t2\
r fMi)dti
+ 1 I h (?,f 2 ,f 3)dvdv = 0
(7-139)
+1
n
'o
1 hW)dndv
J
o
and
4 = *(7} = -A
rf i
i *i
fMi)dqdv = 0
s:
2=X2(r7)=\ /2(7?,?2,f3)^
n: f2(V,U)dvdv
rr
+ \ /2<u)ftj
f2(xi)dr\dv = 0 (7-140)
/2(r?,f2,f3)dT? = 0 (7-138)
where the i?; are introduced for notational
purposes.
A3 = x,(f7) + A = 2A
Generalized Newton Method is used to
solve Eqs. 7-137 through 7-140 for the fr An
initial estimate (, / = 1, 2,5, 4, is made; and a
+ (t1-ti)\
C /2(T),?2)f3)dij
correction is computed according to the
formula
7-40
AMCP70G-192
W= 10,000 lb/in2
Ti~ }f R'I
and
?2 fa -1 R2
[a/?,] m,x = 20,000 lb/in?
F3 fa L *> J R3
It was noted that thre distinct situations
may occur depending on the value of A. In
. /- RA
this example, the problem breaks down as
(7-141) follows:
1. If A > 0.509, then*! (t) < A for all?,
where terms on the right side of Eq. 7-141 are
computed in terms of the r The corrected 2. If 0.156 < A< 0.509, then there isjust
guess ff then takes the place of t and the one point / for which \xx (t) = A, and
process is repeated. 3. If A < 0.156, then there are two values
of t for which |xx (?) I = A.
The derivatives of the Rt with respect to
the f are computed by the chain rule of The numerical example given was solved
differentiation, Eq. 7-109, and Eqs. 7-137 for eleven different values of A. The results of
irough 7-140. Just as inEq. 7-129 several of these calculations are presented in Table 7-5.
these derivatives must be determined by The first column of this table consists of the
successive numerical integration. values of A considered. The following ten
columns give information that, when
The matrix of derivatives which appears in substituted into Eq. 7-130, completely speci-
Eq. 7-141 has sixteen elements. Twelve suc- fies the optimal beam. The next column gives
cessive definite integrals appear in one or the volume of this optimal beam. The final
more elements of this matrix. Therefore, two columns give the volume of the lightest
considerable computation is involved in each beam constant depth that satisfies the condi-
iteration. All this computation was incor- tions of the problem and the percent saving
porated in a single computer program. realized when the optimal beam is used
instead of this uniform beam. Dashes have
As a numerical example, the beam of Fig. been inserted in the table when the quantity
7-14 having the following properties is con- to be tabulated does not exist.
sidered :
For each value of A, the iterative procedure
used to solve the problem required approxi-
M = 1,100 n.-lb mately two minutes per iteration on an IBM
1410 Computer. However, three to five itera-
T = 40 in. tions were sufficient to obtain convergence of
the residue to seven decimal places, so com-
b = 0.5 in puting time was not excessive. This rapid
convergence is, again, characteristic of the
E = 107 lb/in? Generalized Newton Method.
7-41
>
I
4^ 2
o
-o
u
o
e
TABLE 7-5
RESULTS FOR BEAM WITH S-SHAPED DEFLECTION CURVE
Vol. of
Unif.
A, fa- 7, Si f2 t3 U Vol., Beam',
in. in. in. in. in. in. in. in.3 in.3 Savinqs, %
XD.509 16.3 16.3 0
0 40 8.27 13.59 23.93 3.35 69.00 16.7 17.6 5
0 30 5.56 12.75 25.00 4.94 92.91 17.4 19.4 12
0 23 4.18 11.94 25.00 6.57 111.96 17.9 21.1 18
0 16 2.84 10.96 25.00 9.69 146.37 18.8 23.9 28
0.15 2.32 11.56 24.51 31.02 32.02 33.90 11.84 8.45 234.5 1 180.41 19.0 24.3 29
0.14 2.11 11.54 24.56 29.66 31.93 35.09 13.10 9.75 263.99 225.17 19.3 25.0 30
0.10 1.31 11.53 24.58 27.93 31.77 37.50 21.02 16.45 431.98 440.45 21.3 28.0 31
0.075 0.88 11.49 24.74 26.95 31.67 38.42 31.10 24.89 643.33 695.92 23.3 30.7 32
0.05 0.51 11.45 24.89 26.15 31.58 39.12 53.56 43.62 1113.10 1256.79 26.5 35.2 33
0.02 0.15 11.40 25.00 25.37 31.49 39.75 182.14 150.63 3794.73 4457.51 35.9 47.7 33
'Volume of lightest beam of constant depth which satisfies all the requirements of the problem.
AMCP 706-192
example concludes the present subparagraph. optimal beams with deflection requirements
A plot of volume (of optimal beam) versus greater and less than 0.156 have considerably
deflection requirement for the problem con- different form. The beam profiles of Figs.
sidered here is given in Fig. 7-16. From this 7-17 and 7-18 illustrate this difference graph-
ically. As A decreases toward 0.156, the jump
in h(t) at / = 25 (see Fig. 7-17) becomes more
40n pronounced. However, for A very slightly less
than 0.156, the profile is continuous, much as
30-
in Fig. 7-18.
7-4.5.4 CONCLUSIONS
20-
n
1.14
All dimensions in inches
T
_L
0.81
I I I
0 3.2 25 40
1.21
IT 0.81
7-43
AMCP 706-192
However, for applications in which weight is a forming nonuniform beams is not prohibitive,
premium, such as in aerospace work, fabrica- such as in the manufacture of reinforced
tion of minimum weight structural members concrete beams, then nonuniform optimal
may be quite feasible. Further, if the cost of beams may be used to advantage.
REFERENCES
7-44
AMCP 706-192
7-45
AMCP 706-192
CHAPTER 8
-1
AMCP 706-192
8-2.1 THE PROBLEM CONSIDERED (p(t,u)<z 0, = q'+ l,...,q, t < t< t1.
(8-1)
subject to the conditions + \o>[t,x(t),u(t),b] | *=0 (8-6)
i
+1 LAt,x(t)Mt),b]dt = 0,
a= \,...,r'
gration, discontinuities in functions of the
problem (u(t) may still be discontinuous),
intermediate conditions on x(t), and state
variable inequality constraints.
>(8-3)
= g(b,x,xl)
8-2.2 EFFECTS OF SMALL CHANGES IN
DESIGN VARIABLES AND PARAM-
f
+ 1 L [t, x(t),u(t),b] dt< 0, ETERS
8-2
AMCP706192
dX df
db dx dx1 X + h(t) (8-11)
dt dx
dhx df
=
~T~ T~ x + 8u + 8b I
dt dx du db I -d-(Ar 5x) = 6x + AT = - 47 6x
(8-8) dt dt dt dx
ae 1
i&x + s-bx =0, s= I, ...,n I
dx 0 dx1 / + hTSx + XTKSx + XT^8u
dx du
d
8
ft. 8a dga
*=-^
az>
6b +
T7o
dx *+-
dx1
*'
+ XT~-8b = hT8x + XT^-8u
db du
ox + 8u + X^6b.
J
\ \ dx du db
t"
8-3
AMCP 708-192
(8-12) (8-14)
9/ a/
J: fcrx + X7"-^-5w + X7'-^-5Z>
9u 9
dt. Likewise, put h(t)
and define
- bLr[t,uw,b(0)]
dx
\Va(t) as the solution of Eq. 8-11 with
By choosing the function ft(/> and the boundary conditions onX a(t) and X "(t1)
boundary conditions on \(t) apprqpriately, determined by
the identity Eq. 8-12 will yield the desired
relationships. First, put hit) = df0T/dx
U(0\b(0)] and define \J(t) as the solution
of Eq. 8-11 with boundary conditions on
\J(t) and XJ(t1) determined by Sx + - Sx1 (8-15)
9x 9x'
8-4
AMCP 708192
8-2.3 A STEEPEST DESCENT APPROACH Note that the column vector 0(f) may have
different components at different points in t
The problem of determining 8u(t) and 8b < t < t'. In order to assure that constraints
to minimize (maximum negative of) 6J must are satisfied, it will be required that
deal with the inequality constraints, Eqs. 8-3
and 84. The argument to be used here is: 84/a =-a4/a, a= 1,..., r'
simply ignore a constraint function that is
negative before an iteration begins. If, on the 8ipa < -a\j/a, a = r'+ l,...,r (8-19)
other hand, a constraint function is positive,
and a&A
it is required to be reduced. For example, if
4>a > 0 or <p (f) > 0 for some t, then it is
required that 8i>a = a\jja , a = 1,..., r' and and
S'Pa < ~axlJ<X' a = r' + 1>- rand 4>a > Oand
50^(0 = - c^W, 0=1,.... q and S<^(0 < 6fy(/) = - c0(f), =h-,q' '
- c4>(t), = 4' + 1,..., 4 and <j>g(t) > 0 where
0 < a 1 and 0 < c < 1. The magnitude of a 60,(0 < -c4>(t), = q'+l,...,q (8-20)
and c are chosen so that the required changes
and GB(t).
5\jja and 84 At) are not excessively large. If
$a and 4>e(t) are not so large that the linear
approximation is violated with a = l,orc = 1, Finally, define
then a or c are chosen as one.
A= {a|*a[x<0>,u<0>,<>] >o}
dt
and a* 1 L a*
Bit) = { \<p0[t,u(o\t)] > 0} .
(8-22)
.8-5
AMCP 706-192
oH jT T ,
8J = S.jT8b+ \ Aji (t)8u dt (8-25) -=0 = _\0A' -TrA*
.fjiT(trfi>--2y08uTW, (8-30)
du
8^ = ^ A* (t)8u dt . (8-26)
(8-27)
(8-31)
where Wb and Wu{t) are chosen as positive In the following development, it will be
assumed that the problem is normal so that it
definite weighting matrices and dP is to be
is permissible to put X0 = 1. Solving Eqs. 8-30
chosen small enough that 8u{t) and 6b are
and 8-31 for5(f) and 8b, respectively, yields
sufficiently small.
H=[-\0A'T-yTA*T(t)-nT(t)daj
* In order to complete the determination of
8u(t) and 8b, p(t) and y must be eliminated
-y08uTWu8u + firc<j> from Eqs. 8-32 and 8-33. A direct analytical
(8-28) elimination of MC) and y is not feasible at this
8-6
AMCP 706-192
2y0 du "
HT(t)[b4>(t) + ci>{t)) =0
and some components of 5^ +a$ and 50(0 + AJ(t) + A* 0)7 + -v- MO) + c0 = O
9u
c$0) will be zero. A certain amount of logic is
required to find 7 and p(t). Since only small It is assumed that at points where B(t) is not
Su(t) and 8b are admitted, if \p and 00) are
empty, 90/9 has full row rank, i.e., all
zero for (0)O). *(0) then very likely they constraint functions which are zero or posi-
wlalsobe zero for(0)0) + 8u(t), *(0) +5*. tive are independent. Since Wu(t) is non-
Following this line of reasoning, it will be singular, the matrix
assumed first that
90 90r
8\p +a\p = 0 u (8-34)
au du
and
is nonsingular. Therefore,
500) + c0O) = 0 . 0 _1
MO) = -A O)
or and
i
bu{t)=_w-,
, T , dd>J _i
M 4><t>
x(/ A (8-40)
-a * Mw~\
Eq. 8-37 becomes
y
x(A +A*y)
TT (^/+Af^7) + cM|t =ai. (8-41)
-cW-t-T- A* * (-36) Z7
3
where/ is the identity matrix.
Since W (r) is positive definite so is W"1,
Substituting 6u(t) and bb from Eqs. 8-36 and there is a nonsingular matrix 5(f) such
and 8-33 into Eq. 8-26 and then enforcing d\p that W~x(t) = sT(t)s(t). By direct multiplica-
= a\j/, tion, it may be verified that
1 T f''
-^T* W-i (y + *7)-cl A*r
yTM^y=yTl+TW-1 Z*y
f1
i
x"'--1 ^A#_1**-2^-^ A*'
3 + \ yTA* Wj1
/ 30r -I 30 . \
xr" 1 \ / A* - w" 1 /)
Zu
(8-37*
"-- *-'-),*
Defining
-yTZ* W'1 9*y
T
A*TW~l
J fo
/ 30r -i 30 , \ , $>-*-
(8-38)
x
-farsT jsA^y
M
i,i, =n*Tw-1 if +[
i A* Vr1
3 du
x sA*y dt> 0.
(8-39)
(8-42)
8-8
AMCP 706-192
(8-43)
6uH = -Wj>[l-&A*-lwu-
It should be noted that if the set of indices A
is empty, $ does not exist soM, . is not even
.A*M;l<fi4,-cM)
defined. If, in this case, M, , is defined as
one and 0 zero, then y = 0 in Eqs. 8-41 and (8_45)
" a A* 0
8-36 reduces, appropriately. In this way, a
single mathematical analysis holds in all cases.
8-9
AMCP 706-192
In fact, if 0 and if are zero or empty, 8u2 {t) = of Steepest Descent. First, Relation 1 states
8b2 = 0. It appears that 8u2(t) and 8b2 may that the changes Su'Cr), 8bl and 8u2(t), 8b2
be interpreted as making corrections in con- are orthogonal. Relations 2 and 5 show that
straint errors, or keeping constraint functions 8u2(t),8b2 provides the requested reduction
from being violated. Actually, this and more in the constraint functions. Relations 3 and 4
is true. show, as might be expected due to the
orthogonality of Relation 1, that 51 (f), 8bl
Theorem 8-1: The following identities has no effect on the constraint functions in 0
among 6u'(0, 8u2{t), 8b1, and 5b2 of Eqs. and \p. Finally, Relation 6, along with
8-44 through 8-47 hold: Eqs. 8-48 and 8-49, simply states that if 4> = 0
and 00) = 0 then 8u(t), 6b provides a
reduction inJ.
1. 8b1 Wb8b2 + \
C u1
T W 8u2dt = 0
u
30
4. du1 =0 (8-50)
3
and
30
5. 8u2 =-c4
au (8-51)
it is required that
8-10
AMCP 706-192
p
a^a =0. <*=r'+ l,...,r (8-55) Further conditions from Theorem 6-7 are,
fromEq. 6-124,
$(0*fl(f) = 0, = q'+\,...,q. (8-56)
dG1
1 - _w(/) = 0 (8-62)
Corresponding to the definition of 4 and 4> dx
in Eqs. 8-17 and 8-18, define 3, L, i, and fas
and
containing only components of v, L, g, and ,
corresponding to elements of ty and 0. In this 9Gr
+^) = 0 (8-63)
notation - and due to Eqs. 8-55 and 8-56
Eqs. 8-52, 8-53, and 8-54 become
Multiplying Eqs. 8-62 and 8-63 by fix0 and
d fix1 yields
-^-Zir!L+zh.+yT ^- (8.57)
dt dx dx ax
|l SXO+PTJ8 8xo_uT(to)5xo=0
9i dx" dx
T of d/o -7- ;r 4>
du du
"' du
"" ou (8-64)
(8-58) and
(8-59) ^L8x+^t>x>
dx dx1
Substituting from Eq. 8-11 into Eq. 8-57
yields + i>T(j*8xO+j%8x\
\dx dx1 I
cfw d\J dX*
+ + P - wr(f)5x0 + u>T(.tl )Sx1 = 0 . (8-66)
dt dt dt
\jT(t1)Sx1 -AJT(t)8x
(w + X7 + X* v) =
+ PT\\* T(t1 )5x' - X* r(r)5xI
df
(w+X-7 +X*). (8-6 1)
ax - uTU)ox + cjr(?' )fixJ = 0 (8-67)
8-11
AMCP 706-192
for all 6x and dx1 satisfying the second Premultiplying the transpose of Eq. 8-70 by
equation of Eq. 8-8. By collecting terms, Eq. (30/9") W"1 yields
8-67 becomes
30 , 30r -
/7 7 T l
o/tf') + A V ) + V {t )JSx' 3 " 3 c
\ du du du
HJ + 9* v = 0 . (8-74)
T 30
(8-70)
Substituting for % in Eq. 8-73 into Eq.
8-70,
and
30r A-i 30 ,
+ - A* W"1 A* 3 = 0
du du "
(8-71) (8-75)
8-12
AMCP 706-192
or,
Premultiplying Eq. 8-75 by A* W~l and
integrating yields
V bu bu " I
Mr] M. , (8-78)
Step 4. Solve the differential equation
8-11 with h and the boundary
conditions of Eqs. 8-13 and 8-15
Substituting!/from Eq. 8-78 into Eq. 8-75, to obtain XJ(t) and X a(t), re-
spectively.
A7 -h*M~\ M, ,
Step 5. Compute A/(/)) 7, A* (t),and *
in Eqs. 8-21 through 8-24 and
u
bu bu A*(OinEq. 8-34.
1
+ ML A*- te-W7iA+M7iM Step 6. Choose the correction factors a and
l
**m*j=0
du bu ein Eqs. 8-19 and 8-20.
8-13
AMCP 708-192
Step 7. Compute M^j, M, ,. and M^ ^ in Substituting these expressions into Eq. 8-25
Eqs. 8-38, 8-39, and 8-40. and using Eqs. 8-44 and 8-46,
5/=
Step 8. Choose y0 > 0 and compute y - Wo [/rpv(/-*^;*v)
and MO) of Eqs. 8-43 and 8-35. If
any components of 7 with a > r',
or /J(0 with 0 > q', are negative,
redefine $ and <ji(f) by deleting J ,0
where
u(i>(0=u<>(f)- = bul{t)
'To
2
+ Su (0 MJJ=2,1W^Z'+ [' AjT W^
8-14
AMCP 706192
a= !,...,/'
LEM
> (8-84)
8-3.1 THE PROBLEM CONSIDERED
*=*(*.''.*'>
The basic optimal design problem with
fixed endpoints, no discontinuities, and no 1 La[t,x(t)MO,b]dt<0,
intermediate constraints was treated in the
preceding paragraph. The problem considered a = r' + 1,..., r
here will be a generalization of that problem
to include features such as variable endpoints, and
discontinuities, and intermediate constraints.
The basic idea of the method of solution will *,(*.) = 0, t Qt< f\
be the same as in the preceding paragraph.
Accounting for the additional features of this
problem, however, introduces some com- (8-85)
plexity into the derivation of equations. <t>(t,u)< 0,t<t< /",
dx
f(t,x,u,b), t < t< t*,tt>\ {oj[t,x(t)MO,b]
dt
H8-82 (8-86)
+ \w[t.x{t)Mt),b]\) dt = 0.
d (t.x.t7'.xr>) = 0.s= 1 n 1
Constraints of the form of Eq. 8-85 are easily
treated in a direct manner so they need not be
ni(ti,xi) = 0,i=\,...,v (8-83) reduced to the form of Eq. 8-86.
8-15
AMCP 708192
Just as in the method of par. 8-2, the where Ax1 is the total change in x' at the
idea here will be to estimate u(0)(Y),&(0) point t', Since x{t) is to be continuous before
and t, .... tn, and then to allow small and after the variation, the total change in
changes 8u(t) and bb. The object is to x(t) must be continuous at each point t. This
determine 8u(t) and 6b which yield the requires that
greatest reduction in J and which satisfy
the constraints. 8xi(ti -0)+f(f -0)8f = AxJ
8Q= de8b+ de de 3e
Tb -ir^0+-+^ (> +
,
5/" = 0 (8-91)
3r"
Su(t) and 8b which are to be determined, Or, using the definition of Ax', this is
introduce the adjoint variable X(t) just as in
Eq. 8-91, ('* i
1 dx 8xdt = -XT(t + 0)Ax
J t
dh _ lf_ dFT
(8-93)
dt dx ax
+ Xr(f +0)f(t +0)5/
Integrating the identity below Eq. 8-91 from
t' to t'+1, 0 < /,/ + 1 < qr,one obtains,just as + ... + [\T(t> - 0)
in Eq. 8-92,
-XT(t'+0)]Ax'
Xr(^+1 -0)8x(t'+1 -0)
- [XV - o)/o> - o)
-XT(t' + 0)8x(t> +0)
-XT(t'+0)f(t'+0)]8t'
8u + XT~8b) dt
) \ du db +... + + F(t' - 0) - F(f' + 0)
dt>
8-17
AMCP706192
+xra* -o)/(/" -o)jsf for all Ax' and St' satisfying Eqs. 8-90 and
8-92.
9x
+xr(f) W + For proof of this Lemma see Ref. 3, page
12.
- Xr(f + 0)/(r + 0) St
^o-^l,^
9g
+ +
912
Ax
+ 7o
Ix5"
T
- F(?/ + 0) + \ (t> - 0)f(t' - 0)
an'
+ 7-
7 ~ Ax'
> bx>.
^_\T{tn _0)+ 2T "
Zg r 3x" 3x"
+ ... + X (f - 0)
3x"
an"
+7 =C (8-102)
n '" 3x"
be s 9^"
+ 2 r s ~ +y Ax"
i=i 3*" " dx"
-^ - F(t + 0) - Xr(f + 0)/(f + 0)
+ +
+ F(f> - 0) + Xr(f - 0)/(f - 0)
3fi
be
08 rr. . bl>
_XT(^-0) + XT(r'+0)+ 7,^ = 0 + -*-/[?' 0, x(f'' 0), u(f'' + 0), */
3x' 9x' 9x!
(8-101) (8-107)
8-19
AMCP 706192
ae
e flf o,x(t' 0), u(t' 0),b]
+ -LL. g(P - 0) + F(P - 0) + 2 T
A('" - )
dx' s=l
+ 2 Td(t +0) (8-1 10) In order to use Eqs. 8-111 and 8-112 as
explicit conditions on \(t) at t and P, the
parameters rs must be eliminated. This may
Substituting Eq. 8-1 lOinto Eq. 8-100 yields be accomplished by alegbraic manipulation in
9^ " deT particular cases. To illustrate this idea on a
problem which has been treated extensively in
the literature (Refs. 5,7,8,9), consider the
case in which a full set of initial conditions
n(t0 +o) Ug(t + 0) is given, i.e.,
8-20
AMCP 706-192
(^T^Y>>- <8-"7
\git'0) + Fit'-0)
n'it'0)
-Fit' +0)
- \T(t' - 0) \fit> 0)-fit' - 0)] This equation meets the objective of this
subparagraph, namely, determination of the
dependence of SQ on 8b and Suit) explicitly.
+ \Tit' + 0) [fit' 0)- fit' + 0)] [ .
Since Q was any functional, this result can be
applied to the particular functionals of the
Substituting this expression into Eq. 8-101, present problem, / and <i . To obtain 6J and
8-21
AMCP 706-192
g = g0andF=/0forX/(0 (8-118)
J? (8-122)
and
and
a
g=aandF = Ifor X* (/). (8-119)
In this notation
5/ =
db ' 3 f0 La* jT
5/ = % 6b+\ AjT (t)8u(t)dt (8-124)
J ,0
+.X7 (t)\dt \ 6b
and
In 9
8\jj = %'"<* 8b + (t)8u(t)dt .
,7 3/
6u{t)dt (8-125)
du
8-22
AMCP 706192
8-23
AMCP 706-192
B(t)={[<t>[tMHt)\ > 0}
i;
AjT\t)bu{t)dt (8-134)
By the argument of par. 8-2, it will be The algorithm of par. 8-2 may now be
required that given with references to equations of this
paragraph.
Using the notation of Eqs. 8-122 and Step2. Estimate t, t', .... ?", and solve
8-123, define the matrices Eq. 8-82 forx(t).
8-24
AMCP 706-192
Step 5. Solve the differential equation, Eq. Step 13. Adjust f, t1 ,..., tn as required by
8-93, with boundary and inter- the discussion below Eq. 8-127.
mediate conditions of Eqs. 8-111, Return to Step 2 with uw, M>
8-114, and 8-116. The solutions being replaced by (1) and (1).
corresponding to the functions,
Eqs. 8-118 and 8-119, yield \J(t) For an alternate development of the al-
and X a (t) .respectively. gorithm in the special case of a full set of
initial conditions, see Refs. 5 and 7. Several
Step 6. Compute ZJ, AJ(t), * , A* (?), and example problems are solved in Ref. 5.
A*(f), in Eqs. 8-120, 8-121, 8-132,
8-133, and 8-34, respectively.
8-4 STEEPEST DESCENT PROGRAMMING
Step 7. Choose the correction factors a FOR A CLASS OF SYSTEMS DE-
and ein Eqs. 8-130 and 8-131. SCRIBED BY PARTIAL DIFFEREN-
TIAL EQUATIONS
Step 8. Compute M^j, M, ,, and M. in
Eqs. 8-38, 8-39, and 8-40. 8-4.1 THE CLASS OF PROBLEMS CON-
SIDERED
Step 9. Choose 7o > 0 and compute y and
(t) of Eqs. 8-43 and 8-35. If any Thus far, all problems considered have had
components of y, with a > r', or their state variable specified by algebraic
(t), with > q' are negative, equations or boundary-value problems in
redefine \p and <j>(t) by deleting ordinary differential equations. It is possible,
corresponding terms and return to however, that the state of the system being
Step 6. considered is governed by a boundary-value
problem with partial differential equations. In
Step. 10. Compute 5u'(0, 8u2(t), 8bl, such cases, the state and design variables are
and 8b2 of Eqs. 8-44 through functions of more than one independent
8-47. variable. One may then think of the design
variable as being distributed over an area,
Step. 11. Compute volume, or higher dimensional space. For this
reason, such problems have been described as
u^Kt)=u^Ht)--^ bul(t) distributed parameter systems.
27o
A great deal of work has been done on
+ a2o) distributed parameter systems which have a
time-like variable (Refs. 12,13); i.e., a variable
M >=*> - 8b1 + 8b2 . which makes the governing differential equa-
27o
tion hyperbolic or parabolic. In this para-
Step 12. If the constraints are satisfied and graph, consideration will be limited to static
S1^) and 8b1 are sufficiently problems such as equilibrium of plates, shells,
8-25
AMCP 706-192
8-26
AMCP 706-192
A,L(x,u,b)[z] =L(x,u,b)[z]
" U
92,[x,u(0),b<0)]Su
Once the dependence of P on changes in u, v
Bu
and b is determined, the result may be applied
directly to J and \j/a.
3b (8-144)
To first order terms,
forjcfi and
z + 5v + 6b I dr
oz dv db
0 ( 0 m w r V
B[v'- \b^](bz)+At,B[v - \b ][z ]bv '/(
bF
+ A&5[v<0>,b<0)][z(0)]Sb XT 02
bz + ^ bu + *P 6b)dJ2.
3 3b 7
a? (8-148)
U,v<>,b(0>]5v
dv
In order to make use of Eq. 8-148 in the
determination of bu, Si>, and 6b, it is desirable
+ /"x/),b]5b
db to eliminate explicit dependence on bz. This
is done through use of the adjoint operator
(8-145) L * defined by
8-27
AMCP 706-192
- 8z - A\\]TC[8z] (8-152;
8P
bg
8z~A[\T}C[8z] +-8v
bv
+ -& 8b\ dV
for X(x).
+ ZTSb
+ 8b \ d,
bb where
Substituting from Eq. 8-144 for L{u,b)[8z] ,
bFT bQT
this is A(x) = - -AuL(.u.b)[z]\ + -2-\
bu " by
bF
8P- \T&L(u,b)[z\
bu Tl(x) = coefficient of 8v in Eq. 8-15 1 after
n
substitution
+ \T -$-\ 8u
l"
bFT
bF -AfcL(u,ft)[z]X
+ { -\TAbL(u,b)[z] bb
+ A' dtt bb
ty
A
dg
+ \l -r- 5z~A[\)TC[8z] + -&5v after substitution] dV
(8-151) (8-153)
+ 8b}dr.
bb
8-28
AMCP708192
By putting Y
a
(8-158)
F= (8-154)
g=go> fo LaGA
and where
respectively
n*(x)=( n*;ae^) > (8-161)
* = ( je* ; aEA ) .
The expressions, Eqs. 8-156 and 8-157, give
the desired explicit dependence of 6J and In this notation,
8^a on 5u,Sv, and 8b. The problem is now
reduced to determination of 8u, 8v, and 6b
which give the greatest reduction in J subject 8\p= ff A* (x)T8u dl
to the constraints of the problem.
Likewise, define
The procedure will now be to choose 8u,
8v, and 8b so as to minimize 5/ subject to the
constraints Eqs. 8-141 through 8-143, just as
*(*) = (8-163)
in pars. 8-1 through 8-3. In order to insure
iED(x)_
Eq. 8-141, define
8-29
AMCP 706-192
0 < C2 < 1 and 0 < C, < 1 for all bu, 8v and 8b, where
30
6$(x) = (x)u(x) (8-169)
du 57 = Jj r-X0A/r(jf)-7rA*7'(x)
and
9j
Sw (x) = - (x)8v(x) (8-170) 7o 8uT W..
ov
(8-173)
8-30
AMCP706192
Using 5/ of Eq. 8-173 inEq. 8-172 At this point it is assumed that the problem
is normal so that X0 = 1 may be chosen. Eqs.
5(8/) = 0= Cf\~\0AjT(x) 8-175 through 8-177 yield
L
si
-yTA*7(x)
-2yQ8uTWu
8u{x)= 2^-W-Hx)
T,-^ d<t> 2
- ' (x) 8 udSl
du
-AJ(x)-A+(x)y- ^~ /*(*)
du
J - \0lljT(x)
xen (8-178)
-yTn*T(x)-27o5vTWv 8v(x)
2To
3d)
82vdV
.v'ixy
dv x ["ny(x)-n^7
A,7'
"$>4> 3d)-,T
v{x) ,xer (8-179)
2y08bWh 82b ,
and
(8-174)
6b
=^yTWb1 <-fi/-*7)- (8-180)
for all 82uQc), xefi, 82v(x), xsr, and 82b.
This implies
Assume for the present that Eqs. 8-167 and
J
- \0 (x) - A* (x) -. 27o Wu 8u{x) 8-168 are equalities. Substituting Eqs. 8-178
and 8-179 into Eqs. 8-167 and 8-168 yields
30 T
(x) = 0 (8-175)
du
3wr and
v(x) = 0 (8-176)
dv
1 3c , , r
W~l - llJ - IT* 7
forx^V, and 27o ?>V
3>r
7
-X0e -8*7-2T0W6 = 0. (8-177) c =-c,w, xer.
3v
8-31
AMCP 706-192
9v 1 i
~TZ~ M, j - -* M, , y - C2M, -
-"-7) , *er (8-182)
where C3JW, . =-Ci^ (8-185)
r
9cZ) , 9w
A" = 3v ^ 3v
7 A
^-Wo^1 LI " 3u ' ^=ffA 3
30 -r
30 A
W-1 }
du " /
x (-AJ -A*?)
>Vv-ir A*
30 r 6 ..
A@-'27oC20j', xES~ x
9u 37TI)n'dr
8-32
AMCP 706-192
30 _,, and
X
-^K )A*da
8b = 5ft1 +8b2 (8-193)
dT 27o
^ p
\ 3v
where
!".-'!'
" \ du du " /
+ 8* H/;1^ (8-187)
x (AJ - A*M~l Alexen (8-194)
*.- ^K'^^'i*
(8-188)
and "V du du " )
M, . = I TVi>Tw-, dT W
A *dr.
3v
(8-189)
+ C,M ^ U>
>] (8-195)
(8-190)
dv 3 "
8-33
AMCP708192
J
Sbl = Wri (i -ft*M~\Mn) (8-198)
4. -du2 =~C2<t>, in
du
and
90 ,
5. m-Su1 = 0, in $2
&b>=W-b1S.*M-\(-C1+ + C2Mii$
+ C
3^> (8-199) , 9w ,
6. -6v2 =-C3w, onT
3v
2. 2*T8b2 + S IT^Sv^r
Step 4. Solve the differential equation, Eq.
8-150, subject to the boundary
+ SS A* 8u2dSl = -Clip conditions generated by Eq. 8-152
a
with g and F given by Eqs. 8-154
and 8-155, to obtain \J and A*01,
3. Q"/T5b1 + S n*T8v1dr
r respectively.
+ SS A* 8u1d. = 0
n Step 5. Compute A/(x), nJ(x), ZJ, A* (x),
8-34
AMCP708192
11* (x), and * in Eqs. 8-153 and As in par. 8-2, if after several iterations the
8-161. constraints are all satisfied, 8u2(x), 8v2(x),
and 8b2 will all be zero. In this case,
Step 6. Choose the correction factors Ct,
C2, and C3 in Eqs. 8-160, 8-167,
5/= \Mlr-MTl M-\M, ) (8-200)
and 8-168.
2
+ 5M (JC)
8-5 OPTIMAL DESIGN OF AN ARTIL-
LERY RECOIL MECHANISM"
v<l>(x) =v<>(x) 8v*(x)
2To
An artillery weapon mounted on tires or
2
+ 8v (x) tracks has some undesirable features. Unlike
the hard mount (weapon rests on a base
plate), the flexible mount will have a pitch
fe(D = bW 8bl +8b2. motion. During the recoil stroke, when the
2To
weapon is fired at 75-deg elevation, the tires
load up or compress; and when counterrecoil
Step 11. If the constraints are satisfied and
begins, the tires act like a spring and unload
u'fx), 8v1(x), and 8b1 are suf-
sending the tires off the ground. It is quite
ficiently small, terminate. Other-
obvious that, when the weapon comes to rest,
wise, return to Step 2 with
u<>00, V<>(JC), and Z>(0) re- the likelihood of it being zeroed in for the
placed by u(1)(*), v(')(~)%nd *The results of this paragraph represent the work of Mr.
6(1) .respectively. T. D. Streeter, Ref. 15.
8-35
AMCP 706-192
next round has been reduced considerably, modeling for the high-speed digital computer.
especially for high rate of fire weapons. This To do this, the steepest descent, numerical
phenomenon is known as a secondary recoil technique is used to minimize the hop or
effect, because an additional acceleration pitch motion of the weapon and, at the same
term enters into the recoil equations. Because time, to determine the necessary control
of this secondary recoil effect, the recoil rod design that will minimize hop.
mechanism design becomes much more dif-
ficult. For short recoil, the orifice areas in the The recoil equation for a rigid mount is of
recoil mechanism are designed at maximum the form
elevation (75 deg). Therefore, when elevation
is mentioned throughout the remainder of x+f(x)x2 + g(x)=h(t) (8-201)
this report, it refers to maximum elevation.
The weapon positioned for high-angle fire is where x is the displacement of the recoiling
shown in Fig. 8-1. parts, g(x) is a restoring force, and h(t) is the
The purpose of this paragraph will be to *The control rod in a hydraulic recoil mechanism is a rod of
develop a systematic recoil mechanism design variable cross section which moves through a larger orifice
during recoil and varies the area of the orifice to control
procedure characterized by mathematical recoil force level.
-36
AMCP 706-192
8-37
AMCP 706-192
At time t2 the velocity of the pitch motion intermediate and terminal constraint func-
must be zero (necessary condition for maxi- tions occur; xs and x3 are the velocities of
mum pitch), and the displacement of the the recoiling parts and pitch motion, respec-
pitch motion is to be a minimum. Note that it tively; x2 and xA are the displacements of the
will be possible for t2 to vary between tl and recoiling parts and pitch motion, respectively;
t3. Therefore, the hop or pitch motion will be i^i = 0 is the constraint on the displacement
minimized for the entire counterrecoil stroke. of the recoiling parts such that at the end of
At the final time t3, which is the end of the recoil stroke the displacement will be
counterrecoil, the recoiling parts must return exactly equal to ??max inches. \fr2 = 0 is the
to their original position, and the velocity of constraint demanding that the recoiling parts
the recoiling parts will be some specified value return to the latch position at the end of
V3. This is to insure that the recoiling parts counterrecoil. ^ 3 = 0 is the constraint which
return to the latch position. It will also be requires that the velocity of the recoiling
demanded that the total cycle time be equal parts come into the latch position at a
to cT seconds. velocity V3 inchespersecond.fi1 =0 defines
the time at which the end of the recoil occurs;
Formulating this problem into the mathe- Q,2 = 0 defines the times at which the pitch
matical notation of par. 8-3 yields velocity is zero and the one with the largest
displacement is selected, thus defining the
Minimize/ = x^(t2) =g0 (8-202) time at which maximum hop occurs; and fi3
= 0 defines the total cycle time to be exactly
subject to the equality constraints: equal to cT seconds.
8-38
AMCP 706-192
8-39
AMCP 706-192
M = mass of elevating parts less recoil- q2 = distance from center line of trun-
b
ing parts nion to front of cradle (horizon-
tal)
i)b = distance from center line of trun- =
nion to mass center of Wb (hori- f2 distance from center line of trun-
zontal) nion to R(t) application (verti-
cal)
f6 = distance from center line of trun-
nion to mass center of Wb (verti- ct = distance from center line of trun-
cal) nion to bottom of rail (vertical)
/a =
traverse moment of inertia of Wa H,Z = axes fixed in cradle
about its own CG
8-40
AMCP 706192
_/i(|S, \+\S2 |) sgn(77) (8-206) tively, for the XM164 Howitzer. Eqs. 8-208
and 8-209 determine the guide friction.
JAfa(77 +yt COS7 + zf siny)2 + Affi [(rjft COS7
For small 0 the following approximations
+ 2
yt-$b sm7) + 0?6 snry + z, are made:
sin0 = 0
02
+ Id ! 0 + 2A/STJ0(T? + jr COS7 + zf sin7) COS0 = 1 s-
-g\Ma(T)+y, cosy +zr siny) From these approximations and the follow-
ing definitions, Eqs. 8-206 and 8-207 can be
xcos(7 + 0) +Md(yd cos0 zrf sin@) simplified. For simplification, define:
+ CON1 = Ma
Mb [yt cos0 zf sin0 + r\b cos(7 +0)
8 41
AMCP 706-192
CON32 = ^\Mfl-CON8
+ [R{t) +CON3] -CON6 +CON18-7?
CON33 = g-M -CON4-CON8
+ CON19-7T02 +CON37-02
CON34 = M,-g-nb-com (8-211)
8-42
AMCP 706-192
+ CON38-0+CON32-5?0
The optimal design problem can be stated
as follows: Determine the design variable
+ CON39 -c<p + CON16-02i?
R(t) in the interval 0 < t < t3 so as to
+ CON17-02 +(f)-CON7
minimize J = x^(t2) (8-216)
+ [(0+CON3]'CON6
subject to the constraints
+ CON18-r?+CON19-T?-02 1>i=x2(t1)-rio+vntx=0 ^
iP3=xl{ti)-V} =0
Eq. 8-212 can now be written as
(8-217)
1 1
ft =xl(t ) = 0
V = (v13-v22^v12v23)/(v11-v22))
(8-213)
n2 =x3o2)=o
<t>=v23/v22 .
a3 =?3 -C-r =o
By making the following definitions, Eqs.
8-213 can be put into first order form:
8-43
AMCP708192
4>=R(t)-R
r
* ' max
< 0 (8-2 18) 3v,
'22 \ / 3v. 1
M
"iil^-^l +"22'
dx. 3*.
and satisfying
l(viivl2) i= 1,2,3,4
x=f (8-219)
ML -v12x3(CON13-x2 +CON14)
where the components of the vector f are dx 1
given in Eq. 2-215 and initial conditions are
/("nv22)
*i(0) = X3(0) = x4(0)=0, x2(0) = r)o
ML VuV22[2Mav13(x2 +CON4)
dx2
The minimization problem stated here
starts with an estimated design for R(t),
+ Mav22xl - "12 (COm3>Xlx3
analyzes it, and then improves on the design.
The steepest descent technique of par. 8-3 is
+ CON32-x4+CON16-xi
used here to solve the design problem stated.
The first step in implementing the computa- + CON18 + CON19MC4)]
tional algorithm of par. 8-3.3 is computation
of auxiliary variables required for the al- -(V13V22 ~vl2v23)
gorithm.
^[2Mavll{x2+COm)]\l{v\lv\2)
8-5.3.1 DETERMINATION OF THE AD-
ML = iv22{2Max2x3 + 2-CON12jc3)
JOINT EQUATIONS bx3
-C+2'CON16-x3x2
"a/" 3
x=- X, 0 < t * t
dx
+ 2-CON17-*3)]/(v11vaa)
v
"(Sr)-vi;
+ 2-CON37-*4)]/(yll()
9/2
dvu dxx ax2 dx3 3x4
8/3 3v2;
- (vi3y22 - v12v23) dx: bx)~V23\bx '?)>'
8-44
AMCP 706-192
3/4
3A "0, ^0, 3/4 3/4
7T--U ^~0. It is the object of this paragraph to
ax, OJt 0JC3
2 0X4
determine the initial conditions at t3, t2~ ,and
tl~ for the four different integrations per-
formed on the adjoint equations, i.e., for i>x,
The adjoint equations now become
i//2, ^3, and/.
3^_ 3/a
Since fi3 of Eq. 8-203 does not depend
3*! 3xx
explicitly on x, dl3/dx = 0 and Eq. 8-114
reduces to
3/3
3x4 3jcd
x^a3) = (o,o,o,o)r
(8-221)
a3) = (o,i,o,o)r
where the partial derivatives are as computed
in this paragraph. x*ja3) = (i,o,o,o)r
8-45
AMCP706192
2
gl(t -0)=0 g*(fi -0) = 0
8-46
AMCP 706-192
where
Eq. 8-220 may now be integrated from t3
to t, withjumps at t2 and t1 defined by Eqs. 1, if 0 < 0
8-223 and 8-224. This completes computation d(t)
required by Step 5 of the algorithm of par. 0, ifd>> 0
8-3.3.
A'W-^A'W
and from Eq. 8-35
8-47
AMCP 708-192
Finally, the improved design is provided by important factor in reducing the hop; i.e., the
Step 11 of the algorithm, where entries are faster the recoiling parts accelerate during this
computed directly from Eqs. 8-44 and 8-45. period, the greater the reduction in hop. As
While these formulas are a bit messy, they are one would expect, an increase in allowable
routinely programmed matrix computations recoil length also reduces hop significantly.
which are no real challenge to the computer. An increase in the maximum rod force will
also reduce hop, for example, if the recoil
8-5.4 RESULTS AND CONCLUSIONS force is allowed to obtain the value 24160 lb
in constraint set of Eq. 8-225, the hop can be
Fig. 8-6 represents the optimal recoil force reduced an additional 0.32 in. Fig. 8-7 shows
to minimize hop at 75-deg quadrant elevation
< 0.08
t
=S
4V %;
^
0 10 20 30 40 50 60 70 8090 100110120130140 o 0.06 V ^
o
Time, msec (3ro ove s foi Minimum N \
0.04 iop foi R(f) < 22,000 It)
Figure 8-6. Optimal Rod Force
0.02
'V
s\
0
j ^
0 2 4 6 8 10 12 14 16 1820 22 24 26 28
with the following constraints: Recoil Length, i
8-48
AMCP 706-192
recoiling parts were 250 in. away from the to optimize tire characteristics for the final
latch position and the required final velocity configuration in the tire itself. In order to
of 6 in./sec, was 96 in./sec. In approximately obtain optimum weapon performance for
14 iterations, convergence was obtained flexible mount systems, such information as
which seems to be very fast if one considers tire performance could be incorporated into
the complexity of the equations involved. the mathematical model and perhaps tire
characteristics could also be optimized in the
Results from firing tests show a significant environment for which they are being used.
reduction (50% or more) in hop can be
achieved simply by increasing the tire pres- The technique used here has the capability
sure. Because tire performance information is to optimize many design parameters simulta-
not presently available, it was assumed neously. If there exist other sensitive param-
throughout this analysis that the spring rate eters, consideration should be given to opti-
of the tires was constant (linear spring). mize them along with the design variable R(t).
Therefore, it is not known what results would This study clearly indicates that weapon
be obtained under a nonlinear spring model. performance can be improved by using
Tire manufacturers are investigating methods methods of optimal design.
REFERENCES
8-49
AMCP708192
11. E. B. Lee and L. Markus, Foundations of 15. T. D. Streeter, Optimal Weapon Stability
Optimal Control Theory, Wiley, New By a Steepest-Descent Method, Tech.
York, 1967. Rept. SY-R2-69, Systems Analysis
Directorate, U.S. Army Weapons Com-
12. P. K. Wang "Control of Distributed mand, Rock Island, 111, Aug 1969..
Parameter Systems", in C. T. Leondes,
Ed., Advances in Control Systems, Vol.
16. J. W Frantz and R. H. Coberly, Design of
1, Academic Press, New York, 1964.
Control Rod for XM44 Recoil Mecha-
13. W. L. Brogan, Optimal Control Theory nism (XM1 64 Howitzer) Using Coupled
Applied to Systems Described by Partial Rotational and Translational Equations
Differential Equations, Report No. of Motion, Technical Notes, U.S. Army
65-29, Department of Engineering, Weapons Command, Rock Island, Illinois,
UCLA, Los Angeles, 1965. August 1968.
8-50
AMCP 706-192
CHAPTER 9
9-1
AMCP70G192
dimensional optimal structural design prob- T is the boundary of that region. L and B are
lem is given, and the computational algorithm differential operators on Q, and T, respective-
based on theory of Chapter 8 is developed. In ly. The functions Q and q are generally
following paragraphs, this theory and com- related to loads.
putational algorithm are applied to example
problems. Alterations in the general theory To better fix the idea of operators, con-
are made as they are required in the solution sider the simply supported beam of Fig. 9-1.
of individual problems.
9-2
AMCP 708-192
which 2 is a subset of two dimensional space, Constraints of the form i}(x,z,u,b)< 0 forxS2
see par. 9-7. will have to be reduced to functional con-
straints as in Eq. 8-6.
In addition to response due to static load,
it is necessary to treat the response of a Beginning with an engineering estimate of
structure to dynamic loads. One important the design variables u(x) and b, Eqs. 9-3
characteristic of a structure, which is classi- through 9-6 may be solved for z(x),y{x), and
fied as dynamic response, is natural fre- f. Aperturbation, {bu, 5b), in the design leads
quency. Another response, which must be to perturbations in the cost and constraint
treated, is buckling. Both buckling loads and functions
natural frequencies are determined, in the
present class of problems, by linear eigenvalue
problems. Again using operator notation,
5/ =
db aj- f& bz
and 90.
60; = 1#- 5u, xGfi, i= !,...,/ (9-11)
Cy = 0, xer (9-6)
de, de..
de,. r nog,
where f is the eigenvalue and y(x) the 5.//,
v =_L hb+L 5f + U 8z
associated eigenfunction or mode shape. The > db 3f J J\fa
operators K andM generally relate to stiffness
and mass, respectively. In conservative prob-
lems they will be symmetric (Ref. ^(formal- + ^Et 6U +^L 5b) dl,
bu db J
ly self-adjoint) but in nonconservative sys-
tems, they will not be symmetric. The more /= l,...,s (9-12)
general unsynimetric case is treated here.
The object, as in preceding work, is to
The optimal design problem is that of eliminate explicit dependencies on 5z and 5f.
minimizing First, the perturbation equation for 8z is
9-3
AMCP 706-192
6/= % B ff\
of i+ n \ o + lK(u,b)y]8b
do
m
C*jF = 0, JCGT
ae7
Premultiplying Eq. 9-19 by TlT ,
y integrating, db db
and rearranging terms yields
[L{ujb)z\ \J
T
ffy My<m\b$ =
a '
?( [M{u,b)y]) y \\da
-$yT [M(u,b)y] \ bu da
(9-22)
W yT-^ [K(u,b)y]
A'/ = Ml
du
+
dQ1
du
// [yTK8y - $yTM8v] da
x [i {Kiu> b)y] y
= ff [8yT(K*y - $M*y)] da = Q
n
S [ [M(u,b)y] ) y
by definition of the adjoint operator. It must (9-23)
9-5
AMCP706192
yTMydSl (9-28)
(9-24) (9-30)
and Defining A<j>(x) and Ai// to be desired
7- reduction in constraint error, the linearized
ae problem is to choose 6u(x) and Sb to mini-
3w 9
mize
ij fJyTMydSl subject to
Z.T
A* 6 - A(j>< 0, forxJ2 (9-32)
[[K(u,b)y]) y
fi*r5fc+ //A*"r6u c/n-A^<0
LL [M(u,b)y]J y (9-33)
(9-25) and
A;r dV M_,
Steepest Descent Algorithm for Optimal M i><t>
-I! " du
MT} *4>dto
"">
Structural Design
30 . 30r
Sul(x)=Wl7l [I ^~ M-}
du <*"
T x T 1
M.>jij
r = ^ W- & + jJA'" W-
-W
du "
u
\ du ">> du )
9-7
706-192
A minimum weight column problem has In its present form, the boundary-value
been solved in pars. 5-4 and 7-2 to illustrate problem is just as in Eqs. 9-5 and 9-6 and is
the use of two optimization techniques. The self-adjoint, so y = y in par. 9-2. For use in
same problem is solved in this paragraph, by the steepest-descent algorithm, Eq. 9-23 is
9-8
AMCP706192
(9-42)
fundamental frequency at least as large as a
predetermined frequency o>0. Further, so that
the beam can support a minimum level of
bending moment, it is required that the
The computational algorithm of par. 9-2 second moment of its cross-sectional area
now applies directly to the present problem. shall always be at least as large as a positive
The solution of the eigenvalue problem, Eqs. constant I0.
9-39 and 9-40, was obtained using the finite
element analysis technique outlined in par.
/77T7
5-4. The solution of this continuous problem ~&%h
required approximately the same time per T w
iteration as the discrete technique but fewer
iterations were generally required for con-
Figure 9-2. Simply Supported Vibrating Beam
vergence. Exactly the same results as given in
par. 7-2 were obtained. The reader is referred
As in the column problem of the preceding
to that paragraph for a tabulation of results.
paragraph and par. 7-2, the geometry of the
cross section is fixed and all dimensions are
9-4 A MINIMUM WEIGHT VIBRATING
allowed to vary proportionally. If the area is
BEAM denoted u(t), then
9-9
AMCP 706192
TABLE 9-1
L,
COMPARISON OF OPTIMAL BEAMS
Volume of Weight
2
r
Uniform Reduction
Beam of Optimal or
Frequency, Length Volume, Material 1 3
rad/sec 10in? in3 Savings,
%
X
Defining
y2=EIiw'l
j>4 =EI2w'2'
ys =w3
9-11
AMCP 706-192
the differential equations for vibration of the The perturbation boundary conditions
frame are from Eq. 9-51 are
/.'-
lky2 0 8y\(0) = 0 8y'1(L)=8y'3(0)
8y'2(L) + 8y'6(0)
(9-50)
where f = pco2. Boundary conditions are
y'2(L)+y'6(0)>
H. 8u2{x)dx yx(L)
-
M ?| u2(x)dx ViiL)
9-12
AMCP 706-192
i 2y22
Eoi1uj
5! in Table 9-2 for several frequency require-
ments, and the profile of an optimum frame is
shown in Fig. 9-5.
6wo
:;J
TABLE 9-2
5, I t/jc
WEIGHT OF OPTIMUM FRAMES
Optimum
sr: fc !
Weight, lb 1.73 2.56 3.59 4.69
3^5 )rf*.
2yl
-'O
8u7
L^aj! + ryi(i-)
2^1 Su3 \ dx
a*ui
(9-54)
J:
1 T
5? = l' Af 6W dx (9-55) Figure 9-5. Profile of Optimum Frame
9-13
AMCP 706-192
[Ea,u21(x)w"]" = -q{x)
Figure 9-6. Laterally Loaded Frame
and the boundary conditions are
w(0) = 0
allowed to vary along the length of the first
and second members but the geometrical w'(0) = 0
shape of the cross section is fixed. Thus, ' (9-58)
w"(L)=0
Ii(x) = a.uf(x)
3EI3w(L)
- (Ea,uiw")' (L)= ;
i>3
where a; depends on the cross-sectional geom-
etry and Ufo) is the cross-sectional area of the
ith member at the point x. The size of the To get the boundary-value problem, Eqs.
third member is fixed and all members are 9-57 and 9-58, into the form of Eq. 9-1,
taken the same length. define
z2 and
~
Lz- aiUf(x)+I0 < 0 /= 1,2 (9-67)
j" - [l/Ealu21]z2_
The constraint, Eq. 9-65, requires that the
(9-60)
'(*) axial load 7' in the second member be less
=e than or equal the buckling load. A limit S is
0 placed on horizontal deflection of the top of
the frame in Eq. 9-66. The constraints, Eq.
9-67, are included to insure that member
and cross section does not go to zero anywhere. A
more realistic constraint would be on bending
MO) "0~ stress but this would require a constraint of
the form of Eq. 8-6. This constraint will be
zi(0) 0 included in subsequent work but will not be
Bz = treated here.
22 (L) 0
>(0)"
It remains to obtain expressions for5zi (L)
Q- (9-63)
and 6P explicitly in terms of du.(x). From Eq.
9-21, SP may be expressed in terms of 5w2 (x)-
J = y\
i
'0
[ul{x) + u2(x)]dx (9-64)
/
fL
X7L8z dx yields
T
-L
The constraints to be enforced are j \ L8z dx= j( SzTLX dx
\T
-'0 Jn
+ \28z[ -X252,)
<p2=-Zi(L)-S< 0 (9-66)
9-15
AMCP706192
Choosing X such that IX = 0, substituting for Eqs. 9-1 and 9-5, are formally self-adjoint so
LSz from Eq. 9-13, and using the linearized L* = L, K* = K, and M* - M in the general
boundary conditions of Eq. 9-61, this be- theory. Since the boundary-value problem,
comes Eqs. 9-62 and 9-63, is self-adjoint, y = y and
r
the computation required in Step 2 of the
2Zn steepest descent algorithm is considerably
buidx =
ECLyU\ reduced.
(9-70)
TABLE 9-3
Optimum
Volume, 502.1 531.1 556.0 653.S
2El \l(L)-\2(L) = 1 in3
(9-71)
\1(0)=-K\(0)=\2(L)=0
9-7 A MINIMUM WEIGHT VIBRATING
the identity, Eq. 9-70, becomes
PLATE
f f 2z2(jc)X2(x)
5i(x)rfx In order to illustrate the use of the
steepest descent method in higher dimen-
(9-72) sional problems, a minimum weight vibrating
plate problem will be solved. A rectangular
Thus an explicit relationship between 5u and plate, Fig. 9-9, is specified by its thickness
Sz1 (Z) has been found and may be substi- function h(xx, x2) over the plate. The object
tuted directly into the linear constraints, Eqs. here is to choose h{xx, x2) such that the
9-68 and 9-69. With proper choice of nota- weight of the plate is as small as possible
tion, these inequalities fit into the form of subject to the constraint that the natural
Eq. 9-33. frequency of lateral vibration is greater than
or equal to a given frequency to. Further, due
In this problem the differential equations, to applied loads, a constraint of the form is
9-16
AMCP 706-192
3T57 w ,,
-*-*2
>>>>>/ / // / f
enforced. In the present problem, h0(xlt x2) yi(a, x2) = 0,^,(0, x2) = 0
is taken as a constant h,.
y*(a, *2) = 0,y4(0, x2) = 0
Denoting bending moments (Ref. 4) by yx (9-75)
=
MX, y2 ~ M and y3 = Af and the lateral y2(x1,0) = 0,y2(x1,b) = 0
displacement by y4 = w, the equations
governing lateral vibration may be written y4(x1,0) = 0,y4(x1,b) = 0
(Ref. 4) in self-adjoint form as
<>2yt 12
-0,2 -^i) To complete the formulation of this $rob-
ax: Eh3
K(h)y -;
lem in terms of the preceding theory, a cost
=f = JAW0.V
V. "6(1 + M) function is defined by
y
Sx1dX1 Eh3
9-17
AMCP 708192
This problem is now in the form of the (Ref. 5). Data for this problem are a = b = 5.0
general problem of par. 9-2. The domain 1 in in., E = 3.0xl07 psi, p =7.43xl(T4 lb-sec2/
this case is simply the rectangular region of in.4, v = 0.30, and co0 = 1375 rad/sec. A
the plate that is of dimension two. Further, uniform plate withf = pco2 = 1400 was taken
since the boundary-value problem, Eqs. 9-74 as the initial estimate. The volume of the
and 9-75, is self-adjoint,^ = y in the theory of optimum plate is 10.71 in? Double sym-
par. 9-2. metry of the optimum plate was observed
about the axes through (a/2,6/2). One quarter
Using the definition of K and M, <p h0 of the optimum plate with contour lines is
h{xux2) in Eq. 9-73, and e;.(f) = poo2, -f in
i shown in Fig. 9-10.
Eq. 9-77, from Eq. 9-23,
A/=7 (9-78)
A '
p6(yj 2w1y2+y21)+im+IJ)yl
Eh"
-H/n
a
hy\dxxdx2
(9-79)
REFERENCES
9-18
AMCP 706-192
APPENDIX A
CONVEXITY
A-l
AMCP 706-192
of R" . Then/"(x) is called a convex function if All these desirable properties of convex
for any points z andj inZ), functions will go to waste unless one is able to
test a given function for convexity. The
f[z+0(y-z)] </(z) + fl[/(y)-/(z)] following three theorems provide useful tests:
REFERENCE
A-2
AMCP 706-192
APPENDIX B
B-\
AMCP 706-192
which is or
(B-6)
f
Jo
*2 dw2\
KE= + |e?x
ft- (6x2 _6x)
(B-7)
2
i- (6x - 6fcc)
x- \ . x 3
-"l I ~ )+2
+ 5-(3x2 -411X+K.2)
(B-5)
or
(2x3 -3fcc2)
o o
o o
3 3 1 1
+ (x3-2k' +fcc) oo -h
58 --h
5S ^r
20 ^
20
u. (B-8)
3 3 1
o o 5S 20 "20
1 1 e 12
0 0-^- -^r
20 20 15 "60
1 1 e
+ ^f(x3-x2) rfx 0 0
20 20 ~60 75
B-2
AMCP 706-192
In most structural analysis work it is more Equilibrium states of a structure can now
convenient to use variational principles to be characterized in terms of the total poten-
describe the state of the structure than it is to tial energy. For unconstrained conservative
use Newton's laws directly. For use in analysis systems, a necessary and sufficient condition
of the structures considered here, the basic that u be an equilibrium state is that
variational principles are stated. These prin-
ciples apply to systems that are unconstrained
n W{u)
in the particular coordinate system chosen; bV(u) = 2 bu. = 0 (B-10)
1 = 1 out '
i.e., once the coordinates, up i = 1,..., n,that
describe the state of the system are chosen,
for all bu.. This is equivalent to
there are no algebraic relationships between
these variables. Further, it is required that all
dV(u)
force-state relationships for external or in- = 0, i= l,...,n. (B-ll)
du.
ternal forces are continuous, i.e., small
changes in state yield only small changes in
forces.
This result is proved in Ref. 2, page 23.
Let W denote the work done by all forces
on a structure due to an admissible displace- A second result is that a conservative,
ment. Then, the system is called conservative unconstrained system is in stable equilibrium
if the work W done in any displacement that at a state u if and only if the total potential
returns to the original state is zero; i.e., the energy is a relative minimum at u. This result
system is conservative if no energy is required is proved in Ref. 2, page 30. For the kind of
to change the state of the structure and bring structural systems considered here, it is
further shown, Ref. 2, page 211, that at a
it back to the original state along any path. A
buckling load P , the second variation must
structure would be nonconservative, for
be positive semi-definite, i.e.,
example, if sliding friction or viscous damping
were present.
b2V{u) = buT-&-f(u)bu> 0 (B-12)
Starting from some reference state of the
structure u0, define the stored energy in the
structure at state u to be U(u). Note that and further that there is a bu such that
since the system is conservative, U(u) depends
only on u and not on the path the state
_~ 92V
variable followed in getting from u0 to u. bzV{u)=buTT(u)bu =0. (B-13)
bur
Likewise, the negative of work done by the
external forces acting on the structure due to
the change in state variables will be denoted The inequality, Eq. B-12, shows that zero is
Sl(u). Again, 2(u) depends only on the final a relative minimum and Eq. B-13 shows that
state and not on the path from u0 to u. With relative minimum is attained for bu = bu. For
this notation, the total potential energy V(u) 52 V(u) treated as a function of bu, then, it is
is defined as necessary that
B-3
AMCP 706-192
1 p ' /
l62V(u)] =0, i= l,...,n m '
3/
35 i 2 /=i dZ 3Z
REFERENCES
B-5
AMCP 706-192
INDEX
INDEX (Con't.)
1-2
AMCP 708-192
INDEX (Con't.)
1-3
AMCP 706-192
(AMCRD-TV)
JOHN LYCAS
Colonel, GS
Chief , HQ Admin Mgt Ofc
DISTRIBUTION:
Special