Numerical Methods Using Trust-Region Approach For Solving Nonlinear Ill-Posed Problems
Numerical Methods Using Trust-Region Approach For Solving Nonlinear Ill-Posed Problems
Numerical Methods Using Trust-Region Approach For Solving Nonlinear Ill-Posed Problems
1147-1157
NUMERICAL METHODS USING
TRUST-REGION APPROACH FOR SOLVING
NONLINEAR ILL-POSED PROBLEMS
SUNYOUNG KIM
ABSTRACT. Nonlinear ill-posed problems arise in many application in-
cluding parameter estimation and inverse scattering. We introduce a
least squares regularization method to solve nonlinear ill-posed problems
with constraints robustly and efficiently. The regularization method uses
Trust-Region approach to handle the constraints on variables. The Gen-
eralized Cross Validation is used to choose the regularization parameter
in computational tests. Numerical results are given to exhibit faster
convergence of the method over other methods.
1. Introduction
Differential equations describing many real-life models must be solved
numerically, This involves discretization and transformation to a system
of nonlinear algebraic equations,
(1) F(x, y) = 0, 0 ~ x ~ b
where the dimensions of vectors F, x, y depend on the number of dis-
cretization. Let x E Rm, F and y ERn, where m ~ n. Then, (1)
becomes
(2)
Problem (2) is said to be well-posed if: (i) for any y ERn, there ex-
ists a solution x E Rm; (ii) the solution X is unique; (iii) the solution
Received April 11, 1996. Revised September 4, 1996.
1991 AMS Subject Classification: 49M35, 65FI0.
Key words and phrases: nonlinear problems, ill-posed problems, constrained opt-
imization.
Research supported by Korean Ministry of Education, BSRI-96-1430.
1148 Sunyoung Kim
x depends continuously on the data y. Otherwise, the problem is ill-
posed. Examples of nonlinear ill-posed problem are inverse problems of
differential equations (i.e., parameter estimation) [6], inverse scattering,
nonlinear Fredholm first kind equation [8].
The purpose of this paper is to solve nonlinear ill-posed problems effi-
ciently, which, in most cases, is impossible with usual numerical methods.
In many modeling problems, obtaining accurate solutions is very impor-
tant. One way to achieve this is by discretizing the differential equations
with small intervals. This, in turn, results a large number of equations
and parameters. As the number of parameters and equations of problem
(2) increases, it tends progressively difficult to solve the problem. There-
fore, it is essential to develop a stable numerical method that provides
accurate solutions.
Before we present numerical methods to solve (2) for any value of
y that corresponds to 0 ~ x ~ b, we discuss Trust-Region approach
briefly. Trust-Region approach has been well-known for unconstrained
optimization problems .
minf(x)
x
where f : Rn ~ R is smooth [1]. It usually generates a new iterate
x k+l = x k +S from the current approximation x k by solving the sub-
problem
(3) minM(x) subject to IIsII2 ~ (32,
where M(x) is a quadratic approximation to f(Xk+S) and (3 > O. If the
solution S for (3) decreases the value of f(Xk), viz., f(Xk +x) < f(Xk),
then Xk+l = Xk+S. Otherwise, the trust region parameter (3 is decreased
and solve (3) until either f(Xk+S) < f(Xk) or {3 ~ 0, when the iteration
is terminated. In order to solve (2), we develop a method utilizing the
Trust-Region approach that handles restriction on x properly. We first
discuss how (2) has been solved by outlining Algorithm. 2 in [rpt] and
then show how the Trust-Region approach can be used to circumvent
the ill-conditioning and bounds on x.
Algorithm 1[11] (y fixed, x variable)
1. Given y, assume an initial value for x.
Numerical methods using Trust-Region approach 1149
2. Compute F(x, y). If IIF(x, y)112 < to, where to is a small number, or
too many iterations have been done, then stop; else 'compute
dF F(x +toej,y) - F(x, y) .
-dej = , J = 1, ... , m,
x to
where ej is the jth column of the identity matrix of order m.
3. Solve
(4)
dF
dx 6x = -F(x,y)
for 6x, and let x = x +6x. Go to step (2).
Step (3) of the above algorithm involves the solutions of a system of
linear equations. Let J == ~ and y == -F(x, y). Since x E R
m
and
J is a n X m matrix, solving equation (4) is equivalent to the linear
least-squares problem
(5)
min IIJbx - Y l l ~ .
6x
One of well-known methods for solving (5) is Gauss-Newton method
[9J. Instead of solving Jbx = y in Algorithm 1,
is solved for ox. However, Gauss-Newton method fails to provide a
solution for many problems due to its computational aspects [2J. It
often exhibits large oscillations. One of the features of nonlinear ill-
pose problems is that J in (5) is often ill-conditioned and therefore its
condition number is very large. For these reasons, Gauss-Newton method
and other methods for (5) are unsuccessful. Let us look at (5) carefully.
Problem (5) is called discrete ill-posed [3.5J if the following conditions
are satisfied:
1. The singular values of J decay gradually to zero.
2. The ratio between the largest and the smallest nonzero singular
values is large.
1150 Sunyoung Kim
For this problem, standard methods such as LU, Cholesky, QR fac-
torization may give inappropriate results with large oscillations. One of
the popular methods for ill-posed problems is called Tikhonov regular-
ization method [12], from which many numerical methods for ill-posed
problem have originated. In t h ~ next section, we mention a regulariza-
tion method using generalized singular value decomposition and present
a method using Trust-Region approach. We also modify the problem so
that a meaningful solution can be determined. Then, we will present
numerical results showing better performance of our suggested method
over other methods.
2. Numerical Methods
In Tikhonov Regularization method, we define the regularized solu-
tion x as the following minimizer of the weighted combination of the
residual norm and the side constraint
min{IIJc5x - Y I I ~ + -y
2
I1L(c5x - c5x*) lin,
6x
where c5x* is an initial estimate and L is typically either identity matrix
or a p x n discrete approximation of the (n - p)th derivative operator.
The successful numerical tools for nonlinear ill-posed problems are
the Singular Value Decomposition (SVD) of J [4] and the Generalized
Singular Value Decomposition (GSVD) of the matrix pair (J, L). It
is known that the SVD reveals the difficulties associated with the ill-
conditioning of the matrix J while the GSVD of (J, L) yields important
insight into the regularization problem involving J and L [5]. For this
reason, we discuss the GSVD and use it to present a new algorithm.
The numerical method using the GSVD is called modified regularization
method.
Modified Regularization Method (Rl) Minimization problem (5)
is modified to
(6)
Numerical methods using Trust-Region approach 1151
where L is a discrete approximation to some derivative operator and ,
is a free parameter. Let L be an (m -1) x m matrix of rank of (m -1).
Problem (6) is equivalent to solving the least squares solution to the
overdetermined linear system [13]
which leads to the normal equation
(7)
Since, is a free parameter, equation (7) must be solved for various values
of ,. But, it is pointed out in [13], that solving (7) with different ,'s is
not as effective as the regularization method solving
This method is a special case of (6) when L = I. Since we are interested
in solving (6), we use the Generalized Singular Value Decomposition
technique. The Generalized Singular Value Decompositions of J and L
are
J = UDaP-l, L = VDcp-l,
where U and V are, respectively, n x n and (m - 1) x (m - 1) orthogonal
matrices, D
a
= diag(a}, ... , am) and
o
Cm-l
In view of the above equations, from (7) it follows that
1152
which leads to
(8)
Sunyoung Kim
where Pi is the ith column of P. If the modified regularization method
is used to solve ill-posed problem, the following algorithm can be used.
Algorithm 2
1. L is given. Get an initial guess Xo.
2. for k = 1,
3. Compute the Jacobian J(Xk).
4. Find the GSVD with J(Xk) and L.
5. Choose, from GCV (Generalized Cross Validation) fun-ctional.
6. Compute Xk+l = Xk +6x using (8)
7. for i = 1, ... ,m
if Xk+l(i) > b(i), then Xk+l(i) = b(i).
if xk+l(i) < O(i), then Xk+l(i) = O(i).
Algorithm 2 is simple conceptually and easy -to implement with
respect to handling the bounds on Xk. However, setting Xk to the value
of the bounds when Xk is too large or too small may cause divergence
of the iterates from a solution. This may happen more than often for
solving ill-posed problems because the changes to iterates must be made
carefully. This leads us to explore other possibilities, e.g., Trust-Region
method, for the bounds on Xk.
A Regularization Method using Trust-Region Approach (R2)
We would like to develop a regularization method with the General-
ized Singular Value Decomposition and Trust-Region approach. We first
transform (6) to a constrained minimization problem. Let JXk +Y== z.
Since
we have
(9)
min{IIJXk+l - z l l ~ +,2II LXk+l lin
Xk+l
subject to 0 ~ Xk+l ~ b
Numerical methods using Trust-Region a.pproach 1153
Notice that it minimizes on Xk+I' It is based on the fact that we only
know the bounds on Xk's. One way to solve (9) is to include Trust-Region
approach for constraints on Xk+l' Let XHI == x. Then, the problem (9)
is equivalent to the constrained problem:
where 0: = IIb!l2. H we use the GSVD described in the modified regular-
ization method, we get
Our approach for solving the above problem is to use the first-order
necessary conditions [7],
(11) (D; + ,.,? D ~ Dc)P-1x - DQU
T
Z + AX =0,
subject to !Ix1l
2
:5 0:
2
, A{/lxll
2
- 0:
2
] = 0, A~ 0.
For computation of a solution Xk of (11), we utilize a routine in NETLIB
for constrained minimizations. An algorithmfor the regularization meth-
od using Trust-Region approach is given as follows.
Algorithm 3
1. L is given. Get an initial guess Xo.
2. for k = 1""
3. Compute the Jacobian J(Xk).
4. Find the GSVD with J(Xk) and L.
5. Choose "y .from GCV functional
6. Compute (D: +,'lD'[Dc)p-l and DQUTz.
7. Solve (11) for Xk+l using a NETLIB routine.
In Algorithm 3, we do not set the values of XHI to 0 or b as Algo-
rithm 2. Rather, we solve (11) with the bounds on Xk. This is possible
by transforming the problem (6) to the constrained minimization prob-
lem (11).
1154 Sunyoung Kim
3. Computational Results
We apply the numerical methods in the previous section to a severely
ill-posed problem arising in a modeling problem of physiology [10]. We
describe the problem briefly. The differential equations for the model
are:
d1?iv )
dw +Jiv(W = 0,
d1?ivCik () . -l
dw +J ik W = 0, Z I 4,
d1?4k
dw + J4k(W) =0,
dC
4k
D4k dw - F4k +F4vC4k = 0,
4 4
LJiv(w) = 0, LJik(W) = 0,
i=1 i=1
where J iv(w) and J ik(w) are, respectively, volume and solute fluxes,
D
4k
are the diffusion coefficients and F4k(w) are the axial solute flows,
J
iv
(w) and J ik(w) are functions of only Cik(w), C
4
k(w) and x( w)( a
parameter vector, e.g. water and solute permeabilities). This problem
is transformed to a system of nonlinear equations using the trapezoidal
rule. We can denote the system of nonlinear eqution as
4>(x,y) = 0,
where x E R
18
and Y E R
63
The direct problem is: Given x, determine
Y from
min 114>(x, y)II.
x
And the inverse problem is: Given y, determine x from
(12)
min 114>(x,Y)II, ~ x ~ b.
x
See [10,11] for additional detail.
Numerical methods using Trust-Region approach 1155
In order to test methods Rl and R2 for (12), as in [10,11], we used
the model with parameters xm and xl given in [14]. Let us denote the
y's that correspond to xm and xl, respectively, by ym and yl which
can be obtained from solving the direct problem. We use these values
to provide initial values of y for the inverse problem. The intermediate
values obtained from ym and yl are taken as inputs in (2) by
(13) y = yl +8(ym - yl).
We expect to have xm as a solution of the inverse problem for ym
obtained from 8 = 1 in (13) and xl for yl from 8 = O. Therefore, in the
computional tests, we initially vary 8 from 0 to 1 and then check whether
we can find a solution for other values of 8. We could get a solution for
8 0 to 2 as given in Table l.
We have computed solutions using methods Rl and R2. In Rl and
R2, we took
-1
-1
1
1 -1
The term in (10) is included to control the smoothness [5] of
the solution x. The regularization parameter, controls the weight given
to relative to IIJx - The parameter, in (10) was chosen by
using Generalized Cross Validation technique. It is very popular and suc-
cessfully incorporated in many regularization methods such as Tikhonov
Regularization [12] and Truncated Singular Value Decomposition [4].
The GCV fUIlctional is
IIJx - +
G(,) = {Tr[I _ J(JTJ + ,2LTL)-lJT]p'
In algorithm 2 and 3, we compute the G(i) for various values of i and
choose, that gives the smallest value of (10).
All computations were performed on Sun Sparc-20. The initial guess
for x was given as xl. Without using the GCV functional, the conver-
gence was much slower and sometimes failed. We used Rl because Rl
1156 Sunyoung Kim
has been the most successful method so far solving the problem men-
tioned above. We used 500 as the maximum number of iterationS for
Rl and H2. In the experiments, both Rl and R2 do not provide better
solutions after 500 iterations. We obtained smaller values of IIFII for the
various values of y by using R2. The values of i for each () are given
in Table 1. The time difference of Algorithm 2 and Algorithm 3 is
from step 6. Rl takes less time in each iteration than R2. However,
R2 always converges to the values near IIFII shown in Table 1 earlier in
iterations, say, after 5 iterations than R1. That is, R2 converges faster
than Rl in IIFII. Moreover, R2 gives more accurate solutions as shown
in Table 1. Hence, we can conclude that R2 shows better performance
to find a solution than R1.
Table 1. Rl VB. R2
R2 R1
Iter.
II
F
II 1
Iter.
II
F
II 1
0.10 500 4.88851234-04 4.0363-08 500 9.9305909Q..05 4.0395-09
0.25 500 7.75893815-04 3.9897-08 500 1.19343469-04 3.9919-09
0.50 500 1.14674364-03 3.8983-08 500 1.60958269-04 3.9005-09
0.75 500 1.48632481-03 3.8658-08 500 2.07426879-04 3.8687-09
1.00 500 1.84542734-03 3.8787-08 500 2.56120387-04 3.8807-08
1.50 500 2.42974140-03 3.9110-08 500 3.36237531-04 3.9120-09
2.00 500 3.29750461-03 3.9430-08 500 4.58714961-04 3.9436-09
References
1. J. E. Dennis and R. B. Schnabel, Numerical Methods for Unconstrained Opti-
mization and Nonlinear Equations, Prentice-Hall Inc., Englewood Cliffs, New
Jersey (1983).
2. C. Fraley, Computational behavior of Gauss-Newton Methods, SIAM J. Sci. Stat.
Comput. 10 (1989), 515-532.
3. V. N. Joshi and R. P. Tewarson, On Solving Ill-conditioned Systems of Linear
Equations, Transactions of New York Academy of Sciences 34 (1972), 565-57l.
4. P. C. Hansen, The 1Tuncated SVD as a method for Regularization, BIT 27 (987),
543-553.
5. Per Christian Hansen, Regularization Tools: A Matlab Package for Analysis and
Solution of Discrete nt-posed Problem, Numerical Algorithms 6 (1994), 1-35.
6. C. Kravaris and J. H. Seinfeld, Identification of Parameters in Distributed Pa-
rameter Systems by Regularization, SIAM J. Control Optim. 23, 217-241 yr 1985.
Numerical methods using Trust-Region approach 1157
7. H. W. Kuhn and A. W. Tucker, Nonlinear Progmmming in proceedings of the
Second Berkeley Symposium on Mathematical Statistics and Probability, J. Ney-
man(editor), University of California Press, Berkeley and Los Angeles, Calif.
(1961), 481-492.
8. F. O'Sullivan and G. Wahba, A Cross Validated Bayesian Retrieval Algorithm for
Nonlinear Remote Sensing Experiments, J. Comput. Phys. 59 (1985), 551-555.
9. J. Stoer and R Bulirsch, Numerical Analysis, Springer-Verlag, New York, 1980.
10. R. P. Tewarson, Inverse Problem for Kidney Concentrating Mechanism Analyses,
Appl. Math. Lett. 4 (1993), 63-66.
11. , Models of Kidney Concentmting Mechanism: Relationship Between
Core Concentmtions and Tube Permeabilities, Appl. Math. Lett. 6 (1993), 71-73.
12. A. N. Tikhonov, Solution of Incorrectly Formulated Problems and the Regular-
ization Method, Dokl. Adak. Nauk. SSSR, 151 (1963), 501-504.
13. J. M. Varah, A Practical Examination of Some Numerical Methods for Linear
Discrete Ill-Posed Problems, SIAM Rev. 21 (1979), 100-111.
14. A. S. Wexler, R. E. Kalaba and D. J. Marsh, Three-dimensional Anatomy and
Renal Concentrating Mechanism I. Modeling Results, Am. J. Physiol., 260, (Re-
nal Fluid Electrolyte Physiol. 29), F368-F383, (1991).
Department of Mathematics
Ewha Womans University
Dahyun-Dong, Seoul 120-750, Korea
e-mail: [email protected]