An Improved LU-SGS Implicit Scheme For High Reynolds Number Flow Computations On Hybrid Unstructured Mesh
An Improved LU-SGS Implicit Scheme For High Reynolds Number Flow Computations On Hybrid Unstructured Mesh
An Improved LU-SGS Implicit Scheme For High Reynolds Number Flow Computations On Hybrid Unstructured Mesh
Abstract
The lower-upper symmetric Gauss-Seidel (LU-SGS) implicit relaxation has been widely used because it has the merits of less
dependency on grid topology, low numerical complexity and modest memory requirements. In original LU-SGS scheme, the
implicit system matrix is constructed based on the splitting of convective flux Jacobian according to its spectral radius. Although
this treatment has the merit of reducing computational complexity and helps to ensure the diagonally dominant property of the
implicit system matrix, it can also cause serious distortions on the implicit system matrix because too many approximations are
introduced by this splitting method if the contravariant velocity is small or close to sonic speed. To overcome this shortcoming,
an improved LU-SGS scheme with a hybrid construction method for the implicit system matrix is developed in this paper. The
hybrid way is that: on the cell faces having small contravariant velocity or transonic contravariant velocity, the accurate deriva-
tive of the convective flux term is used to construct more accurate implicit system matrix, while the original Jacobian splitting
method is adopted on the other cell faces to reduce computational complexity and ensure the diagonally dominant property of the
implicit system matrix. To investigate the convergence performance of the improved LU-SGS scheme, 2D and 3D turbulent
flows around the NACA0012 airfoil, RAE2822 airfoil and LANN wing are simulated on hybrid unstructured meshes. The nu-
merical results show that the improved LU-SGS scheme is significantly more efficient than the original LU-SGS scheme.
Keywords: LU-SGS scheme; hybrid unstructured mesh; Navier-Stokes equations; flux Jacobian; convergence performance; tur-
bulent flow
used to determine Wim,L and Wim,R. The D(Wim,L, Wim,R) where, in unstructured grid case, M is a large, sparse
in Eq. (3) represents the dissipation term added to pre- and generally non-symmetric matrix, which consists of
vent non-physical oscillations, and the iconic features DOF×DOF blocks. Here DOF represents the degrees
of different convective flux evaluation schemes mainly of freedom, which equals the total number of control
lie in the specific forms of this term. The evaluation of volumes in finite volume method.
the viscous flux term Fv is relatively simple. The vis- The LU-SGS scheme is based on the LDU factoriza-
cous flux is usually discretized by the standard central tion of the implicit system matrix, which is given by
scheme because of the elliptic nature of this term. This Ω
means that the flow quantities and their first deriva- ( I + M ) = L+ D +U = ( L+ D ) D −1 (U + D) − LD −1U
Δt
tives on the faces of the control volume, which exist in
the formula of viscous flux term, can be obtained by (9)
the simple average, and then be used to calculate Fv where L only consists of block terms in the strictly
straightforwardly. lower triangular matrix, U only consists of block terms
To simplify the expression, the residual of Eq. (2) is in the strictly upper triangular matrix and D is a block
denoted here by diagonal matrix.
Suppose that the LD−1U is a small term and its con-
Ri (W ) = ∑ ( (Fc )im − (Fv )im ) Sim + Ωi Qsource (4)
tribution to the implicit system matrix can be neglected,
m∈N ( i )
then Eq. (8) can be replaced by the following equation:
When fixed-volume computational grid is used,
Eq. (2) becomes ( L + D) D −1 (U + D)ΔW n = − Rn (10)
dWi Equation (10) can be inverted in the following for-
Ωi = − Ri (W ) (5) ward and backward sweep procedure:
dt
⎪⎧ DΔW = − R − LΔW
* n *
⎪ Mi, j =
∂R i (W n ) ⎩⎪ ⎝ ∂Wj
⎪ ∂W jn where ∂ (Ωi Qsource ) ∂Wi n only lies in the turbulence
⎪
⎪ ⎡ M1,1 M1,2 L M1,DOF ⎤ model equations and its expression is the derivative of
⎨ ⎢ M
⎪ ⎢ 2,1 M 2,2 L M 2,DOF ⎥⎥ the source term in terms of the conservative variables.
⎪ M = The viscous flux Jacobians ( Av+ )ij = ∂(Fv )ij ∂Wi n and
⎢ M M M ⎥
⎪ ⎢ ⎥
⎪⎩ ⎢⎣ M DOF,1 M DOF,2 L M DOF,DOF ⎥⎦ ( Av− )ij = ∂(Fv )ij ∂W j n in Eq. (12) have complex ex-
and pressions even if the thin shear layer approximation is
used. In order to reduce the computational cost, the
ΔW n = W n +1 − W n viscous flux Jacobian can be approximately counted by
n+1
Ri(W ) in Eq. (6) is substituted by the linearization using its spectral radius:
term in Eq. (7), then the following implicit system is ( Av±)ij ≈ ±(λ v)ij I =
obtained:
Ω Sij ⎡ ⎛ 4 γ ⎞ ⎛ μL μ ⎞ ⎤
( I + M )ΔW n = − R(W n ) (8) ± ⎢ max ⎜ , ⎟⎜ + T ⎟ ⎥I (13)
Δt rij ⎢ ⎜ 3ρij ρij ⎟ ⎝ PrL PrT ⎠ ⎥
⎣ ⎝ ⎠ ij ⎦
· 36 · WANG Gang et al. / Chinese Journal of Aeronautics 25(2012) 33-41 No.1
where γ, μ and Pr denote the ratio of specific heat coef- lute values of the above eigenvalues represent the pro-
ficient, dynamic viscosity coefficient and Prandtl num- pagating speed of corresponding waves. It is clear that
ber respectively, and rij is the distance between grid ( Ac+ )ij and ( Ac− )ij given by Eq. (14) can reflect the pro-
cell i and cell j. In original LU-SGS approach, the con-
pagating directions of perturbations correctly. But they
vective flux Jacobians ∂(Fc )ij ∂Wi n and ∂(Fc )ij ∂W j n cannot represent the propagation velocities of pertur-
are not defined with the exact derivatives based on bations accurately, which leads to considerable distor-
Eq. (3), but approximately evaluated by the splitting of tions in evaluating the variations of fluxes caused by
convective flux Jacobian with its maximal eigenvalue the fluctuations of flow states. If the values of
⎧ ∂ ( Fc )ij (|Vij・nij | + cij ) |Vij・nij | and (|Vij・nij | + cij ) (|Vij・nij | −cij )
1
⎪ n
≈ ( Ac+ )ij = ( A + λ max I ) and are very large, serious distortions on the variations
⎪ ∂ W i 2
⎨ (14) of fluxes may be introduced by using Eq. (14). And
⎪ ∂ ( Fc )ij ≈ ( A− ) = 1 ( A − λ I ) such kind of cases occurs not only in the low Mach
⎪ ∂W n c ij
2
max
flow simulations but also commonly lies in the high
⎩ j
Reynolds number flow computations. The reason is
where A = ( ∂ Fc ∂W )ij , and λmax is the maximal ei- that the absolute value of Vij・nij could be very small on
the cell faces which are parallel to the wall boundary
genvalue of A, namely λmax = Vij・nij + cij . Here, the faces or near the bottom region of boundary layer.
subscript ij denotes the face which is shared by grid Hence, too many distortions may be introduced into
cell i and cell j, c is the sonic speed, V is the velocity. implicit system matrix with the above construction
The ( Ac+ )ij in Eq. (14) only has non-negative eigenval- method based on the maximal eigenvalue splitting, and
these distortions are harmful for the convergence be-
ue, and ( Ac− )ij non-positive eigenvalue. According to the havior of non-linear time marching iteration.
theories on propagation of perturbation, ( Ac+ )ij reflects In order to construct more accurate implicit system
the variations of fluxes caused by the increment of flow matrix, ∂( Fc )ij ∂Wi n and ∂(Fc )ij ∂W j n need to be
variables on grid cell i, and ( Ac− )ij the variations of evaluated by using the derivation of Eq. (3), namely
fluxes received from the fluctuation of flow variables on ⎧ ∂ ( Fc )ij 1 ∂Fc (Wij ,L ) ∂D(Wij ,L ,Wij ,R )
⎪ = ・ + ≈
neighboring grid cell j. Hence, there has certain rational- ⎪ ∂Wi
n
2 ∂Wi n ∂Wi n
ity for using ( Ac+)ij and ( Ac− )ij instead of ∂( Fc )ij ∂Wi n ⎪ 1 ∂Fc (Wij ,L ) ∂D (Wij ,L , Wij ,R )
⎪ ・ +
and ∂(Fc )ij ∂W j n . ⎪⎪ 2 ∂Wij ,L ∂Wij ,L
In fact, there are two kinds of iterative processes ⎨ (16)
contained in the implicit time marching scheme of this ⎪ ∂ ( Fc )ij = 1・∂Fc (Wij ,R ) + ∂D(Wij ,L , Wij ,R ) ≈
⎪ ∂W n 2 ∂W j n ∂W j n
paper. One is the non-linear time marching iteration for ⎪ j
solving Eq. (6) with backward Euler method, the other ⎪ 1 ∂Fc (Wij ,R ) ∂D (Wij ,L , Wij ,R )
one is an inner iteration on each time level for solving ⎪ ・ +
⎪⎩ 2 ∂Wij ,R ∂Wij ,R
the linear equation system Eq. (8) with LU-SGS
method. In constructing an efficient implicit scheme, where ∂D(Wij, L,Wij, R) /∂Wij, L and ∂D(Wij, L,Wij, R) /∂Wij, R
convergence and stability performances of the above are defined according to the spatial discretization
two iteration processes need to be considered on bal- scheme. For example, if Roe scheme is used, and let
ance. A Roe represent Roe average Jacobian matrix, then we
The construction of implicit system matrix with Eq.
get
(14) is favorable to the inner iteration. In detail, two
distinctive benefits can be identified. Firstly, the block ⎧ ∂D(Wij ,L , Wij ,R ) 1
diagonal matrix D in Eq. (10) degenerates into a pure ⎪ = ARoe
⎪ ∂Wij ,L 2 ij
diagonal matrix, which can be evaluated and inversed ⎨ (17)
with very little CPU time. Secondly, it ensures the dia- ⎪ ∂D(Wij ,L , Wij ,R ) = − 1 A
gonal dominance of implicit system matrix for very ⎪ ∂Wij ,R 2
Roe ij
⎩
large time stepping. This benefit not only guarantees
the convergence of LU-SGS iteration, but also makes In fact, the construction of the implicit system ma-
the neglected term LD−1U really negligible compared trix with the above accurate convective flux Jacobian
to the entire matrix system. not only brings a notable increase in computational
However, we all know that convective flux Jacobian cost, but also causes the implicit system matrix M to
matrix A has five eigenvalues: lose its property of diagonal dominance as the time
λ1 = λ 2 = λ3 =Vij・nij , λ 4 =Vij・nij -c, λ 5 =Vij・nij +c (15) step size Δt takes a great value. It is well known that
the convergence of the LU-SGS inner iteration cannot
Their signs express the propagating directions of be guaranteed if the system matrix is not diagonally
waves relative to the surface normal Sim, and the abso- dominant. In order to achieve diagonal dominance, a
No.1 WANG Gang et al. / Chinese Journal of Aeronautics 25(2012) 33-41 · 37 ·
relatively small Δt should be used in the time marching flow simulation, the improved LU-SGS will be identi-
process. In such case, the evolution speed of flow field fied with the block LU-SGS method. The optimal values
is considerably decreased and more time marching of ε1 and ε2 are hard to be known beforehand because
steps are required to achieve the convergence state. they vary with different flow conditions and meshes.
Another way for overcoming the limitation of diagonal According to the practical experience, ε1 = ε2 ≈ 0.25 is
dominance is giving up the LU-SGS scheme but using recommended in high Reynolds number flow simula-
some other inner iteration algorithms which have better tions.
stability, for example the GMRES scheme. The cost of Finally, it should be pointed out that the existence of
this choice is that the computational complexity will be viscous flux Jacobian, which is computed by using
increased significantly. Eq. (13), offers favorable term for the diagonal domi-
nance of implicit system matrix. This means that the side
2.4. Improved LU-SGS scheme effects of the usage of Eq. (18) on the stability of
LU-SGS inner iteration can be eliminated partly in the
In the above analysis, we find out that the approxi-
boundary layer region of the high Reynolds number flow.
mate convective flux Jacobian based on maximal ei-
Hence, the maximum allowable time step size will not be
genvalue splitting is helpful for the inner linear itera-
decreased obviously if the above improved LU-SGS
tion while the accurate convective flux Jacobian based
scheme is used.
on exact derivation is beneficial to the outer non-linear
time stepping. If we can make the advantages of them
complement each other, then the convergence speed of 3. Test Cases and Results
LU-SGS implicit time marching scheme will be im-
proved further. For this purpose, a hybrid method for Several typical test cases are selected here to illus-
constructing the implicit system matrix is proposed and trate the practicality and computational efficiency of
described in the following way. the improved LU-SGS implicit time marching scheme.
For the cell faces on which the contravariant speed These test cases include the low Mach number and
supersonic viscous flows around NACA0012 airfoil,
has small value or close to sound speed, ∂( Fc )ij ∂Wi n transonic viscous flow around RAE2822 airfoil and 3D
and ∂(Fc )ij ∂W j n are computed by using Eq. (16) so transonic flow around LANN wing. The S-A turbu-
lence model, second order Roe scheme and the local
that the excessive approximations on implicit matrix
time stepping [12] are chosen as common settings for all
could be avoided; on the other cell faces, the classical
computational cases. The convergence performances of
maximal eigenvalue splitting method, namely Eq. (14)
original LU-SGS and improved LU-SGS are analyzed
is used to evaluate the above convective Jacobians with
and compared in each test case.
the aim of reducing computational cost and strength-
ening the diagonal dominance of implicit system ma-
trix. According to this principle, the hybrid method 3.1. NACA0012 airfoil
for determining ∂( Fc )ij ∂Wi n and ∂ ( Fc )ij ∂W j n can be As shown in Fig. 1, a hybrid unstructured mesh for
expressed as viscous flow simulation on NACA0012 airfoil is gen-
If Vij・nij cij ≤ ε1 or 1 - ε 2 ≤ Vij・nij cij ≤ 1+ε 2 , erated with the method described in Ref. [16]. It con-
sists of 200 wall boundary nodes, 7 991 computational
⎧ ∂ ( Fc )ij 1 ∂Fc (Wij ,L ) ∂D(Wij ,L , Wij ,R ) field nodes and 11 552 mesh cells. Near the boundary
⎪ ≈ ・ + layer region, 22 layers of quadrangle mesh are distrib-
⎪ ∂Wi
n
2 ∂Wij ,L ∂Wij ,L
(18) uted and the spacing of the first layer normal to the
⎨
⎪ ∂ ( Fc )ij ≈ 1・∂Fc (Wij ,R ) + ∂D(Wij ,L , Wij ,R ) wall is chosen as 3.0×10−6 times of chord length.
⎪ ∂W n 2 ∂Wij ,R ∂Wij ,R Firstly, the incompressible viscous flow is computed
⎩ j
with an angle of attack α = 5º, a Mach number Ma =
otherwise, 0.15 and a Reynolds number Re = 3×106. The same
⎧ ∂ ( Fc )ij 1 Courant-Friedrichs-Lewy (CFL) number 30 is used for
⎪ n
≈ ( A + λ max I ) the computations with the improved and the original
⎪ ∂Wi 2 LU-SGS schemes. The convergence histories of the
⎨ (19)
⎪ ∂ ( Fc )ij ≈ 1 ( A − λ I )
maximal residual in terms of the time iteration numbers
⎪ ∂W n max are shown and compared in Fig. 2. As one can see, the
⎩ j 2
convergence speed of the maximal residual computed
In Eq. (18), ε1 and ε2 represent non-negative empirical by the original LU-SGS has slowed down quite mark-
coefficients. The smaller values are given to ε1 and ε2, edly after it decreased five orders. While in the com-
the more characteristics of the original LU-SGS will be putation with the improved LU-SGS, the convergence
assigned to the improved LU-SGS method. On the other rate is constant. In practice, we are more concerned
hand, the larger values are given to ε1 and ε2, the more about the convergence speeds of aerodynamic coeffi-
characteristics of the block LU-SGS method [4, 6] will be cients. In Fig. 3 the convergence curves of life coeffi-
represented. For example, if ε1 = 1 is used in subsonic cient are plotted. The dash-dotted lines in this
· 38 · WANG Gang et al. / Chinese Journal of Aeronautics 25(2012) 33-41 No.1
the improved LU-SGS with ε1 = ε2 = 0.25 can be both results agree very well with the experimental data.
given as 50. The convergence histories in Fig. 5 show
that due to the stability limitation on CFL number, the
improved LU-SGS scheme based on pure accurate con-
vective flux Jacobians even has less efficiency compared
with the same scheme based on the hybrid definition of
convective flux Jacobians, its convergence rate is 1.5
times faster than that of the original LU-SGS based on
the maximal eigenvalue flux Jacobian splitting.