Session 10516 WebSphere Application Server Z - OS L2 Update
Session 10516 WebSphere Application Server Z - OS L2 Update
Session 10516 WebSphere Application Server Z - OS L2 Update
• MQ APAR IZ94777
• WebSphere App Server looping using high CPU
• MQ connectivity lost
• high GC in CR
• Issue seen at MQ-JMS 7.0.1.4 level (zWAS 7.0.0.17)
• Fixed in MQ-JMS 7.0.1.6 level (zWAS 7.0.0.21)
• Several error symptoms listed in IZ94777
• How to tell what level of MQ-JMS is running:
• BBOO0222I: WMSG1611I: The installed level of the WebSphere MQ
messaging provider is 7.0.1.5.
• Techdoc from MQ:
• http://www.ibm.com/support/docview.wss?uid=swg21248089
4
DB2 APAR PM56361 can cause
WebSphere native out of storage
5
WLM APAR OA38367
WAS SERVERS NOT PROCESSING
TRANSACTIONS AFTER POLICY ACTIVATION
are classified
• New WLM policy installed and activated
• Work in the new service class workload times out
• Service Class is bound to a Servant Region
• New Service Class is not getting bound to a Servant
Region
6
LE APAR PM38867 - DB2 04E ABEND, SIGABND
SIGNAL NOT RAISED BY LE
• DB2 shows:
+DSNX908I DSNX9TIM PROCEDURE OR FUNCTION xxxxx WITH LOAD MODULE
xxxxx EXCEEDED CPU RESOURCE LIMIT SSN= xxxx PROC=DB2xxxxx
ASID=nnn WLM_ENV=DB2xxxxx
• WebSphere SR ABENDS
03.12.14 STC25715 BPXP018I THREAD 21FF9E0000000046, IN PROCESS 66477,
ENDED WITHOUT BEING UNDUBBED WITH COMPLETION CODE 0404E000,
AND REASON CODE 00E50013
7
zFS APAR OA37950 can cause Poor
performance / High GCP usage in WebSphere
8
zFS APAR OA37950 can cause Poor
performance / High GCP usage in WebSphere
• Symptoms:
• WebSphere App Server restart times increase
• WebSphere App Servers show higher zAAP_On_GCP usage in
RMF Mon III
• ZFS shell command 'zfsadm query -vnodecache' reports higher #
of Vnodes usage than the actual configured size (66k vs. 5k)
• zFS shell command ‘zfsadm query –usercache’ reports ‘zero’ or
very small number of allocated segments in the end section
Dataspace Allocated Free
Name Segments Pages
-------- ---------- ----------
ZFSUCD00 0 4000
ZFSUCD01 1 3999
9
Problem Prevention tips from zFS L2
10
Inside (WebSphere) factors
PE APAR PM58377
12
PE APAR PM58377
• Local Fix #1
• rebuild console-defs.idx using iscdeploy.sh -restore
• If this is a base server, then stop the application server and perform the
following:
1. cd /<WAS_HOME>/AppServer/profiles/default/bin
2. ./iscdeploy.sh -restore
3. Copy the output to a text file.
4. Restart application server
• If this is a ND environment, then stop the deployment manager
and perform the following:
1. cd
/<WAS_HOME>/DeploymentManager/profiles/default/bin
2. ./iscdeploy.sh -restore
3.Copy the output to a text file.
4.Restart the deployment manager.
13
PE APAR PM58377
• Local Fix #2
• relink console-defs.idx in the config root and install root.
Note: these commands should be entered on one line
1. rm <config_root>/systemApps/isclite.ear/isclite.war/WEB-INF/console-
defs.idx
2. ln –s <install_root>/systemApps/isclite.ear/isclite.war/WEB-INF/console-
defs.idx <config_root>/systemApps/isclite.ear/isclite.war/WEB-
INF/console-defs.idx
14
PM58366 – WebSphere V8 Server Startup
may hang in shell utility (z/OS only)
Workaround 1:
It is safe to use /bin/kill -9 against the PID (process ID) for
the /bin/chmod utility. This will terminate the chmod command
without causing harm to the processing being performed by
applyPTF.sh. The server will then complete its startup.
Workaround 2:
Examine the number of files in directory
<WAS_HOME>/profiles/default/properties/service/productDir/PreConfigActions/logs
Each server startup will leave a file in this directory of the form
postinstallerConfigActions#############.log.
16
Loop during migration job BBOWMG3B during
PREUPGRD step
• interpreted as a ‘/’ and an infinite loop gets triggered when searching for
‘config’ since ‘c:’ analyzed first and finds the intended file
• Work around by renaming or deleting any directories with a name like a:, b:, c:
etc that may reside under the profile home directory.
17
WebSphere Application Server creates
files with 660 permission
• FixPack 7.0.0.17
• Temp files generated by WebSphere applications may no longer be
readable by other applications
18
WebSphere Application Server creates
files with 660 permission
• http://www.ibm.com/support/docview.wss?uid=swg21572240&acss=wasz121511
19
Idle Server using CPU ??
• Application Server
• Node Agent
20
Idle Server using CPU ??
• Node Agent
• Automatic file Synchronization
• Node Agent and Deployment Manager
• Security NEEDS sync
• Propagate updated Certs / LTPA Changes
• Application Deployment
• you will have to remember to sync manually
• High Availability Manager
• HA runs in every App/Proxy Server, Node Agent and Deployment
Manager in a cell
• cells can be divided into several high avail domains aka. core groups
• Disable IF you can.. some WAS services/features use HA Manager
as well as some stack products
21
Idle Server using CPU ??
• Application Server
• Start components as needed
• Dynamic cache service background processing
• EJB cache and pool background processing
• Application
• Class Loading and Update Detection
• JSP (Java Server Pages) Reloading
• good for development, production apps should be stable
22
Idle Server using CPU ??
Servlet class Deployment.xml ibm-web.xmi Load new servlet class
Reload enabled reloadingEnabled
reloading interval >0
false false no
Cause:
WebSphere V6.1 default enabled Trustmanager is IbmX509
WebSphere V7.0 default enabled TrustManager is IbmPKIX
For some customers....
Revocation checking was enabled in V6.1 for the IbmPKIX TrustManager,
but not enforced since the IbmX509 TrustManager was in use.
Solution:
•Disable Revocation checking if not needed (most common solution)
•Diagnose why the revocation status could not be determined (multiple reasons)
http://www14.software.ibm.com/webapp/wsbroker/redirect?version=…
24 …compass&product=was-nd-zos&topic=csec_sslx509certtrustdecisions
Migration - IbmPKIX Trustmanager
Revocation Checking
Prior to migrating to WebSphere V7.0,
if revocation checking is enabled, disable if not needed
26
Repeat (from last SHARE) factors
Versions, Dates, and Service Levels...
• http://www.ibm.com/support/docview.wss?uid=swg21570083
• http://www.ibm.com/software/support/lifecycle/index_a_z.html
28
Transaction Partner Logs
• When they have entries in them, can cause problems
• At server startup, checks to see if there are any to recover
• If so it will try to recover them, and KEEP TRYING until it can
• has been found to cause high cpu (how many are out there)
• BBOT0009I: TRANSACTION SERVICE RESTART UR STATUS COUNTS
FOR W6SR02A: IN-BACKOUT=0, IN-DOUBT=0, IN-COMMIT=0
• If there are entries, you will have to resolve them with RRS
• STOP the WebSphere App Server
• delete UR’s associated with this server
• delete partner logs(log1 and log2)
<WAS_HOME>/profiles/default/tranlog/cellname/clustername/servername/transaction/partnerlog/
• Start the WebSphere App Server
may see message about ‘epoch mismatch’ now that RRS and WAS logs out of sync
30
JESSPOOL management
• http://www.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/TD103695
• Techdoc describes how to implement these facilities
• Includes a sample python script to update the WebSphere variables.
31
QUESTIONS ??