Isilon Troubleshooting Guide File System Locking
Isilon Troubleshooting Guide File System Locking
Isilon Troubleshooting Guide File System Locking
This troubleshooting guide applies to OneFS 7.2 - 8.0 Revised December 5, 2016
IMPORTANT!
If you arrived at this
Start guide from a Protocols
guide, consult a coach
or SME.
No
Is the cluster
unresponsive to Yes Go to 2A
isi commands?
No
No
No
2A
Refer to:
OneFS: Node expectedly reboots and/or any
of the following errors are seen in messages:
Did the command "Double failure detected for txn_p" "txn
return anything Yes (X:xxxxxxxxxx) is not resolved" "error =
other than zero? 98dexitcode: XXXX: EJDEADLOCK",
467837
No
Refer to:
Isilon OneFS: Nodes that have run for more than 248.5
consecutive days may restart without warning which may lead to
______
potential data unavailability, 462835 Caution:
After initiating a Code Red Engagement, per the
and previous KB, do not make changes to the cluster until
you get a response to your escalation.
UPDATE: ETA 202452: Isilon OneFS: Nodes that have run for
______
497 consecutive days may restart without warning, 301837 Continue through this guide, checking for known
issues and gathering as much information as you can.
Go to 3A
Note CAUTION!
Two common symptoms of deadlocks: If isi commands are not responding when you receive
isi commands are unresponsive the case, do not run any additional isi commands as
Clients cannot access the cluster you try to troubleshoot.
Go to:
Is more than one OneFS: How to recover
from a cluster-wide Did documentation
node in the cluster Yes Yes End
deadlock?, 303990 solve the problem?
unresponsive?
No
No
Do hangdumps appear in
the /var/log/messages Yes Go to 4A
log?
No
4A
Verify that a shared merge lock is failing by running the following command:
5A
nilp
Go to 6A
6A