VMware Vsphere: Troubleshooting
VMware Vsphere: Troubleshooting
VMware Vsphere: Troubleshooting
Lab Manual
ESXi 7 and vCenter Server 7
Copyright © 2021 VMware, Inc. All rights reserved. This manual and its accompanying
materials are protected by U.S. and international copyright and intellectual property laws.
VMware products are covered by one or more patents listed at
http://www.vmware.com/go/patents. VMware is a registered trademark or trademark of
VMware, Inc. in the United States and/or other jurisdictions. All other marks and names
mentioned herein may be trademarks of their respective companies. VMware vSphere®
vMotion®, VMware vSphere® High Availability, VMware vSphere® ESXi™ Shell, VMware
vSphere® Client™, VMware vSphere®, VMware vSAN™, VMware vRealize® Log Insight™ for
vCenter™, VMware vRealize® Log Insight™, VMware vRealize®, VMware vCloud Director®,
VMware vCloud Director® for Service Providers, VMware vCloud®, VMware vCenter®
Server Appliance™, VMware vCenter Server®, VMware View®, VMware Horizon® View™,
VMware Verify™, VMware Horizon® 7, VMware Horizon® 7, VMware Horizon® 7 on VMware
Cloud™ on AWS, VMware vSphere® Storage I/O Control, VMware PowerCLI™, Project
Photon OS™, VMware Photon™, VMware NSX®, VMware vCenter® Log Insight™, VMware
Go™, VMware ESXi™ and VMware ESX® are registered trademarks or trademarks of
VMware, Inc. in the United States and/or other jurisdictions.]
The training material is provided “as is,” and all express or implied conditions, representations,
and warranties, including any implied warranty of merchantability, fitness for a particular
purpose or noninfringement, are disclaimed, even if VMware, Inc., has been advised of the
possibility of such claims. This material is designed to be used for reference purposes in
conjunction with a training course.
The training material is not a standalone training tool. Use of the training material for self-
study without class attendance is not recommended. These materials and the computer
programs to which it relates are the property of, and embody trade secrets and confidential
information proprietary to, VMware, Inc., and may not be reproduced, copied, disclosed,
transferred, adapted or modified without the express written approval of VMware, Inc.
www.vmware.com/education
Typographical Conventions
• <ESXi_host_name>
www.vmware.com/education
www.vmware.com/education
Contents
v
Lab 5 Monitoring NIC Teaming During Failover ................................................................ 19
Task 1: Verify the Distributed Switch Configuration .......................................................................................... 20
Task 2: Verify Network Operation on the ESXi Host ........................................................................................ 21
Task 3: Monitor the ESXi Host When the Active Link Goes Down ............................................................. 21
Task 4: Monitor the ESXi Host When the Standby Link Goes Down ........................................................ 22
Task 5: Reconfigure the Port Group pg-SA-Production-01 ........................................................................... 23
Lab 6 Monitoring and Recovering Distributed Switches ................................................25
Task 1: Display Distributed Switch Information.................................................................................................... 26
Task 2: Disable the Network Rollback Option ......................................................................................................27
Task 3: Recover from a Distributed Switch Failure ............................................................................................27
Task 4: Enable the Network Rollback Option...................................................................................................... 30
Task 5: Migrate Management Network.................................................................................................................. 30
Lab 7 Applying the Troubleshooting Methodology ......................................................... 31
Task 1: Run a Break Script ............................................................................................................................................ 31
Task 2: Narrow the Scope of the Problem to a VM.......................................................................................... 32
Task 3: Narrow the Scope of the Problem to the ESXi Host ........................................................................ 33
Task 4: Resolve the Problem ..................................................................................................................................... 34
Task 5: Verify the Solution .......................................................................................................................................... 35
Lab 8 Troubleshooting Network Problems.........................................................................37
Task 1: Run a Break Script ........................................................................................................................................... 38
Task 2: Verify That the System Is Not Functioning Properly ........................................................................ 39
Task 3: Troubleshoot and Resolve the Problem ................................................................................................ 40
Task 4: Verify the Solution........................................................................................................................................... 41
Lab 9 Investigating Disk Issues on ESXi .............................................................................. 43
Task 1: Run a Break Script ........................................................................................................................................... 43
Task 2: Create a Virtual Machine .............................................................................................................................. 44
Task 3: Troubleshoot the Problem .......................................................................................................................... 45
Task 4: Resolve the Problem ..................................................................................................................................... 45
Task 5: Verify the Solution .......................................................................................................................................... 46
Lab 10 Troubleshooting Storage Performance Issues .................................................. 47
Task 1: Generate VM Disk Activity........................................................................................................................... 48
Task 2: Start esxtop Utility and Review Disk Statistics.................................................................................... 48
Task 3: Monitor Performance by Storage Adapter .......................................................................................... 49
vi
Task 4: Monitor Performance by Storage Device ............................................................................................. 50
Task 5: Monitor Storage Performance by VM ..................................................................................................... 51
Lab 11 Troubleshooting VM Power-On Problems............................................................ 53
Task 1: Create and Power On the VM .................................................................................................................... 54
Task 2: Troubleshoot Problems or Errors ............................................................................................................. 55
Task 3: Resolve the Problem ..................................................................................................................................... 56
Task 4: Verify the Solution.......................................................................................................................................... 56
Lab 12 Troubleshooting VM Snapshot Problems ..............................................................57
Task 1: Power On the VM............................................................................................................................................ 58
Task 2: Troubleshoot Problems or Errors ............................................................................................................. 58
Task 3: Resolve the Problem ..................................................................................................................................... 59
Task 4: Verify the Solution.......................................................................................................................................... 60
Lab 13 Working with VM Snapshots Using the Command Line .................................. 61
Task 1: Power On a VM ................................................................................................................................................ 62
Task 2: Create Snapshots and Monitor Their Creation .................................................................................... 62
Task 3: Monitor Snapshot Deletion.......................................................................................................................... 62
Lab 14 Troubleshooting Storage Problems ....................................................................... 63
Task 1: Run a Break Script ........................................................................................................................................... 63
Task 2: Verify That the System Is Not Functioning Properly ........................................................................ 65
Task 3: Troubleshoot and Resolve the Problem ................................................................................................ 66
Task 4: Verify the Solution.......................................................................................................................................... 66
Lab 15 Troubleshooting Cluster Problems.......................................................................... 67
Task 1: Create a Cluster and Power Off VMs ...................................................................................................... 67
Task 2: Run the Break Script Break-8-1.ps1 .......................................................................................................... 68
Task 3: Run a Break Script .......................................................................................................................................... 68
Task 4: Verify That the System Is Not Functioning Properly ........................................................................ 70
Task 5: Troubleshoot and Resolve the Problem ................................................................................................ 70
Task 6: Verify the Solution ............................................................................................................................................71
Lab 16 Resolving VM Power-On Problems .........................................................................73
Task 1: Run a Break Script ........................................................................................................................................... 74
Task 2: Troubleshoot the Problem ...........................................................................................................................75
Task 3: Resolve the Problem ......................................................................................................................................75
Task 4: Verify the Solution.......................................................................................................................................... 76
vii
Lab 17 Troubleshooting VM Problems .................................................................................. 77
Task 1: Run a Break Script ........................................................................................................................................... 78
Task 2: Verify That the System Is Not Functioning Properly ........................................................................ 79
Task 3: Troubleshoot and Resolve the Problem ................................................................................................ 80
Task 4: Verify the Solution.......................................................................................................................................... 80
Lab 18 Restarting ESXi Management Agents .................................................................... 81
Task 1: Restart Management Agents Using the DCUI ....................................................................................... 81
Task 2: Restart Management Agents from the Command Line ................................................................... 82
Lab 19 Troubleshooting ESXi Host Disconnection Problems...................................... 83
Task 1: Run a Break Script ........................................................................................................................................... 83
Task 2: Troubleshoot the Problem .......................................................................................................................... 84
Task 3: Resolve the Problem ..................................................................................................................................... 84
Task 4: Verify the Solution.......................................................................................................................................... 84
Lab 20 Troubleshooting vCenter Server Connection Problems ............................... 85
Task 1: Run a Break Script ........................................................................................................................................... 86
Task 2: Troubleshoot the Problem .......................................................................................................................... 86
Task 3: Resolve the Problem ..................................................................................................................................... 87
Task 4: Verify the Solution.......................................................................................................................................... 87
Lab 21 Troubleshooting vCenter Server and ESXi Host Problems .......................... 89
Task 1: Run a Break Script ........................................................................................................................................... 90
Task 2: Verify That the System Is Not Functioning Properly ......................................................................... 91
Task 3: Troubleshoot and Resolve the Problem ................................................................................................. 91
Task 4: Verify the Solution.......................................................................................................................................... 92
Lab 22 Appendix: Troubleshooting Network Communication Failures .................. 93
Task 1: Verify the IP Configuration ........................................................................................................................... 94
Task 2: Verify the VLAN Configuration ................................................................................................................. 94
Task 3: Verify the Speed, Duplex, or MTU Configuration ............................................................................... 94
Task 4: Verify the Uplink Configuration ................................................................................................................. 95
Task 5: Verify the Teaming Configuration ............................................................................................................ 95
Task 6: Verify the Network Link Status ................................................................................................................. 95
Task 7: Investigate a Host Failure ............................................................................................................................ 96
Task 8: Investigate a Network Failure .................................................................................................................... 96
Task 9: Investigate a Communications or Port Failure ..................................................................................... 97
viii
Lab 23 Appendix: Troubleshooting Storage Failures .................................................... 99
Task 1: Follow Storage Troubleshooting Procedures..................................................................................... 100
Task 2: Investigate a VM Disk Failure .................................................................................................................... 101
Task 3: Investigate an I/O Overload Problem .................................................................................................... 101
Task 4: Investigate an iSCSI Storage Failure ..................................................................................................... 102
Task 5: Investigate an NFS Storage Failure ....................................................................................................... 103
Task 6: Investigate a Fibre Channel Storage Connectivity Failure ............................................................ 104
Task 7: Investigate a FCoE Failure ........................................................................................................................ 104
Task 8: Troubleshoot a Path Failure ..................................................................................................................... 105
Task 9: Troubleshoot a Local Disk Failure .......................................................................................................... 106
Task 10: Troubleshoot a Storage Array Failure ................................................................................................ 106
Task 11: Troubleshoot a Storage Site Disaster ...................................................................................................107
Lab 24 Appendix: Troubleshooting Cluster Failures .................................................... 109
Task 1: Troubleshoot a vSphere vMotion Migration Failure .......................................................................... 110
Task 2: Investigate a Management Agent Problem ........................................................................................... 111
Task 3: Reset Migrate Enabled and Verify the Result ...................................................................................... 112
Task 4: Investigate an HA Configuration Problem ............................................................................................ 112
Task 5: Investigate an HA Resources Problem................................................................................................... 113
Task 6: Investigate Why DRS Never Migrates ...................................................................................................114
Task 7: Investigate Why DRS Rarely Migrates ...................................................................................................114
Task 8: Investigate DRS Erratic Behavior .............................................................................................................114
Lab 25 Appendix: Troubleshooting Virtual Machine Failures ...................................... 115
Task 1: Investigate a CID Problem ...........................................................................................................................116
Task 2: Investigate a Quiesced VM Problem .......................................................................................................116
Task 3: Investigate a General Snapshot Failure .................................................................................................. 117
Task 4: Investigate a Power-On Failure ................................................................................................................ 117
Task 5: Investigate a VM That Shows an Invalid or Orphaned State ......................................................... 118
Task 6: Investigate a VMware Tools Installation Failure .................................................................................. 118
Lab 26 Appendix: Troubleshooting ESXi Host and vCenter Server System
Failures.............................................................................................................................................. 119
Task 1: Investigate a Certificate Problem ............................................................................................................ 120
Task 2: Replace Self-Signed Certificate with CA-Generated Certificate ................................................ 120
Task 3: Restart the vCenter Server Service ...................................................................................................... 120
Task 4: Investigate a vCenter Server Database Free Space Problem ...................................................... 121
ix
Task 5: Investigate a vCenter Server PostgreSQL Problem ........................................................................ 121
Task 6: Investigate a Purple Diagnostic Screen................................................................................................. 122
Task 7: Investigate Why an ESXi Host Is Unresponsive................................................................................. 122
Answer Key ................................................................................................................................... 123
x
Lab 1 Using the Command Line
1. Verify that you are successfully logged into the student desktop.
NOTE
If not, log in to your student desktop by entering vclass\administrator as the user name and
VMware1! as the password.
1
Task 2: Validate the vSphere Licenses
You log in to the vCenter Server system and determine whether the vSphere licenses are valid. If
the licenses are expired, you add valid licenses to the vCenter Server system and ESXi hosts.
2. Select the vSphere Client (SA-VCSA-01) bookmark in the vSphere Site-A folder to connect
to vCenter Server Appliance at https://sa-vcsa-01.vclass.local/ui.
3. On the VMware vSphere Login page, enter the vCenter Server user name
[email protected] and password VMware1! and click Login.
4. Select Menu > Administration.
6. Click Assets.
9. If any license has expired, obtain new licenses from this link.
2
Task 3: Directly Access the DCUI of the ESXi Host
You directly access the ESXi host’s direct console user interface (DCUI).
3. Click in the console window, press F2, and log in to the host by entering root as the ESXi
host user name and VMware1! as the password.
4. Use the up and down arrow keys to view the menu selections.
6. If vSphere ESXi Shell is disabled, select Enable ESXi Shell and press Enter to activate it.
7. If SSH is disabled, select Enable SSH and press Enter to activate it.
9. Press Ctrl+Alt to release the insertion point from the ESXi console window.
10. Press Esc until you are logged out of the DCUI.
3
Task 4: Remotely Access the DCUI of the ESXi Host
You access the ESXi host’s DCUI from an MTPuTTY session.
a. If the connection does not automatically complete, log in manually by entering the ESXi
host user name root and password VMware1!
6. Press F2 to display the login screen and log in by entering the ESXi host user name root
and password VMware1!
a. Using the down arrow key, select Configure Management Network and press Enter.
b. Select IPv4 Configuration and view the IP configuration in the right pane.
8. Use the up and down arrow keys to view the other menu selections.
9. Press Esc until you are logged out of the DCUI and press Ctrl+C to exit the DCUI process.
4
Task 5: Use ESXCLI Commands to View Host Hardware Configuration
You use the CLI to view the hardware configuration of the vSphere environment.
1. If the SSH session closed, double-click the entry for the SA-ESXI-01 host in the MTPuTTY
utility.
a. Enter esxcli hardware clock get to view the time and date on the host.
b. Enter esxcli hardware cpu list | less to view the number of CPUs on
the host.
You must press the space bar to scroll through the output. When done, press q to exit
the less utility.
d. Enter esxcli hardware pci list and find VMkernel Name: vmnic7 to
identify the PCI address that it is listed under.
a. Enter esxcli storage vmfs extent list to view the number of VMFS
extents that are available to the host.
b. Enter esxcli storage core adapter list to view the SCSI host bus
adapters.
c. Enter esxcli storage core path stats get to view the SCSI path
statistics.
d. Enter esxcli storage filesystem list to view the boot partitions and the
datastores that are available to each host.
e. Enter esxcli storage nfs list to view the information about the NFS 3
datastores that are available on this host.
5
Task 7: Use ESXCLI Commands to View Virtual Switch Information
You use the CLI to view the virtual switch configuration of the vSphere environment.
a. Enter esxcli network ip dns server list to view the IP address of the
DNS server.
c. Enter esxcli network vswitch standard list to view that two standard
switches are available to the host.
d. Enter esxcli network vswitch dvs vmware list | more to view the
available distributed switches.
h. Enter esxcli network ip route ipv4 list to view the default gateway
address for the VMkernel interfaces on the host.
6
Lab 2 Using vim-cmd Commands
1. Get VM Information
3. Register a VM
4. Power On a VM
5. Unregister a VM
NOTE
For useful information related to this lab, see "VMware ESXi vim cmd Command: A Quick
Tutorial" at https://communities.vmware.com/docs/DOC-31025. Before starting the lab,
review this reference and then use the information, as needed, while performing the lab tasks.
7
Task 1: Get VM Information
You use vim-cmd commands to list information about the VMs that run on the sa-esxi-
03.vclass.local host. You also use vim-cmd commands to change the power state of a VM.
a. List information about the VMs that are registered on the ESXi host.
vim-cmd vmsvc/getallvms
b. Record the VMID for the Win-6 VM. __________
vim-cmd vmsvc/power
6. View the power state of Win-6.
7. Power on Win-6.
8
Task 2: Manage the ESXi Hosts
You use vim-cmd commands to place the sa-esxi-03.vclass.local host in maintenance mode,
take it out of maintenance mode, and view host configuration information.
vim-cmd hostsvc/maintenance_mode_enter
The operation times out because Win-6 is powered on, and the host does not belong to a
fully automated DRS cluster.
vim-cmd hostsvc/maintenance_mode_exit
Task 3: Register a VM
You use vim-cmd commands to register the Win-11 VM with the host.
vim-cmd vmsvc/getallvms
Win-11 should appear in the list.
a. In the Firefox bookmarks toolbar, click the vSphere Client (SA-VCSA-01) bookmark in
the vSphere Site-A folder.
9
Task 4: Power On a VM
You use vim-cmd commands to power on the Win-11 VM.
5. View the power state of Win-11 again and verify that this VM is powered on.
Task 5: Unregister a VM
You use vim-cmd commands to unregister the Win-11 VM from the host and the vCenter
Server system.
2. Unregister Win-11.
vim-cmd vmsvc/getallvms
Win-11 should not appear in the list.
Win-11 should be in an orphaned state. An orphaned VM is one that exists in the vCenter
Server database but is no longer present on the ESXi host.
a. If Win-11 is not in an orphaned state, refresh the vSphere Client to update the navigation
pane.
5. In the vSphere Client, remove Win-11 from the Hosts and Clusters inventory.
10
Lab 3 Using Standalone ESXCLI and
DCLI
2. Load the Digital Security Certificate from the vCenter Server System
3. Test the Digital Security Certificate from the vCenter Server System
3. If a PuTTY Security Alert dialog box appears, click Yes to accept and cache the server’s
host key.
11
Task 2: Load the Digital Security Certificate from the vCenter Server
System
You load the digital security certificate from the vCenter Server system into the Ubuntu VM for
use with ESXCLI commands.
With this digital security certificate, you can run commands on ESXi hosts without entering a
digital thumbprint for each ESXi host.
NOTE
1. To examine the CPU hardware on sa-esxi-01, enter the esxcli command from the
vSphere CLI VM.
esxcli -s sa-esxi-01.vclass.local hardware cpu list
2. Enter root for the user name.
This command fails. For security reasons, you are required to enter the thumbprint of the
target ESXi host. Instead of manually entering a long thumbprint, you will load the digital
certificate from the vCenter Server system.
3. Minimize the MTPuTTY utility but do not close it.
4. Return to the Firefox web browser, open a new tab, and go to https://sa-vcsa-
01.vclass.local.
5. Click Download trusted root CA certificates.
7. Open Windows File Explorer and go to the Downloads folder (select This PC > Downloads)
on the student desktop.
12
13. Click the WinSCP utility icon on the student desktop taskbar.
14. Select the Ubuntu-CLI site and click Login to open an SCP session to the Ubuntu-CLI VM.
15. If you see a security warning, click Yes to add the thumbprint to the cache.
Different operating systems use different folders and procedures to load the digital
certificates of certificate authority (CA) servers. The procedure used in this lab is required for
Ubuntu Linux servers. If you host vSphere CLI software on a different OS, you must look up
the required procedure and file location for that OS.
18. Select the sa-vcsa-01.crt certificate file in the left pane and click Upload.
Task 3: Test the Digital Security Certificate from the vCenter Server
System
You test the vCenter Server system's digital security certificate that you loaded into the Ubuntu
VM for use with ESXCLI commands.
1. Use the Ubuntu-CLI VM session in MTPuTTY and enter the command to change the
directory to where the certificate is stored.
cd /usr/local/share/ca-certificates/
You must either be in the same directory in which the certificate file is stored or use the full
path to the certificate file when you enter a command.
13
Task 4: (Optional) Add Credentials and Thumbprint for ESXCLI
Commands
You add the user name, password, and digital thumbprint of the sa-esxi-01 host into the Ubuntu
VM credential store for use with ESXCLI commands.
1. Return the MTPuTTY utility session to the Ubuntu-CLI VM and enter this command.
cd /root/vmware-vsphere-cli-distrib/apps/general
2. Try to display a list of the CPU hardware by entering this command.
The command fails, but it shows the thumbprint of the ESXi host.
4. Add the user name and password for the sa-esxi-01.vclass.local ESXi host to the local
credentials store.
When adding credentials to the credential store, you always add the user name and
password before you add the thumbprint.
1D:67:07:E9:58:FC:97:81:AC:17:8F:BF:0E:74:E9:8F:BD:61:27:D5
The thumbprint is case-sensitive and must match exactly.
14
6. Display a list of the CPU hardware.
You can use the following commands to manage the credentials store:
• ./credstore_admin.pl help
• ./credstore_admin.pl list
• ./credstore_admin.pl add
• ./credstore_admin.pl remove
• ./credstore_admin.pl clear
To remove a bad thumbprint, run this command:
1. Return the MTPuTTY utility session to the Ubuntu-CLI VM and enter the command to start a
DCLI interactive session to vCenter Server.
15
5. Enter y to save the credentials.
You can use the following commands to manage the credentials store:
• +credstore-list
• +credstore-add
• +credstore-remove
6. Enter exit to quit the DCLI.
16
Lab 4 ESXi Command History
NOTE
For information about vSphere ESXi Shell logins and commands, see VMware knowledge
base article 2004810 at https://kb.vmware.com/kb/2004810.
An administrator might run commands directly on an ESXi host that cause downtime or
disconnection. In the same session, you can use the up arrow key to find which commands were
previously run. However, if the session is closed or you log in as a different user, you must use a
different method to view the history of the commands that were previously run.
1. Use MTPuTTY to connect to sa-esxi-03.vclass.local.
2. Determine the most recent date and time that sa-esxi-03 was placed in maintenance mode
using the vim-cmd command.
a. Use /var/log/shell.log to determine the most recent date and time that sa-
esxi-03 was placed into maintenance mode using the vim-cmd command.
c. Record the date and time that the command was run. __________
3. Use /var/log/auth.log to determine the date and time that the user logged in and
the IP address from which the user logged in.
17
18
Lab 5 Monitoring NIC Teaming During
Failover
3. Monitor the ESXi Host When the Active Link Goes Down
4. Monitor the ESXi Host When the Standby Link Goes Down
NOTE
For useful information about the NIC teaming failover process, see the following references.
Review these references before you start the lab and use the information, as needed, while
performing the lab tasks.
Reference Link
19
Task 1: Verify the Distributed Switch Configuration
You verify that networking for the sa-esxi-01, sa-esxi-02, and sa-esxi-03 hosts is configured
correctly on the dvs-SA-Datacenter distributed switch.
2. In the vSphere Client, reset all the triggered alarms to return them to a normal state.
7. Click OK on the warning pop-up window to confirm that no active uplinks exist.
8. Add Uplink 5 as a standby uplink on pg-SA-Production-02.
NOTE
Ensure that you add the uplink to pg-SA-Production-02, and not pg-SA-Production-01.
9. Verify that the pg-SA-Production-02 distributed port group consists of two uplinks: Uplink 6
(active uplink) and Uplink 5 (standby uplink).
10. Verify that vmnic4 is assigned to Uplink 5 and vmnic5 is assigned to Uplink 6.
20
Task 2: Verify Network Operation on the ESXi Host
You verify that networking on sa-esxi-02.vclass.local is functioning properly by pinging the
gateway from the linux-a-07 VM.
1. Power on the linux-a-07 VM and open a web console from the vSphere Client.
Task 3: Monitor the ESXi Host When the Active Link Goes Down
You bring Uplink 6 (active link) down and monitor the behavior of the ESXi host sa-esxi-
02.vclass.local.
5. Return to the esxtop display and verify the uplink that the VM is using.
6. In the vSphere Client, check for messages related to vmnic5 being down on sa-esxi-
02.vclass.local.
21
7. View the log files on sa-esxi-02 for any entries related to vmnic5 being down.
Task 4: Monitor the ESXi Host When the Standby Link Goes Down
You bring Uplink 5 (standby link) down and monitor the behavior of the ESXi host sa-esxi-
02.vclass.local.
1. Using the vSphere Client, view the configuration on pg-SA-Production-02 to verify that
Uplink 5 is a standby uplink and Uplink 6 is an active uplink.
2. Enter the command to take down Uplink 5 (vmnic4) and monitor the behavior of sa-esxi-
02.vclass.local.
4. Return to the esxtop display and verify the uplinks that the VM is using.
Q1. Which uplink is now used by the VM?
A1. vmnic5, the active uplink
22
Task 5: Reconfigure the Port Group pg-SA-Production-01
Using best practices, you configure pg-SA-Production-01 to ensure network reliability if an
outage occurs.
3. Set Uplink 5 as the active uplink and Uplink 6 as the standby uplink.
23
24
Lab 6 Monitoring and Recovering
Distributed Switches
NOTE
For information about monitoring distributed switches and recovering from a distributed
switch failure, see the following references. Review these references before you start the lab
and use the information, as needed, while performing the lab tasks.
Reference Link
25
Task 1: Display Distributed Switch Information
You run the net-dvs command to display information about the dvs-SA-Datacenter distributed
switch configuration.
The command retrieves this information from the /etc/vmware/dvsdata.db binary file.
This file is maintained by the ESXi host and is updated at 5-minute intervals.
2. Display the output for the distributed switch configuration one page at a time.
net-dvs | less
3. Find information about the distributed switch.
The UUID is the long hexadecimal string that follows the word switch.
e. Verify that Cisco Discovery Protocol (CDP) is enabled for this switch.
CDP is enabled when CDP is set to listen, advertise, or advertise & listen.
26
Task 2: Disable the Network Rollback Option
In the vSphere Client, you disable the network rollback option. The network rollback feature
prevents the ESXi hosts from disconnecting from the management network.
By disabling this option, you force the ESXi host to disconnect from the management network.
2. In the Firefox bookmarks toolbar, select vSphere Client (SA-VCSA-01) from the vSphere
Site-A folder.
7. Click the filter icon next to the Name column to search for parameters with the word
rollback.
8. Change config.vpxd.network.rollback to false and click SAVE.
You recover connectivity to your ESXi hosts by creating a standard switch from the command
line.
The DCUI provides an option to create a standard switch, but this option is disruptive and can
cause you to lose much of your distributed switch configuration. Instead, you can manually create
a standard switch from the command line. By manually creating a standard switch, you can
control the vmnics and VMkernel interfaces that get migrated to the new standard switch.
Hint: Right-click dvs-SA-Datacenter in the inventory and select Add and Manage Hosts.
NOTE
27
3. Verify that you can no longer ping sa-esxi-02.
6. At the vSphere ESXi Shell login window, log in by entering root as the user name and
VMware1! as the password.
7. Enter the command to verify the status of the current distributed switch configuration.
28
e. Add the vmnic0 and vmnic1 uplinks to recoveryswitch.
services.sh restart
11. Verify that you can ping sa-esxi-02 again.
a. If sa-esxi-02 does not appear as Connected, right-click the host, select Connection, and
click Connect.
12. Log out of DCUI and select the STUDENT-A-01 console from the VM list to return to the
student desktop.
29
Task 4: Enable the Network Rollback Option
You enable the network rollback option.
1. In the vSphere Client, select sa-vcsa-01.vclass.local in the navigation pane and click the
Configure tab.
3. Click the filter icon next to the Name column to search for parameters with the word
rollback.
4. Change config.vpxd.network.rollback to true and click SAVE.
30
Lab 7 Applying the Troubleshooting
Methodology
5. To run the break script, enter .\Break-ts-method.ps1 and wait for it to finish.
31
6. In the vSphere Client, open a remote console on the linux-a-01 VM.
An end-user support request is filed: The linux-a-01 VM cannot ping its default gateway,
172.20.11.10.
NOTE
In the lab environment, having multiple VM consoles open at the same time might degrade
performance. Never open more than one VM console at a time in the lab. This problem does
not occur in production systems.
7. Log in to linux-a-01 by entering root as the user name and VMware1! as the password.
8. Verify that the linux-a-01 VM cannot ping the default gateway 172.20.11.10.
The problem is now defined. You continue using the troubleshooting methodology by
narrowing the scope of this problem to identify its cause.
3. In the vSphere Client, verify that the correct uplink (network) is connected to the VM.
4. In the vSphere Client, verify that the network link status is connected.
5. Power on a second VM (linux-a-02) to verify that the problem is not specific to linux-a-01.
6. Verify that the second VM is running on the same ESXi host as linux-a-01 and that it is
connected to the same network (pg-SA-Production-01 or pg-SA-Production-02).
Both the pg-SA-Production-01 and pg-SA-Production-02 network port groups are on the
same physical network.
32
7. Open a remote console on linux-a-02 and log in by entering root as the user name and
VMware1! as the password.
8. Repeat the same tests on linux-a-02.
1. In the vSphere Client, verify that a VMkernel adapter is not assigned on the ESXi host for the
Production network.
a. Select Hosts & Clusters > ESXi_host_name> Configure > VMkernel adapters.
2. Verify that the VLAN setting of any distributed switch is correct on vCenter Server.
a. In the vSphere Client, select Networking > dvs-SA-Datacenter > port_group_name >
Actions > Edit Settings > VLAN.
The VLAN setting should match the physical network VLAN setting.
If a VLAN is assigned where it should not be, or if the VLAN setting is incorrect,
communications do not work.
33
3. Verify that the speed and duplex setting of any ESXi host is correct on the vCenter Server
system.
a. In the vSphere Client, select Hosts and Clusters > ESXi_host_name > Configure >
Networking > Physical adapters to verify the setting.
4. In the vSphere Client, verify that the correct uplink (network) is connected to the virtual
switch.
a. Select Networking > dvs-SA-Datacenter > Configure > Topology and select the
individual port groups that you want to verify.
If a virtual switch is connected to the wrong uplink on any ESXi host, the distributed
switch does not work or behaves erratically. All standard and distributed switches must
be connected to the same uplinks, and the uplinks must be correct for the physical
hardware.
In this case, the pg-SA-Production-01 port group should be connected to Uplink 5
(vmnic4) on both sa-esxi-01 and sa-esxi-02. The pg-SA-Production-02 port group
should be connected to Uplink 6 (vmnic5) on both sa-esxi-01 and sa-esxi-02.
2. In the navigation pane, right-click the dvs-SA-Datacenter distributed switch and select Add
and Manage Hosts.
You select only the sa-esxi-01 host, which is the ESXi host with the configuration problem.
When troubleshooting, the best approach is to change only what needs to be changed to
resolve the problem. Otherwise, production systems that do not require changes might be
impacted.
6. Click Next.
34
7. Select the vmnic4 physical adapter and click Assign uplink.
2. Because this VM is configured with DHCP, enter the dhclient command to renew the IP
address configuration.
Because you powered on a second VM (linux-a-02) to troubleshoot the problem, you must
repeat these steps on this VM.
9. Because this VM is configured with DHCP, enter the dhclient command to renew the IP
address configuration.
10. Enter the ifconfig -a command in the linux-a-02 VM console to verify the IP
configuration.
11. Enter the route -n command to verify the default gateway address.
35
12. Enter the ping 172.20.11.10 command.
The ping should be successful.
If any warnings or alerts remain in the vSphere Client, you should clear them before
continuing to the next lab. Sometimes, a refresh of the vSphere Client clears stale warnings
or alerts.
36
Lab 8 Troubleshooting Network
Problems
37
Task 1: Run a Break Script
You run a break script to damage networking in the lab environment.
Several break scripts are provided to create network problems. Each script damages networking
in the lab environment in a different way. You can run the break scripts in any order. And you can
choose which problems to resolve.
IMPORTANT
The VMs that are impacted by each break script are listed in the Support Request table in
task 2. Before running a break script, verify that the impacted VM or VMs are powered on
with the guest operating systems online.
In the Difficulty column of the table, 1 signifies least difficult and 3 signifies most difficult to
resolve.
.\Break-6-1.ps1 1
.\Break-6-2.ps1 2
.\Break-6-4.ps1 2
.\Break-6-5.ps1 3
.\Break-6-8.ps1 3
.\Break-6-9.ps1 3
NOTE
After a break script completes, do not run another break script until after you complete
tasks 2 through 4 for each network problem. You must run the scripts one at a time.
4. Wait until the You are ready to start the lab message appears.
38
Task 2: Verify That the System Is Not Functioning Properly
You verify that networking is damaged in your lab environment.
1. Use the support request summary information to verify that you see the symptoms reported
for your break script and that your lab environment is not working.
NOTE
The first time you open a virtual machine console, you are prompted to click either Web
Console or VMware Remote Console. You must click Web Console.
.\Break-6-4.ps1 linux-a-01, linux- Users on the linux-a-01 and linux-a-02 VMs cannot
a-02 communicate with each other. The IP address for
linux-a-01 is 172.20.11.200. The IP address for linux-a-
02 is 172.20.11.201.
.\Break-6-5.ps1 linux-a-01, linux- Users on the following VMs report total network
a-02, communication failures: linux-a-01, linux-a-02, linux-a-
linux-a-03, 03, and linux-a-04.
linux-a-04
39
Break Script Impacted Support Request
Virtual
Machines
IMPORTANT
1. Use the available techniques and tools to troubleshoot and resolve the problem.
• Lab topology handout, which provides important information about the network,
storage, host, and VM configurations.
• Internet
40
Task 4: Verify the Solution
You verify that the virtual network is functioning properly.
2. Use the vSphere Client and VM web console, as needed, to verify that the problem is
resolved.
3. Leave the vSphere Client open until you complete all network troubleshooting problems.
4. After you verify that the problem is resolved, return to task 1 and run another break script.
41
42
Lab 9 Investigating Disk Issues on
ESXi
NOTE
For useful information about troubleshooting ESXi storage problems, see VMware
knowledge base article 1003564 at https://kb.vmware.com/kb/1003564. Review this
reference before you start the lab and use the information, as needed, while performing the
lab tasks.
cd /vmfs/volumes/sa-esxi-01-local/studentscripts
43
3. List the contents of the studentscripts directory.
ls /vmfs/volumes/sa-esxi-01-local/studentscripts
4. Enter ./script1.sh to run the script.
The script runs for a few seconds and returns to a command prompt.
1. In the Firefox bookmarks toolbar, select the vSphere Client (SA-VCSA-01) bookmark in the
vSphere Site-A folder.
Parameter Value
Name Win-1
Datastore Shared3
NOTE
The VM creation task fails. If you do not see an error message in the Recent Tasks pane,
click the Refresh icon at the top of the window.
44
Task 3: Troubleshoot the Problem
You analyze diagnostic messages and log files to identify the root cause of the failed task.
1. In the vSphere Client, review the error messages that appear in the Recent Tasks pane.
You must determine whether the cause of the problem is the VM, ESXi host, or storage.
2. In the vSphere Client, select the ESXi host's Monitor tab and review the Tasks and Events
list.
3. Identify tasks and events that provide insight into the cause of the problem.
NOTE
Focus on log entries that have a time stamp close to the time that the error occurred.
5. View the /var/log/hostd.log file and identify the log entries, if any, that provide hints
about the cause of the problem.
6. Using the information that you found in the vSphere Client and the log files, identify the root
cause of the problem.
1. Analyze each possible resolution and its impact, if any, on the vSphere environment.
45
Task 5: Verify the Solution
You verify that the VM creation task is successful.
1. Create a VM on sa-esxi-01.vclass.local.
Parameter Value
Name Win-1
Datastore Shared3
46
Lab 10 Troubleshooting Storage
Performance Issues
NOTE
For useful information about using the esxtop utility, see the following references. Review
these references before you start the lab and use the information, as needed, while
performing the lab tasks.
Reference Link
47
Task 1: Generate VM Disk Activity
You power on the Win-4, Win-5, and Win-6 VMs to generate disk activity.
After logging in to these VMs, a script runs on each VM to generate disk activity.
NOTE
Because the lab environment contains a small number of VMs, you cannot generate enough
load (IOPS) in the environment.
2. Enter esxtop.
3. Find out what the disk statistics mean and how they are useful in troubleshooting
performance issues.
48
Task 3: Monitor Performance by Storage Adapter
You view storage adapter (HBA) statistics on sa-esxi-03.vclass.local to determine which adapter
experiences the highest disk activity.
The asterisks next to the A, B, C, E, and G fields signify that statistics in these fields are
shown in the disk statistics display. These fields act like a toggle. If you enter a, the A fields
are shown in the display. You can turn on (and off) any of the fields by toggling the letter.
3. Verify that only the A, C, D, E, and G fields are selected (an asterisk should appear next to
the letter).
49
4. Press any key, such as Return, to get back to the disk statistics display.
The table shows the statistics that you see when you select the D, E, and G fields.
5. Interpret the statistics that are shown in the storage adapter display.
1. In the esxtop utility, enter u to view information about the storage devices (LUNs).
The table shows the statistics that you see when you select the F, G, and I fields.
50
4. Interpret the statistics that are shown in the storage device display.
6. Enter the command to view the datastore name of the affected storage device.
esxcli storage vmfs extent list
Q2. What is the datastore name of the affected storage device?
A2. Shared3
1. In the esxtop display, enter v to view information about the VM disk activity.
2. In the Fields menu, verify that only the B, C, D, E, I, J, and K fields are selected.
The table shows the statistics that you see when you select the I, J, and K fields.
Q2. What possible solutions can help you get better performance?
A2.
Also, check
Enable
Migrate one the
Storage DAVG value
I/O VMs
or two Control or latency
and the values
set datastore.
to another for5 the
value to ms. VM. Add another VMkernel port and vmnic for software iSCSI multipathing and set the multipathing policy to Round Robin.
4. In the vSphere Client, shut down Win-4, Win-5, and Win-6 and close each VM's console
window.
51
52
Lab 11 Troubleshooting VM Power-On
Problems
NOTE
For useful information about troubleshooting VM power-on failures, see VMware knowledge
base article 1014501 at https://kb.vmware.com/kb/1014501. Review this reference before
you start the lab and use the information, as needed, while performing the lab tasks.
53
Task 1: Create and Power On the VM
You create a VM and attempt to power on this VM to determine the power-on problem.
1. In the Firefox bookmarks toolbar, click the vSphere Client (SA-VCSA-01) bookmark in the
vSphere Site-A folder.
If you cannot create this VM, shut down all the other VMs stored on Shared storage.
a. Configure options for name, location, host, storage, compatibility, and guest OS with the
values in the table.
Option Value
Name linux-a-14
Location SA-Datacenter
Host sa-esxi-03.vclass.local
Storage Shared
Option Value
Memory 8 GB
Hard disk 2 GB
54
Task 2: Troubleshoot Problems or Errors
You view and analyze the warning and error messages that are generated in the vSphere Client
and log files as a result of the VM's failure to power on.
1. In the vSphere Client, find information that helps you determine the cause of the linux-a-14
VM's failure to power on.
a. Record your initial ideas about what might be causing the problem.
c. Determine how to verify your initial assumption of what might be causing the problem.
3. Verify whether your initial assumption is valid by viewing log files to find relevant information.
a. Find the log files in the /var/log directory that contain information related to the
linux-a-14 VM.
b. Focusing on the files that contain information about linux-a-14, examine each of these log
files to identify data that is related to linux-a-14.
NOTE
Even if the VM name is mentioned in a log file, the information in that file might not be
helpful when troubleshooting.
If you are new to troubleshooting, you might find it worthwhile to investigate all log files
to familiarize yourself with the types of information that each log provides.
As you gain more experience with troubleshooting, you can go directly to the most
useful log files.
c. Identify the log files in /var/log that contain information that is useful in determining
the problem's root cause.
55
Task 3: Resolve the Problem
You identify potential resolutions to the problem and apply the most appropriate resolution
based on your analysis.
1. Identify ways to resolve the problem and describe any negative impacts of these resolutions.
a. If the problem can be resolved in more than one way, list the potential resolutions and
explain how each resolution works.
b. If any of these resolutions might have a negative impact on the environment, describe
the possible negative impact of each.
NOTE
For purposes of this lab, do not choose resolutions that involve increasing the size of the
datastore.
5. After the linux-a-14 powers on successfully, power off the VM and delete it.
56
Lab 12 Troubleshooting VM Snapshot
Problems
1. Power On the VM
NOTE
For information about troubleshooting VM snapshot problems, see the following references.
Review these references before you start the lab and use the information, as needed, while
performing the lab tasks.
Reference Link
"The parent virtual disk has been modified since the child https://kb.vmware.com/kb/1007969
was created" error (1007969)
57
Task 1: Power On the VM
In the vSphere Client, you attempt to power on a VM called linux-a-13.
1. In the vSphere Client, find information that might help you to identify the cause of the linux-a-
13 VM's failure to power on.
a. Record your initial thoughts about what might be causing the problem.
c. Determine how to verify your initial assumption of what might be causing the problem.
58
3. Verify whether your initial assumption is valid by viewing log files to find relevant information.
a. Determine which log files in the /var/log directory contain information related to the
linux-a-13 VM.
b. Focusing on the files that contain information about linux-a-13, examine each of these log
files to identify data that is related to linux-a-13.
NOTE
Even if the VM name is mentioned in a log file, the information in that file might not be
helpful when troubleshooting.
If you are new to troubleshooting, you might find it worthwhile to investigate each of
these log files to identify the types of information about the failure that the logs provide.
As you gain more experience with troubleshooting, you can go directly to the log files
that are the most useful.
c. Identify the log files in /var/log that contain information that is useful in determining
the problem's root cause.
1. Identify ways to resolve the problem and describe any negative impacts of these resolutions.
a. If the problem can be resolved in more than one way, list the potential resolutions and
explain how each resolution works.
b. If any of these resolutions might have a negative impact on the environment, describe
the possible negative impact of each.
NOTE
Although the vSphere Client might be easier for you to use, practice resolving the problem
using the appropriate vSphere commands.
59
Task 4: Verify the Solution
You run vSphere commands to verify that the problem is resolved and that the linux-a-13 VM
powers on successfully.
a. If the task seems to hang, use the vSphere Client to determine whether a question is
pending on the VM.
60
Lab 13 Working with VM Snapshots
Using the Command Line
NOTE
For useful information about using the command line to create and delete VM snapshots and
to monitor these tasks, see the following references. Review these references before you
start the lab and use the information, as needed, while performing the lab tasks.
Reference Link
61
Task 1: Power On a VM
You use the vSphere Client to power on the Win-2 VM.
NOTE
1. Enter the vim-cmd command to create five VM snapshots, where each snapshot includes
the VM's memory.
2. Enter the watch command to monitor changes to the VM’s home directory.
Each new snapshot file should appear in the VM's home directory.
4. Enter the vim-cmd command to view details about the snapshot creation task.
b. Run the vim-cmd command to view details about the snapshot creation task.
1. Delete all the VM snapshots and monitor their deletion by running the watch command.
b. Enter the vim-cmd command to view details of the snapshot deletion task.
62
Lab 14 Troubleshooting Storage
Problems
Several break scripts are provided to create storage problems. Each script damages storage in
the lab environment in a different way. You can run the break scripts in any order. And you can
choose which problems to resolve.
1. Double-click the PowerCLI icon on the student desktop system to start a PowerCLI session.
63
3. Enter the name of a break script.
For example, you enter .\Break-7-1.ps1 to run the first break script.
In the Difficulty column of the table, 1 signifies least difficult and 3 signifies most difficult to
resolve.
.\Break-7-1.ps1 1
.\Break-7-2.ps1 2
.\Break-7-3.ps1 3
.\Break-7-4.ps1 3
.\Break-7-5.ps1 3
.\Break-7-6.ps1 3
.\Break-7-7.ps1 2
.\Break-7-8.ps1 2
.\Break-7-9.ps1 2
.\Break-7-10.ps1 3
IMPORTANT
After the break script completes, do not run another break script until you complete tasks 2
through 4 for each storage problem. You must run the scripts one at a time.
4. Wait for the You are ready to start the lab message to appear.
5. Leave the PowerCLI window open for the next problem and go to task 2.
64
Task 2: Verify That the System Is Not Functioning Properly
You verify that storage is damaged in your lab environment.
1. Use the support request summary information to verify that you see the symptoms reported
for your break script and that your lab environment is not working.
.\Break-7-1.ps1 A vSphere administrator cannot create any VMs on the NFS datastore.
The administrator also cannot migrate any existing VMs to the NFS
datastore.
.\Break-7-4.ps1 Storage paths have disappeared from one ESXi host iSCSI storage
adapter. The vSphere administrator did not specify which ESXi host or
storage had the problem. You might need to wait for 10-15 minutes, after
executing the script, for the problem to show up.
.\Break-7-6.ps1 End users report extremely poor performance on several VMs. All VMs
that were reported are stored on the Shared datastore.
65
Break Script Support Request
.\Break-7-10.ps1 A vSphere administrator reports that some VMs stored on the Shared
datastore are now marked inaccessible. The vSphere administrator did not
specify which ESXi host or VMs had the problem.
1. Use the available techniques and tools to troubleshoot and repair the problem.
• Lab topology handout, which contains important information about the network,
storage, host, and VM configurations
• Internet
2. Use the vSphere Client and remote consoles, as needed, to verify that the problem is
resolved.
3. Leave the vSphere Client open until you complete all storage troubleshooting problems.
66
Lab 15 Troubleshooting Cluster
Problems
This cluster is used by the break scripts. Without this cluster, the break scripts fail to complete.
1. Create a cluster called Lab Cluster and leave all features disabled.
3. Power off all VMs in the inventory before running any break scripts.
67
Task 2: Run the Break Script Break-8-1.ps1
You use PowerCLI to run the script called Break-8-1.ps1, which configures the cluster and impacts
Ramdisk use in the lab environment.
1. Double-click the PowerCLI icon on the student desktop system to start a PowerCLI session.
.\Break-8-1.ps1
After the script completes, do not run another break script until you complete tasks 4
through 6.
IMPORTANT
You must start with the first break script, Break-8-1.ps1. After you run the first break script
and solve that problem, you can run the remaining break scripts in any order.
You need to run the .\Break-8-1.ps1 script only once.
4. Wait for the You are ready to start the lab message to appear.
5. Leave the PowerCLI window open for the next problem, skip task 3, and go to task 4.
Several break scripts are provided to create cluster problems. Each script damages the cluster
configuration in the lab environment in a different way. You can run these break scripts in any
order, and you can choose which problems to resolve.
1. Double-click the PowerCLI icon on the student desktop system to start a PowerCLI session.
68
3. Enter the name of a break script.
For example, you enter .\Break-8-4.ps1 to run the first optional break script.
In the Difficulty column of the table, 1 signifies least difficult and 3 signifies most difficult to
resolve.
.\Break-8-4.ps1 2
.\Break-8-6.ps1 2
.\Break-8-8.ps1 2
NOTE: Before running this script, move sa-esxi-03.vclass.local
into the cluster that you created (Lab Cluster).
.\Break-8-10.ps1 2
IMPORTANT
You can run the remaining break scripts in any order. After the break script completes, do
not run another break script until you complete tasks 4 through 6 for each cluster problem.
You must run the scripts one at a time.
4. Wait for the You are ready to start the lab message to appear.
5. Leave the PowerCLI window open for the next problem and go to task 4.
69
Task 4: Verify That the System Is Not Functioning Properly
You verify that the cluster configuration is damaged in your lab environment.
1. Use the support request summary information to verify that you see the symptoms reported
for your break script and that your lab environment is not working.
.\Break-8-8.ps1 A vSphere administrator reports that CPU use is not balanced across
hosts in the cluster.
.\Break-8-10.ps1 A vSphere administrator reports that CPU use is not balanced across
hosts in the cluster.
1. Use the available techniques and tools to troubleshoot and repair the problem.
• Lab topology handout, which contains important information about the network,
storage, host, and VM configurations
• Internet
70
Task 6: Verify the Solution
You verify that the cluster is repaired.
2. Use the vSphere Client and remote consoles, as needed, to verify that the problem is
resolved.
3. Leave the vSphere Client open until you complete all cluster troubleshooting problems.
71
72
Lab 16 Resolving VM Power-On
Problems
NOTE
For useful information about troubleshooting a VM that fails to power on, and if you need
help while performing the tasks in this lab, see VMware knowledge base article 2001005 at
https://kb.vmware.com/kb/2001005.
Although this knowledge base article covers many possible similar issues and solutions, the
solution that you require might not be listed.
Search for other articles that are specific to the error message that you receive when you try
to power on the VM and it fails. It is important to understand the various factors and errors
that can cause a VM to fail when powered on.
73
Task 1: Run a Break Script
You run a break script on the ESXi host on which Win-4 is located.
cd /vmfs/volumes/sa-esxi-01-local/studentscripts
5. Run the script3.sh script, where <datastore name> is the datastore on which
Win-4 is located.
Script Running......
Powering off VM:
74
Task 2: Troubleshoot the Problem
You view and analyze error messages that occurred when the Win-4 VM failed to power on. You
view information in the vSphere Client and files on the sa-esxi-01.vclass.local host to determine
the root cause.
1. Find information in the vSphere Client that might give you hints about why the Win-4 VM did
not power on.
• Are there any other potential causes? If so, what are they?
• How can you verify your initial assumption of what is causing the problem?
3. Identify log files that might contain information about Win-4's failure to power on and view
them to find relevant information.
• Is there more than one way to resolve the problem? If so, list the potential resolutions
and explain why each resolution might work.
• Do any of these resolutions have a negative impact? If so, which ones and why?
3. Using the command line, apply the resolution that you selected.
75
Task 4: Verify the Solution
You use vSphere commands, instead of the vSphere Client, to verify that the problem is resolved
and that the Win-4 VM powers on successfully.
1. Use MTPuTTY to establish an SSH session with the ESXi host on which the Win-4 VM is
located.
3. Using the command line, verify that the Win-4 VM powers on successfully.
76
Lab 17 Troubleshooting VM Problems
77
Task 1: Run a Break Script
You use PowerCLI to run a break script to damage VMs in the lab environment.
Several break scripts are provided to create VM problems. Each script damages VMs in the lab
environment in a different way. You can run the break scripts in any order. And you can choose
which problems to resolve.
1. Double-click the PowerCLI icon on the student desktop system to start a PowerCLI session.
For example, you enter .\Break-9-1.ps1 to run the first break script.
In the Difficulty column of the table, 1 signifies least difficult and 3 signifies most difficult to
resolve.
.\Break-9-1.ps1 2
.\Break-9-2.ps1 2
.\Break-9-6.ps1 2
.\Break-9-7.ps1 2
.\Break-9-8.ps1 3
.\Break-9-9.ps1 3
NOTE
After the break script completes, do not run another break script until you complete tasks 2
through 4 for each VM problem. You must run the scripts one at a time.
4. Wait for the You are ready to start the lab message to appear.
5. Leave the PowerCLI window open for the next problem and go to task 2.
78
Task 2: Verify That the System Is Not Functioning Properly
You verify that VMs are damaged in your lab environment.
1. Use the support request summary information to verify that you see the symptoms reported
for your break script and that your lab environment is not working.
.\Break-9-7.ps1 An end user cannot power on a VM. The user did not report which
VM failed to power on.
79
Task 3: Troubleshoot and Resolve the Problem
You troubleshoot and resolve the problem with the VMs, drawing on relevant techniques and
tools.
1. Use the available techniques and tools to troubleshoot and resolve the problem.
• Lab topology handout, which contains important information about the network,
storage, host, and VM configurations
• Internet
2. Use the vSphere Client and remote consoles, as needed, to verify that the problem is
resolved.
3. Leave the vSphere Client open until you complete all the VM troubleshooting problems.
80
Lab 18 Restarting ESXi Management
Agents
NOTE
For useful information about restarting the ESXi management agents, see VMware knowledge
base article 1003490 at https://kb.vmware.com/kb/1003490. Review this reference before you
start the lab and use the information, as needed, while performing the lab tasks.
For troubleshooting purposes, it might be necessary to restart the management agents on your
ESXi host.
1. Log in to the DCUI for sa-esxi-01.vclass.local and open the pop-out console.
d. Press F2 and log in to the DCUI with user name root and password VMware1!
81
2. Restart the management agents on sa-esxi-01.
a. Use the down arrow key to select Troubleshooting Options.
b. From the Troubleshooting Mode Options menu, select Restart Management Agents.
The warning states that restarting the management agents disconnects all the remote
management software. You temporarily lose any SSH session opened to the ESXi host.
While the management services are restarting, you cannot access the ESXi host directly
from the vSphere Client. The ESXi host shows up as disconnected from the vCenter
Server system.
c. To proceed with restarting the management agents, press F11.
d. After the agents restart, press Enter.
e. Press ESC twice to log out of the DCUI.
f. Click on the CONSOLES tab to open a list of available consoles.
g. Click STUDENT-A-01 to switch to the console for the student desktop.
Management agents can be restarted from the local console or an SSH session.
1. Use MTPuTTY to log in to sa-esxi-01.vclass.local.
2. Enter the command to restart the management agents.
services.sh restart
Progress is output to the terminal and written to the /var/log/jumpstart-
stdout.log file.
NOTE
The services.sh restart command restarts all the services on the ESXi host. This
command must be used with care because it can cause downtime in a production
environment.
/etc/init.d/hostd restart
Instead of restarting all management agents at the same time, you can restart an individual
agent, such as hostd or vpxa.
82
Lab 19 Troubleshooting ESXi Host
Disconnection Problems
NOTE
For useful information about troubleshooting an ESXi host in a nonresponding state, see
VMware knowledge base article 1003409 at https://kb.vmware.com/kb/1003409.
1. In the vSphere Client, verify that the linux-a-01 and Win-4 VMs are powered on.
cd /vmfs/volumes/sa-esxi-01-local/studentscripts
4. Run the script.
./script2.sh
83
Task 2: Troubleshoot the Problem
You analyze diagnostic messages in the vSphere Client and log files to identify the root cause of
the ESXi host disconnection problems.
• What other tasks can you perform to assess the affected environment?
2. Drawing on the information that you gathered so far, determine what the root cause might
be.
3. Verify the log files to find additional information that can aid you in identifying the root cause.
4. Conduct any additional tests and analysis to identify the root cause.
NOTE
For the purposes of this lab, do not resolve the problem by rebooting the host.
3. Using the command line, apply the resolution that you selected.
1. Verify that you can successfully log in to sa-esxi-01.vclass.local with VMware Host Client.
2. Log in to the vSphere Client and verify that sa-esxi-01 is operating normally.
84
Lab 20 Troubleshooting vCenter
Server Connection Problems
NOTE
For useful information about troubleshooting vCenter Server connection problems, see these
references. Review these references before you start the lab and use the information, as
needed, while performing the lab tasks.
Reference Link
85
Task 1: Run a Break Script
You run a break script to introduce a problem on the vCenter Server Appliance instance.
5. When the script completes running, wait 15-20 seconds and log in to sa-vcsa-01.vclass.local
using the vSphere Client.
2. Review the log files for additional information that might help you to identify the root cause.
3. Perform any additional tests and analysis to determine the root cause.
86
Task 3: Resolve the Problem
You apply the resolution that you deem is the most appropriate based on your analysis.
NOTE
For purposes of this lab, do not resolve the problem by rebooting the system.
1. Verify that you can successfully log in to sa-vcsa-01.vclass.local using the vSphere Client.
87
88
Lab 21 Troubleshooting vCenter
Server and ESXi Host Problems
89
Task 1: Run a Break Script
You use PowerCLI to run a break script to damage your vCenter Server configuration or ESXi
host configuration in the lab environment.
Several break scripts are provided to create vCenter Server configuration and ESXi host
configuration problems. Each script damages the configuration in the lab environment in a
different way. You can run the break scripts in any order. You can choose which problems to
resolve.
For example, you enter .\Break-11-1.ps1 to run the first break script.
In the Difficulty column, 1 = least difficult to resolve, and 3 = most difficult to resolve.
.\Break-11-1.ps1 3
.\Break-11-2.ps1 2
.\Break-11-5.ps1 2
.\Break-11-7.ps1 3
.\Break-11-11.ps1 1
NOTE
After the break script completes, do not run another break script until you complete tasks 2
through 4 for each problem. You must run the scripts one at a time.
4. Wait until the You are ready to start the lab message appears.
90
Task 2: Verify That the System Is Not Functioning Properly
You verify that an ESXi host or vCenter Server configuration is damaged in your lab environment.
1. Using the support request summary information, verify that the symptoms reported for your
break script occur and that your lab environment is not working.
.\Break-11-2.ps1 Note: Power off all VMs in the inventory before running this
script.
1. Use the available techniques and tools to troubleshoot and resolve the problem.
• Lab topology handout, which provides important information about the network,
storage, host, and VM configurations
• Internet
91
Task 4: Verify the Solution
You verify that the vCenter Server and ESXi host configuration problem is resolved.
2. Use the vSphere Client and VM web console, as needed, to verify that the problem is
resolved.
3. Leave the vSphere Client open until you complete all vCenter Server and ESXi host
troubleshooting problems.
92
Lab 22 Appendix: Troubleshooting
Network Communication Failures
Troubleshooting Flowchart
The flowchart presents a logical sequence for troubleshooting failures related to network
communications.
Troubleshooting Tasks
To troubleshoot network communication failures, you might perform the following tasks:
93
Task 1: Verify the IP Configuration
A specific host (ESXi host, VM, or vCenter Server) on a specific network seems to have a
problem.
1. Verify that the IP address and subnet mask of the VMkernel ports (ESXi hosts) or assigned
NICs (vCenter Server system, VM) are correct.
2. Verify that the default gateway of the VMkernel ports (ESXi hosts) or assigned NICs
(vCenter Server system, VM) is correct.
3. Verify that the DNS settings of the VMkernel ports (ESXi hosts) or assigned NICs (vCenter
Server system, VM) are correct.
1. In the vSphere Client, verify that the VLAN configuration of any distributed switch is correct
on the vCenter Server system.
a. Select Networking > distributed_switch > port_group > Configure > Edit Settings >
VLAN.
2. Verify that the VLAN configuration of any standard switch is correct on every ESXi host.
a. Select Host & Clusters > ESXi_host > Configure > Virtual Switches > virtual_switch >
port_group > ... > View Settings > Properties > VLAN ID.
1. In the vSphere Client, verify that the hardware configuration for speed, duplex, and MTU of
any physical adapters is correct on every ESXi host.
a. Select Hosts > ESXi_host > Configure > Networking > Physical adapters.
94
Task 4: Verify the Uplink Configuration
An uplink configuration error on a specific host causes network connectivity to fail.
The correct uplink must be connected to the correct virtual switch on all ESXi hosts that are
connected to a standard switch or a distributed switch. VMs must be connected to the correct
port group.
a. In the vSphere Client, select Networking > specific_switch > Configure > Topology.
a. In the vSphere Client, select the VM and select VM Hardware > Network adapter.
The correct teaming and failover configuration must be set on every ESXi host connected to a
standard switch or a distributed switch.
1. In the vSphere Client, verify that the teaming and failover configuration is correct by
selecting Networking > switch > port_group > Edit Settings > Teaming and failover.
a. On each host, run the command to verify that link status is up on all network links.
a. In the vSphere Client, select the VM and select VM Hardware > Network adapter.
95
Task 7: Investigate a Host Failure
One host (ESXi, VM, or vCenter Server) seems to have a problem on all networks.
a. Use ping and other communication tools to verify normal communications everywhere
else.
2. Verify all local network configuration settings on this specific host from inside the host for
each network device.
b. Verify DNS, gateway, subnet masks, and firewall settings for each network device in
use.
A correct configuration on eth0 does not help if you are using eth1.
3. From the vSphere Client, verify all network configurations settings that are local to this
specific host on the network.
a. Verify VMkernel settings (if applicable), port settings, port group settings, NSX firewalls,
NSX routing, MTU settings, and so on.
b. Verify that the network device for this host is set to active (instead of standby or
unused).
5. Review logs for any indication of traffic overloads or DOS attacks on this specific host.
a. Use ping and other communication tools to Verify normal communications everywhere
else.
96
3. Verify whether a routing or gateway problem exists.
4. Verify all network configurations settings specific to this network, starting with virtual
hardware and then physical hardware (if applicable).
Configuration settings include VMkernel (if applicable) settings, port settings, port group
settings, NSX firewalls, NSX routing, MTU settings, and so on.
Is the network device for this host set to active (instead of standby or unused) on this
configuration?
5. Review logs for any indication of traffic overloads or DOS attacks on this specific network.
1. Verify that all hosts can communicate normally with other protocols.
2. Examine all network firewall configurations settings that are specific to this network.
97
98
Lab 23 Appendix: Troubleshooting
Storage Failures
Troubleshooting Flowchart
The flowchart presents a logical sequence for troubleshooting storage failures.
99
Troubleshooting Tasks
To troubleshoot storage failures, you might perform the following tasks:
1. Verify that all individual ESXi hosts can see all LUNs.
df -h | grep VMFS
All datastores should be visible and show free space.
4. If storage is working but performing poorly, look for overloads from specific ESXi hosts,
VMs, and so on.
b. If NAS storage (iSCSI, NFS, FCoE, and so on) is used, follow network troubleshooting
procedures to identify bandwidth overloads or other network configuration problems.
100
Task 2: Investigate a VM Disk Failure
A single VM has lost connection to one or more disk devices.
1. Verify that all the VMs that use the same datastore are operating normally.
2. If other VMs on the same datastore are experiencing problems, go to Datastore Failure.
3. View the vmware.log file to find error messages related to this VM.
4. In the vSphere Client datastore browser, locate all files for the VM.
a. Verify that no files are missing, especially VM disk descriptor files, VM configuration files,
and VM disk files (.vmdk and .vmx).
5. Verify that the VM configuration correctly identifies the VM disk files and has the correct
path to the VM disk files.
7. Attempt to edit the disk descriptor file with a text editor to resolve any CID mismatch errors.
1. If storage is working but performing poorly, look for overloads from specific ESXi hosts,
VMs, and so on.
2. If NAS storage (iSCSI, NFS, FCoE, and so on) is used, follow network troubleshooting
procedures to identify bandwidth overloads or other network configuration problems.
101
Task 4: Investigate an iSCSI Storage Failure
A network storage device connected by the iSCSI protocol is offline.
1. Verify that the iSCSI target array is supported and that it presents the LUN to the ESXi host.
2. Verify that the iSCSI storage adapter on each ESXi host is configured correctly.
• Port
• Authentication
• Port binding
3. Verify that the IP networking components on each ESXi host are configured correctly.
• VMkernel port
• Uplink
4. Verify that you can ping the VMkernel port from other devices on the IP storage network.
• VLAN
• Teaming
• Traffic shaping
• Port blocking
8. Verify that the virtual switch is configured correctly, including MTU and filtering settings.
9. Verify that iSCSI network traffic is not fighting congestion from other types of IP traffic on
the network.
An isolated storage network is recommended, and you might need to isolate iSCSI from NFS
traffic.
102
10. Verify that IP communications between the affected ESXi hosts and the target array are
working.
11. Verify that no firewalls are blocking TCP 3260 between the affected ESXi hosts and the
target array.
a. Run the command to verify communication between the ESXi host and the iSCSI array.
nc -z IPaddr 3260
12. Verify that the physical storage hardware on the iSCSI target array is functioning correctly.
13. Use the VMware vSphere On-disk Metadata Analyzer (VOMA) to verify VMFS metadata
consistency.
1. Verify that the IP networking components on each ESXi host are correctly configured.
• VMkernel port
• Uplink
3. Run ESXCLI and other commands from an SSH session or PowerCLI to verify all local
network configuration settings on the specific host from inside the host.
a. Verify DNS, gateways, subnet masks, firewall, and other settings for each network
device in use.
a. On ESXi hosts, verify that you are troubleshooting the correct virtual switch, port group,
VMkernel address, and uplink.
A correct configuration on eth0 does not help if you are using eth1 for storage.
5. Verify that network communications are correctly configured on the NFS Storage provider.
6. Verify that the NFS storage provider and the ESXi hosts are consistent on the NFS protocol
that is used (v3, v4.1, and so on).
8. Verify that time is synchronized between the ESXi host and the NFS storage provider.
103
Task 6: Investigate a Fibre Channel Storage Connectivity Failure
A connectivity problem occurs with Fibre Channel storage. Channel storage arrays are
connected with vendor-specific dedicated hardware.
1. Verify that no ESXi hosts can see the shared storage array.
If the problem is specific to a single ESXi host, it might be either a configuration error on the
storage adapter or a hardware failure.
3. Verify that the storage adapter on the ESXi host is configured correctly.
4. Verify that the VMFS metadata is consistent with the vSphere On-disk Metadata Analyzer
(VOMA).
5. Verify that the fibre switch zoning configuration permits the ESXi host to see the storage
array.
6. If your configuration requires an ESXi host reboot after the zone set change on the FC-SAN
array, reboot your ESXi host.
7. For more information about troubleshooting Fibre Channel storage connectivity, see
VMware knowledge base article 1003680 at https://kb.vmware.com/s/article/1003680.
1. Verify that Ethernet layer 2 connectivity between the ESXi host and FCoE storage array is
good.
2. Verify that the storage adapter on the ESXi host is configured correctly.
4. Verify that the VMFS metadata is consistent with the vSphere On-disk Metadata Analyzer
(VOMA).
104
Task 8: Troubleshoot a Path Failure
Path failure means one or more paths between the ESXi host and the storage device or storage
array are down. Either a device is permanently unavailable to an ESXi host (permanent device
loss or PDL) or all paths between the ESXi host and the storage device or array are down (all
paths down or APD). An APD condition is expected to be temporary.
Symptoms:
1. Verify that all other hosts can communicate normally to the device.
a. Use ping and other communication tools to Verify normal communications everywhere
else.
Settings include DNS, gateways, subnet masks, and firewall settings for each network device
in use.
A correct configuration on eth0 is not going to help if you are using eth1.
4. If multiple hosts lose communications with the same storage array, verify configuration
settings on the storage array.
Also verify that no network hardware between the hosts and storage array is misconfigured
(firewalls, VLANs, port blocking, MTU, and so on.)
5. Verify that teaming and failover on the required NICs is correctly configured and that the
correct uplink is active.
105
6. Verify that default APD handling is enabled on the ESXi host with the global setting
Misc.APDHandlingEnable = 1.
Also verify that the time-out setting is long enough. Misc.APDTimeout = 140 is
recommended (140 seconds).
7. Verify that the path selection policy is correctly configured on the storage adapter.
8. For NFS devices, verify that the correct NFS protocol (NFS 3 or NFS 4.1) is used.
9. After configuration problems are solved, you might need to reattach or remount the
datastore.
2. Verify that local storage is available and functioning using ESXCLI commands.
A local disk failure might be caused by a local hardware failure on the ESXi host.
106
Task 11: Troubleshoot a Storage Site Disaster
A disaster occurs at a storage site. All storage that is hosted at a specific physical site is offline.
2. Create a service recovery estimate for how long the facility will be offline.
3. Determine if storage hardware is physically damaged and must be physically replaced with
data restored from backups.
4. Drawing on the scope of the problem and the service recovery estimate, determine which
form of the disaster recovery or business continuity plan should be implemented.
107
108
Lab 24 Appendix: Troubleshooting
Cluster Failures
#Troubleshooting Flowchart
The flowchart presents a logical sequence for troubleshooting cluster failures.
109
Troubleshooting Tasks
To troubleshoot cluster failures, you might perform the following tasks:
1. Verify that the network configuration on the management network is correct by reviewing
the settings.
• IP address
• Subnet mask
• Gateway
• Uplink connections
• VLAN settings
2. Verify that the network configuration on the vMotion network is correct by reviewing the
settings.
• IP address
• Subnet mask
• Gateway
• Uplink connections
• VLAN settings
110
3. Verify that name resolution is working on all ESXi hosts and vCenter Server systems.
4. Verify that time is synchronized across the environment (ESXi hosts and vCenter Server
systems).
5. Verify that enough disk-free disk space is available on the target host. (Occurs only during
storage migration).
6. Verify that the reservation requirements (if any) on the VM can be met on the target host.
7. Verify that the log.rotateSize parameter is not set too low for the VM.
8. Restart the hostd and vpxa management agents on both ESXi hosts.
1. Verify that the network configuration on the management network is correct by reviewing
the settings.
• IP address
• Subnet mask
• Gateway
• Uplink connections
• VLAN settings
2. Verify that name resolution is working on all ESXi hosts and vCenter Server systems.
3. Verify that time is synchronized across the environment (ESXi hosts and vCenter Server
systems).
4. Verify that the log.rotateSize parameter is not set too low for the VM.
5. Restart the hostd and vpxa management agents on both ESXi hosts.
111
Task 3: Reset Migrate Enabled and Verify the Result
A reset of the Migrate.Enabled parameter can solve some vSphere vMotion migration
problems.
1. Verify that the Fault Domain Manager (FDM) agent is installed on all ESXi hosts.
• Poor network bandwidth can prevent the FDM agent from installing.
• Insufficient disk space in /root can prevent the FDM agent from installing.
• Verify that the /etc/opt/vmware/fdm directory exists and has the correct files installed.
Poor network bandwidth can prevent the FDM agent from installing. Insufficient disk
space in the /root directory can prevent the FDM agent from installing.
b. Verify that the /etc/opt/vmware/fdm directory exists and has the correct files
installed.
a. If the FDM is not running, restart it on an ESXi host after you determine what caused the
failure.
3. Verify that all ESXi hosts are connected to the vCenter Server system.
a. Test connectivity from the ESXi host back to the vCenter Server system using ping.
a. If you use DHCP, verify that the IP address for each host persists across reboots.
Although HA supports both IPv4 and IPv6, ensure that all HA network traffic is either
one protocol or the other, not a mixture of both protocols.
112
5. Verify that all hosts have at least one heartbeat network in common (management network
or vSAN network, if vSAN was first enabled on the cluster).
9. To ensure that any VM can run on any host in the cluster, provide all hosts with access to the
same VM networks and datastores.
10. Verify that you have a minimum of two ESXi hosts in a vSphere HA cluster.
12. Verify that all hosts are licensed for vSphere HA.
vSphere HA supports IPv4 and IPv6. However, a cluster that mixes both of these protocol
versions is more likely to result in a network partition.
13. Verify that all ESXi hosts can access the same networks.
14. Verify that all ESXi hosts can access the same shared datastores.
2. Verify VM reservations.
One or more VMs might have excessive reservations. Check VM bandwidth reservations.
113
Task 6: Investigate Why DRS Never Migrates
DRS does not migrate VMs, even when the cluster is badly imbalanced.
2. Verify that the DRS automation level is not set too low.
5. Verify that all VMs are not using local host resources.
2. Verify that the DRS automation level is not set too low.
4. Verify that some VMs are not using local host resources.
3. Verify that the DRS automation level is not set too high.
114
Lab 25 Appendix: Troubleshooting
Virtual Machine Failures
#Troubleshooting Flowchart
The flowchart presents a logical sequence for troubleshooting VM failures.
115
Troubleshooting Tasks
To troubleshoot VM failures, you might perform the following tasks:
1. Examine the vmware.log file associated with the VM to identify the specific disk chain
that is affected.
For multiple .vmdk files, the CID and parentCID that are referenced in the files should
match.
1. Verify that you can take a normal snapshot with Quiesce deselected.
3. Verify that appropriate services are running and startup types are correct.
5. Verify that all the VSS writers are stable and not reporting errors.
116
Task 3: Investigate a General Snapshot Failure
User cannot create or commit a snapshot.
3. Verify that you have permission to create or commit snapshots, including permission to write
to the datastore.
4. Verify that the -delta.vmdk file does not have an associated descriptor file that is
missing.
5. Verify that the snapshot file size does not exceed the maximum size supported by the
datastore.
a. Run the vmkfstools command to identify which ESXi host is locking the file.
b. Run the lsof command to identify the process that is locking the file.
4. If a file is locked and you cannot stop the process that locks it, migrate all VMs to a new host
or reboot the ESXi host that is locking the file.
6. Verify that the ESXi host is online and connected to the vCenter Server system.
a. Verify that the ESXi host can respond to a network ping on the management interface.
b. Open the direct console to the ESXi host and look for a purple error screen.
117
Task 5: Investigate a VM That Shows an Invalid or Orphaned State
VMs might show as Invalid or Orphaned. This problem might be caused by a vCenter Server
system failure or a restart during a migration process.
1. If the vCenter Server system was rebooted, wait until it is completely back online and stable.
4. Examine the Recent Tasks pane to verify that the VM is being migrated.
5. If the VM is registered on one of the ESXi hosts, restart the management processes on that
ESXi host.
9. If the files are missing (files deleted outside of vCenter Server system), restore files from
backup and reregister the VM.
2. Verify that the correct VMware Tools ISO image is available and is not corrupt
118
Lab 26 Appendix: Troubleshooting
ESXi Host and vCenter Server System
Failures
Troubleshooting Flowchart
The flowchart presents a logical sequence for troubleshooting ESXi host and vCenter Server
failures.
119
Troubleshooting Tasks
To troubleshoot ESXi host and vCenter Server problems, you might need to perform the
following tasks:
3. Verify that all ICA and root CA certificates are published into the trusted store in the VECS.
For more information about restarting vCenter Server services, see VMware knowledge base
article 2109881 at http://kb.vmware.com/kb/2109881.
b. Click Services.
120
2. Restart the vCenter Server service from an SSH session.
1. In the vSphere Client, examine database settings to verify that your configuration is not
trying to record too much data in the database.
5. Verify that the last time the rollup jobs ran is not more than 24 hours past.
6. Verify that the vpx_hist_statl table does not include more than 10 million rows.
7. If you use an internal PostgreSQL database and it is out of space, shut down the vCenter
Server Appliance VM, expand the VM's hard disk, power on vCenter Server Appliance, and
run the vpxd_servicecfg storage lvm autogrow command.
121
Task 6: Investigate a Purple Diagnostic Screen
An ESXi host stops responding and displays a purple diagnostic screen.
4. If VMware Technical Support determines that the issue is a hardware problem, contact your
hardware vendor.
1. Verify that an ESXi host is not responding by performing tasks on the host.
• Monitor network traffic from the ESXi host and its VMs.
If any of the verification tasks are successful, your ESXi host should be at least minimally
operational.
2. In the ESXi host's DCUI, press ALT+F12 to display VMkernel messages on the screen.
5. After hardware problems are corrected, reinstall and configure the ESXi host, using your
most recent backup to ensure that faulty hardware did not corrupt the disk.
6. Install the latest patches and updates for the ESXi host.
122
Answer Key
123
A2. The default router for the Production network should be 172.20.11.10. However, in a
DHCP network configuration, no router is assigned if a network problem occurs.
Q3. Does the host have the correct network configured?
A3. The network should be configured as either the pg-SA-Production-01 or the pg-SA-
Production-02 network.
Q4. Does the host have a network link status of connected?
A4. Yes, the network status is connected.
Q1. Does the host have the correct uplink configured?
A1. Uplinks are not correct. The sa-esxi-01 host does not have uplinks configured on the pg-
SA-Production-01 and pg-SA-Production-02 port groups.
Lab 10 Troubleshooting Storage Performance Issues
Q1. Which HBA might be the cause of slow storage performance?
A1. vmhba65, because this HBA shows high IOPS.
Q2. What condition is degrading storage performance?
A2. A high number of read commands are being issued from vmhba65.
Q1. Which storage device seems to be affected?
A1. The device with the storage identifier naa.60003ff44dc75adcaf760d6a0ac8e3fe
Q2. What is the datastore name of the affected storage device?
A2. Shared3
Q1. Which VM or VMs might be contributing to slow storage performance?
A1. Win-4, Win-5, and Win-6 are running several read commands per second. However,
these VMs do not seem to be causing a significant amount of latency because the load is
still less.
Q2. What possible solutions can help you get better performance?
A2. Add another VMkernel port and vmnic for software iSCSI multipathing and set the
multipathing policy to Round Robin. Also, check the DAVG value or latency values for
the VM. Enable Storage I/O Control and set the value to 5 ms. Migrate one or two VMs
to another datastore.
124