IBM VIOS Maintenance
IBM VIOS Maintenance
IBM VIOS Maintenance
FORSYTHE TALKS
VIOS MAINTENANCE
Jaqui Lynch
[email protected]
AGENDA
Best Practices Setup
Network
Installation
Maintenance and Upgrades
Backup and recovery
Monitoring
FBO
Other
Wrap-up/Questions
7/22/2014
STAY CURRENT
VIOS Lifecycle
Version
GA
1.5
11/07
2.1
11/08
2.2.00
9/10
2.2.1
10/11
2.2.2
10/12
2.2.3
4Q13
2.2.4
2Q15
Latest release:
2.2.3.3 FP27 SP03
EOM
2008
2010
2011
10/12
10/13
EOS/EOL
09/11
09/12
09/13
04/15
09/16
7/22/2014
GENERAL
Keep it simple
Ensure LMB is the same on all servers if you want to use LPM
Use hot pluggable adapters rather than built in ones
Easier maintenance
7/22/2014
PAGING
Clean up paging
By default VIO has a 512MB hd6 and a 1.5GB paging00 on the same LUN
Swapoff paging00 and delete it
Increase hd6 to 4GB
Reboot
Change from:
lsps -a
Page Space
paging00
hd6
TO:
# lsps -a
Page Space
hd6
Logging
vi /etc/syslog.conf
user.info
user.debug
mail.debug
*.emerg
*.alert
*.crit
*.err
auth.notice
*.info
cd /usr/local/logs
touch syslog mailog infolog messages
stopsrc -s syslogd
startsrc -s syslogd
Try not to log to /var as bad things happen if it fills up
Make /usr/local/logs a new filesystem
7/22/2014
TUNING
ADAPTER TUNING
fcs0
bus_intr_lvl
bus_io_addr
115
0xdfc00
bus_mem_addr 0xe8040000
False
False
False
init_link
al
intr_priority
lg_term_dma
3
Interrupt priority
Long term DMA
True
False
max_xfer_size
0x100000
True
(16MB DMA)
num_cmd_elems
200
pref_alpa
sw_fc_class
0x1
2
Preferred AL_PA
FC Class for Fabric
True
True
True
0x800000
True
0x200000
128MB DMA area for data I/O
num_cmd_elems 1024
10
7/22/2014
VIO SERVER
#lsattr -El fcs0
lg_term_dma
max_xfer_size
num_cmd_elems
the adapter True
0x800000
0x200000
2048
11
12
7/22/2014
MEMORY PLANNING
13
VIRTUAL ETHERNET
Link aggregation
Put vio1 aggregate on a different switch to vio2 aggregate
Provides redundancy without having to use NIB
Allows full bandwidth and less network traffic (NIB is pingy)
Basically SEA failover with full redundancy and bandwidth
14
7/22/2014
NETWORK
15
STARTER
SET OF TUNABLES
NETWORK
16
7/22/2014
# ifconfig -a
en6:
flags=1e080863,580<UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTIC
AST,GROUPRT,64BIT,CHECKSUM_OFFLOAD(ACTIVE),CHAIN>
inet 192.168.2.5 netmask 0xffffff00 broadcast 192.168.2.255
tcp_sendspace 262144 tcp_recvspace 262144 rfc1323 1
lo0:
flags=e08084b,1c0<UP,BROADCAST,LOOPBACK,RUNNING,SIMPLEX,MULTICAST
,GROUPRT,64BIT,LARGESEND,CHAIN>
inet 127.0.0.1 netmask 0xff000000 broadcast 127.255.255.255
inet6 ::1%1/0
tcp_sendspace 131072 tcp_recvspace 131072 rfc1323 1
17
VALID ADAPTERS
FOR
P7 AND P7+
770
1768 Integrated Multifunction Card with Copper SFP+ - Dual 10Gb copper and dual 10/100/1000MB copper ethernet
1769 Integrated Multifunction Card with SR Optical - Dual 10Gb optical and dual 10/100/1000MB copper ethernet
PCIE Adapters
7/22/2014
NIM = Network Install Manager. Used for installing AIX or VIOS over the network instead of DVD drive.
19
Depends on:
20
10
7/22/2014
NOTES ON 10GBE
Using jumbo frames better allows you to use the full bandwidth coordinate with
network team first
Jumbo frames means an MTU size of 9000
Reduces CPU time needed to forward packets larger than 1500 bytes
Has no impact on packets smaller than 1500 bytes
Must be implemented end to end including virtual Ethernet, SEAs, etherchannels, physical adapters,
switches, core switches and routers and even firewalls or you will find they fragment your packets
Throughput can improve by as much as 3X on a virtual ethernet
Manage expectations
Check that the switch can handle all the ports running at 10Gb
Make sure the server actually has enough gas to deliver the data to the network
at 10Gb
21
10GBE TIPS
Use flow control everywhere this stops the need for retransmissions
If you need significant bandwidth then dedicate the adapter to the LPAR
There are ways to still make LPM work using scripts to temporarily remove the adapter
If you set large_receive on the SEA the AIX LPARs will inherit the setting
Consider increasing the MTU size (talk to the network team first) this increases the size of the
actual packets
If you use ifconfig to make the changes it does not update ODM so the change does not survive
a reboot
22
11
7/22/2014
10GBE TIPS
Low CPU entitlement or too few VPs will impact network performance
https://www.ibm.com/developerworks/wikis/download/attachments/153124943/7_PowerVM_10Gbit_Ethernet.pdf?ver
sion=1
23
OTHER NETWORK
SEA failover performance can be improved by setting:
portfast trunk on the switch ports
And by activating:
dead gateway detection with a ping time of 2 seconds
Also look at:
enabling bridge protocol data unit (BPDU) guard on switch
ports to protect against spanning tree loops
24
12
7/22/2014
25
INSTALLATION
26
13
7/22/2014
Copy the bosinst.data from the DVD and create a viosbosinst resource
You can now use bos_inst to do a mksysb install once the partition profile is defined
http://www-01.ibm.com/support/docview.wss?uid=isg3T1011386
28
14
7/22/2014
29
UPDATING VIOS
1. Normally upgrade HMC first then firmware then VIOS and then AIX
2. BUT check the readme for all of the above first to make sure there is
not a different required order
3. Download the updates and cross-check compatibility using FLRT
4. Read the readme again
5. Run errpt to check for problems,check there are no stale partitions,
missing disks or paths, etc
6. Ensure all paths on clients are redundant so LPARs will stay up when
this VIOS is rebooted
7. Run HMC Scanner or sysplan to document prior to changes
8. Backup the VIOS
9. Mount the NFS filesystem or DVD or FBO image to be used for update
30
15
7/22/2014
MIGRATION
Back the VIO up before doing anything and again when done!
1. In order to migrate to v2.* your HMC must be at v7 or later at least 7.7.4
If VIOS is lower than v2.1 then you must migrate to 2.0.0 using the migration DVD
2. Migrating from prior to v1.3
Basically this is a reinstall
Once you are on v2.1 then upgrades are all done using updateios
There are specific concerns around updates if you are running SSPs (Shared storage pools)
31
32
16
7/22/2014
17
7/22/2014
18
7/22/2014
rootvg
rootvg
active
active
bosboot a d hdisk0
bosboot a d hdisk1
bootlist m normal hdisk0 hdisk1
37
Once all checks are passed and VIO2 is back up then go do the same upgrade to VIO1
38
19
7/22/2014
20
7/22/2014
41
BACKING
UP
VIOS
42
21
7/22/2014
BACKING
UP
43
22
7/22/2014
MONITORING
45
NETSTAT -V
46
23
7/22/2014
NETSTAT
VIO
SEA
Transmit Statistics:
-------------------Packets: 83329901816
Bytes: 87482716994025
Interrupts: 0
Transmit Errors: 0
Packets Dropped: 0
Receive Statistics:
------------------Packets: 83491933633
Bytes: 87620268594031
Interrupts: 18848013287
Receive Errors: 0
Packets Dropped: 67836309
Bad Packets: 0
Max Packets on S/W Transmit Queue: 374
S/W Transmit Queue Overflow: 0
Current S/W+H/W Transmit Queue Length: 0
Elapsed Time: 0 days 0 hours 0 minutes 0 seconds
Broadcast Packets: 1077222
Multicast Packets: 3194318
No Carrier Sense: 0
DMA Underrun: 0
Lost CTS Errors: 0
Max Collision Errors: 0
47
BUFFERS
Virtual Trunk Statistics
Receive Information
Receive Buffers
Buffer Type
Min Buffers
Max Buffers
Allocated
Registered
History
Max Allocated
Lowest Registered
532
502
Tiny
512
2048
513
511
Small
512
2048
2042
506
Medium
128
256
128
128
Large
24
64
24
24
2048
354
128
128
24
24
24
24
Huge
24
64
24
24
48
24
7/22/2014
NMON
MONITORING
49
AIX v6.1 TL08, VIOS 2.2.2.0, AIX v7.1 TL02, IBM I 7.1 PTF MF56058, RHEL 7, SLES
12
Earlier systems will work but will need a reboot after running DPO
http://www.redbooks.ibm.com/redbooks/pdfs/sg247590.pdf
50
25
7/22/2014
P7 or P7+
AIX v6.1 TL08 SP1 or AIX v7.1 TL02 SP1
Cannot be using AMS (Active memory sharing)
Chargeable feature via an enablement fileset
http://www.redbooks.ibm.com/redbooks/pdfs/sg247590.pdf
Whitepaper at:
http://www-01.ibm.com/common/ssi/cgibin/ssialias?infotype=SA&subtype=WH&htmlfid=POW03093USEN
51
topas C
Most important value is app available pool processors
This represents the current number of free physical cores in the pool
nmon option p for pool monitoring
To the right of PoolCPUs there is an unused column which is the number of free
pool cores
nmon analyser LPAR Tab
lparstat
Shows the app column and poolsize
52
26
04:29
04:29
04:28
04:27
04:26
04:26
04:25
04:24
04:23
04:23
04:22
04:21
04:20
04:20
04:19
04:18
OtherLPARs
04:17
04:17
04:16
04:15
PhysicalCPU
04:14
04:14
04:13
04:12
04:11
04:11
04:10
04:09
04:08
04:08
04:07
04:06
04:05
04:05
04:04
04:03
NMON
04:02
04:02
04:01
04:00
7/22/2014
35
PoolIdle
30
25
20
15
10
53
54
27
7/22/2014
FBO
Provides ISO image access to LPARs connected to the VIO
Uses vSCSI
Add third disk to one of the VIO servers and put in its own VG
Use that VG as the repository
Rip DVDs to ISO images (or download the ISOs)
Can also put mksysb images in the repository
55
USING FBO
So using hdisk2 as my third disk I create a VG:
mkvg y datavg hdisk2
mkrep sp datavg size 10G
This can be increased later and creates:
/dev/VMLibrary 10.00
5.85 42%
7
1% /var/vio/VMLibrary
We can now load images into the repository from an ISO we have on NFS:
mkvopt -name rhelboot64 -file /usr/local/nfsmnt/rhel-64bit-basedvd.iso
This takes the ISO and copies it as follows:
$ ls -l /var/vio/VMLibrary
-rw------- 1 root staff 237981696 Jul 25 13:14 rhelboot64
To list the repository:
$ lsrep
Size(mb) Free(mb) Parent Pool
10199
Name
rhelboot64
5989
datavg
Parent Size
279552
Parent Free
157696
Access
rw
56
28
7/22/2014
USING FBO
In order to use the repository, the virtual host has to be assigned a virtual SCSI FBO adapter; do so
with:
mkvdev -fbo -vadapter vhost0
vtopt0 available
Multiple vtopt devices for a client can be created, if desired.
Now make the image available to vtopt0:
loadopt -vtd vtopt0 -disk rhelboot64
loadopt -disk p8aix71base1 -vtd vtopt0
This example would mount rhelboot64 or p8aix71base1 as if it were a local CD. To unmount it, use:
unloadopt -vtd vtopt0
Also, rmvopt removes an image; rmrep removes a repository; and lsvopt lists the state of the vtopts:
$ lsvopt
VTD
vtopt0
Media
No Media
Size(mb)
n/a
57
Parent Size
1089024
Parent Free
577024
Access
rw
rw
rw
rw
rw
rw
rw
rw
Media
No Media
No Media
No Media
No Media
Size(mb)
n/a
n/a
n/a
n/a
58
29
7/22/2014
59
OTHER
60
30
7/22/2014
USEFUL COMMANDS
Command History
$ fc -l
725 lsrep
726 backupios -file /usr/local/backups/b750viobkp
727 exit
728 lsmap -vadapter vhost0
729 fc l
Global command log
$ lsgcl | grep "Aug 9 2013"
Aug 9 2013, 08:25:35 root ioslevel
Aug 9 2013, 08:59:22 padmin license
Aug 9 2013, 09:00:29 padmin lsmap -vadapter vhost0
Aug 9 2013, 09:01:29 padmin lsgcl
Redirecting output when running as padmin
lsmap all npiv | tee npivdata.txt
61
USEFUL COMMANDS
vSCSI Commands
mkvdev -vdev hdisk2 -vadapter vhost0
mkvdev fbo vadapter vhost0
NPIV
Setup NPIV mappings
vfcmap vadapter vfchost0 fcp fcs0
lsmap npiv all
lsmap vadapter vfchost0 npiv
lsdev virtual
lsnports
lsdev slots
lscfg vpl vfchost0
62
31
7/22/2014
USEFUL COMMANDS
$ lsdev -virtual
name
status
description
ent5
Available Virtual I/O Ethernet Adapter (l-lan)
ent6
Available Virtual I/O Ethernet Adapter (l-lan)
ent7
Available Virtual I/O Ethernet Adapter (l-lan)
vasi0
Available Virtual Asynchronous Services Interface (VASI)
vbsd0
Available Virtual Block Storage Device (VBSD)
vfchost0
Available Virtual FC Server Adapter
vfchost1
Available Virtual FC Server Adapter
vfchost2
Available Virtual FC Server Adapter
vfchost3
Available Virtual FC Server Adapter
vhost0
Available Virtual SCSI Server Adapter
vhost1
Available Virtual SCSI Server Adapter
vhost2
Available Virtual SCSI Server Adapter
vhost3
Available Virtual SCSI Server Adapter
vsa0
Available LPAR Virtual Serial Adapter
b740ios1_rv1 Available Virtual Target Device - Logical Volume
b740l1_rv1
Available Virtual Target Device - Logical Volume
vtopt0
Available Virtual Target Device - File-backed Optical
vtopt1
Available Virtual Target Device - File-backed Optical
vtopt2
Available Virtual Target Device - File-backed Optical
vtopt3
Available Virtual Target Device - File-backed Optical
vtscsi0
Available Virtual Target Device - Disk
vtscsi1
Available Virtual Target Device - Disk
vtscsi2
Available Virtual Target Device - Disk
vtscsi3
Available Virtual Target Device - Disk
ent8
Available Shared Ethernet Adapter
63
USEFUL COMMANDS
b740l1_rv1
Available
0x8300000000000000
lv_b740l1
VTD
Status
LUN
Backing device
Physloc
Mirrored
vtopt0
Available
0x8200000000000000
VTD
Status
LUN
Backing device
Physloc
Mirrored
vtopt1
Available
0x8100000000000000
N/A
N/A
N/A
64
32
7/22/2014
USEFUL COMMANDS
65
USEFUL COMMANDS
$ lsnports
name
physloc
fcs0
U78AA.001.WZSG8PD-P1-C5-T1
$ lsdev -slots
# Slot
Description
Device(s)
HEA 1
Logical I/O Slot lhea0 ent0
U8205.E6B.10934CP-V1-C0
Virtual I/O Slot vsa0
U8205.E6B.10934CP-V1-C11 Virtual I/O Slot ent5
U8205.E6B.10934CP-V1-C12 Virtual I/O Slot ent6
U8205.E6B.10934CP-V1-C13 Virtual I/O Slot ent7
U8205.E6B.10934CP-V1-C21 Virtual I/O Slot vhost0
U8205.E6B.10934CP-V1-C22 Virtual I/O Slot vhost1
U8205.E6B.10934CP-V1-C23 Virtual I/O Slot vhost2
U8205.E6B.10934CP-V1-C31 Virtual I/O Slot vfchost0
U8205.E6B.10934CP-V1-C32 Virtual I/O Slot vfchost1
U8205.E6B.10934CP-V1-C33 Virtual I/O Slot vfchost2
U8205.E6B.10934CP-V1-C32769 Virtual I/O Slot vasi0
U8205.E6B.10934CP-V1-C32773 Virtual I/O Slot vasi1
U8205.E6B.10934CP-V1-C32774 Virtual I/O Slot vasi2
U8205.E6B.10934CP-V1-C32775 Virtual I/O Slot vasi3
U8205.E6B.10934CP-V1-C32776 Virtual I/O Slot vasi4
66
33
7/22/2014
monhmc r mem n 0
monhmc r proc n 0
monhmc r swap n 0
monhmc r disk n 0
chhmcfs r disk n 0
lshmcfs
hmcshutdown r t now
Reboot HMC
67
hscroot@bpichmc:~>lshmcfs
filesystem=/var,filesystem_size=8063,filesystem_avail=6390,temp_files_start_time=07/14/2014 13:11:00,temp_files_size=783
filesystem=/dump,filesystem_size=120951,filesystem_avail=114495,temp_files_start_time=07/14/2014 21:09:00,temp_files_size=0
filesystem=/extra,filesystem_size=20158,filesystem_avail=18940,temp_files_start_time=none,temp_files_size=0
filesystem=/,filesystem_size=15743,filesystem_avail=8009,temp_files_start_time=07/22/2014 23:18:00,temp_files_size=0
68
34
7/22/2014
69
70
35
7/22/2014
hscroot@bpichmc:~>lshmc -v
"vpd=*FC ????????
*VC 20.0
*N2 Wed Jul 23 04:45:57 UTC 2014
*FC ????????
*DS Hardware Management Console
*TM 7042-CR6
*SE 102EEEC
*MN IBM
*PN 0B20PT
*SZ 4140253184
*OS Embedded Operating Systems
*NA 10.250.134.20
*FC ????????
*DS Platform Firmware
*RM V8R8.1.0.0
"
71
72
36
7/22/2014
during upgrade
73
HMC SCANNER
System summary: name, serial number, cores, memory, service processor IP for each server
LPAR Summary: list of all LPAR by serve with status, environment, version, processor mode
Physical Slots: list of all slots of each system with LPAR assignment, description, physical location and drc_index
Virtual Ethernet: network configuration of each virtual switch and each LPAR
Virtual SCSI: configuration of all virtual SCSI adapters, both client and server
VSCSI Map: devices mapped by each VIOS to partitions
Virtual Fibre: virtual fibre channel configuration of client and server with identification of physical adapter assigned
SEA: SEA configuration and statistics for all VIOS
SW Cores: LPAR and virtual processor pool configuration matrix to compute the number of software licenses. Simulation of
alternative scenarios is possible.
CPU Pool Usage: monthly average history of CPU usage of each system. Based on last 12 months of lslparutil data.
Sys RAM Usage: monthly average history of physical memory assignement to each LPAR. Based on last 12 months of lslparutil
data.
LPAR CPU Usage:monthly average history of CPU usage of each LPAR. Based on last 12 months of lslparutil data.
CPU Pool Daily Usage: 1 year of CPU usage of every pool and subpools of each system. Based on daily averages.
LPAR Daily Usage: 1 year of CPU usage of every LPAR of each system. Based on daily averages.
CPU Pool HourlyUsage: 2 months of CPU usage of every pool and subpools of each system. Based on hourly averages.
LPAR Hourly Usage: 2 months of CPU usage of every LPAR of each system. Based on hourly averages.
74
37
7/22/2014
VIOS ADVISOR
https://www.ibm.com/developerworks/community/wikis/home/wiki/Power%20Systems/page/VIOS%20Advisor
76
38
7/22/2014
VIOS ADVISOR
$ part -i 10
part: Reports are successfully generated in b740vio2_140722_19_46_12.tar
$oem_setup_env
#tar tvf b740vio2_ .tar
# pwd
/home/padmin/jaqui/b740vio2_140722_19_46_12
# ls -l
total 704
-rw-r--r-- 1 root staff
249619 Jul 22 19:56 b740vio2_140722_1946.nmon
drwxr-xr-x 2 padmin staff
4096 Jul 22 19:56 images
-r--r--r-- 1 padmin staff
8122 Jul 22 19:56 popup.js
-r--r--r-- 1 padmin staff
6971 Jul 22 19:56 style.css
-r--r--r-- 1 padmin staff
16869 Jul 22 19:56 vios_advisor.xsl
-rw-r--r-- 1 padmin staff
32677 Jul 22 19:56 vios_advisor_report.xml
-r--r--r-- 1 padmin staff
29156 Jul 22 19:56 vios_advisorv2.xsl
Can also process nmon files:
$ part -f b750vio1_140721_2359.nmon
part: Reports are successfully generated in b750vio1_140721_2359.tar
# tar -tvf b750vio1_140721_2359.tar
-r--r--r-- 10 1 16869 Jul 22 19:48:10 2014 ./b750vio1_140721_2359/vios_advisor.xsl
-rw-r--r-- 10 1 36446 Jul 22 19:48:09 2014 ./b750vio1_140721_2359/vios_advisor_report.xml
-r--r--r-- 10 1 29156 Jul 22 19:48:10 2014 ./b750vio1_140721_2359/vios_advisorv2.xsl
b750vio1_140721_2359\vios_advisor_report.xml
77
VIOS ADVISOR
78
39
7/22/2014
VIOS ADVISOR
79
VIOS ADVISOR
80
40
7/22/2014
VIOS ADVISOR
81
VIOS ADVISOR
82
41
7/22/2014
VIOS ADVISOR
83
VIOS ADVISOR
84
42
7/22/2014
SSP LIMITATIONS
http://www14.software.ibm.com/webapp/set2/sas/f/vios/vios2233.readme.html
Limitations for Shared Storage Pool
Software Installation
All VIOS nodes must be at version 2.2.1.3 or later.
When installing updates for VIOS Update Release 2.2.3.3 participating in a Shared Storage Pool, the Shared Storage Pool
Services must be stopped on the node being upgraded.
SSP Configuration
Feature
Min
Max
16
1024
8192
200
10GB
16TB
10GB
512TB
1GB
4TB
512MB
1016GB
85
REFERENCES
86
43
7/22/2014
USEFUL LINKS
Charlie Cler Articles
http://www.ibmsystemsmag.com/authors/Charlie-Cler/
Movie replays
http://www.circle4.com/movies
87
USEFUL LINKS
Nigel Griffiths
AIXpert Blog
https://www.ibm.com/developerworks/mydeveloperworks/blogs/aixpert/?lang=en
Youtube channel
https://www.ibm.com/developerworks/mydeveloperworks/blogs/aixpert/entry/size_with_rperf_if_you_must_but_don_t_fo
rget_the_assumptions98?lang=en
http://www.youtube.com/user/nigelargriffiths
AIX Wiki
https://www.ibm.com/developerworks/wikis/display/WikiPtype/AIX
HMC Scanner
http://www.ibm.com/developerworks/wikis/display/WikiPtype/HMC+Scanner
Workload Estimator
http://ibm.com/systems/support/tools/estimator
http://www.ibm.com/developerworks/wikis/display/WikiPtype/Performance+Monitoring+Tools
Performance Monitoring
https://www.ibm.com/developerworks/wikis/display/WikiPtype/Performance+Monitoring+Documentation
https://www.ibm.com/developerworks/wikis/display/WikiPtype/Other+Performance+Tools
Includes new advisors for Java, VIOS, Virtualization
VIOS Advisor
https://www.ibm.com/developerworks/wikis/display/WikiPtype/Other+Performance+Tools#OtherPerformanceTools-VIOSPA
Capturing Debug output for padmin
http://www-01.ibm.com/support/docview.wss?uid=isg3T1012362
88
44
7/22/2014
REFERENCES
Gareth Coates Tricks of the POWER Masters
http://public.dhe.ibm.com/systems/power/community/aix/PowerVM_webinars/30_Tricks
_of_the_Power_Masters.pdf
Redbook Tip on Maximizing the Value of P7 and P7+ through Tuning and
Optimization
http://www.redbooks.ibm.com/technotes/tips0956.pdf
89
Redbook Tip on Maximizing the Value of P7 and P7+ through Tuning and
Optimization
http://www.redbooks.ibm.com/technotes/tips0956.pdf
90
45
7/22/2014
91
46