NB Troubleshooting Guide
NB Troubleshooting Guide
NB Troubleshooting Guide
Release 6.5.6
20654032
Legal Notice
Copyright 2010 Symantec Corporation. All rights reserved. Symantec and the Symantec Logo are trademarks or registered trademarks of Symantec Corporation or its affiliates in the U.S. and other countries. Other names may be trademarks of their respective owners. This Symantec product may contain third party software for which Symantec is required to provide attribution to the third party (Third Party Programs). Some of the Third Party Programs are available under open source or free software licenses. The License Agreement accompanying the Software does not alter any rights or obligations you may have under those open source or free software licenses. Please see the Third Party Legal Notice Appendix to this Documentation or TPIP ReadMe File accompanying this Symantec product for more information on the Third Party Programs. The product described in this document is distributed under licenses restricting its use, copying, distribution, and decompilation/reverse engineering. No part of this document may be reproduced in any form by any means without prior written authorization of Symantec Corporation and its licensors, if any. THE DOCUMENTATION IS PROVIDED "AS IS" AND ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NON-INFRINGEMENT, ARE DISCLAIMED, EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID. SYMANTEC CORPORATION SHALL NOT BE LIABLE FOR INCIDENTAL OR CONSEQUENTIAL DAMAGES IN CONNECTION WITH THE FURNISHING, PERFORMANCE, OR USE OF THIS DOCUMENTATION. THE INFORMATION CONTAINED IN THIS DOCUMENTATION IS SUBJECT TO CHANGE WITHOUT NOTICE. The Licensed Software and Documentation are deemed to be commercial computer software as defined in FAR 12.212 and subject to restricted rights as defined in FAR Section 52.227-19 "Commercial Computer Software - Restricted Rights" and DFARS 227.7202, "Rights in Commercial Computer Software or Commercial Computer Software Documentation", as applicable, and any successor regulations. Any use, modification, reproduction release, performance, display or disclosure of the Licensed Software and Documentation by the U.S. Government shall be solely in accordance with the terms of this Agreement.
Symantec Corporation 350 Ellis Street Mountain View, CA 94043 http://www.symantec.com Printed in the United States of America. 10 9 8 7 6 5 4 3 2 1
Technical Support
Symantec Technical Support maintains support centers globally. Technical Supports primary role is to respond to specific queries about product features and functionality. The Technical Support group also creates content for our online Knowledge Base. The Technical Support group works collaboratively with the other functional areas within Symantec to answer your questions in a timely fashion. For example, the Technical Support group works with Product Engineering and Symantec Security Response to provide alerting services and virus definition updates. Symantecs support offerings include the following:
A range of support options that give you the flexibility to select the right amount of service for any size organization Telephone and/or web-based support that provides rapid response and up-to-the-minute information Upgrade assurance that delivers automatic software upgrades protection Global support purchased on a regional business hours or 24 hours a day, 7 days a week basis Premium service offerings that include Account Management Services
For information about Symantecs support offerings, you can visit our web site at the following URL: www.symantec.com/business/support/ All support services will be delivered in accordance with your support agreement and the then-current enterprise technical support policy.
Hardware information Available memory, disk space, and NIC information Operating system Version and patch level Network topology Router, gateway, and IP address information Problem description:
Error messages and log files Troubleshooting that was performed before contacting Symantec Recent software configuration changes and network changes
Customer service
Customer service information is available at the following URL: www.symantec.com/business/support/ Customer Service is available to assist with non-technical questions, such as the following types of issues:
Questions regarding product licensing or serialization Product registration updates, such as address or name changes General product information (features, language availability, local dealers) Latest information about product updates and upgrades Information about upgrade assurance and support contracts Information about the Symantec Buying Programs Advice about Symantec's technical support options Nontechnical presales questions Issues that are related to CD-ROMs or manuals
Consulting Services
Education Services
To access more information about enterprise services, please visit our web site at the following URL: www.symantec.com/business/services/ Select your country or language from the site index.
Contents
Chapter 2
Contents
Required NetBackup daemon or service not available .................... New network interface cards .......................................................... Backup performance and NIC cards ................................................. SERVER entries in bp.conf file ........................................................ Resolving unavailable storage units problems ................................... Troubleshooting NetBackup in a SAN environment ............................ NetBackup enterprise lifecycle: best practices ............................. Using CommandCentral Storage to troubleshoot NetBackup ...................................................................... Common NetBackup troubleshooting use cases ............................
71 72 72 73 74 74 75 76 77
Chapter 3
Contents
Query string overview ................................................................. Data types for fields .............................................................. String constants ................................................................... Query string examples ...........................................................
Chapter 4
Chapter 5
Chapter 6
Media and device management status codes and messages ....................................................................... 423
About media and device management status codes and messages ............................................................................ Using debug logs .................................................................. Media Manager status codes ................................................... Device configuration status codes ............................................ 423 423 424 482
10
Contents
Device management status codes ............................................ Robotic status codes .............................................................. Robotic error codes ............................................................... Media and device management messages ........................................
Chapter 7
Contents
11
Replacing a failed node on a UNIX or Linux cluster ..................... Recovering the shared disk on a UNIX or Linux cluster ................ Recovering the entire UNIX or Linux cluster .............................. Clustered NBU server recovery for Windows ................................... Replacing a failed node on a Windows VCS cluster ...................... Recovering the shared disk on a Windows VCS cluster ................ Recovering the entire Windows VCS cluster ..............................
Appendix A
Appendix B
12
Contents
Appendix C Appendix D
Chapter
Introduction
This chapter includes the following topics:
About Troubleshooting Define the problem Record all information Troubleshooting the problem About gathering information for problem reports
About Troubleshooting
This chapter explains the steps to take if you encounter a problem while you use NetBackup. Other chapters provide more specific information. Note: The term "media server", as distinct from "master server" or "server", may or may not apply to the NetBackup server product. It depends on the context. When you troubleshoot a server installation, be aware that only one host exists: the master and the media server are one and the same. You can ignore references to a media server on a different host.
14
Error messages are usually the vehicle for telling you something went wrong. So the first thing to do is to look for an error message. If you dont see an error message in an interface, but still suspect a problem, check the reports and logs. NetBackup provides extensive reporting and logging facilities. These can provide an error message that points you directly to a solution. The logs also show you what went right and the NetBackup operation that was ongoing when the problem occurred. For example, a restore operation needs media to be mounted, but the required media is currently in use for another backup. The log information that NetBackup provides is available: See About using logs and reports on page 81. The following chapters provide interpretations of NetBackup (and Media Manager) status codes and messages.
What operation was tried? What method did you use? For example, more than one way exists to install software on a client. Also more than one possible interface exists to use for many operations. Some operations can be performed with a script. What type of server platform and operating system was involved? If your site uses both the master and the media servers, was it a master or a media server? If a client was involved, what type of client was it? Have you performed the operation successfully in the past? If so, what is different now? What is the service pack level? Do you use operating system software with the latest fixes supplied, especially those required for use with NetBackup? Is your device firmware at a level, or higher than the level, at which it has been tested according to the posted device compatibility lists?
15
NetBackup progress logs NetBackup Reports NetBackup Utility Reports NetBackup debug logs Media and Device Management debug logs On UNIX NetBackup servers, check for error or status messages in the system log or standard output. Error or status messages in dialog boxes On Windows, NetBackup servers, check for error or status information in the Event Viewer Application and System log.
Record this information for each try. A benefit of this approach is that you can compare the results of multiple tries. A record of tries is also useful for others at your site and for customer support in the event that you cannot solve the problem. Explanations of the various logs are available. See About using logs and reports on page 81. On UNIX systems, the /usr/openv/netbackup/bin/goodies/support script creates a file containing data necessary for customer support to debug any problems you encounter. For more details, consult the usage information of the script by using support -h. If your troubleshooting try is unsuccessful, customer support can provide further assistance. Before you call, have the following information ready:
Product and its release level. Server hardware type and operating system level. Client hardware type and operating system level, if a client is involved. Storage units being used, if it is possible that storage units are involved. If it looks like a device problem, be ready to supply the following device information: The types of robots and drives and their version levels along with Media and Device Management and system configuration information. Software patches to the products that were installed.
16
What is the definition of the problem as described earlier in this chapter? Copies of logs or core dumps (if any) can also be required. Have you had this problem before? If so, was there a successful resolution and what did you try that time? Has the configuration recently changed? If so, what changed? If necessary, can you communicate with technical support through ftp, email, or fax? These are useful for when you send information such as copies of logs.
The following topic contains the information you need and also provides methods for gathering information. See About gathering information for problem reports on page 16.
When you have a status code or message, the recommended corrective action is available. See About NetBackup status codes and messages on page 155. See About media and device management status codes and messages on page 423. If no status code or message exists, or the actions in the status code chapters do not solve the problem, check the following topic: See About troubleshooting procedures on page 21. Those procedures describe an effective approach for isolating common problems.
If you dont find the solution, contact customer support. The Symantec Technical Support site has a wealth of information that can help you solve NetBackup problems. For comprehensive troubleshooting details, refer to the following URL: http://entsupport.symantec.com
17
General information
Date: _________________________ Table 1-1 Servers (master_and_media) OS Levels Product version and patch levels
Table 1-2
Table 1-3
Devices Firmware levels Tested firmware level that is listed in the NetBackup hardware compatibility list
What were you doing when the problem occurred? (for example, a backup on a Windows client) ______________________________________________________________________ ______________________________________________________________________ What were the error indications? (for example, status code, error dialog box) ______________________________________________________________________ ______________________________________________________________________ ______________________________________________________________________ Did this problem occur during or shortly after any of the following: _____ Initial installation _____ Configuration change (explain)
18
_____ System change or problem (explain) _____ Have you observed the problem before? (If so, what did you do that time?) Logs or other failure data you have saved: _____ All log entries report _____ Media and Device Management debug logs _____ NetBackup debug logs _____ System logs (UNIX) _____ NetBackup Configuration Validation Utility Output (UNIX) _____ Event Viewer Application and System logs (Windows) Can you communicate with us through any of the following: _____ ftp _____ telnet _____ email _____ fax
The NetBackup-Java administration application startup script, jnbSA, logs data to a log file in /usr/openv/netbackup/logs/user_ops/nbjlogs. At startup, the script tells you which file in this directory it logs to. Normally, this file does not become very large (usually less than 2 KB). Consult the file /usr/openv/java/Debug.properties for the options that can affect the contents of this log file. The NetBackup-Java administration application on Windows logs data to a log file if NetBackup is installed on the computer where the application was started. It logs on install_path\NetBackup\logs\user_ops\nbjlogs. If NetBackup was not installed on this computer, then no log file is created. To produce a log file, modify the last "java.exe" line in the following to redirect output to a file: install_path\java\nbjava.bat.
The /usr/openv/java/get_trace script provides a Java virtual machine stack trace for support to analyze. This stack trace is written to the log file that is associated with the instance of execution (see previous bullet).
19
The /usr/openv/netbackup/bin/goodies/support script creates a file containing data necessary for customer support to debug any problems you encounter. For more details, consult the usage information of the script by using support -h.
If the application does not respond for a long time, it may be hung. However, some operations can take quite a while to complete, especially Activity Monitor and Reports applications. Wait for several minutes before you assume that the operation is hung. If there is no response within several minutes, run /usr/openv/java/get_trace under the account where you started the Java application. This script causes a stack trace to write to the log file. For example, if you started jnbSA from the root account, start /usr/openv/java/get_trace as root. Otherwise, the command runs without error, but fails to add the stack trace to the debug log. This failure occurs because root is the only account that has permission to run the command that dumps the stack trace.
Run /usr/openv/netbackup/bin/goodies/support to get data about your configuration. Run this script after you complete the NetBackup installation and every time you change the NetBackup configuration. Provide the support-script output and log file to Symantec support.
20
Chapter
Troubleshooting procedures
This chapter includes the following topics:
About troubleshooting procedures Preliminary troubleshooting Troubleshooting installation and configuration problems General test and troubleshooting procedures Using the Host Properties window Resolving full disk problems Troubleshooting frozen media Resolving PBX problems New network interface cards Backup performance and NIC cards SERVER entries in bp.conf file Resolving unavailable storage units problems Troubleshooting NetBackup in a SAN environment
22
Note: The Symantec Technical Support site has a wealth of information that can help you solve NetBackup problems. See http://entsupport.symantec.com for comprehensive troubleshooting details. When you perform these procedures, try each step in sequence. If you already performed the action or it does not apply, skip to the next step. If it branches you to another topic, use the solutions that are suggested there. If you still have a problem, go to the next step in the procedure. Also, alter your approach according to your configuration and what you have already tried. Preliminary troubleshooting explains what to check first. It branches off to other procedures as appropriate. Troubleshooting installation and configuration problems apply specifically to installation problems and configuration problems. General test and troubleshooting procedures define general methods for finding server and client problems and should be used last. Note: The term "media server", as distinct from "master server" or "server", does not apply to the NetBackup server product. When you troubleshoot a NetBackup server installation, ignore any references to media server. (This note does not apply to NetBackup Enterprise Server.)
Preliminary troubleshooting
Preliminary troubleshooting explains what to check first and branches to other procedures as appropriate.
23
Ensure that your servers and clients are running supported operating system versions and that any peripherals you use are supported. See the NetBackup release notes and the NetBackup device compatibility lists on www.veritas.com for this information. Use the All Log Entries report and check for NetBackup errors for the appropriate time period. This report can show the context in which the error occurred. Often it provides specific information, which is useful when the status code can result from a variety of problems. If the problem involved a backup or archive, check the Backup Status report. This report gives you the status code. If you find a status code or message in either of these reports, perform the recommended corrective actions. See About NetBackup status codes and messages on page 155. See About media and device management status codes and messages on page 423.
Check the system log on UNIX or the Event Viewer Application and System log on Windows if the following is true: the problem pertains to media or device management and one of the following is true:
NetBackup does not provide a status code. You cannot correct the problem by following the instructions in NetBackup status codes and messages. You cannot correct the problem by following the instructions in media and device management status codes and messages.
These logs can show the context in which the error occurred. The error messages are usually descriptive enough to point you to a problem area.
Read the applicable enabled debug logs and correct any problems you detect. If these logs are not enabled, enable them before you retry the failed operation. See About using logs and reports on page 81.
5 6
If you performed corrective actions, retry the operation. If you did not perform corrective actions or the problem persists, continue with the next step. If you see the problem during a new installation, during an upgrade installation, or after you make changes to an existing configuration, see the following: See Troubleshooting installation and configuration problems on page 27.
24
Ensure that the server and client are operational. If you experienced a server or a client disk crash, procedures are available on how to recover the files that are critical to NetBackup operation. See About disaster recovery on page 571. Verify that you have enough space available in the disk partitions that NetBackup uses. If one or more of these partitions is full, NetBackup processes that access the full partition fail. The resulting error message depends on the process. Possible error messages: "unable to access" or "unable to create or open a file." On UNIX systems, use the df command to view disk partition information. On Windows systems, use Disk Manager or Explorer. Check the following disk partitions:
The partition where NetBackup software is installed. On the NetBackup master or media server, the partition where the NetBackup databases reside. The partition where the NetBackup processes write temporary files. The partition where NetBackup logs are stored. The partition where the operating system is installed.
Enable verbose logging either for everything or only for areas you think are related to the problem. See About using logs and reports on page 81.
Determine which daemons or processes are running. Follow the procedures for UNIX or Windows NetBackup servers.
If the master server is also the EMM server, ensure that the nbemm and the nbrb services are running. If these services are not running, start them by entering the following:
/usr/openv/netbackup/bin/nbemm /usr/openv/netbackup/bin/nbrb
If both nbemm and nbrb are not running, they must be started in this sequence. If only one is not running, start it by using the appropriate command.
25
The nbpem and the nbjm services must be running on the master server. If these services are not running, start them by entering the following:
/usr/openv/netbackup/bin/nbjm /usr/openv/netbackup/bin/nbpem
If both nbjm and nbpem are not running, they must be started in this sequence. If only one is not running, start it by using the appropriate command.
If either the NetBackup request daemon (bprd) or database manager daemon (bpdbm) is not running, run this command to start them:
/usr/openv/netbackup/bin/initbprd
If any of the following media and device management processes are not running:
ltid (ltid only needs to be running if drives are configured on the server) vmd (volume) avrd (automatic volume recognition), only if drives are configured on the
server
To verify that the ltid, avrd, and robotic control daemons are stopped, run:
/usr/openv/volmgr/bin/vmps
If you use ACS robotic control, the acsssi and the acssel processes continue to run when ltid is stopped. For more information about how to stop these daemons, refer to the following: See the Automated Cartridge System (ACS) chapter in the NetBackup Device Configuration Guide.
Stop any robot control daemons that continue to run when ltid is terminated. Then, start all daemons by running:
/usr/openv/volmgr/bin/ltid
26
The following services must be running. If these services are not running, start them by using the NetBackup Activity Monitor or the Services application in the Windows Control Panel: To start all of them, run install_path\NetBackup\bin\bpup.exe. The following services must be running on NetBackup master servers:
NetBackup Request Manager service NetBackup Policy Execution Manager service NetBackup Job Manager service NetBackup Database Manager service NetBackup Device Manager service (if the system has configured devices) NetBackup Volume Manager service NetBackup Client service
If the master server is also the EMM server, the following services must be running:
NetBackup Device Manager service (if the system has configured devices) NetBackup Volume Manager service NetBackup Client service
The following service must be running on NetBackup clients (including NetBackup Remote Administration Consoles):
Use the NetBackup Activity Monitor to see if the following processes are running:
server
Processes for all configured robots. See the NetBackup Administrators Guide for Windows, Volume I.
27
If these processes are not running, stop and restart the NetBackup Device Manager service. Use the NetBackup Activity Monitor or the Services application in the Windows Control Panel.
If you had to start any of the processes or services in the previous steps, retry the operation. If they are running or the problem persists, refer to the following procedure: See General test and troubleshooting procedures on page 33. If you cannot start any of these processes or services, check the appropriate debug logs for NetBackup problems. See About using logs and reports on page 81. When these processes and services start, they continue to run unless you stop them manually or a problem occurs on the system. On Windows systems, we recommend that you add commands for starting them to your startup scripts, so they restart in case you have to reboot.
Can you install the software on the master and the media servers by using the release media? Some reasons for failure can be as follows:
Not logged on as an administrator on a Windows system (you must have permission to install services on the system) Permission denied (ensure that you have permission to use the device and to write the directories and files being installed) Bad media (contact customer support) Defective drive (replace the drive or refer to vendors hardware documentation)
28
Improperly configured drive (refer to the system and the vendor documentation)
Can you install NetBackup client software on the clients? Note: You cannot install PC client software from a UNIX NetBackup server. Do the following:
For an install to a trusting UNIX client, verify the following: The correct client name is in your policy configuration and the correct server name is in the client /.rhosts file. If the install hangs, check for problems with the shell or the environment variables for the root user on the client. The files to check depend on the platform, operating system, and shell you use. For example, your .login on a Sun system runs an stty (such as stty ^erase) before it defines your terminal type. If this action causes the install process to hang, you can modify the .login file to define the terminal before you run the stty. Or, move the client .login to another file until the install is complete.
For an install to a secure UNIX client, check your ftp configuration. For example, you must use a user name and password that the client considers valid.
Is the problem related to general network communications? See Resolving network communication problems on page 40.
Configuration for robotic drive does not specify the robot. Drive is configured as wrong type or density. Incorrect Robotic Drive Number. SCSI ID for the robotic control is specified instead of the logical Robot Number that is assigned to the robot. The same robot number is used for different robots. SCSI ID for the drive is specified instead of a unique Drive Index number.
29
A platform does not support a device or was not configured to recognize it. Robotic device is not configured to use LUN 1, which some robot hardware requires. On UNIX, drive no-rewind device path is specified as a rewind path. On UNIX, tape devices are not configured with "Berkeley style close." This feature is configurable on some platforms. NetBackup requires it. See the NetBackup Device Configuration Guide for more information. On UNIX, tape devices (other than QIC) are not configured as "variable mode." This feature is configurable on some platforms. NetBackup requires it. When this condition exists, you can frequently perform backups but not restores. Further explanation is available. See NetBackup status code: 174 on page 257. Also see the NetBackup Device Configuration Guide. On UNIX, pass-through paths to the tape drives have not been established. Also see the NetBackup Device Configuration Guide.
Daemons or services do not start during reboot (configure system so they start). Wrong daemons or services are started (problems with media server start up scripts). Configuration was changed while daemons or services were running. On Windows, the %SystemRoot%\System32\drivers\etc\services file does not have an entry for vmd, bprd, bpdbm, and bpcd. Also, ensure that the processes have entries for configured robots. A list of these processes is available. See the NetBackup Administrators Guide, Volume I. On UNIX, the /etc/services file (or NIS or DNS) does not have an entry for vmd, bprd, bpdbm, or robotic daemons.
If you found and corrected any configuration problems, retry the operation and check for NetBackup status codes or messages in the following:
Check the All Log Entries report for NetBackup errors for the appropriate time period. This report can show the context in which the error occurred. Often it provides specific information, which is useful when the error can result from a variety of problems. If the problem involved a backup or archive, check the Backup Status report. This report gives you the status code.
30
If you find a status code or message in either of these reports, perform the recommended corrective actions. See About NetBackup status codes and messages on page 155. See About media and device management status codes and messages on page 423.
Check the system log on UNIX or the Event Viewer Application and System log on Windows if the problem pertains to media or device management. One of the following is true:
NetBackup does not provide a status code You cannot correct the problem by following the instructions in the status codes chapters
Check the appropriate enabled debug logs. Correct any problems you detect. If these logs are not enabled, enable them before your next try. See About using logs and reports on page 81.
If you performed corrective actions, retry the operation. If you did not perform corrective actions or the problem persists, go to the next section. See General test and troubleshooting procedures on page 33.
Not licensed for NetBackup server Exceeds a license restriction Has some inherent qualities that make it difficult to auto-configure
The following messages relate to device configuration, along with their explanations and recommended actions: Message: Drive does not support serialization Explanation: The drive does not return its serial number. Note that some manufacturers do not support serial numbers. Although automatic device configuration does not function optimally, the drive can be manually configured and operated without its serial number. Recommended action: Ask the manufacturer for a newer firmware version that returns serial numbers (if available), or manually configure and operate the drive without a serial number. Message: Robot does not support serialization
31
Explanation: The robot does not return its serial number or the serial numbers of the drives that are contained within it. Note that some manufacturers do not support serial numbers. Although automatic device configuration does not function optimally, the robot and drives can be manually configured and operated without serial numbers. Recommended action: Ask the manufacturer for a newer firmware version that returns serial numbers (if available). Or manually configure and operate the robot and drives without serial numbers. Message: Too many drives in robot Explanation: The robotic library has more than two installed drivesthe maximum that is allowed with a NetBackup server license. Recommended action: Remove all but two drives. Message: Too many slots in robot Explanation: The robotic library has more than 30 installed slotsthe maximum that is allowed with a NetBackup server license. Recommended action: If possible, configure the robotic library to have 30 or fewer slots. Only use the robotic libraries that are supported with NetBackup server . Message: No license for this robot type Explanation: NetBackup server does not support the robotic type that is defined for this robot. Recommended action: Define a different robot. Only use the robotic libraries that NetBackup server supports. Message: No license for this drive type Explanation: The drive type that is defined for this drive that the NetBackup server does not support. Recommended action: Define a different drive. Only use the drives that NetBackup supports. Message: Unable to determine robot type Explanation: NetBackup does not recognize the robotic library. The robotic library cannot be auto-configured. Recommended action:
Download a new device_mapping file from the Veritas support Web site, and try again. Configure the robotic library manually.
32
Message: Drive is stand-alone or in unknown robot Explanation: Either the drive is stand-alone, or the drive or robot does not return a serial number. Note that some manufacturers do not support serial numbers. Although automatic device configuration does not function optimally, the drive or robot can be manually configured and operated without a serial number. Recommended action: Ask the manufacturer for a newer firmware version that returns serial numbers (if available), or manually configure and operate the drive robot without serial numbers. Message: Robot drive number is unknown Explanation: Either the drive or robot does not return a serial number. Note that some manufacturers do not support serial numbers. Although automatic device configuration does not function optimally, the drive or robot can be manually configured and operated without a serial number. Recommended action: Ask the manufacturer for a newer firmware version that returns serial numbers (if available). Or manually configure and operate the drive and robot without serial numbers. Message: Drive exceeds drive limit Explanation: The NetBackup server license allows a maximum of two drives and two drives have already been configured. Recommended action: To use this drive, a previously configured drive must be disabled (deleted from the device configuration). Message: Robot exceeds robot limit Explanation: A robotic library has already been configured. Recommended action: To use this robot, a previously configured robot must be disabled (deleted from the device configuration). Message: Drive is in an unlicensed robot Explanation: The drive is in a robotic library that cannot be licensed for NetBackup server . Since the robot cannot be licensed for NetBackup server , any drives that were configured in that robot are unusable. Recommended action: Configure a drive that does not reside in the unlicensed robot. Message: Drive's SCSI adapter does not support pass-thru (or pass-thru path does not exist) Explanation: A drive was found that does not have a SCSI pass-through path configured. Two possible causes for this message are as follows:
33
The drive is connected to an adapter that does not support SCSI pass-through. The pass-through path for this drive has not been defined.
Recommended action: Change the drives adapter or define a pass-through path for the drive. SCSI adapter pass-through information is available. See the NetBackup Device Configuration Guide. Message: No configuration device file exists Explanation: A device has been detected without the corresponding device file necessary to configure that device. Recommended action: Refer to the chapter for your system type in the NetBackup Device Configuration Guide for information on how to create device files. Message: Unable to determine drive type Explanation: The NetBackup server does not recognize the drive. The drive cannot be auto-configured. Recommended action:
Download a new device_mapping file from the Veritas support Web site, and try again. Configure the drive manually. Use only the drives that NetBackup supports.
Message: Unable to determine compression device file Explanation: A drive was detected without the expected compression device file used to configure that device. Automatic device configuration tries to use a device file that supports hardware data compression. When multiple compression device files exist for a drive, automatic device configuration cannot determine which compression device file is best. It uses a non-compression device file instead. Recommended action: If you do not need hardware data compression, no action is necessary. The drive can be operated without hardware data compression. Hardware data compression and tape drive configuration help are available. Refer to the chapter for your system type in the NetBackup Device Configuration Guide.
34
The procedures assume that the software was successfully installed, but not necessarily configured correctly. If NetBackup never worked properly, you probably have configuration problems. Repeat the checks that are mentioned in the Troubleshooting installation and configuration problems procedure when you encounter errors. In particular, look for device configuration problems. You may also want to perform each backup and restore twice. On UNIX, perform them first as a root user and then as a nonroot user. On Windows, perform them first as a user that is a member of the Administrators group. Then perform them as a user that is not a member of the Administrator group. In all cases, ensure that you have read and write permissions on the test files. The explanations in these procedures assume that you are familiar with the functional overview information. See About backup and restore functional overview on page 633.
Enable appropriate debug logs on the master server. See About using logs and reports on page 81. If you do not know which logs apply, enable them all until you solve the problem. Delete the debug log directories when you have resolved the problem.
Configure a test policy (set backup window to be open while you test). Name the master server as the client and a storage unit that is on the master server (preferably a nonrobotic drive). Also, configure a volume in the NetBackup volume pool and insert the volume in the drive. If you dont label the volume by using the bplabel command, NetBackup automatically assigns a previously unused media ID. To verify that the NetBackup daemons or services are running on the master server, do the following:
To check the services on a Windows system, use the NetBackup Activity Monitor or the Services application in the Windows Control Panel.
35
Start a manual backup of a policy by using the manual backup option in the NetBackup administration interface. Then, restore the backup. These actions verify the following:
NetBackup server software is functional, which includes all daemons or services, programs, and databases. NetBackup can mount the media and use the drive you configured.
If a failure occurs, first check the NetBackup All Log Entries report. For the failures that relate to drives or media, verify that the drive is in an UP state and that the hardware functions. To isolate the problem further, use the debug logs. A functional overview sequence of events is available. See About backup and restore functional overview on page 633. If the debug logs do not reveal the problem, check the following:
Systems Logs or Event Viewer System logs Event Viewer Application and System logs on Windows systems
vmd debug logs on the EMM database host for the device bptm debug logs
See the vendor manuals for information on hardware failures. If you use a robot and the configuration is an initial configuration, verify that the robotic drive is configured correctly. In particular, verify the following:
The same robot number is used both in the Media and Device Management and storage unit configurations. Each robot has a unique robot number.
On a UNIX NetBackup server, you can verify only the Media and Device Management part of the configuration. To verify, you use the tpreq command to request a media mount. Verify that the mount completes and check the drive on which the media was mounted. Repeat the process until the media is mounted and unmounted on each drive from the host where the problem occurred. If this works, the problem is probably with the policy or the storage unit configuration. When you are done, tpunmount the media.
36
If you previously configured a nonrobotic drive and your system includes a robot, change your test policy now to specify a robot. Add a volume to the robot. The volume must be in the NetBackup volume pool on the EMM database host for the robot. Start with step 3 to repeat this procedure for the robot. This procedure verifies that NetBackup can find the volume, mount it, and use the robotic drive. If you have difficulties with the robot, try the test utilities. See About robotic test utilities on page 709. Do not use the Robotic Test Utilities when backups or restores are active. These utilities prevent the corresponding robotic processes from performing robotic actions, such as loading and unloading media. The result is that it can cause media mount timeouts and prevent other robotic operations like robotic inventory and inject or eject from working.
Add a user schedule to your test policy (the backup window must be open while you test). Use a storage unit and media that was verified in previous steps. Start a user backup and restore of a file by using the client-user interface on the master server. Monitor the status and the progress log for the operation. If successful, this operation verifies that the client software is functional on the master server. If a failure occurs, check the NetBackup All Log Entries report. To isolate the problem further, check the appropriate debug logs from the following list. Explanations about which logs apply to specific client software are available. See About using logs and reports on page 81. These logs exist only if you enabled debug logging in step 1. On a UNIX system, the debug logs are in the /usr/openv/netbackup/logs/ directory. On a Windows system, the debug logs are in the install_path\NetBackup\logs\ directory. Debug log directories exist for the following processes:
bparchive (UNIX only) bpbackup (UNIX only) bpbkar bpcd bplist bprd
37
Reconfigure your test policy to name a client that is located elsewhere in the network. Use a storage unit and media that has been verified in previous steps. If necessary, install the NetBackup client software. Create debug log directories for the following processes:
bprd on the server bpcd on the client bpbkar on the client nbwin on the client (Windows only) bpbackup on the client (except Windows clients) bpinetd (Windows only)
Explanations about which logs apply to specific client types are available. See About using logs and reports on page 81.
10 Perform a user backup and then a restore from the client that is specified in
step 8. These actions verify the following:
Communications between the client and the master server NetBackup software on the client
All Log Entries report The debug logs that you created in the previous step
A likely cause for errors is a communications problem between the server and the client.
11 When the test policy operates satisfactorily, repeat specific steps as necessary
to verify other clients and storage units.
12 When all clients and storage units are functional, test the remaining policies
and schedules that use storage units on the master server. If a scheduled backup fails, check the All Log Entries report for errors. Then follow the suggested actions in the status codes chapters.
38
Enable appropriate legacy debug logs on the servers See About using logs and reports on page 81. If you are uncertain which logs apply, enable them all until you solve the problem. Delete the legacy debug log directories when you have resolved the problem.
Configure a test policy with a user schedule (set the backup window to be open while you test) by doing the following:
Name the media server as the client and a storage unit that is on the media server (preferably a nonrobotic drive). Add a volume on the EMM database host for the devices in the storage unit. Ensure that the volume is in the NetBackup volume pool. Insert the volume in the drive. If you do not pre-label the volume by using the bplabel command, NetBackup automatically assigns a previously unused media ID.
Verify the following: all NetBackup daemons or services are running on the master server and Media and Device Management daemons or services are running on the media server. Do one of the following:
To perform this check on a Windows system, use the Services application in the Windows Control Panel.
Perform a user backup and then a restore of a file. Perform these operations from a client that has been verified to work with the master server. This test verifies the following:
NetBackup media server software NetBackup on the media server can mount the media and use the drive that you configured
39
Communications between the master server processes nbpem, nbjm, nbrb, EMM server process nbemm, and media server processes bpcd and bpbrm Communications between media server process bpbrm and client processes bpcd and bpbkar
For the failures that relate to drives or media, ensure that the drive is in an UP state and the hardware functions. If you suspect a communications problem between the master and the media servers, check the debug logs for the involved processes. If the debug logs dont help you, check the following:
On a UNIX server, the System log On a Windows server, the Event Viewer Application and System log
vmd debug logs
See the vendor manuals for information on hardware failures. If you use a robot and the configuration is an initial configuration, verify that the robotic drive is configured correctly. In particular, verify the following:
The same robot number is used both in the Media and Device Management and storage unit configurations. Each robot has a unique robot number.
On a UNIX server, you can verify only the Media and Device Management part of the configuration. To verify, use the tpreq command to request a media mount. Verify that the mount completes and check the drive on which the media was mounted. Repeat the process until the media is mounted and unmounted on each drive from the host where the problem occurred. Perform these steps from the media server. If this works, the problem is probably with the policy or the storage unit configuration on the media server. When you are done, use tpunmount to unmount the media.
If you previously configured a nonrobotic drive and a robot was attached to your media server, change the test policy to name the robot. Also, add a volume for the robot to the EMM server. Verify that the volume is in the NetBackup volume pool and in the robot. Start with step 3 to repeat this procedure for a robot. This procedure verifies that NetBackup can find the volume, mount it, and use the robotic drive. If a failure occurs, check the NetBackup All Log Entries report. Look for any errors that relate to devices or media. If the All Log Entries report doesnt help, check the following:
40
In an initial configuration, verify that the robotic drive is configured correctly. Do not use a robot number that is already configured on another server. Try the test utilities. See About robotic test utilities on page 709. Do not use the Robotic Test Utilities when backups or restores are active. These utilities prevent the corresponding robotic processes from performing robotic actions, such as loading and unloading media. The result is that it can cause media mount timeouts and prevent other robotic operations like robotic inventory and inject or eject from working.
6 7
When the test policy operates satisfactorily, repeat specific steps as necessary to verify other clients and storage units. When all clients and storage units are in operation, test the remaining policies and schedules that use storage units on the media server. If a scheduled backup fails, check the All Log Entries report for errors. Then follow the suggested actions in the status codes chapters.
UNIX clients
For UNIX clients, perform the following steps. Before you start this procedure, add the VERBOSE option to the /usr/openv/netbackup/bp.conf file. Also, create a bpcd debug log directory on your server and clients and a bprd log directory on the server. During subsequent retries, the debug logs provide detailed debug information, which can help you analyze the problem. To resolve network communication problems with UNIX clients
41
Check any recent modifications to ensure that they did not introduce the problem. Ensure that the client software was installed. Ensure that the client operating system is one of those supported by the client software. Check the client names, server names, and service entries in your NetBackup configuration as explained in the following topic: See Verifying host name and service entries on page 49. Two other checks that you can make on host names are as follows:
Use the hostname command on the client to determine the host name that the client sends with requests to the server. Check the bprd debug log (verbose) on the server to determine what occurred when the server received the request.
42
Pay special attention to NIS or the DNS updates that are required. Failure to update these services properly is a common source of network problems with NetBackup.
Verify network connectivity between client and server by trying to ping the client from the server.
ping clientname
Where clientname is the name of the client as configured in the NetBackup policy configuration, /etc/hosts, and also in NIS and DNS (if applicable). For example, to ping a client that is named ant:
ping ant ant.nul.nul.com: 64 byte packets 64 bytes from 199.199.199.24: icmp_seq=0. time=1. ms ----ant.nul.nul.com PING Statistics---2 packets transmitted, 2 packets received, 0% packet loss round-trip (ms) min/avg/max = 1/1/1
Also, try ping from the client to the server. If ping succeeds in both instances, it verifies connectivity between the server and client. If ping fails, you have a network problem outside of NetBackup that must be resolved before you proceed. Note that some forms of the ping command let you ping the bpcd port on the client as in:
ping ant 13782
or
ping ant bpcd
Check that the client listens on the correct port for bpcd connections by running one of the following commands (depending on platform and operating system).
netstat -a | grep bpcd netstat -a | grep 13782 (or the value that is specified during the install) rpcinfo -p | grep 13782 (or the value that is specified during the install)
For example, assume that the client is a Solaris system and you run:
netstat -a | grep 13782
43
If there is no problem with the port, the results are similar to:
tcp 0 0 *.13782 *.* LISTEN
The LISTEN indicates that the client listens for connections on this port. If there is a problem, this line does not appear and one of the following three conditions exists:
/etc/services (or applicable NIS file) does not have the correct bpcd
13782/tcp
bpcd
/etc/inetd.conf (or applicable NIS or DNS file) does not have the correct bpcd entry. The correct /etc/inetd.conf entry is: bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd
/etc/inetd.conf was changed but was not re-read. Correct this condition
Or
/bin/ps -aux | grep inetd kill -HUP the_inetd_pid
On a Hewlett-Packard platform, use inetd -c to send a SIGHUP to inetd. If the problem is with an AIX client, do the following: use SMIT to verify that the InetServ object policy was updated with information about the bpcd process (/etc/inetd.conf and /etc/services information). If you modify the InetServ object policy by using SMIT, the inetexp command automatically runs. If you edit the InetServ object policy by using an ODM editor, do the following: run the inetexp command to export the InetServ object policy to the /etc/inetd.conf and /etc/services files. This command keeps these files in sync with the InetServ object policy. If you change the /etc/inetd.conf or /etc/services file by using SMIT, the inetimp command automatically updates the InetServ object policy. If you change either file, run the following command to inform the inetd daemon of the changes to its configuration file: refresh -s inetd or kill -1
InetdPID
44
telnet to bpcd on the client. If it succeeds, keep the connection until after
Where clientname is the name of the client as configured in the NetBackup policy configuration, /etc/hosts, and also in NIS and DNS (if applicable). For example,
telnet ant bpcd Trying 199.999.999.24 ... Connected to ant.nul.nul.com. Escape character is ^].
If the telnet succeeds, then inetd on the client is configured correctly. It can pass its connection to bpcd and NetBackup should also be able to establish a connection. If telnet doesnt work, ensure that the inetd.conf file and /etc/services files on both the server and client are correct and match. By default, these are as follows: In /etc/services:
bpcd 13782/tcp bpcd
In /etc/inetd.conf:
bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd
Then, run kill -HUP to reread the /etc/inetd.conf file as explained in step 3. Also, update the applicable NIS or DNS files. If these files are correct and you cannot connect to the client, you may have network routing problems or a problem with the port assignment. (See the next step.)
Check that the client listens on the correct port for the telnet connection to bpcd. To check, run one of the following commands (depending on platform and operating system).
netstat -a | grep bpcd netstat -a | grep 13782 (or the value that is specified during install) rpcinfo -p | grep 13782 (or the value that is specified during install)
45
For example, assume the client in step 4 is a SunOS system that is named ant. The telnet is from a NetBackup server that is named whale:
netstat -a | grep 13782
In the first line of the result, ESTABLISHED indicates that the telnet connection was established to bpcd through port 13782 on the client. The LISTEN in the second line indicates that the client listens for further connections on this port. We suggest that you not change the port number for bpcd or other NetBackup services. Do so only if there is no alternative. Then, remember that all NetBackup servers and clients in the configuration must use this new port assignment.
If there is a process other than bpcd that uses the port, try to reboot the client to clear the problem. If the problem is still not fixed, it may be necessary to change one of the service numbers (preferably for the other service). To change a service number, modify the /etc/services files Then send SIGHUP signals to the inetd processes on your clients.
/bin/ps -ef | grep inetd kill -HUP the_inetd_pid
Or
/bin/ps -aux | grep inetd kill -HUP the_inetd_pid
On a Hewlett-Packard platform, use inetd -c to send a SIGHUP to inetd. Also make applicable NIS or DNS updates.
46
If the problem is with an AIX client and you make changes to the /etc/inetd.conf and /etc/services information, do the following: Use SMIT to verify that the InetServ object policy was updated. See step 4.
To verify client to master server communications, use the bpclntcmd utility. When -pn and -sv run on a NetBackup client, they initiate inquiries to the NetBackup master server (as configured in the client bp.conf file). The master server then returns information to the requesting client. More information is available about bpclntcmd. See Using bpclntcmd on page 53.
PC clients
This procedure helps you resolve network communication problems with PC clients. To resolve network problems
Increase the logging level on the client (see the clients user guide). On the NetBackup server, create a bprd debug log directory and on the clients create a bpcd debug log. On the NetBackup server, set the Verbose level to 1 on the TroubleShooting tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface. Then click NetBackup Client Properties on the File menu. See Using the Host Properties window on page 61.
If this client is new, verify the client and the server names in your NetBackup configuration. See Verifying host name and service entries on page 49.
Verify basic network connectivity between client and server by pinging from the server to the client and vice versa. Use the following command:
ping hostname
47
If ping succeeds in all instances, it verifies basic connectivity between the server and client. If ping fails, you have a network problem outside of NetBackup that must be resolved before you proceed. As a first step, verify that the workstation is turned on. Not being turned on is a common source of connection problems with PC workstations.
On Microsoft Windows or NetWare clients, check the NetBackup Client service. Do one of the following tasks:
Ensure that the service is active by checking the logs or by doing one of the following:
On Windows XP or Windows Server 2003 clients, use the Services application in the Control Panel to verify that the NetBackup Client service is running. Start it if necessary. On NetWare clients, enter modules bpcd.nlm from the NetWare server console to verify that the NetBackup client daemon is running. If necessary, type bpstart.ncf from the NetWare server console to start the NetBackup client daemon.
Check the bpcd debug logs for problems or errors. Instructions are available on how to enable and use these logs. See About using logs and reports on page 81. Verify that the same NetBackup client Service (bpcd) port number is specified on both the NetBackup client and server (by default, 13782). Do one of the following:
On Microsoft Windows, check the NetBackup Client Service Port number: Start the Backup, Archive, and Restore interface on the client. On the File menu, click NetBackup Client Properties. In the NetBackup Client Properties dialog box on the Network tab, check the NetBackup Client Service Port number. Verify that the setting on the Network tab matches the one in the services file. The services file is located in:
%SystemRoot%\system32\drivers\etc\services (Windows)
The values on the Network tab are written to the services file when the NetBackup Client service starts.
48
On UNIX NetBackup servers, the bpcd port number is in the /etc/services file. On Windows NetBackup servers, see the Client Properties dialog box in the Host Properties window. See Using the Host Properties window on page 61. Correct the port number if necessary. Then, on Windows clients and servers, stop and restart the NetBackup Client service. On NetWare clients, stop and restart the NetBackup client daemon (bpcd).
Do not change NetBackup port assignments unless it is necessary to resolve conflicts with other applications. If you do change them, do so on all NetBackup clients and servers. These numbers must be the same throughout your NetBackup configuration.
Verify that the NetBackup Request Service (bprd) Port number on Microsoft Windows and NetWare clients is the same as on the server (by default, 13720).
On Microsoft Windows clients (use the same method as in step 4). On NetWare clients, see the BPRD setting in the SYS:VERITAS\NBUCLT\NetBack\BP.INI file. Or, instead of the first bullet: On UNIX NetBackup servers, the bprd port number is in the /etc/services file. On Windows NetBackup servers, set these numbers in the Client Properties dialog box in the Host Properties window. See Using the Host Properties window on page 61.
Verify that the hosts file or its equivalent contains the NetBackup server name. The hosts files are the following:
7 8 9
Verify client-to-server connectability by using ping or its equivalent from the client (step 3 verified the server-to-client connection). If the clients TCP/IP transport allows telnet and ftp from the server, try these as additional connectivity checks. For a NetWare client, ensure that the server does not try to connect when a backup or restore is already in progress on the client. If you try more than one job at a time on these clients, it results in a "cant connect" or similar error.
49
12 Verify that the client operating system is one of those supported by the client
software.
See Background for troubleshooting on page 703. The "Rules for using host names in NetBackup" section in the NetBackup Administrators Guide, Volume II
Verify that the correct client and server host names are configured in NetBackup.
On Windows servers, Windows clients, and NetWare nontarget clients, check the following:
The Server to use for backups and restores drop-down list in the Specify NetBackup Machines and Policy Type dialog box. To display these dialog boxes, start the Backup, Archive, and Restore interface on the client. For the General tab, click NetBackup Client Properties on the File menu. For the Server to use for backups and restores drop-down, click Specify NetBackup Machines and Policy Type on the File menu. Do the following:
On the Server to use for backups and restores drop-down list, ensure that a server entry exists for the master server and each media server.
50
On Windows systems, the correct server must be designated as the current master server in the list. If you add or modify server entries on the master server, stop and restart the NetBackup Request service and NetBackup database manager services. On UNIX systems, if you add or modify SERVER entries on the master server, stop and restart bprd and bpdbm.
On the General tab, verify that the client name setting is correct and matches what is in the policy client list on the master server. On a master or a media server, ensure that a server entry exists for each Windows administrative client to use to administer that server. Ensure that host names are spelled correctly in the bp.conf file (UNIX) or in the servers list (Windows) on the master server. If a host name is misspelled or cannot be resolved by using gethostbyname, the following error messages are logged on the NetBackup error log:
Gethostbyname failed for <host_name>:<h_errno_string> (<h_errno>) One or more servers was excluded from the server list because gethostby name() failed.
You can also make these changes on the appropriate tabs in the properties dialog boxes on a Windows NetBackup server See Using the Host Properties window on page 61.
On UNIX NetBackup servers and clients and Macintosh clients, check the server and the client name entries in the bp.conf file by doing the following:
Ensure that a SERVER entry exists for the master server and each media server in the configuration. The master server must be the first name in the list. If you add or modify SERVER entries on the master server, stop and restart bprd and bpdbm before the changes take effect.
The bp.conf of the master server does not require the addition of other clients, other than the master server as CLIENT_NAME = master server name. The name is added by default. The bp.conf file is in the /usr/openv/netbackup directory on UNIX clients and it is in the Preferences:NetBackup folder on Macintosh clients.
Users on UNIX clients can also have a personal bp.conf file in their home directory. A CLIENT_NAME option in $HOME/bp.conf overrides the option in /usr/openv/netbackup/bp.conf.
51
A SERVER entry exists for the master server and each media server in the configuration. The master server must be the first name in the list. The ClientName entry and the entries in the [clients] section are correct and match what is in the policy client list on the master server.
On the master server, verify that you have created any of the following required files:
Verify that each server and client have the required entries for NetBackup reserved port numbers. The examples following this procedure show the default port numbers. Do not change NetBackup port assignments unless it is necessary to resolve conflicts with other applications. If you do change them, do so on all NetBackup clients and servers. These numbers must be the same throughout your NetBackup configuration.
On NetBackup servers, check the services files to ensure that they have entries for the following:
See the NetBackup Administrators Guide, Volume I for a list of these processes. On UNIX, the services file is /etc/services. On Windows, the services file is %SystemRoot%\system32\drivers\etc\services.
On UNIX, Windows, and NetWare clients, verify the NetBackup client daemon or service number, and the request daemon or service port number.
On UNIX clients, check the bprd and the bpcd entries in the /etc/services file.
52
On Microsoft Windows clients, verify that the NetBackup Client Service Port number and NetBackup Request Service Port number match settings in the services file: Start the Backup, Archive, and Restore interface on the client. On the File menu, click NetBackup Client Properties. In the NetBackup Client Properties dialog box on the Network tab, select the following: The NetBackup Client Service Port number and NetBackup Request Service Port number. The values on the Network tab are written to the services file when the NetBackup Client service starts. The services file is in the following location:
%SystemRoot%\system32\drivers\etc\services (Windows)
On NetWare clients, check the BPCD and the BPRD entries in the SYS:VERITAS\NBUCLT\NetBack\BP.INI file.
On UNIX servers and clients, check the /etc/inetd.conf file to ensure that it has the following entry:
bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd
4 5 6
On Windows servers and clients, verify that the NetBackup Client service is running. If you use NIS in your network, update those services to include the NetBackup information that is added to the /etc/services file. NIS, WINS, or DNS host name information must correspond to what is in the policy configuration and the name entries in the following:
On Windows NetBackup servers, Microsoft Windows clients, and NetWare nontarget clients:
Check the General tab: Start the Backup, Archive, and Restore interface on the client. On the File menu, click NetBackup Client Properties. In the NetBackup Client Properties dialog box, click the General tab. Check the Server to use for backups and restores drop-down list: On the File menu, click Specify NetBackup Machines and Policy Type. In the Specify NetBackup Machines and Policy Type dialog box, click the Server to use for backups and restores drop-down list. The bp.conf file on UNIX servers and clients and Macintosh clients. The \veritas\nbuclt\netback\bp.ini file on NetWare clients.
53
To confirm the setup of the following, use the NetBackup bpclntcmd utility: the IP addresses and hostnames in DNS, NIS, and (or) local hosts files on each NetBackup node.
Using bpclntcmd
The bpclntcmd utility resolves IP addresses into host names and host names into IP addresses. It uses the same system calls as the NetBackup application software. The following directory contains the command that starts the utility:
install_path\NetBackup\bin (Windows) /usr/openv/netbackup/bin (UNIX)
On Windows, run this command in an MS-DOS command window so you can see the results. The Bpclntcmd options that are useful for testing the functionality of the host name and IP address resolution are -ip, -hn, -sv, and -pn. The following topics explain each of these options:
bpclntcmd -ip IP_Address
The -ip option lets you specify an IP address. bpclntcmd uses gethostbyaddr() on the NetBackup node and gethostbyaddr() returns the host name with the IP address as defined in the following: the nodes DNS, WINS, NIS, or local hosts file entries. No connection is established with the NetBackup server.
bpclntcmd -hn Hostname
The -hn option specifies a host name. bpclntcmd uses gethostbyname() on the NetBackup node to obtain the IP address that is associated with the host name defined in the following: the nodes DNS, WINS, NIS, or local hosts file entries. No connection is established with the NetBackup server. Use -ip and -hn to verify the ability of a NetBackup node to resolve the IP addresses and host names of other NetBackup nodes. For example, to verify that a NetBackup server can connect to a client, do the following:
On the NetBackup server, use bpclntcmd -hn to verify the following: The operating system can resolve the host name of the NetBackup client (as configured in the client list for the policy) to an IP address. The IP address is then used in the nodes routing tables to route a network message from the NetBackup server.
54
On the NetBackup client, use bpclntcmd -ip to verify the following: The operating system can resolve the IP address of the NetBackup server. (The IP address is in the message that arrives at the clients network interface.)
bpclntcmd -pn
When the -pn option is run on a NetBackup client, it initiates an inquiry to the NetBackup master server. The server then returns information to the requesting client. First, the server is the Current Server in the server list). Then it displays the information that the server returns. The following is an example of the use of bpclntcmd:
bpclntcmd -pn expecting response from server rabbit.friendlyanimals.com dove.friendlyanimals.com dove 123.145.167.3 57141
the master server. The master server obtained this name through gethostbyaddress().
dove is the client name configured in the NetBackup policy client list. 123.145.167.3 is the IP address of the client connection at the master server. 57141 is the port number of the connection on the client.
bpclntcmd -sv
The -sv option displays the NetBackup version number on the master server.
55
Figure 2-1
jupiter
Ethernet Policy Client List jupiter mars usr/openv/netbackup/bp.conf SERVER=jupiter CLIENT_NAME=jupiter usr/openv/netbackup/bp.conf /etc/inetd.conf bpcd ... (see note 1) /etc/services bpcd ... (see note 1) # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd bpdbm 13721/tcp bpdbm # Volume Manager services # vmd 13701/tcp vmd tl8cd 13705/tcp tl8cd . . Notes: 1. The complete inetd.conf entry is: bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd 2. All other applicable network configuration must also be updated to reflect the NetBackup information. For example, this information could include the /etc/hosts file and NIS, and DNS (if used). /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd SERVER=jupiter CLIENT_NAME=mars /etc/inetd.conf
mars
UNIX Client
56
Figure 2-2
UNIX Master Server jupiter
Ethernet Policy Client List jupiter mars saturn usr/openv/netbackup/bp.conf SERVER=jupiter SERVER=saturn CLIENT_NAME=jupiter /etc/inetd.conf bpcd ... (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd bpdbm 13721/tcp bpdbm # Volume Manager services # vmd 13701/tcp vmd tl8cd 13705/tcp tl8cd odld 13706/tcp odld . . Notes: 1. The complete inetd.conf entry is: bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd 2. All other applicable network configuration must also be updated to reflect the NetBackup information. For example, this information could include the /etc/hosts file and NIS, and DNS (if used). usr/openv/netbackup/bp.conf SERVER=jupiter SERVER=saturn CLIENT_NAME=mars /etc/inetd.conf bpcd ... bpcd (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd usr/openv/netbackup/bp.conf SERVER=jupiter SERVER=saturn CLIENT_NAME=saturn /etc/inetd.conf bpcd ... bpcd (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd bpdbm 13721/tcp bpdbm # Volume Manager services # vmd 13701/tcp vmd tl8cd 13705/tcp tl8cd odld 13706/tcp odld . . mars UNIX Client
57
Figure 2-3
UNIX Master Server jupiter
Ethernet
Policy Client List jupiter mars saturn pluto bp.ini usr/openv/netbackup/bp.conf SERVER=jupiter CLIENT_NAME=jupiter /etc/inetd.conf bpcd ... (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd bpdbm 13721/tcp bpdbm # Volume Manager services # vmd 13701/tcp vmd tl8cd 13705/tcp tl8cd odld 13706/tcp odld . . Notes:
mars
saturn
Windows Client
[bp] ClientName=mars [servers] master=jupiter [clients] browser=jupiter [tcpip] bpcd=13782 bprd=13720 NetBackup Client Properties dialog box Servers Server List: jupiter General Client Name: saturn Network NetBackup Client Service Port 13782 NetBackup Request Service Port 13720
1. The complete inetd.conf entry is: bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd 2. All other applicable network configuration must also be updated to reflect the NetBackup information. For example, this information could include the /etc/hosts file and NIS, and DNS (if used).
58
Figure 2-4
UNIX Master Server jupiter
mars Policy Client List jupiter mars saturn pluto usr/openv/netbackup/bp.conf SERVER=jupiter SERVER=saturn CLIENT_NAME=jupiter usr/openv/netbackup/bp.conf /etc/inetd.conf bpcd ... bpcd (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd bpdbm 13721/tcp bpdbm # Volume Manager services # vmd 13701/tcp vmd tl8cd 13705/tcp tl8cd odld 13706/tcp odld . . Notes: 1. The complete inetd.conf entry is: bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd 2. All other applicable network configuration must also be updated to reflect the NetBackup information. For example, this information could include the /etc/hosts file and NIS, and DNS (if used). SERVER=jupiter SERVER=saturn CLIENT_NAME=mars /etc/inetd.conf bpcd ... bpcd (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd usr/openv/netbackup/bp.conf SERVER=jupiter SERVER=saturn CLIENT_NAME=pluto /etc/inetd.conf bpcd ... bpcd (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd pluto UNIX Client meteor Ethernet UNIX Client
In example 4, we first examine the configuration of the router system. The NetBackup policy client list shows this system as mars because that is the name of the interface to the master server. Other than the client name setting, this setup has no special configuration to note. This name must be set to mars, because mars is the name that the master server recognizes.
59
The second client, pluto, is also configured no differently than if it were in the same network as the master server. If all the standard networking files (hosts, NIS, DNS, WINS, and routing tables) are set up correctly, all the required network connections can be made. However, to restore files from pluto would be a problem in the following situation: the mars, meteor system is a type of router that hides the name of the originating host when it routes requests between the two networks. For example, a router between an Ethernet and a token ring network exhibits this behavior. To illustrate what occurs, assume that pluto is on FDDI (token ring) and the server is on Ethernet. Then a user on pluto starts a restore. The router can use the name of its network interface to pluto (meteor) as the peer name when it forwards the request to the server. The server interprets the request as coming from a host that is named meteor. It does not allow the restore because meteor is not in the client list. To resolve this problem, the administrator creates an altnames directory on the master server and adds a file for meteor to that directory. On a Windows NetBackup server, the file path is:
install_path\netbackup\db\altnames\meteor
The master server now recognizes as legitimate any of the restore requests with a peer name of meteor and client name of pluto. Refer to the NetBackup Administrators Guide, Volume I, for more information on altnames configuration. Regardless of the type of router, the configuration for the media server, saturn, is the same as in example 2. If a media server is involved in a backup or restore for pluto, the master server provides the following: the correct peer name and client name for the media server to use to establish connections.
60
Figure 2-5
UNIX Client mars
jupiter meteor
Policy Client List jupiter mars saturn pluto usr/openv/netbackup/bp.conf usr/openv/netbackup/bp.conf SERVER=jupiter SERVER=meteor SERVER=saturn CLIENT_NAME=mars /etc/inetd.conf bpcd ... bpcd (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd SERVER=jupiter SERVER=meteor SERVER=saturn CLIENT_NAME=jupiter /etc/inetd.conf bpcd ... (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd bpdbm 13721/tcp bpdbm # Volume Manager services # vmd 13701/tcp vmd tl8cd 13705/tcp tl8cd odld 13706/tcp odld . . /etc/inetd.conf bpcd ... bpcd (see note 1) /etc/services # NetBackup services bpcd 13782/tcp bpcd bprd 13720/tcp bprd usr/openv/netbackup/bp.conf SERVER=jupiter SERVER=meteor SERVER=saturn CLIENT_NAME=pluto pluto UNIX Client
Notes: 1. The complete inetd.conf entry is: bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd 2. All other applicable network configuration must also be updated to reflect the NetBackup information. For example, this information could include the /etc/hosts file and NIS, and DNS (if used).
61
The NetBackup policy client list specifies jupiter as the client name for the master server. The list can show either jupiter or meteor but not both. Another important item to note is the configuration of the NetBackup server list. The NetBackup server list on the master server has entries for both jupiter and meteor. The reason for both is that when the server does a backup, it uses the name that is associated with the client it backs up. For example, it uses the meteor interface when it backs up pluto and the jupiter interface when it backs up mars. The first server entry (master server name) is jupiter because that is the name used to back up the client on the master server. The NetBackup server list for the other systems also has entries for both the jupiter and the meteor interfaces. This setup is recommended to keep the server entries the same on all clients and servers in the configuration. It would be adequate to list only the master-server name for the local network interface to the client system or media server. (For example, list meteor for pluto.) For the network that is shown, the only configurations that are required are the differences for the policy client list and the server list. If all the standard networking files (hosts, WINS, NIS, DNS, and routing tables) are set up correctly, all required network connections can be made. As in example 4, there would be a problem to restore the files in the following situation: the master server system is a router that hides the originating host name when it routes requests between networks. For example, if pluto were on FDDI (token ring), the master server would use meteor as the peer name when it forwards the request to NetBackup. NetBackup would then interpret the request as coming from a host that is named meteor, which was not in the client list. The restore would fail. The solution, in this case, is also identical to the solution that is discussed in the following: See Figure 2-4 on page 58.
62
for the local system where you are running the interface. Most settings in the NetBackup Client Properties dialog box are also available in the Host Properties window. For more information, see the online Help or the NetBackup Administrators Guide, Volume I. To access configuration settings through Host Properties
1 2 3 4 5
Start the NetBackup Administration Console. Click Host Properties. Select the servers or clients where you want to make the change. On the Actions menu, select Properties. In the properties dialog box, select the appropriate tab and make your change.
The NetBackup Resource Broker (nbrb) log may have database connection errors in it. These errors indicate failed tries to establish connections to the nbemm database. The following is an example of such errors in the nbrb log:
7/20/2005 12:33:47.239 [RBDatabase::connectDatabase()] ODBC connection failed. ErrMsg: [Sybase][ODBC Driver][Adaptive Server Anywhere]Disk write failure 'Fatal error: disk write failure C:\Program Files\VERITAS\NetBackupDB\data\NBDB.log' -- transaction rolled back ErrCode: -1Sqlstate: HY000
The nbrb log (originator ID 118) is written in /usr/openv/logs (UNIX) or install_path\NetBackup\logs (Windows). More information is available about unified logging. See About using logs and reports on page 81.
To correct the situation, clear up disk space in the directory where NetBackup is installed by doing the following:
63
You may need to delete log files manually, reduce logging levels, and adjust log retention to have log files automatically deleted sooner. More information is available about logging levels, log file retention, and how to configure unified logging. See About using logs and reports on page 81. Consider moving the NetBackup unified logging files to a different file system. See Changing log file locations on page 85.
Use the Activity Monitor to verify that the NetBackup relational database service is running. This service is the NB_dbsrv daemon on UNIX and the "Adaptive Server Anywhere - Veritas_NB" service on Windows. If the NetBackup relational database service is stopped, note the following:
Do NOT stop the nbrb service. If you stop the nbrb service while the NetBackup relational database service is down, it can result in errors. Restart the NetBackup relational database service.
Verify that the NetBackup relational database service is running. If it is not and you remove files to free up disk space, you may not fix the problem. The relational database service must be restarted to allow the Resource Broker (nbrb) to allocate job resources.
The media server that freezes the media stores the actual FROZEN status of that media in its media database (MediaDB). Every media server including the master server has its own unique media database. Use the bpmedialist command to access the MediaDB information, including the media status (Frozen, Full, or Active). To unfreeze the media, use the bpmedia command. Specify the media server that contains that frozen record in the command syntax. Unfreeze the media one at a time. Frozen media does not necessarily mean that the media is defective. NetBackup may freeze media as a safety measure to prevent further errors, drive damage, or data loss.
64
Investigate any patterns to the media IDs, tape drives, or media servers involved when media is frozen.
The following logs are useful when troubleshooting frozen media: UNIX and Linux:
The bptm log from the media servers that froze the media:
/usr/openv/netbackup/logs/bptm
Windows:
The bptm log from the media servers that froze the media:
install_dir\VERITAS\NetBackup\logs\bptm
The Windows Event Viewer System Log The Windows Event Viewer Application Log
The verbosity of the bptm process log should be set to 5 to troubleshoot any media and drive-related issues. This log does not use excessive drive space or resources even at an elevated verbosity. When media is frozen, the bptm logs may contain more detailed information that the Activity Monitor or Problems Report. Set the btpm verbosity for bptm on individual media servers by changing their logging levels under Host Properties on the NetBackup Administration Console. The following conditions can cause media to freeze:
The same media has excessive errors during backup. The log entry is as follows:
FREEZING media id E00109, it has had at least 3 errors in the last 12 hour(s)
Dirty drives. Clean the drives that are freezing media. One of the first symptoms of a dirty drive is frozen media. Clean the drive according to the manufacturer's suggestions. The drive itself. Check for tape device errors reported by the operating system logs or by the device driver. If any are found, follow the hardware manufacturer's recommendations for this type of error. Communication issues at the SCSI or Host Bus Adapter (HBA) level. Check for SCSI or HBA device errors reported by the operating system logs or by their driver. If any are found, follow the hardware manufacturer's recommendations for this type of error.
65
Ensure that the tape drives appear on the hardware compatibility list as supported for NetBackup. This list is located on the Symantec Technical Support Web site:
http://support.veritas.com/menu_ddProduct_NBUESVR_view_CL.htm
Ensure that the media is supported for use with the tape drive by the tape drive vendor
NetBackup requests a media ID to be mounted in a drive. If the media ID that is physically recorded on the tape is different than the NetBackup media ID, the media freezes. This error occurs if the robot needs to be inventoried, or if barcodes have been physically changed on the media. The error can also occur if another NetBackup installation previously wrote to the media with different barcode rules. The drives in the robot are not configured in order within NetBackup, or are configured with the wrong tape paths. Drive configuration that use the correct robot drive number is important to the proper mounting and utilization of media. The robot drive number is normally set based on the relationship of the drive serial number with the drive serial number information from the robotic library. This number must be determined and validated before you consider the device configuration complete.
These tapes have usually been written outside of NetBackup, and have found their way into the library. By default, NetBackup will only write to a blank media or other NetBackup media. Other media types (DBR, TAR, CPIO, ANSI, MTF1 and recycled Backup Exec BE-MTF1 media) will be frozen as a safety measure. Change this behavior by using the following procedure: On UNIX and Linux:
66
To allow NetBackup to overwrite foreign media, add the following to the bp.conf file located at /usr/openv/netbackup/bp.conf for the related media server:
ALLOW_MEDIA_OVERWRITE ALLOW_MEDIA_OVERWRITE ALLOW_MEDIA_OVERWRITE ALLOW_MEDIA_OVERWRITE ALLOW_MEDIA_OVERWRITE ALLOW_MEDIA_OVERWRITE = = = = = = DBR TAR CPIO ANSI MTF1 BE-MTF1
Stop and restart the NetBackup daemons for the changes to take effect. On Windows: On the Administration Console, proceed to Host Properties | Media Server Open the properties for the media server in question. Select the Media tab. The Allow Media Overwrite property overrides the NetBackup overwrite protection for specific media types. To disable overwrite protection, select one or more of the listed media formats. Stop and restart the NetBackup services for the changes to take effect. Do not select a foreign media type for overwriting unless it is certain that this media type should be overwritten. For more details on what each media type is, see the NetBackup System Administrator's Guide.
The media is a tape formerly used for the NetBackup catalog backup.
FREEZING media id 000067: it contains VERITAS NetBackup (tm) database backup data and cannot be used for backups.
This media is frozen because it is an old catalog backup tape which NetBackup does not overwrite by default. The bplabel command must label the media to reset the media header.
The media is intentionally frozen. You can use the bpmedia command to manually freeze media for a variety of administrative reasons. If no record exists of a specific job freezing the media, the media may have been frozen manually. The media is physically write protected. If the media has a write-protect notch that is set for write protection, NetBackup freezes the media.
67
The media_server variable is the one that froze the media. If this item is unknown, run the bpmedialist command and note the "Server Host:" listed in the output. The following example shows that media server denton froze media div008:
# bpmedialist -m div008 Server Host = denton ID rl images allocated last updated density kbytes restores vimages expiration last read <------- STATUS -------> -----------------------------------------------------------------------DIV08 1 1 1 04/22/2005 10:12 04/22/2005 10:12 05/06/2005 10:12 04/22/2005 10:25 hcart FROZEN 35 5
On UNIX: /opt/VRTSpbx/bin/pbxcfg -v
68
On Windows: install_path\VxPBX\bin\pbxcfg -v
On Windows, make sure the "Veritas Private Branch Exchange" service is started. (Go to Start > Run and enter services.msc.)
Example output:
Auth User:0 : root Secure Mode: false Debug Level: 10 Port Number: 1556 PBX service is not cluster configured Auth User must be root and Secure Mode must be false.
Example output:
69
Auth User:0 : localsystem Secure Mode: false Debug Level: 10 Port Number: 1556 PBX service is not cluster configured Auth User must be localsystem and Secure Mode must be false.
To add the correct user to the authenticated user list (UNIX example):
/opt/VRTSpbx/bin/pbxcfg -a -u root
For more information on the pbxcfg command, refer to the pbxcfg man page.
PBX logging
PBX uses unified logging. PBX logs are written to the following:
The unified logging originator number for PBX is 103. More information is available about unified logging. See Unified logging on page 85. Error messages regarding PBX may appear in the PBX log or in the unified logging logs for nbemm, nbpem, nbrb, or nbjm. The following is an example of an error that is related to PBX:
05/11/05 10:36:37.368 [Critical] V-137-6 failed to initialize ORB: check to see if PBX is running or if service has permissions to connect to PBX. Check PBX logs for details
Use the vxlogview command to view PBX and other unified logs. The originator id for PBX is 103. For more information, see the vxlogview man page. You can also refer to the following topic: See Configuring and using unified logging on page 94.
70
where debug_level is a number from 0 to 10. 10 is the most verbose. PBX may log messages by default to the UNIX system logs (/var/adm/messages or/var/adm/syslog) or to the Windows Event Log. As a result, the system logs may fill up with unnecessary PBX log messages, since the messages are also written to the PBX logs (/opt/VRTSpbx/log on UNIX and <install_path>\VxPBX\log on Windows). To disable PBX logging to the system or event logs, enter the following command:
# vxlogcfg -a -p 50936 -o 103 -s LogToOslog=false
You do not have to restart PBX for this setting to take effect.
PBX security
The PBX Secure Mode must be set to false. If Secure Mode is true, NetBackup commands such as bplabel and vmoprcmd do not work. PBX messages similar to the following appear in /opt/VRTSpbx/log (UNIX) or install_path\VxPBX\log (Windows).
5/12/2008 16:32:17.477 [Error] V-103-11 User MINOV\Administrator not authorized to register servers 5/12/2008 16:32:17.477 [Error] Unauthorized Server
On UNIX:
/opt/VRTSpbx/bin/pbxcfg -d -m
On Windows:
install_path\VxPBX\bin\pbxcfg -d -m
Stop NetBackup:
On UNIX:
71
/usr/openv/netbackup/bin/bp.kill_all
On Windows:
install_path\NetBackup\bin\bpdown
Stop PBX:
On UNIX:
/opt/VRTSpbx/bin/vxpbx_exchanged stop
On Windows: Go to Start > Run, enter services.msc, and stop the "Veritas Private Branch Exchange" service.
Start PBX:
On UNIX:
/opt/VRTSpbx/bin/vxpbx_exchanged start
On Windows: Go to Start > Run, enter services.msc, and start the "Veritas Private Branch Exchange" service.
Start NetBackup:
On UNIX:
/usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpup
72
[Info] PBX_Client_Proxy::parse_line, line = [Info] hand_off looking for proxy for = EMM [Error] No proxy found. [Info] PBX_Client_Proxy::handle_close
Start the needed service. In this example, the missing NetBackup service is EMM. To start the needed service, enter the nbemm command (UNIX) or start the NetBackup Enterprise Media Manager service (Windows; Start > Run, enter services.msc).
On UNIX:
/usr/openv/netbackup/bin/bp.kill_all /usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
73
To view and reset duplex mode for a host or device (if the manufacturers documentation does not help)
1 2
Log in to the host that contains the network interface card(s). Enter the following command to view the current duplex setting.
ifconfig -a
On some operating systems, this command is ipconfig. Example output from a NAS filer:
e0: flags=1948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 inet 10.80.90.91 netmask 0xfffff800 broadcast 10.80.95.255 ether 00:a0:98:01:3c:61 (100tx-fd-up) flowcontrol full e9a: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 ether 00:07:e9:3e:ca:b4 (auto-unknown-cfg_down) flowcontrol full e9b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 ether 00:07:e9:3e:ca:b5 (auto-unknown-cfg_down) flowcontrol full
In this example, the network interface that shows "100tx-fd-up" is running in full duplex. Only interface e0, the first in the list, is at full duplex. A setting of "auto" is not recommended, because devices can auto negotiate to half duplex.
The duplex mode can be reset by using the ifconfig (or ipconfig) command. For example:
ifconfig e0 mediatype 100tx-fd
For most hosts, you can set full-duplex mode permanently, such as in the hosts /etc/rc files. Refer to the hosts documentation for more information.
74
nbftclnt unified logging file (OID 200) for errors. You may see the following in the nbftclnt log:
The license is expired or this is not a NBU server. Please check your configuration. Note: unless NBU server, the host name can't be listed as server in NBU configuration.
Remove or correct the SERVER entry in the bp.conf file, restart nbftclnt on the client, and retry the operation. Note: The nbftclnt process on the client must be running before you start a SAN client backup or restore over Fibre Channel.
The resources that the job requests The granted (allocated) resources.
If a job is queued awaiting resources, the Job Details dialog lists the resources for which the job waits. The three types of messages begin with the following headers:
requesting resource ... awaiting resource ... granted resource ...
Intermittent backup failures Connectivity issues (drives that are down) SAN configuration changes
If the SAN administrator rezones the network or masks an array in use by NetBackup, the following can occur: some of the machines or devices that
75
NetBackup needs may not be available. Either action causes backups to fail and drives to go down. The only information available to the NetBackup administrator is an error 83 (media open error) or error 84 (media write error) status code. You can use Veritas CommandCentral Storage (or the earlier SANPoint Control) to check elements of the SAN configuration. For example, you can check whether a particular device is connected as well as the zoning and masking on the SAN. Sometimes a switch or a Windows box is interrupted and sends out a reset command. Since NetBackup doesnt automatically maintain persistent bindings, the reset command can cause drives to be mapped differently. CommandCentral Storage can help find the problem by showing the changes in the drive mappings, although it cannot automatically fix the problem. For information on SharedDisk, refer to the "SharedDisk troubleshooting checklist" in the NetBackup Shared Storage Guide. For information on how to implement persistent bindings, refer to the NetBackup Device Configuration Guide. NetBackup lets you launch CommandCentral Storage in-context. The CommandCentral Storage Web GUI precisely displays the area of the SAN configuration you plan to troubleshoot.
Operators who have limited access to hosts and to the fabric of the SAN System administrators who have administrator privileges, but no access to the fabric
The SAN administrator generally operates outside the NetBackup domain entirely. Troubleshooting NetBackup is difficult when it involves the SAN because administrative responsibility tends to be spread out. No one person has a clear picture of the overall backup structure. CommandCentral Storage provides a consistent view of the entire SAN against which to measure performance. It gives NetBackup administrators the data they need to request changes of and collaborate with the SAN administrators. It helps NetBackup administrators when they design, configure, implement, or modify solutions in response to changes in backup environments (hardware, applications, demand). CommandCentral Storage can help those responsible for managing a backup system in a SAN environment by integrating SAN management and backup operation information.
76
CommandCentral Storage can provide support during the following backup lifecycle stages:
Design Use CommandCentral Storage during the design phase to determine the following:
Where to deploy a backup system on the SAN If SAN redesign is required to meet backup windows at minimum hardware cost and application impact For example, a backup design may not require the purchase of additional switches if it takes into account the following: the performance trending reports that CommandCentral Storage keeps to determine the pattern of fabric utilization. Or perhaps if you re-zone the fabric through CommandCentral Storage, it may provide sufficient bandwidth for meeting backup window requirements. In addition, CommandCentral Storage can provide visibility into recovery designs and fabric performance in the event of large restores that critical business operations require.
Configuration, testing Generally, backup systems are tested before implementation to obtain benchmarks and adjust (tune) the system for maximum efficiency. CommandCentral Storage can provide the performance metrics for end-to-end I/O capabilities for all elements in the backup path. Additionally, CommandCentral Storage can provide valuable environmental information for qualifying the backup environment as well as a baseline for future troubleshooting configuration management. Implementation, reconfiguration, production CommandCentral Storage can help to determine whether a host can see through the entire I/O path to the target backup device by pinpointing connectivity issues.
In-context launch
The ability to launch CommandCentral Storage and access an overview of the SAN from NetBackup in context is valuable for determining root cause problems quickly. In addition, because NetBackup administrators and SAN administrators
77
are often in different groups, the fragmented operations that lead to resolution delays may be avoided. With CommandCentral Storage, the NetBackup administrator has a view of the overall health of the SAN as part of the initial troubleshooting process.
Launch CommandCentral Storage in context from NetBackup to check fabric health. Check reports for fabric events occurring around the time NetBackup generated the error log.
1 2
Check the NetBackup device monitor to see whether a device is down. If a device is down, try to bring it back up. If the drive is still down, check the following for status 219 (the required storage unit is unavailable) and 213 (no storage units available for use) on the media server:
78
3 4
Check the NetBackup logs for status 83, 84, 85, and 86. These codes relate to write, read, open, position failures to access the drive. Try a robtest to determine connectivity. If there is no connectivity, the likely problem is with hardware.
5 6 7
From the master server, select the robot or device the storage unit is associated with. Launch CommandCentral Storage for a view of the media server and devices. Check the fabric connectivity (whether any I/O path devices are down).
79
1 2 3
Run device discovery again. If the new device is still not seen, the likely problem is with hardware. Launch CommandCentral Storage. If the new device does not appear in the CommandCentral Storage topology, check SAN hardware connections to determine whether or not the device is connected. If the new device shows up as disconnected or offline, contact the SAN administrator and check switch configuration. Compare this troubleshooting procedure to a similar problem without the benefit of CommandCentral Storage, such as Robotic status code: 214, robot number does not exist. See Robotic status code: 214 on page 534.
1 2
Select a drive inside the NetBackup Device Monitor. Launch CommandCentral Storage in the drive context to see whether the drive is connected to the SAN. Check CommandCentral Storage alert reports to see whether a SAN problem existed that would have affected the drive during the time the backup job failed.
80
Chapter
About using logs and reports Where is the log and report information? Reports Status for user operations UNIX system logs Debug logs on servers Debug logs on UNIX clients Debug logs on PC clients Windows Event Viewer logging option Troubleshooting the Administration Console for UNIX Query string overview
82
Using logs and reports Where is the log and report information?
Note: The term "media server", as distinct from "master server" or "server", does not apply to the NetBackup server product. When you troubleshoot a NetBackup server installation, ignore any references to media server. (This note does not apply to NetBackup Enterprise Server.)
83
Figure 3-1
SERVER
Error Catalog
File Catalog
Status Logs
Media Catalog
More information is available on the programs and daemons that are mentioned in this figure. See About backup and restore functional overview on page 633.
Reports
NetBackup provides the standard reports that show most of the status information and error information. To run these reports, use the NetBackup Administration Console.
84
See the NetBackup Administrators Guide, Volume I, for instructions and detailed descriptions about these reports. Table 3-1 provides a summary of the reports. Table 3-1 Report
Status of Backups
Client Backups
Problems
Images on Media
Media Logs
Tape Reports
Disk Reports
85
Unified logging
Unified logging creates log file names and messages in a standardized format. Certain NetBackup processes use unified logging. A list of the processes that use unified logging is available. See Table 3-2 on page 88.
Log locations
All unified logs are written to the /usr/openv/logs directory (UNIX) and the install_path\NetBackup\logs folder (Windows). Unlike legacy logging, you do not need to create logging subdirectories. Log files for originator IDs are written to a subdirectory with the name specified in the log configuration file.
On UNIX:
/usr/openv/netbackup/bin/vxlogcfg -a -p NB -o Default -s LogDirectory=new_log_path
86
On Windows:
install_path\NetBackup\bin\vxlogcfg -a -p NB -o Default -s LogDirectory=new_log_path
Message types
The following three kinds of messages can appear in unified logging files:
Application log messages include informational, warning, and error messages. Application messages are always logged and cannot be disabled. These messages are localized. An example of an application message follows:
05/02/09 11:02:01.717 [Warning] V-116-18 failed to connect to nbjm, will retry
Diagnostic log messages are the unified logging equivalent of the legacy debug log messages. They can be issued at various levels of detail (similar to verbose levels in legacy logging). These messages are localized. An example of a diagnostic message follows:
05/05/09 14:14:30.347 V-116-71 [JobScheduler::doCatIncr] no configured session based incremental catalog schedules
Debug log messages are intended primarily for Symantec engineering. Like diagnostic messages, they can be issued at various levels of detail. These messages are not localized. Note: Like diagnostic messages, debug messages can be disabled with the vxlogcfg command. An example of a debug message follows:
10/29/09 13:11:28.065 [taolog] TAO (12066|1) Transport_Cache_Manager::bind_i, 0xffbfc194 -> 0x7179d0 Transport[12]
87
productID-originatorID-hostID-date-rotation.log
The following shows the meaning of each of the variables in the log file name:
product ID originatorID Identifies the product. The NetBackup product ID is 51216. Identifies the log writing entity, such as a process, service, script, or other software. Identifies the host that created the log file. Unless the file was moved, this ID is the host where the log resides. Shows when the log was written, in YYMMDD format. A numbered instance of a log file for a given originator. This numbering is used for log file rotation. See Unified logging file rollover on page 93.
hostID
date rotation
Where:
51216 116 The product ID (entity ID) for NetBackup. The originator ID of the nbpem process (the NetBackup policy execution manager). The host ID for the host that created this log. The date in YYMMDD format. The rollover number that indicates the instance of this log file. By default, log files roll over based on file size. If the file reaches maximum size and a new log file is created for this originator, the new file is designated 0000000001. See Unified logging file rollover on page 93.
88
All logs with the following IDs are written to the /usr/openv/logs subdirectory on UNIX or to the install_path\NetBackup\logs subdirectory on Windows (except as noted in this table). Table 3-2 lists the NetBackup server processes that use unified logging. Table 3-2 Originator ID
103
Unified logging: originator IDs and NetBackup processes Processes that use the originator ID
Private Branch Exchange service (PBX). Writes logs to /opt/VRTSpbx/log (UNIX) and install_path\VxPBX\log (Windows). nbemm: Enterprise Media Manager (EMM). This process runs only on the EMM server. nbpem: NetBackup Policy Execution Manager. This process runs only on the master server. nbjm: NetBackup Job Manager. This process runs only on the master server. nbrb: NetBackup Resource Broker. This process runs only on the EMM server. bmrd and bmrbd: Bare Metal Restore (BMR) master (or boot) server daemons. bmrbd runs on the BMR boot server. bmrsavecfg: Bare Metal Restore data collection utility. bmrsavecfg runs on the NetBackup client, not server. bmrc: The Bare Metal Restore utility that UNIX clients use to communicate to the BMR master server during a restore. bmrc originates on the BMR boot server and runs on the restoring client. bmrs: the Bare Metal Restore commands and database interface. bmrcreatefloppy.exe: (Windows only) used by the Bare Metal Restore commands that create floppy disks. bmrcreatefloppy.exe runs on the BMR boot server. bmrsrtadm: The Bare Metal Restore utility that creates a shared resource tree and bootable CDs, and runs on the BMR boot server. bmrprep: The Bare Metal Restore utility that prepares BMR servers for a client restoration. bmrsetupmaster and bmrsetupboot: Bare Metal Restore installation, configuration, and upgrade processes.
111
116
117
118
119
121
122
123 124
125
126
127
89
Unified logging: originator IDs and NetBackup processes (continued) Processes that use the originator ID
Bare Metal Restore libraries get their log messages from this catalog. bmrconfig: The Bare Metal Restore utility that modifies a client's configuration. bmrpkg and bmrcreatepkg: Bare Metal Restore utilities to add the following: Windows drivers, service packs, and hotfixes to the BMR master server so they can be used in a restore. bmrrst.exe and bmrmap.exe (Windows systems only). Utilities that restore Windows Bare Metal Restore clients. They run on the restoring client. nbsl: NetBackup Service Layer. ndmpagent: NDMP Agent daemon that manages NDMP backup and restore. Controls the logging level in the NetBackup libraries. The application and diagnostic messages are for customer use; debug messages are intended for Symantec engineering. Media server user interface for the Enterprise Media Manager (EMM). bmrepadm: a utility that manages the Bare Metal Restore external procedures that are used during a restore. mds: the media selection component and device selection component of Enterprise Media Manager (EMM). Device Allocator, for shared drives. The Symantec OpsCenter reporting service, part of Symantec OpsCenter. The Symantec OpsCenter Client, part of Symantec OpsCenter. The Symantec OpsCenter Server, part of Symantec OpsCenter. NDMP protocol messages, avrd, and robotic processes. bmrovradm: a utility that manages custom override functions for Bare Metal Restore.
130
131
132 134
137
140 142
143
144 146
90
Unified logging: originator IDs and NetBackup processes (continued) Processes that use the originator ID
Controls the logging level in the (ACE/TAO) CORBA components for any process that uses a CORBA interface. The default level is 0 (only important messages are logged). This logging is intended for Symantec engineering.
166 178
210 219
220 221
222
91
Unified logging: originator IDs and NetBackup processes (continued) Processes that use the originator ID
The storage lifecycle manager (libssmgr), which controls lifecycle image duplication operations. The Remote Disk Service Manager interface (RDSM) that runs within the Remote Manager and Monitor Service. RMMS runs on media servers. Event Manager Service (nbevtmgr). nbevtmgr provides asynchronous event Management Services for cooperating participants. BMR launcher (bmrlauncher). A utility in the Windows BMR Fast Restore image that configures the BMR environment. Recovery assistant for Sharepoint Portal Server for NetBackup clients. Artifact generator generated source. NetBackup Administration Console for Windows (nbconsole). Legacy error codes. The Expiration Manager (libexpmgr), which handles capacity management and image expiration for storage lifecycle operations. Encryption Key Management service NetBackup Client Oracle utility nbars: Database Agent Request server process call
230
231
248
1 2 3
In the NetBackup Administration Console, expand NetBackup Management > Host Properties. Select Master Servers, Media Servers, or Clients. In the Details pane, click the server or client to view the version and platform. Then, double-click to view the properties.
92
4 5
In the left pane, click Logging. Note the Global logging level setting. If you make changes with Global logging level, it affects the logging level that both unified logging and legacy logging use.
PBX logging Media and device management logging (vmd, ltid, avrd, robotic daemons, media manager commands) Any unified logging processes whose debug level has been changed from the default setting
Set logging levels for PBX. See PBX logging on page 69. For logging information on media manager, see the following: See Media and device management legacy debug logs on page 107. To set verbosity levels for legacy logging without affecting unified logging, use the bp.conf and vm.conf files. See How to set legacy logging levels on page 112. To set the logging level for unified logging without affecting legacy logging, use the vxlogcfg command. See Configuring and using unified logging on page 94. The NetBackup Administration Console Global logging level field allows values of 0 to 5. Table 3-3 lists the kind of detail each level includes. Table 3-3 Logging level
0 1 2 3 4 5
93
In the Global logging level field of the Administration Console, a zero (0) level specifies the minimum level of logging for both legacy and unified logging. However, for diagnostic and debug messages in unified logging, the logging level can be turned off completely (no diagnostic messages or debug messages are logged). This level cannot be set with the Global logging level field in the NetBackup Administration Console. You can set it with the vxlogcfg command. See Configuring and using unified logging on page 94. A change to Global logging level affects the logging level of all NetBackup and Enterprise Media Manager (EMM) processes on the server or client. (The exceptions are PBX and media and device management logging.) This setting overrides any previous settings. If you make a change to the VERBOSE level in the bp.conf file, it only affects the legacy logging level. It you make a change with the vxlogcfg command, it only affects the unified logging level.
File size (this option is the default), as defined by the MaxLogFileSizeKB option. Local time, as defined by the RolloverAtLocalTime option. Periodic (elapsed time), as defined by the RolloverPeriodInSeconds option. File size or Local time, whichever limit is encountered first File size or Periodic, whichever limit is encountered first
To set these options, use the vxlogcfg command with the RolloverMode option. An example that uses vxlogcfg is available: See vxlogcfg command on page 97. By default, log-file rollover is based on file size (5120 KB). When a log file reaches 5120 KB in size, the file is closed and a new one is created. The following example file names show log file rollover, with rotation ID incremented:
94
Logs for the processes that are listed in Table 3-2 can use rotation. Certain legacy logs can also use rotation. See Legacy logging file rotation (robust logging) on page 109.
Unified logging files can be automatically deleted when the number of log files exceeds a certain number. The NumberOfLogFiles option on the vxlogcfg command defines that number. An example of vxlogcfg usage is available: See vxlogcfg command on page 97. The Keep logs For setting can delete both unified logs and legacy logs. This parameter is located in the Clean-up dialog box under Host Properties in the NetBackup Administration Console.
Logs that age beyond the number of days that the Keep logs For setting specifies are deleted for unified and legacy logging. Unified logging log files can also be deleted explicitly using the vxlogmgr command. If vxlogmgr does not manually delete or move files, the Keep logs For setting removes the old logs for both unified and legacy logging.
If the vxlogcfg LogRecycle option is ON (true), the Keep logs For setting is disabled for unified logs. In this case, unified logging files are deleted when their number (for a particular originator) exceeds the number that is specified by NumberOfLogFiles on the vxlogcfg command.
95
vxlogview command
Use this command to view the logs that unified logging creates. These logs are stored in subdirectories in /usr/openv/logs on UNIX or install_path\logs on Windows. Unlike the files that are written in legacy logging, you cannot view unified logging files with a text editor. The unified logging files are in binary format, and some of the information is contained in an associated resource file. Only the vxlogview command can assemble and display the log information correctly. You can use vxlogview to view NetBackup log files as well as PBX log files. To view PBX logs using the vxlogview command, do the following:
Ensure that you are an authorized user. For UNIX and Linux, you must have root privileges. For Windows, you must have Administrator privileges. Specify the PBX product ID by entering -p 50936 as a parameter on the vxlogview command line.
vxlogview examples
Example 1 Display the log messages for NetBackup that show only the date, time, message type, and message text:
vxlogview --prodid 51216 --display D,T,m,x
Example 2 Display the log messages for originator 116 (nbpem) that were issued during the last 20 minutes:
vxlogview -o 116 -t 00:20:00
Note that you can specify -o nbpem instead of -o 116. Example 3 Display the log messages for nbpem that were issued during a particular time period:
vxlogview -o nbpem -b "05/03/05 06:51:48 AM" -e "05/03/05 06:52:48 AM"
Example 4 You can use the -i option instead of -o, to specify an originator:
96
vxlogview -i nbpem
The vxlogview -i option searches only the log files that the specified process (nbpem) creates. The originator ID of these files in the log file name is 116. By limiting the log files that it has to search, vxlogview returns a result faster. By comparison, the vxlogview -o option searches all unified log files for messages that are logged by the specified process. Note: If you use the -i option with a process that is not a service (no originator ID in the file name), vxlogview returns the message "No log files found." In this case, use the -o option instead of the -i option. Typically, the NetBackup process is a service if it appears in the Activity Monitor of the NetBackup Administration Console, under the Daemons tab (UNIX) or Services tab (Windows). Example 5 You can search the logs for a particular job ID:
vxlogview -i nbpem | grep "jobid=job_ID"
Note: The jobid= search key should contain no spaces, and jobid= must be lowercase. When searching for a job ID, you can use any vxlogview command options. This example uses the -i option with the name of the process (nbpem).
vxlogmgr command
Use this command to manage unified logging files, such as to move or delete logs. Example 1 List all unified log files for the nbrb service:
vxlogmgr -s -o nbrb
Example output:
/usr/openv/logs/nbrb/51216-118-1342895976-050503-0000000000.log /usr/openv/logs/nbrb/51216-118-1342895976-050504-0000000000.log /usr/openv/logs/nbrb/51216-118-1342895976-050505-0000000000.log Total 3 file(s)
If the vxlogcfg NumberOfLogFiles option is set to 1, the following deletes the two oldest log files for the nbrb service:
97
vxlogmgr -d -o nbrb -a
Example output:
Following are the files that were found: /usr/openv/logs/nbrb/51216-118-1342895976-050504-0000000000.log /usr/openv/logs/nbrb/51216-118-1342895976-050503-0000000000.log Total 2 file(s) Are you sure you want to delete the file(s)? (Y/N): Y Deleting /usr/openv/logs/nbrb/51216-118-1342895976-050504-0000000000.log ... Deleting /usr/openv/logs/nbrb/51216-118-1342895976-050503-0000000000.log ...
Example 2 Delete the unified log files that NetBackup created in the last 15 days:
vxlogmgr -d --prodid 51216 -n 15
vxlogcfg command
Use this command to configure unified logging settings. For instance, use vxlogcfg to change logging levels and rollover settings. Note the following items:
vxlogcfg is the only way to turn off diagnostic and debug messages in unified
logging. In legacy logging, the writing of messages cannot be turned off, only minimized.
The vxlogcfg settings for robust file logging (MaxLogFileSizeKB and NumberOfLogFiles) also affect certain legacy logs. See Legacy logging file rotation (robust logging) on page 109. Absolute paths must be specified, not relative ones.
98
Note: For MaxLogFileSizeKB to be effective, the vxlogcfg commands RolloverMode option must be set to FileSize. The following sets the default NetBackup rollover mode:
vxlogcfg -a --prodid 51216 --orgid Default -s RolloverMode=FileSize MaxLogFileSizeKB can be set per originator. If it is not configured for a given
originator, that originator uses the default value. The following example overrides the default value for service nbrb (originator 118).
vxlogcfg -a -p 51216 -o nbrb -s MaxLogFileSizeKB=1024
This example sets rollover mode for nbpem (originator ID 116) to periodic. It also sets the interval until the next nbpem log file rollover to 24 hours (86400 seconds).
99
This example sets nbemm rollover mode to file size, and turns on log recycling. When the number of log files exceeds 999999, the oldest log file is deleted. See Controlling the number of log files with vxlogmgr on page 100.
For further details on these commands, refer to the NetBackup Commands manual or to the man pages.
Output:
Configuration settings for originator 118, of product 51,216... LogDirectory = /usr/openv/logs/ DebugLevel = 5 DiagnosticLevel = 5 LogToStdout = False LogToStderr = False LogToOslog = False RolloverMode = FileSize MaxLogFileSizeKB = 5120 RolloverPeriodInSeconds = 43200 RolloverAtLocalTime = 0:00 NumberOfLogFiles = 4 OIDNames = nbrb L10nLib = /usr/openv/lib/libvxexticu.so L10nResource = nbrb L10nResourceDir = /usr/openv/resources SyslogIdent = VRTS-NB
100
The following deletes log files for the nbrb service only:
vxlogmgr -a -d -o nbrb
The total disk space NetBackup unified logs consume is the sum of the disk space that each originator consumes. If none of the originators overrides the NumberOfFiles and MaxLogFileSizeKB settings, then the total disk space that unified logging consumes is as follows:
Number of NetBackup originators * default MaxLogFileSizeKB * default NumberOfFiles
To see the current unified logging settings, use the vxlogcfg command as shown in the following topic: See Listing unified logging settings on page 99.
101
No NetBackup originators override default settings for MaxLogFileSizeKB or NumberOfFiles. The number of active NetBackup originators on the host is 10. (This total may be typical of a NetBackup master server that is not running BMR or NDMP.) The default NumberOfFiles is equal to 3. The default MaxLogFileSizeKB is equal to 5120.
Which is 10 * 5120 * 3 KB, or 15360 kilobytes of disk space at the end of each hour.
102
Create a directory:
mkdir /upload
Example output:
Following are the files that were found: /usr/openv/logs/bmrsetup/51216-157-2202872032-050125-0000000000.log /usr/openv/logs/nbemm/51216-111-2202872032-050125-0000000000.log /usr/openv/logs/nbrb/51216-118-2202872032-050125-0000000000.log /usr/openv/logs/nbjm/51216-117-2202872032-050125-0000000000.log /usr/openv/logs/nbpem/51216-116-2202872032-050125-0000000000.log /usr/openv/logs/nbsl/51216-132-2202872032-050125-0000000000.log Total 6 file(s) Copying /usr/openv/logs/bmrsetup/51216-157-2202872032-050125-0000000000.log ... Copying /usr/openv/logs/nbemm/51216-111-2202872032-050125-0000000000.log ... Copying /usr/openv/logs/nbrb/51216-118-2202872032-050125-0000000000.log ... Copying /usr/openv/logs/nbjm/51216-117-2202872032-050125-0000000000.log ... Copying /usr/openv/logs/nbpem/51216-116-2202872032-050125-0000000000.log ... Copying /usr/openv/logs/nbsl/51216-132-2202872032-050125-0000000000.log ...
103
Output:
51216-111-2202872032-050125-0000000000.log 51216-116-2202872032-050125-0000000000.log 51216-117-2202872032-050125-0000000000.log 51216-118-2202872032-050125-0000000000.log 51216-132-2202872032-050125-0000000000.log 51216-157-2202872032-050125-0000000000.log
On UNIX:
/usr/openv/netbackup/logs /usr/openv/volmgr/debug
On Windows:
install_path\NetBackup\logs install_path\Volmgr\debug
Information is available about how to enable the Status Collector Daemon: See Enabling the status collector daemon on page 109. Tables are available that list the log directories that you must create.
104
See Table 3-4 on page 104. See Table 3-5 on page 107. Note: You must create these directories before logging can take place. If these directories exist, NetBackup creates log files in the directory for the associated process. A debug log file is created when the process begins.
Note: On a Windows server, you can create the debug log directories at once, under install_path\NetBackup\logs, by running the following batch file: install_path\NetBackup\Logs\mklogdir.bat. Media servers have only the bpbrm, bpcd, bpdm, and bptm debug logs.
NetBackup client daemon or manager. The NetBackup Client service starts this process NetBackup jobs database manager program.
NetBackup database manager. This process runs only on master servers. On Windows systems, it is the NetBackup database manager service.
105
Table 3-4
NetBackup legacy logs (not media and device management) (continued) Associated process
The NetBackup-Java application server authentication service that is started when the NetBackup Java interface applications start. On UNIX servers, inetd starts it. On Windows servers, the Client Services service starts it. This program authenticates the user that started the application.
bpjava-susvc
The NetBackup program that bpjava-msvc starts upon successful logon through the logon dialog box that is presented when a NetBackup-Java interface starts. This program services all requests from the Java user interfaces on the NetBackup master or media server host where bpjava-msvc is running (all Windows platforms). NetBackup request daemon or manager. On Windows systems, this process is called the NetBackup Request Manager service. The NetBackup process for synthetic backup. nbjm starts bpsynth. bpsynth runs on the master server. NetBackup tape management process.
bprd
bpsynth
bptm
syslogs
You must enable system logging to troubleshoot ltid or robotic software. See the syslogd man page.
System log.
106
Table 3-4
NetBackup legacy logs (not media and device management) (continued) Associated process
The user_ops directory is created during the install of NetBackup on all servers and clients. NetBackup Java interface programs use it for the following: temporary files and for job and progress log files that the user Backup, Archive, and Restore program (jbpSA) generates. This directory must exist for successful operation of any of the Java programs and must have public read, write, and execute permissions. user_ops contains a directory for every user that uses the Java programs. In addition, on NetBackup-Java capable platforms, the NetBackup Java interface log files are written in the nbjlogs subdirectory. All files in the user_ops directory hierarchy are removed according to the setting of the KEEP_LOGS_DAYS configuration option.
vnetd
The Veritas network daemon, used to create "firewall friendly" socket connections. Started by the inetd(1M) process.
More information is available on the programs and daemons that write the logs. See About backup and restore functional overview on page 633. On UNIX systems, also refer to the README file in the /usr/openv/netbackup/logs directory.
107
NetBackup creates one log per day in each of the debug directories. Information is available about the format of the log file names. See Legacy logging file name format without log file rotation on page 111. On UNIX: To disable vmd debug logging, either delete or rename the /usr/openv/volmgr/debug/daemon directory. On Windows, to disable debug logging for the NetBackup Volume Manager service, either delete or rename the install_path\Volmgr\debug\daemon folder. NetBackup retains debug logs for the number of days you specify with the DAYS_TO_KEEP_LOGS = entry in the vm.conf file. (The default is infinite retention.) For instructions on how to use this entry, see the NetBackup Administrators Guide, Volume II.
Table 3-5 describes the media management legacy debug log and device management legacy debug logs. Table 3-5 Media and device management legacy debug logs Associated process
Debug information on transactions between NetBackup and the Storage Tek ACSLS server. Debug information for vmd (NetBackup Volume Manager service, Windows) and its associated processes (oprd and rdevmi). Stop and restart vmd after creating the directory. Debug information on ltid, the Media Manager device daemon (UNIX), or on the NetBackup Device Manager service (Windows), and on avrd. Stop and restart ltid after creating the directory.
daemon
ltid
108
Table 3-5
Media and device management legacy debug logs (continued) Associated process
Debug information on the processes that request media management services from vmd or EMM. Stop and restart vmd after creating the directory. Debug information on all robotic daemons, which includes tldcd, tl8cd, and tl4d daemons. Stop and restart robotic daemons. Debug information for device configuration, including the tpconfig and the tpautoconf commands and the NetBackup Administration Console.
robots
tpcommand
vmscd
/usr/openv/volmgr/debug/ (UNIX)
Debug information for the NetBackup Status Collection daemon. Stop and restart vmscd install_path\Volmgr\debug\ (Windows) after creating the directory.
The ltid command that started the device management processes. If the -v option is included on the ltid command, all daemons that were started as a result also have the -v option in effect. Or A command to start a specific daemon (for example, acsd -v).
To troubleshoot ltid or robotic software, you must enable system logging. See the syslogd(8) man page for information on setting up system logs. Errors are logged with LOG_ERR, warnings with LOG_WARNING, and debug information with LOG_NOTICE. The facility type is daemon.
109
Note: On HP-UX, the sysdiag tool may provide additional information on hardware errors.
110
When a log file grows to its maximum size, it closes and a new file is opens. If the new file exceeds the number of log files that is allowed for the directory, the oldest file is deleted. Logs created by the following NetBackup processes can use log rotation (robust logging):
bpbkar (client process only) bpbrm bpcd bpdbm bpdm bprd bptm
For the legacy logs created by other NetBackup processes (but not media and device management), use the Keep Logs For setting as follows:
Windows: In the Host Properties > Properties > Clean-up dialog box UNIX: In the Host Properties > Properties > Global Attributes dialog box
The Keep Logs For setting ultimately applies to all legacy logs. If this setting is 10 and robust file logging settings allow logs to exist more than 10 days, the logs are deleted on day 11.
111
Go to Host Properties > Master Servers > Properties > Logging and click the Enable robust logging box to enable legacy log rotation. This item applies to legacy logs only. By default, the maximum file size is 5120 KB and the number of files that are kept per log directory is 3. If Enable robust logging is disabled, the standard log-file behavior remains in effect: one log file is created per log directory per day, and log deletion is based on the Keep Logs For setting.
To change the maximum file size, or number of log files per directory, use the MaxLogFileSizeKB and NumberOfLogFiles options on the vxlogcfg command. The vxlogcfg command is located in /usr/openv/netbackup/bin (on UNIX) and install_path\NetBackup\bin (on Windows). For example:
vxlogcfg -a -p 51216 --orgid Default -s MaxLogFileSizeKB=2048,NumberOfLogFiles=10
This example sets the maximum file size to 2048 KB and sets the maximum number of log files per log directory to 10. It sets the default values for all unified logging processes and for the legacy processes for NetBackup (product ID 51216). A list of these processes is available: See Legacy logging file rotation (robust logging) on page 109. More examples of the vxlogcfg command are available. See Configuring and using unified logging on page 94. You can also refer to the vxlogcfg man page or to the NetBackup Commands manual.
On UNIX: log.mmddyy
112
For example: 040105_00001.log Where nnnnn is a counter or a rotation number for the log file. When the counter exceeds the setting for number of log files, the oldest log file is deleted. The NumberOfLogFiles option on the vxlogcfg command sets the number of log files. For compatibility with existing scripts, the debug log file naming format does not change. If log files are already created with the standard legacy naming format and then robust file logging is enabled, the following occurs: only the new log files for the processes that robust logging governs use the new file rotation naming format. Any mixture of new and old log file names in a legacy debug log directory is managed according to the following: the Keep Logs For setting and the robust logging settings, when applicable. See Legacy logging file rotation (robust logging) on page 109.
On Windows or UNIX systems, set the Global Logging Level to a higher level, in the Logging dialog box. (Click Host Properties > Master Server Properties > Logging tab > Logging dialog box.) Note: This setting affects legacy logging (but not media and device management logging) and unified logging. More information is available for synthetic backup. See Logs to accompany problem reports for synthetic backup on page 113.
113
On UNIX, add a VERBOSE entry in the /usr/openv/netbackup/bp.conf file. VERBOSE by itself sets the verbose value to 1. For more log detail, enter VERBOSE = 2 or a higher value. This setting affects legacy logging only. Warning: High verbose values can cause debug logs to become very large. Use the Logging dialog box to set the logging level for individual processes. (Click Host Properties > Master Server Properties > Logging tab > Logging dialog box.) Or, specify the verbose flag (if available) when you start the program or daemon. See the NetBackup Administrators Guide, Volume I.
In media and device management legacy logging, only two levels exist: not verbose (the default), and verbose. To set the verbose (higher) level, add the word VERBOSE to the vm.conf file. Create the vm.conf file if necessary and restart ltid and vmd after you add the VERBOSE entry. On UNIX, add VERBOSE to the /usr/openv/volmgr/vm.conf file. On Windows, add VERBOSE to the install_path\Volmgr\vm.conf file. This entry affects logging levels in the Event Viewer Application and System log.
Log files that unified logging creates For information about how to gather unified logging files relevant to the problem, see the following: See Submitting unified logging files to Symantec support on page 101. Log files that legacy logging creates If the following legacy log directories have not been created, you must create the directories. Set the debug level to 5 and then rerun the job.
114
Set the logging level as follows: Use the Global logging level option on the Logging tab in the Master Server Properties dialog box. See Using the Host Properties window on page 61. Rerun the job and then gather the logs from the directories that you created. The bptm logs are required only if the images are read from or written to a tape device. The bpdm logs are needed only if the images are read from or written to disk. If the images are read from multiple media servers, the debug logs for bptm or bpdm must be collected from each media server.
Try file
Include the try file for the job ID from the following directory:
install_path/netbackup/db/jobs/trylogs/jobid.t
For instance, if the job ID of the synthetic backup job was 110, then the try file is named 110.t.
Policy attributes
Capture the output from the following command and send it to Support with the rest of the information:
install_path/netbackup/bin/admincmd/bppllist policy_name -L
where policy_name is the name of the policy for which the synthetic backup job was run.
115
nbftclnt: originator ID 200. bmrsavecfg: originator ID 121. bmrc: originator ID 122. bmrc originates from the BMR boot server, which may
nbbrowse: originator ID 359. NetBackup Client Browser. nbcs: originator ID 366. Client service.
A description is available of unified logging file name format and other details: See Unified logging on page 85. Unified logging is enabled by default.
The following table lists the legacy debug log directories that apply to UNIX clients. Additional information on legacy logging is available: See Legacy NetBackup logging on page 103. Note: Create the directories with access modes of 777 so that user processes can write to the log files. Table 3-6 describes the UNIX client debug logs for legacy logging.
116
bpjava-msvc
bpjava-usvc
bplist
bpmount
bporaexp
bporaexp64
bporaimp
bporaimp64
bprestore
117
mtfrd
tar user_ops
118
See Unified logging file name format on page 86. See Legacy logging file name format without log file rotation on page 111.
nbftclnt: originator ID 200 bmrsavecfg: originator ID 121. bmrc: originator ID 122. bmrc originates from the BMR boot server, which may
bmrrst.exe and bmrmap.exe: originator ID 131. These originate from the BMR boot server, which may or may not be a NetBackup server, and run on the restoring client.
nblbc: originator ID 351. NetBackup Live Browse Client. nbgre: originator ID 352. NetBackup Granular Restore. nbbrowse: originator ID 359. NetBackup Client Browser. nbcs: originator ID 366. Client service.
A discussion of file name format and other unified logging details is available. See Unified logging on page 85. Unified logging is enabled by default.
119
Note: These are the default locations in which to place these directories. You can specify another location during client installation.
Table 3-7 lists the legacy debug log directories that apply to these clients: Table 3-7 PC client debug logs: Legacy logging Associated process
Client-user interface program for NetWare. Client service logs. These logs have information on the bpinetd32 process. Archive program that is run from the command line. The backup program that is run from the command line. Backup and archive manager. These logs have information on the bpbkar32 process. NetBackup client daemon or manager. These logs have information on communications between the server and client. On NetWare clients, these logs also contain the log information for the backup and restore processes.
bpinetd
Windows2003
bparchive
Windows 2003
bpbackup
Windows 2003
bpbkar
Windows 2003
bpcd
bpjava-msvc
The NetBackup-Java bpjava-msvc application server authentication service that the Client Services service starts during startup of the NetBackup Java interface applications. This program authenticates the user that started the application. (On all Windows platforms.)
120
Table 3-7
NetBackup program that bpjava-usvc bpjava-msvc starts upon successful login through the Login dialog box that is presented when a NetBackup-Java interface is started. This program services all requests from the Java administration and user interfaces on the NetBackup host where bpjava-msvc is running.(On all Windows platforms.) Windows 2003 List program that is run from the command line. The program that is used to collect drive names on the client for multistreaming clients. The restore program that is run from the command line. NetBackup service utility. This program allows the system with the user interface to communicate with the NetBackup for NetWare client. tar process. These logs have information about the tar32 process.
bplist
bpmount
Windows 2003
bprestore
Windows 2003
bpsrv
NetWare nontarget
tar
Windows 2003
121
Table 3-7
For Windows clients, logs are kept for the number of days that are specified in the Backup, Archive, and Restore interface. On the File menu, click NetBackup Client Properties > General tab: Keep status of user-directed backups, archives, and restores for.
122
For NetWare clients, logs are kept the number of days that are specified in file \veritas\nbuclt\netback\bp.ini (under Keep_Logs_Days).
On Windows clients, set the debug level with the Verbose field on the TroubleShooting tab of the NetBackup Client Properties dialog box. On the Backup, Archive, and Restore interface, click File > NetBackup Client Properties. For the unified logging files that the Bare Metal Restore process bmrsavecfg creates, you also can control logging level with the vxlogcfg command. See Configuring and using unified logging on page 94. On NetWare clients, change the value of the level and the tcp parameters in the debug section of the bp.ini file. For instructions, see the NetBackup user guide for the client. An increase in the log level can cause the logs to grow very large; increase the logging level only if unexplained problems exist.
123
Add an entry (optional) to the eventlog file that specifies the severity and type of NetBackup messages that are written. The following is an example:
56 255
The first parameter controls the messages that NetBackup writes to the Application Log, which are based on severity level. The second parameter controls the type of messages that NetBackup writes to the Application Log.
Both parameters are specified as decimal numbers and equate to a bitmap that expresses these values.
Severity: 1 = Unknown 2 = Debug 4 = Info 8 = Warning 16 = Error 32 = Critical Type: 1 = Unknown 2 = General 4 = Backup
124
If the file is empty, the default severity is Error (16) and the default type is Backup Status (64). If the file has only one parameter, it is used for the severity level. The default value of Backup Status (64) is used for the type.
Example
Assume you want to include all types of messages that have severity levels of warning, error, and critical. In this instance, the entry is: 56 255 Where: 56 = severity= the sum of warning, error, and critical (8 + 16 + 32) 255 = type = the sum of all types (1 + 2 + 4 + 8 + 16 + 32 + 64 +128) The following is an example of a message that is written in the Windows Event Viewer Application Log:
16 4 10797 -1 cacao bush nbpem backup of client bush exited with status 71
The meaning of each field is as follows (left to right): severity = 16 (Error) type = 4 (Backup) jobid = 10797 job group ID = 1 server = cacao client = bush process = nbpem text = backup of client bush, which exited with status 71
Using logs and reports Troubleshooting the Administration Console for UNIX
125
NetBackup status codes and messages. See About NetBackup status codes and messages on page 155. See About media and device management status codes and messages on page 423. Operations performed in the Administration Console can result in errors that are recognized in other parts of NetBackup. These errors usually appear exactly as documented in the NetBackup status codes and messages. Note: A status code does not always accompany the error message. To find the status code, look up the message in the alphabetical listing of the following topic See NetBackup Messages on page 397. Then use the status code to find the full description of the message in the first half of that chapter.
NetBackup Administration Console: application server status codes and messages. See About NetBackup status codes and messages on page 155. These messages have status codes in the 500 range. Messages with status codes 500, 501, 502, 503 and 504 begin with "Unable to login, status:". Messages with status codes 511 and 512 may or may not begin with "Unable to login, status:". A status code does not always accompany the message (see note). Java exceptions Either the Java APIs or NetBackup Administration APIs generate these. These messages begin with the name of the exception. For example:
java.lang.ClassCastException
126
Using logs and reports Troubleshooting the Administration Console for UNIX
or
vrts.nbu.NBUCommandExecutionException
In the status line (bottom) of the NetBackup Administration window In the log file generated by the jnbSA or jbpSA commands When it is set up, in the output file of the Windows Display Console .bat file. More information is available about detailed debug logging. See Enabling detailed debug logging on page 127.
Operating system errors If messages appear that do not match those documented in this manual, they probably are errors in the operating system.
On the host that is specified in the login dialog box In /usr/openv/netbackup/logs/user_ops On the host where the Console was started In /usr/openv/netbackup/logs/user_ops/nbjlogs
If space is not available in the respective file systems, you may experience the following: long waits for application response, incomplete data, reduced functionality, and unexpected error messages. The following are some of the results you may receive:
No response during login "Cannot connect" socket errors during login to the NBJava application server Reduced functionality in the NetBackup interface, for example, only the Backup, Archive, and Restore and Files System Analyzer nodes appear in the tree An error dialog box with the "Unable to login, status: 35 cannot make required directory" message An error dialog box with "/bin/sh: null: not found (1) "message Empty warning dialog boxes
Using logs and reports Troubleshooting the Administration Console for UNIX
127
An error dialog box with the message "An exception occurred: vrts.nbu.admin.bpmgmt.CommandOutputException: Invalid or unexpected class configuration data: <the rest of the message will vary>"
On the NetBackup client or server that is specified in the login dialog box, create the following: bpjava-msvc, bpjava-susvc (if a NetBackup server), and bpjava-usvc (if a NetBackup client) debug log directories in the /usr/openv/netbackup/logs directory (UNIX) or in install_path\NetBackup\logs (Windows). For more information on debug logs, see the following: See Debug logs on servers on page 85.
On the UNIX machine where you run the jnbSA or jbpSA commands, add the following line to the Debug.properties file in the /usr/openv/java directory.
debugMask=2
The log file name is displayed in the xterm window where you ran the jnbSA or jbpSA commands.
128
If you use the NetBackup Java Windows Display Console, add the following line to the Debug.properties file in the NetBackup Java installed folder (for example, C:install_path\VERITAS\java):
debugMask=2
If you use the Windows Display Console on a host where NetBackup is not installed, you have to do the following: edit the nbjava.bat file located in the NetBackup Java-installed folder to redirect output to a file. See the nbjava.bat file for details.
129
Example
PRODID = 100 or PRODID = 'NBU'
ORGID or orgid
Integer or string
Provide the start date in seconds or STDATE = 98736352 or STDATE = in the locale specific short date and '4/26/04 11:01:00 AM' time format. For example, a locale may have format 'mm/dd/yy hh:mm:ss AM/PM' Provide the end date in seconds or ENDATE = 99736352 or ENDATE = in the locale specific short date and '04/27/04 10:01:00 AM' time format. For example, a locale may have format 'mm/dd/yy hh:mm:ss AM/PM' Provide the hours in 'hh:mm:ss' PREVTIME = '2:34:00' format. This field should be used only with operators =, <, >, >= and <= Provide one of the following possible severity types:
PREVTIME or prevtime
String
SEV or sev
Integer
INFO or info WARNING or warning ERR or err CRIT or crit EMERG or emerg
130
Type
Integer
Example
MSGTYPE = 1 or MSGTYPE = DIAG
CTX or ctx
Integer or string
Provide the context token as string CTX = 78 or CTX = 'ALL' identifier or 'ALL' to get all the context instances to be displayed. This field should be used only with the operators = and !=.
String constants
String constants should be given in single quotes. For example, prodid = 'NBU' Start and end date can be provided as follows: either as a string constant of the regional display short date format or a long value of number of seconds that elapsed since midnight January 1, 1970.
Chapter
About using NetBackup utilities Analysis utilities for NetBackup debug logs Network troubleshooting utilities NetBackup support utility (nbsu) NetBackup consistency check (NBCC) NetBackup consistency check repair (NBCCR) The nbcplogs utility
132
To trace a NetBackup job requires that you view and correlate messages in multiple log files on multiple hosts. The log analysis utilities provide a consolidated view of the job debug logs. The utilities scan the logs for all processes that are traversed or run for the job. The utilities can consolidate job information by client, job ID, start time for the job, and policy that is associated with the job. The available utilities are as follows:
backuptrace copies to standard output the debug log lines relevant to the
restoretrace copies to standard output the debug log lines relevant to the
bpgetdebuglog is a helper program for backuptrace and restoretrace duplicatetrace consolidates the debug logs for the specified NetBackup
importtrace consolidates the debug log messages for the specified NetBackup
verifytrace consolidates the debug log messages for the specified verify job(s)
Installation requirements
The log analysis utilities are available for all platforms that are supported for NetBackup servers. Note: Though the utilities must be initiated on supported platforms, they can analyze debug log files from most NetBackup UNIX and Windows client and server platforms.
Output format
The format of an output line is as follows:
daystamp.millisecs.program.sequence machine log_line
The following is a short description of each part of the outline line format:
daystamp millisecs program The day of the log in yyyymmdd format. The number of milliseconds since midnight on the local machine. The name of program (BPCD, BPRD, etc.) being logged.
133
Line number within the debug log file. The name of the NetBackup server or client. The line that appears in the debug log file.
Limitations
While the log analysis utilities cover a variety of logs, the following exceptions occur:
Media and device management logs are not analyzed. The legacy debug log files must be in standard locations on the servers and clients. /usr/openv/netbackup/logs/<PROGRAM_NAME>/log.mmddyy on UNIX and <install_path>/NetBackup/Logs/<PROGRAM_NAME>/mmddyy.log on Windows. An option may be added later that allows the analyzed log files to reside on alternate paths. Note: For the processes that use unified logging, no log directories must be created. The consolidated debug log may contain messages from unrelated processes. You can ignore messages with time stamps outside the duration of the job from the following: bprd, nbpem, nbjm, nbrb, bpdbm, bpbrm, bptm, bpdm, and bpcd.
134
Table 4-1
backuptrace
restoretrace
restoretrace consolidates the debug logs for specified NetBackup restore jobs. The utility writes debug log messages relevant to the specified jobs to standard output and sorts the messages by time. restoretrace attempts to compensate for time zone changes and clock drift between remote servers and clients. The format of the output makes it relatively easy to sort or grep by time stamp, program name, and server or client name. At a minimum, you must enable debug logging for bprd on the master server. Enable debug logging for bpbrm and bptm or bpdm on the media server and tar on the client. For best results, set the verbose logging level to 5. Enable debug logging for bpdbm on the master server and for bpcd on all servers and clients.
bpgetdebuglog
bpgetdebuglog is a helper program for backuptrace and restoretrace. It can also be useful as a stand-alone program and is available for all NetBackup server platforms. bpgetdebuglog prints to standard output the contents of a specified debug log file. If only the remote machine parameter is specified, bpgetdebuglog prints the following to standard output: the number of seconds of clock drift between the local machine and the remote machine.
135
Table 4-1
backupdbtrace
duplicatetrace
duplicatetrace consolidates the debug logs for the specified NetBackup duplicate jobs and writes them to standard output. It sorts the messages by time. duplicatetrace attempts to compensate for time zone changes and clock drift between remote servers and clients. At a minimum, you must enable debug logging for admin on the master server and for bptm or bpdm on the media server. For best results, set the verbose logging level to 5 and enable debug logging for the following: bpdbm on the master server and bpcd on all servers and clients in addition to the processes already identified.
importtrace
importtrace consolidates the debug log messages for the specified NetBackup import jobs and writes them to standard output. It sorts the messages by time. importtrace attempts to compensate for time zone changes and clock drift between remote servers and clients. At a minimum, you must enable debug logging for admin on the master server, and for bpbrm, bptm and tar on the media server. For best results, set the verbose logging level to 5 and enable debug logging for the following: bpdbm on the master server and bpcd on all servers and clients in addition to the processes already identified.
136
Table 4-1
verifytrace
Hardware, operating system, and NetBackup level settings. Examples include correct DNS lookups, firewall port openings, and network routes and connections. The NetBackup Domain Network Analyzer (nbdna) verifies this configuration. A set of utilities including bptestcd and bptestnetconn verifies the NetBackup level settings. These settings include CONNECT_OPTIONS and CORBA endpoint selection. Network configuration utilities
The bptestnetconn command performs several tasks that aid in the analysis of DNS and connectivity problems with any specified list of hosts, including the server list in the NetBackup configuration. To help troubleshoot connectivity problems between services that use CORBA communications, bptestnetconn can perform and report on CORBA connections to named services. The bptestbpcd command tries to establish a connection from a NetBackup server to the bpcd daemon on another NetBackup system. If successful, it reports information about the sockets that are established.
Table 4-2
bptestnetconn
bptestbpcd
137
Table 4-2
nbdna
For a description of the nbsu command options, refer to the nbsu man page. More information is available. See the NetBackup Commands manual.
To obtain baseline data on your NetBackup installation. If you encounter problems later, this data can be useful. To document changes in your NetBackup or operating system environment. Run nbsu periodically to keep your baseline data up to date. To help isolate a NetBackup or operating system issue. To report issues to Symantec support.
138
139
nbsu output
nbsu writes the information it gathers to text files in the following directory: On UNIX:
/usr/openv/netbackup/bin/support/output/nbsu/hostname_timestamp
On Windows: install_path\NetBackup\bin\support\output\nbsu\hostname_timestamp The NetBackup environment where nbsu runs determines the particular files that nbsu creates. nbsu runs only those diagnostic commands that are appropriate to the operating system and the NetBackup version and configuration. For each diagnostic command that it runs, nbsu writes the command output to a separate file. As a rule, the name of each output file reflects the command that nbsu ran to obtain the output. For example, nbsu created the NBU_bpplclients.txt by running the NetBackup bpplclients command and created the OS_set.txt file by running the operating systems set command.
--------------------- Network ipconfig information report ------------------------------------------- Command used ---------------------------> "C:\WINDOWS\system32\ipconfig" /all Windows IP Configuration Host Name . . . . . . . Primary Dns Suffix . . Node Type . . . . . . . IP Routing Enabled. . . WINS Proxy Enabled. . . DNS Suffix Search List. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . : : : : : : host1 Hybrid No No company.com
140
Figure 4-2 shows an example of part of the nbsu output file for the bpgetconfig command. Figure 4-2 Example nbsu output file: bpgetconfig command (excerpt)
------------------ NetBackup bpgetconfig information report ---------------------------- nbsu diagnostic name and internal procedure used ------------NBU_bpgetconfig - NBU_get_bpgetconfig_info ------------------------------- Command Used ------------------------------> "C:\Program Files\VERITAS\netbackup\bin\admincmd\bpgetconfig" -g host1 -L Client/Master = Master NetBackup Client Platform = PC, Windows2000 NetBackup Client Protocol Level = 6.5.0 Product = NetBackup Version Name = 6.5Alpha Version Number = 650000 NetBackup Installation Path = C:\Program Files\VERITAS\NetBackup\bin Client OS/Release = Windows2003 5 ------------------------------- Command Used ------------------------------> "C:\Program Files\VERITAS\netbackup\bin\admincmd\bpgetconfig" SERVER = host1 SERVER = host2 SERVER = host3 SERVER = host4 SERVER = host5 SERVER = host6 SERVER = host7
If the executed command returned a non-zero status, an EXIT STATUS header indicates the status. For example:
----------------------- EXIT STATUS = 227 -------------------------
As part of the internal processing of each command that a diagnostic command runs, nbsu redirects each commands STDERR to an internal file. If the command writes information to STDERR, nbsu captures this information and includes a STDERR header along with the information. For example:
----------------------------- STDERR -----------------------------bpclient: no entity was found (227)
141
/usr/openv/netbackup/bin/support/output/nbsu/host1_master_20060814_ 164443/host1_master_20060814_164443.tar.gz
where host1 is the name of the host on which nbsu ran. master indicates that the host is a NetBackup master server. nbsu supports tar for archive and gzip for compression. Symantec may add support for other archive and compression utilities in the future. For an up-to-date list of supported archive and compression utilities, run the nbsu -H command on your installed version of NetBackup. Note: Archiving and compression utilities are usually available on UNIX and Linux systems. On Windows, it may be necessary to install these programs. Note that the archiving utility must be referenced in the system PATH environment variable.
This command runs only the diagnostic commands that are related to NetBackup status code 25. Since fewer commands are run, the result may be a smaller set of output files. A description of the output files that nbsu generates is available. See Output files: format on page 139.
142
To determine what information nbsu can collect for a particular status code, run nbsu with the -l option:
nbsu -l -nbu_e 25
Note: You can also use a NetBackup exit script to call nbsu. The script passes the NetBackup status code to nbsu to gather associated diagnostics for a job.
For an nbsu description, examples, and how to gather diagnostic information to send to Symantec support, refer to the nbsu command. See the NetBackup Commands manual. For troubleshooting, run nbsu when the system is in the same state as when the problem occurred. For example, do not stop and restart the NetBackup processes after the error occurs or make a change to the server or network. nbsu may not be able to gather key information about the problem. If a NetBackup component is not operational (for example, bpgetconfig does not return information), nbsu may be unable to properly report on the system. For these cases, use the -nbu_down command line option to bypass the need for NetBackup to be operational. For a full description of the nbu_down command line option, see the NetBackup Commands manual.
By default, nbsu sends error messages to standard error (STDERR) and also includes the messages in its output files under the header STDERR. Note the following alternate ways to view nbsu error messages:
To redirect the nbsu error messages to standard output (STDOUT), enter the following:
To send all nbsu screen output including error messages to a file, enter:
nbsu 2>&1 > file_name
143
where 2>&1 directs standard error into standard output, and file_name directs standard output into the designated file.
The messages are written to the nbsu_info.txt file. Information is available about the location of this file. See nbsu output on page 139.
nbsu_info.txt file
The nbsu_info.txt file provides an overview of the environment where nbsu is run, and contains the following:
General operating system and NetBackup information on the environment that nbsu detects A list of diagnostics that were run A list of diagnostics that returned a non-zero status
This information may indicate why nbsu returned particular values, or why it did not run certain commands. If nbsu does not produce adequate information or if it seems to perform incorrectly, do the following: run nbsu with the -debug option to include additional debug messages in the nbsu_info.txt file.
Queries the operating system for configured Host Services and creates/reads the nbcc-server-aliases file for hostname normalization Through examination of the NetBackup configuration, identifies potentially tape based NetBackup media servers Gathers database/catalog information Analyzes the consistency of the gathered configuration and database/catalog information
144
For a description of the NBCC command options, refer to the NBCC man page. More information is available. See the NetBackup Commands manual.
To check the consistency of the NetBackup configuration and catalog/database information from a tape media perspective To gather and create a package bundle when directed to do so by Symantec Technical Support
2.0
145
Would you like to verify each addition prior to the update? [Y/y,N/n] N If NBCC is unable to determine the NetBackup version for ANY detected media server, is there a SINGLE version of NetBackup you would like associated to these media servers? [Y/y,N/n] Y Please enter the NetBackup primary version number [3,4,5,6] 6 2.1 Reading the initial list of IP addresses and associated hostnames from the D:\Program Files\VERITAS\NetBackup\bin\support\config\ nbcc-server-aliases file Verifying the IP addresses and hostnames contained in the D:\Program Files\VERITAS\NetBackup\bin\support\config\ nbcc-server-aliases file Pinging primary hostnames... Building NetBackup retention level list... Building NetBackup storage unit list... Obtaining EMM server names... Obtaining tpconfig NDMP configuration information... The following NDMP Attach hosts were detected in EMM, but are not active in EMM: Obtaining NetBackup unrestricted media sharing status... Configuration state = NO Obtaining NetBackup Media Server Groups... No Server Groups configured Gathering required NetBackup catalog information Start time = 2009-01-14 12:20:59 Gathering NetBackup EMM conflict table list Found 0 EMM conflict records Building NetBackup Image database contents list ... Found 5890 images in the Image database Obtaining NetBackup version from media servers
2.2
3.1 3.2
3.3
146
3.4
3.5 3.6
Building EMM database Media and Device configuration attribute lists ... Found 212 media records in the EMM database Building EMM database Unrestricted Sharing Media attribute lists Found 0 Unrestricted Sharing media records in the EMM database Building the EMM database Volume attribute list... Getting the EMM database Volume attributes from EMM server master1 ... Found 586 Volume attribute records in the EMM database Building NetBackup volume pool configuration list Server master1 Building NetBackup scratch pool configuration list Server master1 Gathering NetBackup EMM merge table list Found 0 EMM merge table records Summary of gathered NetBackup catalog information ... Catalog data gathering took 499 seconds to complete ***WARNING*** ** It took more than 60 seconds to collect all of the DB data. ** ** If backups are running, there is a risk that the data collected ** might not be consistent, because the DBs may have been updated ** while the data was being collected. ** ** If backups are running and it is possible, please stop all ** backups and run NBCC again. ** ** If it is not possible to stop all backups, just let ** NBCC complete. ...
4.0 5.0
Verifying required catalog components were gathered Beginning NetBackup catalog consistency check Start time = 2009-01-14 12:29:19 Processing EMM database Volume attribute records, pass 1 (of 2), 586 records to be processed Processed 586 EMM database Volume attribute records.
5.2
147
5.3
Checking for duplicate EMM server host names in Volume attribute data 5.4 Processing Image DB, pass 1 (of 2), 5890 to be processed 5890 images processed on pass 1 5.5 Processing EMM database Media attribute records, pass 1 (of 3), 212 records to be processed Processed 212 EMM database Media attribute records. 5.7 Check for duplicate media server names in the EMM database Media attribute data 5.8 Processing EMM database Media attribute records, pass 2 (of 3), 212 records to be processed CONSISTENCY_ERROR Oper_16_4 5.9 NetBackup catalog consistency check completed End time = 2009-01-14 12:29:21
6.1
Processing detected NetBackup server entries Summary of NBCC server processing ... Report complete, closing the .\output\nbcc\master2_NBCC_20090114_121931\nbcc-info.txt output file.
On Windows: install_path\NetBackup\bin\support\config\ The following example shows a sample server alias file listing.
# # # # # # # This file is used by NBCC as a basis for determining primary and associated alias hostnames as described in the NBCC -help output. List the primary IP address, all secondary IP addresses, the primary hostname and all aliases used for the same server by NetBackup in bpgetconfig SERVER related entries, storage units and Volume database hosts (eg, fully-qualified hostnames, hostnames associated with other
148
# network interfaces on the server, cluster node names) on a single line. # # # Note: Any hostnames that contain upper-case characters are converted # to lower-case characters by NBCC for processing. # # Accurate representation of both primary and secondary IP addresses and # primary and associated hostnames are important for both the gathering # and consistency checking of the NetBackup catalog data. # # Format of entries in the "nbcc-server-aliases" file: # # primary_IP [ secondary_IP ... ] primary_name alias [ alias ... ] # # Examples: # # ##.##.###.### ##.##.###.## nbserver NBserver nbserver.lab.symantec.com # ##.##.###.### ##.##.###.## nbmaster-bkup nbmaster zeus zeus-bkup # # Any entries that are added to this file must NOT start with a '#'. # # All lines starting with a '#' are ignored by NBCC. # # Empty lines are ignored by NBCC. # 10.10.100.10 nbu_server.com nbu_server 10.10.111.10 nbu_server1.com nbu_server1 10.10.112.10 nbu_server2.com nbu_server2
NBCC output
NBCC writes the information it gathers to packaged files in the following directory. On UNIX and Linux:
/usr/openv/netbackup/bin/support/output/nbcc/hostname_NBCC_timestamp
On Windows: install_path\NetBackup\bin\support\output\nbcc\hostname_NBCC_timestamp
149
available, NBCC compresses the archive file. Otherwise, the individual output files remain unarchived and uncompressed. An example of a compressed (UNIX) archive file that NBCC created is as follows:
/usr/openv/netbackup/bin/support/output/NBCC/host1_NBCC_20060814_ 164443/host1_NBCC_20060814_164443.tar.gz
where host1 is the name of the host where NBCC had been run. On UNIX platforms, NBCC supports the tar , compress and gzip utilities for UNIX file archiving and compression. On Windows platforms, NBCC supports the tar, Makecab and gzip utilities for Windows File archiving and compression.
For an NBCC description, examples, and how to gather NetBackup catalog/database information to send to Symantec support, refer to the NBCC -help command, or do the following: See the NetBackup Commands manual. NBCC is designed to be run on NetBackup master servers. In some cases, a non-functioning operating system or NetBackup process or service can prevent NBCC from running properly or completing. As NBCC progresses through the interrogation of various operating system or NetBackup components, it outputs what it is going to process to STDOUT. As NBCC processes catalog/database components, it displays how many records have been processed. The number of records processed is in direct relationship to the size of the catalog/database being processed. If NBCC detects a failure, related information will be output to STDERR. Information to STDOUT or STDERR are also output to the nbcc-info.txt file (if available).
Use a text editor to look for error notices in the nbcc-into.txt file. By default, NBCC sends error messages to standard error (STDERR) and also includes the messages in its output files under the header STDERR. If NBCC does not produce adequate information or if it seems to perform incorrectly, run NBCC with the -debug option to include additional debug messages in the nbcc-info.txt file.
150
For troubleshooting, run NBCC when the system is in the same state as when the problem occurred. For example, do not stop and restart the NetBackup processes after the error occurs or make a change to the server or network. NBCC may not be able to gather key information about the problem.
nbcc-info.txt file
The nbcc-info.txt file provides an overview of the environment where NBCC is run, and contains the following:
General operating system and NetBackup configuration information on the environment that NBCC detects A copy of the NBCC processing information that was displayed to STDOUT or STDERR.
This information would indicate the processing that NBCC had done. The Processing detected NetBackup server entries section of the nbcc-info.txt contains a Summary of NBCC server processing. This information summarizes the results of the processing of detected server entries. The following example show a successful summary of server processing from the nbcc-info.txt file.
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ + Primary and associated alias hostnames: + + server_1 + + Primary and associated IP addresses: + + 100.100.10.1 + + Sources: + + config_file bpstulist nbemmcmd + + MediaDB host = up VolDB host = no Disk STU host = no + + Member of Server Group = no + + Media server NetBackup version = 6.5 + + Primary hostname ping results: + + Successful + ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
The following example shows a failed summary of server processing from the nbcc-info.txt file.
-----------------------------------------------------------------+ - Primary and associated alias hostnames: + - server_2 + - Primary and associated IP addresses: + - 100.100.10.100 +
151
- Sources: + - config_file nbemmcmd + - MediaDB host = down VolDB host = no Disk STU host = no + - Member of Server Group = no + - Media server NetBackup version = 6.X + - Primary hostname ping results: + - Failed - No IP address detected + -----------------------------------------------------------------+
Data collection. NBCCR first collects information required to perform a repair. Repair qualification. Immediately before the suggested repair is applied, NBCCR verifies that the current status of the tape still qualifies for the requested repair. It recognizes that time has passed and the environment may have changed since the data was collected. If so, it reports in a history file that the repair is not qualified. Repair. Finally, NBCCR performs up to three steps of repair for every repair entry in the SRA file. An element may be modified to enable the repair and steps may be necessary after the repair. It implements the repairs with proven syntax, so ad hoc scripting occurs. If the repair fails during the repair operation, NBCCR tries to roll back the repair so that the corrective action does not introduce any new errors.
For a description of the NBCCR command options, refer to the NBCCR man page. More information is available. See the NetBackup Commands manual.
152
NBCCR files
NBCCR accepts one input file and creates two output files.
NBCCR accepts as input the Suggested Repair Action (SRA) file named NBCCA.SRA.txt. The file resides in the same directory as the utility. NBCCR creates NBCCR.History.txt, a history file of all the repair actions attempted, and places it in the same directory as the utility. It also creates an output file, NBCCR.output.txt. See the two sample output files that follow.
While the NBCCR utility runs, the following file appears in the same location as the NBCCR utility:
KeepOnTruckin.txt
To terminate NBCCR while it processes repairs, delete this file. This action causes NBCCR to complete the current repair, then shut down. Any other interruption causes undetermined results.
The following sample show an unsuccessful repair action. NBCCR did not find one or more images on the tape, so the repair action is not performed.
MContents for ULT000 MediaServerExpireImagesNotOnTapeFlag ExpireImagesNotOnTape flag not set Did NOT find Backup ID winmaster_1234315163 Copy 1 AssignTime 2009-02-11 \ 01:19:13 (1234315153) on ULT000 Leaving winmaster_1234315163 Copy 1 on ULT000 in ImageDB ULT000 MContents - One or more images from images catalog NOT found on tape MContents ULT000 status: ActionFailed
153
A time frame for the logs. The log types that you want to collect. Bundling and in-transit data compression.
In addition, you can preview the amount of log data to be copied. More information on the nbcplogs utility is available in the Commands manual.
154
Chapter
The first section lists the NetBackup status codes in numerical order and includes an explanation of what occurred along with recommended actions. See NetBackup status codes on page 156. The second section lists the same status codes but sorts them alphabetically according to the message. See NetBackup Messages on page 397.
If you see a status code without its associated message text, use the bperror command to determine the message, its explanation, and recommended action. On UNIX and Linux systems:
/usr/openv/netbackup/bin/admincmd/bperror -statuscode nstatuscode [-recommendation]
On Windows systems:
156
NetBackup status codes and messages About NetBackup status codes and messages
where statuscode is the number of the message. The following example shows how you use bperror to access error code information: On UNIX and Linux: /usr/openv/netbackup/bin/admincmd/bperror -statuscode
150
Note: The Symantec Technical Support site has a wealth of information that can help you solve NetBackup problems. Visit http://entsupport.symantec.com for comprehensive troubleshooting details.
NetBackup status codes and messages About NetBackup status codes and messages
157
A file or a directory path is more than 1023 characters long. For NetBackup Snapshot Client, the maximum path name length is 1000 characters for snapshot backups, not 1023. When the snapshot is created, a new mount point is added to the beginning of the file path. If the new mount point plus the original file path exceeds 1023 characters, the backup fails with status code 1. The progress log includes the entry "ERR-Skipping long dir path." You cannot open a file. The file may have been locked for some reason. The following information applies only to UNIX and Linux systems: NetBackup cannot get the link name of a file. The following information applies only to UNIX and Linux systems: NetBackup cannot process a sparse file. A read error that was encountered in a file. File is of an unknown type, or may be hidden. The following information applies only to UNIX and Linux systems: The lstat system call fails on a file that is eligible to be backed up. This error may be a permission problem. The following information applies only to UNIX and Linux systems: A file cannot be locked that has mandatory locking enabled. A synthetic backup job may terminate with a status code 1 under the following conditions:
- No images were found to synthesize (status code = 607). - TIR info has been pruned from component images (status code = 136). - Image format is unsupported (status code = 79).
158
NetBackup status codes and messages About NetBackup status codes and messages
The synthetic backup job logs the actual status code in the NetBackup error log. Refer to the documentation for the corresponding NetBackup error code for the corrective action to take.
A BMR job may terminate with status code 1 in the following situation: You save the BMR configuration and it returns an error even though the child jobs completed successfully. For information, examine the Detailed Status tab of the Job Details dialog box, or the nbjm unified log (originator ID 117). A policy that contains multiple backup scripts starts a scheduled backup of a UNIX database extension client. If it fails with a status code 1, some of the backup scripts returned a failure status. On clients using Windows Open File Backups (WOFB) to back up open or active files, the following may occur: Volume snapshots were not enabled successfully for the backup. The following logging messages should appear in the bpbkar32 logs if volume snapshots were not successfully enabled. If multistreamed backup jobs are enabled, log messages similar to the following appear that indicate volume snapshots were not enabled for the multistreamed backup job:
11:05:44.601 AM: [1536.724] <4> tar_backup::V_AddToFI_XBSAObj: INF - Volume snapshots not enabled for: D:\Directory1
If multistreamed backups were not enabled, log messages similar to the following appear, which indicate volume snapshots were not enabled for the non-streamed backup job:
1:59:41.229 PM: [2076.2088] <4> V_Snapshot::V_Snapshot_CreateSnapshot: INF =============================== 1:59:41.229 PM: [2076.2088] <4> V_Snapshot::V_Snapshot_CreateSnapshot: INF create snapshots for D:\Directory1 1:59:41.229 PM: [2076.2088] <4> V_Snapshot::V_Snapshot_CreateSnapshot: INF C:\Program Files\VERITAS\NetBackup\bin\bpfis "D:\ Directory1" 1:59:41.799 PM: [2076.2088] <4> V_Snapshot::V_Snapshot_ParseBpfisOutput: INF creation, FIS_ID: 1058813981 1:59:41.799 PM: [2076.2088] <4> V_Snapshot::V_Snapshot_ParseBpfisOutput: INF EXIT STATUS 11: system call failed
Attempting to
- Snapshot
- Snapshot creation
NetBackup status codes and messages About NetBackup status codes and messages
159
1:59:41.799 PM: [2076.2088] <4> V_Snapshot::V_Snapshot_CreateSnapshot: INF - Snapshot creation was not successful 1:59:41.799 PM: [2076.2088] <4> V_Snapshot::V_Snapshot_CreateSnapshot: INF ===============================
In this case, examine the bpfis logs for error messages regarding snapshot creation failures. More details are available on the bpfis logs. See the NetBackup Snapshot Client Administrators Guide. In the bpfis logs, the following messages may appear when snapshot creation fails for Windows Open File Backup: First message:
04:01:14.168 [376.2364] <32> onlfi_fi_split: VfMS error 11; see following messages: 04:01:14.168 [376.2364] <32> onlfi_fi_split: Fatal method error was reported 04:01:14.168 [376.2364] <32> onlfi_fi_split: vfm_freeze_commit: method: VSP, type: FIM, function: VSP_make 04:01:14.168 [376.2364] <32> onlfi_fi_split: VfMS method error 3; see following message: 04:01:14.168 [376.2364] <32> onlfi_fi_split: snapshot services: snapshot creation failed: invalid argument(s).
Cause: VSP was not enabled because the VSP snapshot for the backup did not meet the specified minimum time in the Busy File Wait VSP setting. Either increase the Busy File Timeout VSP setting (recommended setting: 300 seconds or more) or submit the backup job when the volume has less activity. Second message:
04:17:55.571 [1636.3224] <2> onlfi_vfms_logf: snapshot services: (null): There was an unexpected error while preparing the VSP snapshot transaction. Dumping the parameter array to provide more information: Error 112 from VSP_Prepare
Cause: VSP was not enabled for the backup because the client for the VSP Snapshot Cache files does not have enough free disk space. Free up disk space on the volumes being backed up. Third message: If Microsoft Volume Shadow Copy Service (VSS) is used as the Windows Open File Backup snapshot provider and snapshot creation fails, refer to the following: Event Viewers Application and System Logs for error information.
160
NetBackup status codes and messages About NetBackup status codes and messages
A snapshot error may have occurred. Also, you may have clients that use the Windows Open File Backup option to back up open or active files. In this case, a log message in the bpbkar32 debug log appears, which indicates that a snapshot error occurred. The following is an example of a snapshot error:
8:51:14.569 AM: [1924.2304] <2> tar_base::V_vTarMsgW: ERR Snapshot Error while reading test.file
The granular processing operation of an Exchange Granular backup job has failed to complete successfully. Check the bpbkar debug log for more information. A legacy (non-VSS) backup still truncates transaction logs if the job ends with status 1 because a granular processing operation failed. In this situation, the backup image is suitable for database recovery.
The following information applies only to Lotus Notes: This error occurs when archive style logging is not enabled for the Lotus Domino server on UNIX. It also occurs when another backup of the transaction logs is in progress. For troubleshooting guidance, such as a list of logs to gather, and for details on particular issues, refer to the Symantec support document 276903.pdf. See the troubleshooting topic of the appropriate guide for a description of troubleshooting tools:
NetBackup for Microsoft SQL Server Administrators Guide NetBackup for Lotus Notes Administrators Guide NetBackup for Enterprise Vault Agent Administrators Guide
NetBackup status codes and messages About NetBackup status codes and messages
161
Ensure that the clients server list contains entries for the master server and any media servers that can be used during a backup or restore. Examine the status or the progress log on the client for messages on why the restore failed. Also, check the All Log Entries report on the server. Check ownership and permission on directories where files are restored. Correct the problems that you find and retry the restore. If you tried to restore files from a FlashBackup backup after a NetBackup patch was installed, the patch may not have been installed properly. Follow the
162
NetBackup status codes and messages About NetBackup status codes and messages
installation instructions in the patch readme file and make sure the libsfr.so file is copied as instructed.
Verify that you have read access to the files. Check the status or the progress log on the client for messages on why the backup failed. Correct problems and retry the backup. The following information applies only to Windows clients: Verify that the account used to start the NetBackup Client service has read access to the files. The following information applies only to Macintosh clients: This code can be due to multiple backups being tried simultaneously on the same client. Some possible solutions are as follows:
Adjust the backup schedules. If the client is only in one policy, set the policy attribute, Limit jobs per policy, to 1. Set the NetBackup global attribute, Maximum jobs per client, to 1 (note that this limits all clients in all policies).
For a UNIX or Linux database extension client (for example, NetBackup for Oracle), this status code can indicate a problem with the script that controls the backup. Check the progress report on the client for a message such as Script exited with status code = number (the number varies). The progress log usually names the script. Check the script for problems. Also, check that the troubleshooting logs created by the database extension. See the NetBackup guide that came with the database extension for information on the scripts and the troubleshooting logs.
NetBackup status codes and messages About NetBackup status codes and messages
163
Recommended Action: Verify that you have read access to the files. Check the progress log or the status on the client for messages on why the archive failed. Correct problems and retry the archive. On Windows clients, verify that the account used to start the NetBackup services has read access to the files.
Check the All Log Entries and Problems reports to determine the system call that failed and other information about the error.
164
NetBackup status codes and messages About NetBackup status codes and messages
nbjm and nbproxy return status code 11 when an exception is processed, such
as when nbproxy obtains policy or configuration information. Examine the nbjm unified log (originator ID 117) or the nbproxy legacy log for more detail on the cause of the error.
A frequent cause is that the servers file system is full. For example, you may see a message similar to the following in the Problems report or bpdbm debug log:
06/27/95 01:04:00 romb romb failed (11) 06/27/95 01:04:01 romb romb process db_FLISTsend failed: system call media manager terminated by parent
06/27/95 01:05:15 romb romb backup of client romb that exited with status 11 (system call failed) On UNIX and Linux systems, run a df command on the /usr/openv/netbackup/db directory. If the df command does not reveal the problem, check the bpdbm debug logs or do a grep for the message
system call failed
In relevant files under the directory /usr/openv/netbackup/db/error/ On Windows systems, verify that the disk partition where NetBackup is installed has enough room.
Verify that the system is not running out of virtual memory. If virtual memory is the problem, turn off unused applications or increase the amount of virtual memory. To increase virtual memory on Windows, do the following in the order presented:
Display the Control Panel. Double-click System. On the Performance tab, set Virtual Memory to a higher value.
The following information applies only to UNIX and Linux clients: Check for a semaphore problem. This error may occur because the system does not have enough allocated semaphores. It is most commonly seen on Solaris servers when an RDBMS is also running. The symptoms of the problem vary. In some cases, error messages in the NetBackup log indicate a backup failure due to an error in semaphore operation.
NetBackup status codes and messages About NetBackup status codes and messages
165
Another symptom is the inability of the NetBackup device manager daemon, ltid, to acquire a needed semaphore. System requirements vary; thus, no definite recommendations can be made. One customer running NetBackup and ORACLE on a Solaris server made the following changes to the /etc/system file and then rebooted the system (boot -r). The changes were adequate.
set set set set semsys:seminfo_semmni=300 semsys:seminfo_semmns=300 semsys:seminfo_semmsl=300 semsys:seminfo_semmnu=600
Set these attributes to a value great enough to provide resources to all applications on your system.
Examine other debug logs or the progress log or status on the client. Examine the nbjm unified log (originator ID 117) for more detail on the cause of the error.
If you want the path for the disk storage unit to reside in the root file system: Open the Change Storage Unit dialog box in the Administration Console and select the check box: "This directory can exist on the root file system or system disk." If the specified path for the disk storage unit is not in the root file system or system device, do the following: Verify that the path is in a mounted file system. If the specified path for the disk storage unit is in the root file system or system device but does not need to be: Use the Change Storage Unit dialog box to specify a different (non-root) path in a mounted file system. Check the NetBackup Problems report. Try to determine the file and why the error occurred. A possible cause is a permission problem with the file. For
166
NetBackup status codes and messages About NetBackup status codes and messages
detailed troubleshooting information, create a debug log directory for the process that returned this status code. Then, retry the operation and check the resulting debug log.
For NetBackup Lotus Notes, point-in-time restore jobs may fail with a status 12. These jobs are initiated from the master server by using either the NetBackup Administration Console or the Backup, Archive, and Restore interface. Their failure is reported in the NetBackup tar log file. (For Windows, this file is located in the install_path\NetBackup\logs\tar folder. For UNIX and Linux, it is located in the /usr/openv/netbackup/logs/tar folder.) If the install path of the NetBackup master server is different from the install path of the NetBackup client, the following occurs: The automatic restore of Lotus transaction log extents during recovery of the Lotus database fail. Note that the Activity Monitor shows a status 0 (successful). The tar log on the client, however, shows success for the restore but a failure (status 12) for the Lotus database recovery. Perform the restore job from the Backup, Archive, and Restore interface on the NetBackup client. For NetBackup Snapshot Client, status code 12 may appear in the /usr/openv/netbackup/logs/bptm or bpdm log with the following:
tpc_read_config failed: cannot open file /usr/openv/volmgr/database/3pc.conf
This status code may indicate that the policy is configured with either of the following: The NetBackup media server or Third-Party Copy Device as the off-host backup method, but the 3pc. file does not exist or is in the wrong location. Instructions are available on how to create the 3pc. file. See the NetBackup Snapshot Client Administrators Guide.
For a FlashBackup policy, if the CACHE= entry follows the source data entry, the backup fails with status code 12. Messages such as the following appear in the /usr/openv/netbackup/logs/bpbkar logs on the client:
09:55:33.941 [6092] <16> bpfsmap: ERR - open_snapdisk: NBU snapshot enable failed error 3 09:55:33.942 [6092] <32> bpfsmap: FTL - bpfsmap: can't open snapshot disk /dev/rdsk/c4t1d0s3 errno 0 09:55:33.950 [6092] <16> bpbkar Exit: ERR - bpbkar FATAL exit status = 12: file open failed 09:55:33.956 [6092] <4> bpbkar Exit: INF - EXIT STATUS 12: file open failed
NetBackup status codes and messages About NetBackup status codes and messages
167
Change the order of the backup selections list so that the CACHE entry precedes the source data entry. (The source data entry specifies the raw partition that contains the file system to be backed up.)
A network communication problem has occurred on the master server, media server, or one of the clients. An I/O error that occurred during a read from the file system. Read of an incomplete file or a corrupt file. A socket read failure that is caused by a network problem or a problem with the process that writes to the socket. A problem specific to NetBackup Snapshot Client (see recommended actions). The first EV-SQL backup after a NetBackup installation failed. See the Troubleshooting section of the NetBackup for Enterprise Vault Agent Administrators Guide.
Check the NetBackup Problems report for clues on where and why the problem occurred. Check that network communication works properly. See Resolving network communication problems on page 40. For a FlashBackup client, check the /var/adm/messages log for errors like the following:
Mar 24 01:35:58 bison unix: WARNING: sn_alloccache: cache /dev/rdsk/c0t2d0s3 full - all snaps using this cache are now unusable
This error indicates that the cache partition is not large enough. If possible, increase the size of the cache partition. Or, if multiple backups use the same cache, reduce the number of concurrent backups. To reduce the number, reschedule some of them or reschedule the entire backup to a time when the file system is less active.
168
NetBackup status codes and messages About NetBackup status codes and messages
For detailed troubleshooting information, create a debug log directory for the process that returned this status code. Then retry the operation and check the resulting debug log. Ensure that the latest service packs for all products and components (SQL, Exchange, Notes, etc.) have been installed. Ensure that all the network hardware (NICs, hubs, switches, routers, etc.) throughout the environment are set to full duplex, not half duplex. Check the following items regarding the NICs in your system:
Upgrade to the latest NIC drivers throughout the system. Ensure that all NICs are set to full duplex, not half duplex. See Backup performance and NIC cards on page 72. Increase the timeout settings on the NIC. If NIC teaming is implemented, deactivate for testing purposes. Replace the NIC itself on the affected client or server.
For NetBackup Snapshot Client, status code 13 may appear in the /usr/openv/netbackup/logs/bpbkar log. The log can indicate the following:
The files to back up reside on an IDE drive as opposed to SCSI. The off-host backup method was set to either NetBackup media server or Third-Party Copy Device. If you use off-host backup, the disk that contains the client files must be a SCSI or Fibre Channel device. If the disk is an IDE drive, you may see the following in the /usr/openv/ netbackup/logs/bpfis log:
get_disk_info: FTL - /var/tmp/caa026fEU disk_inquiry failed. Errno = 25: Inappropriate ioctl for device
Processing /var get_disk_info() failed, status 13 tpc_get_disk_info() failed: err 13 bpbkar FATAL exit status = 13: file read failed EXIT STATUS 13: file read failed
The files to back up exist on a file system that is not mounted. The file system that is specified as the snapshot source must be mounted. If the snapshot source is not mounted but the mount point is present, NetBackup
NetBackup status codes and messages About NetBackup status codes and messages
169
may do the following: Try to take a snapshot of the directory preceding the directory that was specified as the snapshot source.
For the NetBackup media server method, you may need to increase the client read timeout value. In some environments, NetBackup may require more read time than the default value allows. If the client read timeout is insufficient, the backup may fail, which causes this error. To increase the client read timeout for all clients, do the following: In the NetBackup Administration Console, go to Host Properties > Master Servers > double-click the master server, then go to Properties > Timeouts. Then increase the client read timeout.
An I/O error that occurred during a write to the file system. Write to a socket failed. Cause of this failure: A network problem or a problem with the process that reads from the socket. Writing to a full disk partition.
Check the NetBackup Problems report for clues on where and why the problem occurred. For detailed troubleshooting information, create a debug log directory for the process that returned this status code. Then retry the operation and check the resulting debug log. Make sure that routers, bridges, and other network devices are all at "full" duplex. See Backup performance and NIC cards on page 72. Use a "sniffer" program to determine the number of packets being rejected or re-requested. On Windows systems, the client bpbkar log may contain a 10054 "Connection Reset Error"error (usually indicates a hardware error). Somewhere between the NetBackup client and server, the connection was reset. When NetBackup receives this error, it cannot continue the backup. This error has been attributed to the following:
170
NetBackup status codes and messages About NetBackup status codes and messages
A bad network interface card on a NetBackup client. A bad network interface card on the NetBackup server. Faulty routers. Any other applications that interfere with NetBackup connections.
On Novell systems, status code 14 has also been attributed to network issues. Try a "sniffer" program. The error occurs while you use the NetBackup-Java interface: The application server (bpjava processes) for the NetBackup-Java interface probably ran out of disk space in the file system containing /usr/openv/netbackup/logs/user_ops. The application server writes temporary files into directories in the /user_ops directory. Try clearing up disk space in the file system.
NetBackup status codes and messages About NetBackup status codes and messages
171
Recommended Action: Check the NetBackup Problems report for clues on why the failure occurred. For detailed troubleshooting information, create a debug log directory for the process that returned this status code. Then retry the operation and check the resulting debug log.
Check the NetBackup Problems report for clues on why the failure occurred. On a UNIX or Linux system, check that /etc/services and NIS services map (if applicable) have entries for the NetBackup services: bpcd, bpdbm, and bprd. On a Windows system, verify that the
%SystemRoot%\system32\drivers\etc\services file shows the correct entries
for the NetBackup Internet processes: bpcd, bpdbm, and bprd. Ensure that the following numbers match the settings in the services file: The NetBackup Client Service Port number and NetBackup Request Service Port number on the Network tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface and click NetBackup Client Properties on the File menu. The values on the Network tab are written to the services file when the NetBackup Client service starts. See Verifying host name and service entries on page 49.
Check the level of network activity. An overloaded network can cause this error. If these actions do not reveal the problem, create a debug log directory for the process that returned this status code. Then retry the operation and check the resulting debug log.
172
NetBackup status codes and messages About NetBackup status codes and messages
This error can also occur if the wrong parameters are used when you run a command line. Recommended Action: Do the following, as appropriate:
Check the NetBackup Problems report for clues. If the error occurs when you run a command on the command line, verify that the parameters are valid. This status code may occur if nbjm passes parameters but does not have a required parameter. Check the nbjm unified logs (originator ID 117) for the list of parameters that were passed. The following information pertains to NetBackup Snapshot Client.
If the following appears in the /usr/openv/netbackup/logs/bptm log as enabled on a third-party copy backup, multiplexing was enabled on a third-party copy backup.:
bptm: cannot perform Third-Party-Copy for multiplexed backups send_brm_msg: ERROR 20
bptm: EXITING with status 20 The Third-Party Copy Device off-host backup method is incompatible with multiplexing (the writing of two or more concurrent backup jobs to the same storage device). You must disable multiplexing for any third-party copy backups. If multiplexing is enabled, the backup fails.
The media server may not have the correct 3pc. file entry for the client disk that is needed for the backup. The following appears in the /usr/openv/netbackup/logs/bpbkar log:
14:45:00.983 [15773] <4> bpmap_mm_get_devid: GET_DEVICE_INDEX 1 EMC:SYMMETRIX:601092014000 14:45:00.986 [15773] <4> bpbkar child_send_keepalives: keepalive child started, pid = 15822 14:47:02.029 [15773] <4> bpmap_mm_get_devid: keepalive child: 15822 killed 14:47:02.030 [15773] <4> bpmap_mm_get_devid: DEVICE_INDEX -1 14:47:02.031 [15773] <16> bpmap_send_extend: ERR - can't obtain device id string EMC:SYMMETRIX:601092014000 14:47:33.167 [15773] <16> bpbkar Exit: ERR - bpbkar FATAL exit status = 227: no entity was found 14:47:33.167 [15773] <4> bpbkar Exit: INF - EXIT STATUS 227: no entity was found 14:47:33.168 [15773] <2> bpbkar Exit: INF - Close of stdout complete
NetBackup status codes and messages About NetBackup status codes and messages
173
This shows that a particular device cannot be found in the 3pc. file on the media server (14:47:02.031 [15773] <16> bpmap_send_extend: ERR can't obtain device id string EMC:SYMMETRIX:601092014000). The problem is one of the following:
The 3pc. file on the media server is outdated. Recreate the 3pc. file.
The media server is not on the same Fibre Channel network as the third-party copy device and client disk. As a result, the 3pc. file does not have a correct entry for the client disk. Run the bptpcinfo command with the -x client_name option; this option adds the client disk to the 3pc. file. For each disk that is added to the file by means of bptpcinfo -x client_name, you may need to add the devices worldwide name (wwn=). See the NetBackup Snapshot Client Configuration online document. See "Snapshot Client Assistance" in the NetBackup Snapshot Client Administrators Guide.
The HP VxFS snapshot mechanism requires a dedicated cache partition for each snapshot. A check is made in the mount table to make sure that the cache partition is not already in use. If the cache partition is already in use, status code 20 occurs. Check the /usr/openv/netbackup/logs/bpbkar log for a message similar to the following:
bpfsmap: FTL - bpfsmap: snapshot cache already in use, /dev/arrayvg/vol4c bpbkar Exit: ERR - bpbkar FATAL exit status = 20: invalid command parameter bpbkar Exit: INF - EXIT STATUS 20: invalid command parameter
If the snapshot cache partition is already in use, do one of the following: Set up your policy schedules to run at different times or use different cache partitions for each backup. If the Allow multiple data streams option is enabled, each stream must have its own dedicated cache partition.
Compare the NetBackup version level on the server to the version level on the clients by doing the following:
On UNIX or Linux NetBackup servers and clients, check the /usr/openv/netbackup/bin/version file. On Windows NetBackup servers, check the install_path\Netbackup\version.txt file or the About NetBackup item on the Help menu.
174
NetBackup status codes and messages About NetBackup status codes and messages
On Microsoft Windows clients, check the About NetBackup item on the Help menu. On NetWare target clients, check the Version entry in the bp.ini file. On Macintosh clients, check the version file in the bin folder in the NetBackup folder in the Preferences folder. If a Java interface displays the error, tell them how to enable the debug print manager in the Java startup file. Retry and compare the parameters that were logged on the Java log with the parameters listed in the commands usage statement.
If these actions do not reveal the problem, do the following: Create a debug log directory for the process that returned this status code (if the process uses legacy logging). Then retry the operation and check the resulting log.
Check the NetBackup Problems report for clues on where and why the failure occurred. If you cannot determine the cause from the Problems report, create debug log directories for the processes that returned this status code. Then, retry the operation and check the resulting debug logs. The following information applies only to Sun Solaris: Verify that all operating system patches are installed. See the Operating Notes section of the NetBackup Release Notes. The following information applies only to Windows systems: Verify that the recommended service packs are installed.
Check the NetBackup Problems report for clues on where and why the failure occurred. If you cannot determine the cause from the Problems report, create debug log directories for the processes that could have returned this status code. Then, retry the operation and check the resulting debug logs. The following information applies only to Sun Solaris:
NetBackup status codes and messages About NetBackup status codes and messages
175
Verify that all operating system patches are installed. See the Operating Notes section of the NetBackup Release Notes.
The following information applies only to Windows systems: Verify that the recommended service packs are installed.
Check the NetBackup Problems report for clues on where and why the failure occurred. If you cannot determine the cause from the Problems report, create debug log directories for the processes that could have returned this status code. Then, retry the operation and check the resulting debug logs. Corrupt binaries are one possible cause for this error. Load a fresh bptm from the install media to try to resolve the problem. The following information applies only to Sun Solaris: Verify that all operating system patches are installed. See the Operating Notes section of the NetBackup Release Notes. The following information applies only to Windows systems: Verify that the recommended service packs are installed. This error may occur during a restore to a Novell client. Note the following possible actions:
By default, the value for Novell "Maximum Concurrent Disk Cache Writes" may be too low (for example, 50); Novell recommends setting it to 100. A value of 100 increases the speed and efficiency of the disk cache writes. It increases the number of write requests to be run at one time. Change to or add the following settings in the Novell sys:system\autoexec.ncf file:
SET SET SET SET SET Maximum Maximum Maximum Maximum Maximum Packet Receive Buffers = 4000 Directory Cache Buffers = 4000 Concurrent Disk Cache Writes = 2000 Concurrent Directory Cache Writes = 2000 Physical Receive Packet Size = 1514
On Windows master servers, check the LIST_FILES_TIMEOUT value and ensure that this value is at least 1800.
176
NetBackup status codes and messages About NetBackup status codes and messages
Check the NetBackup Problems report for clues on where and why the failure occurred. If you cannot determine the cause from the Problems report, create debug log directories for the processes that could have returned this status code. Then retry the operation and check the resulting debug logs. A possible cause is a high network load. For example, this problem occurs with Cannot write to STDOUT when a Windows system that monitors network load detects a high load. It then sends an ICMP packet to other systems to inform them that the route those systems use was disconnected. The log messages were similar to the following:
01/31/96 14:05:23 ruble crabtree.null.com from client crabtree.null.com: ERR - Cannot write to STDOUT. Err no= 242: No route to host 01/31/96 14:05:48 ruble crabtree.null.com successfully wrote backup id crabtree.null.com_0823125016, copy 1, fragment 1, 440864 Kbytes at 628.538 Kbytes/sec 01/31/96 14:05:51 netbackup crabtree.null.com CLIENT crabtree.null.com POLICY Remote3SysFullW SCHED Sirius EXIT STATUS 24 (socket write failed)
The following information applies only to Sun Solaris: Verify that all operating system patches are installed. See the Operating Notes section of the NetBackup Release Notes. The following information applies only to Windows systems: Verify that the recommended service packs are installed. This error may occur during a restore to a Novell client. Note the following possible actions:
By default, the value for Novell "Maximum Packet Receive Buffers" may be too low (such as 100). To improve the restore performance, change this value to 2000. To change it, issue "SET Maximum Packet Receive Buffers=<value>" at the console, or enter the value in either of the following Novell files: sys:system\startup.ncf or sys:system\autoexec.ncf. Change to or add the following settings in the Novell sys:system\autoexec.ncf file:
NetBackup status codes and messages About NetBackup status codes and messages
177
Packet Receive Buffers = 4000 Directory Cache Buffers = 4000 Concurrent Disk Cache Writes = 2000 Concurrent Directory Cache Writes = 2000 Physical Receive Packet Size = 1514
nbjm is unable to connect to bpcd on the media server nbpem is unable to connect to nbproxy bptm on the media server is unable to connect to nbjm on the master server.
Verify that bpcompatd, vnetd, and Private Branch Exchange (PBX) are running. Information on how to start PBX is available. See Resolving PBX problems on page 67. If necessary, stop and restart NetBackup.
178
NetBackup status codes and messages About NetBackup status codes and messages
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
The following information applies only to a UNIX or Linux NetBackup master server: Verify that the bprd and the bpdbm processes are running. If these processes are not running, start them. On a Windows master server, verify that the NetBackup Request Manager and NetBackup database manager services are running. If these services are not running, start them. If these processes or services are running, examine the All Log Entries report for the time of the failure to determine where the failure occurred. Do one of the following:
If you cannot view the report or you get a cannot connect on socket error when you try to view it, do the following: Verify again that the NetBackup database manager service or daemon is running. Then, create a debug log directory for bpdbm, retry the operation, and check the resulting debug log. If you can view the report and have not found an entry that is related to this problem: Create debug log directories for the related processes that were running when the error first appeared. (This process frequently is bpbrm.) Then, retry the operation and check the resulting debug logs.
Verify that the server list specifies the correct master server.
The following information applies only to Windows systems: The master server is designated in the Server to use for backups and restores drop-down in the Specify NetBackup Machines and Policy Type dialog box. To display this dialog box, start the Backup, Archive, and Restore interface and click Specify NetBackup Machines and Policy Type on the File menu. The following information applies only to UNIX, Linux, and Macintosh systems: The master server is the first SERVER entry in the bp.conf file. On NetWare target clients: The master server name is the first SERVER entry in the bp.ini file. Make sure all recommended NetBackup patches were installed. Check the Symantec support Web site for current patch information. Go to www.support.symantec.com. Then select "NetBackup" followed by "files and updates."
NetBackup status codes and messages About NetBackup status codes and messages
179
If failure occurs when you run a user-directed backup from a client, make sure that a user-directed backup schedule exists at the master server. With NetBackup database extensions: Make sure that the applicable database product has the correct permissions allowing NetBackup to write to the progress log on the client. The following information applies only to UNIX and Linux systems: If bpdbm has quit when the shutdown script runs on a media server, carefully read the K77netbackup script. It contains details on how to prevent this problem. The script is in /usr/openv/netbackup/bin/goodies.
If you change the server list on a master server, stop, and restart the following: The NetBackup database manager and request daemons (UNIX and Linux) or the NetBackup database manager and NetBackup Request Manager services (Windows).
The following information applies only to UNIX and Linux systems: Verify that the /etc/services file (and NIS services if NIS is used) has entries for the NetBackup services: bpcd, bpdbm, and bprd.
On Windows, verify that the %SystemRoot%\system32\drivers\etc\services file has the correct entries for bpcd, bpdbm, and bprd. Also, verify that the following numbers match the settings in the services file: The NetBackup Client Service Port and the NetBackup Request Service Port on the Network tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface and click NetBackup Client Properties on the File menu. The values on the Network tab are written to the services file when the NetBackup Client service starts. See Verifying host name and service entries on page 49.
On Sun Solaris, verify that all operating system patches are installed See the Operating Notes section of the NetBackup Release Notes. On Windows, verify that the recommended service packs are installed. When the base NetBackup license key expires, daemons (such as bprd and bpdbm) terminate on the NetBackup server. If these daemons are not running, you are likely to encounter status code 25 errors in the Administration console. Install a valid base NetBackup license key, restart the daemons, and restart the console. For NetBackup Snapshot Client, the following applies: When many devices are configured on a media server, it may take a long time for the bptpcinfo command to generate the file 3pc. When the backup is run for the first time,
180
NetBackup status codes and messages About NetBackup status codes and messages
the backup may fail with status 25. Make sure that the /usr/openv/volmgr/database/3pc.conf file exists. If it does, rerun the backup. If the backup fails again, run the bptpcinfo manually to generate the file 3pc., then try the backup again.
NetBackup status codes and messages About NetBackup status codes and messages
181
Recommended Action: Check the NetBackup All Log Entries report for clues on where and why the failure occurred. For detailed troubleshooting information, create debug log directories for the processes that think may have returned this status code. Then, retry the operation and check the resulting debug logs.
Check the NetBackup All Log Entries report for clues on where and why the failure occurred. Check the permissions on the command to be run. For detailed troubleshooting information, create a debug log directory for the process that returned this status code. Then retry the operation and check the resulting debug log.
182
NetBackup status codes and messages About NetBackup status codes and messages
A process does not have permission to create the directory The path to the directory is not valid An IO error occurs
NetBackup status codes and messages About NetBackup status codes and messages
183
Check the NetBackup All Log Entries report to determine which directory was not created and why it was not created. In particular, check for a full disk partition. Check the permissions on the parent directory. Verify that NetBackup services are started with a "Logon as" account that has permission to create the directory. For detailed troubleshooting information, create a debug log directory for the process that returned this status code. Then retry the operation and check the resulting debug log.
184
NetBackup status codes and messages About NetBackup status codes and messages
See Using the Host Properties window on page 61. If a server or Windows NetBackup Remote Administration Console has more than one host name (for example, it has multiple network interfaces), do the following: verify that the master server has a server list entry for each of them. If you change the server list on a UNIX or Linux master server, for the changes to take effect do the following: stop and restart the NetBackup Request daemon (bprd) and NetBackup database manager daemon (bpdbm). If you change the server list on a Windows master server, stop, and restart the NetBackup Request Manager and NetBackup database manager services.
Try pinging the client from the server. If pinging is not possible, check for loose connections or other network problems. Verify that the server list settings are correct on both the client and the server. If the backup involves a media server, verify that these entries are correct on
NetBackup status codes and messages About NetBackup status codes and messages
185
both the master and the media server. For example, if a media server does not have a server list entry for the master, it does not accept connections from the master.
On Windows, the master server is designated on the Servers tab in the Master Server Properties dialog box. See Using the Host Properties window on page 61. On UNIX, Linux, and Macintosh systems the master server is the first SERVER entry in the bp.conf file. On NetWare target clients the master server name is the first SERVER entry in the bp.ini file.
If you change the server list on a UNIX or Linux master server, for the changes to take effect you must do the following: Stop and restart the NetBackup Request daemon (bprd) and NetBackup database manager daemon (bpdbm). On Windows, stop and restart the NetBackup Request Manager and NetBackup database manager services.
Status code 40 can also be due to denial of a mount request by the operator. This status code may occur if nbjm was unable to connect to bpbrm or to bpmount. Examine the nbjm unified log (originator ID 117) or the bpbrm or the bpmount legacy logs for more detail on the cause of the error.
On UNIX, Linux, or Windows clients, check for the following problems with the bpbkar client process. On Windows clients: The bpbkar client process is not hung. Due to the files and directories it scans, it has not replied to the server within the Client read timeout or Client connect timeout period. This error occurs during incremental backups when directories have thousands of unmodified files. For this case, use Host Properties on the NetBackup server to change Client connect timeout or Client read timeout. These settings are on the Timeouts and Universal Settings tabs, respectively, in the Master Server Properties dialog box. The default for these timeouts is 300 seconds. See Using the Host Properties window on page 61. You can also monitor CPU utilization to determine if this condition exists.
186
NetBackup status codes and messages About NetBackup status codes and messages
The bpbkar client process is hung on a file that has a mandatory locking set. For this case, add the following to the clients bp.conf file:
VERBOSE
Then retry the operation. The names of the files are logged on the debug log file in the /usr/openv/netbackup/logs/bpbkar directory before bpbkar processes them. The last file in the log is the file that causes problems. Note: Also, use these procedures for other "unknown" bpbkar hangs. If the problem is due to mandatory file locking, have NetBackup skip the locked files. Set LOCKED_FILE_ACTION to SKIP in the /usr/openv/netbackup/bp.conf file on the client. The bpbkar client process is not hung. Due to the files and directories it scans, it has not replied to the server within CLIENT_READ_TIMEOUT or CLIENT_CONNECT_TIMEOUT. This error occurs during backups when directories have thousands of unmodified files or during restores of the sparse files that have thousands of holes. For this case, try to add or modify the CLIENT_READ_TIMEOUT and CLIENT_CONNECT_TIMEOUT values in the servers /usr/openv/netbackup/bp.conf file. The default for the CLIENT_READ_TIMEOUT and CLIENT_CONNECT_TIMEOUT is 300 seconds if it is not specified. Use your systems ps command and monitor CPU utilization to help decide which of these conditions exist. When you finish the investigation of the problem, delete the /usr/openv/netbackup/logs/bpbkar directory, since the log files can become quite large and are not deleted automatically. Also delete /usr/openv/netbackup/bpbkar_path_tr so you do not generate larger log files than needed the next time you create directory /usr/openv/netbackup/logs/bpbkar.
NetBackup status codes and messages About NetBackup status codes and messages
187
Repair hard drive fragmentation. Try an application that is called Diskeeper Lite, which is part of the Windows Resource Kit. Make sure that enough space is available in \temp.
If the server cannot connect to the client, do the following: Create bpcd or bpbkar (UNIX, Linux, and Windows only) debug log directories on the client. Then retry the operation and check the resulting logs. If these logs do not provide a clue, create a bpbrm debug log on the server. Then retry the operation and check the resulting debug log. If the bpbrm log has entries similar to the following, the problem is in the routing configuration on the server:
bpbrm hookup_timeout: timed out waiting during the client hookup bpbrm Exit: client backup EXIT STATUS 41: network connection timed out
Verify that the client IP address is correct in the name service that is used. On UNIX and Linux clients, if both the NIS and the DNS files are used, verify that they match. See Resolving network communication problems on page 40.
If you use an AIX token ring adapter and the routed daemon is running, the timeout occurs because the token ring adapter creates dynamic routes. It then causes the routed daemon to crash. For a FlashBackup client, this error occurs if the file system being backed up is very large and has a very large number of files. It can also occur if a large number of concurrent data streams are active at the same time. To correct it, add CLIENT_READ_TIMEOUT to the /usr/openv/netbackup/bp.conf file and set it to increase the timeout interval. Make sure all recommended NetBackup patches are installed. Check the Symantec support Web site for current patch information. Go to www.support.symantec.com. Then select "NetBackup" followed by "files and updates". Add the CLIENT_READ_TIMEOUT values to the master server, media server, and client when a NetBackup database extension product is installed. The values should all be the same for each server. The value set is dependent on the size of the database being backed up. More information on CLIENT_READ_TIMEOUT is available. See the NetBackup Administrators Guide, Volume II. Make sure that enhanced authentication is configured correctly. For example, the following may result in status code 41: Host A is configured to use enhanced authentication with host B, but host B is not configured to use enhanced
188
NetBackup status codes and messages About NetBackup status codes and messages
authentication with host A. In this case, connections from host B to host A are likely to fail with status code 41. Connections from host A to B are likely to fail with authentication errors (status code 160).
Verify that both the client and the server are operational. Resolve any network communication problems. See Resolving network communication problems on page 40. Check the Problems report for clues.
Verify that the correct version of software is running on the client and the server. To enable detailed debug logging, do the following:
On the server, create a bpbrm debug log directory. On clients, create a bpcd debug log directory (created automatically on Macintosh clients). Increase the amount of debug information to include in the logs. See About using logs and reports on page 81.
Retry the operation and examine the logs. If you use bpstart_notify scripts on UNIX, Linux, or Windows clients, verify that messages are not written to stdout or stderr.
NetBackup status codes and messages About NetBackup status codes and messages
189
Check the Problems report for information about the error. Verify that the client and servers are operational and connected to the network. Create a debug log directory for the process that reported the problem and the operation. Examine the resulting debug log file for detailed troubleshooting information. See Resolving network communication problems on page 40.
On UNIX and Linux NetBackup servers and clients, check the /usr/openv/netbackup/bin/version file. On Windows NetBackup servers, check the install_path\netbackup\version.txt file or the About NetBackup item on the Help menu. On Microsoft Windows clients, check the About NetBackup item on the Help menu. On NetWare target clients, check the Version entry in the bp.ini file. On Macintosh clients, check the version file in the bin folder in the NetBackup folder in the Preferences folder.
The server is not listed on the client as a valid server. The client was configured to require encrypted backups, but the encryption attribute for the backup policy on the server was not selected. The evaluation license for the NetBackup Encryption product has expired on the server, but the NetBackup client was configured to require encrypted backups. As a result, the server tries to make a non-encrypted backup of the client. Since the client is configured to require encryption, the backup failed.
190
NetBackup status codes and messages About NetBackup status codes and messages
If the server is a valid server but is not listed on the client, add its name to the clients server list:
On Windows clients in the Specify NetBackup Machines and Policy Type dialog box, do the following: Add the server in the Server to use for backups and restores drop-down list. To display this dialog box, start the Backup, Archive, and Restore interface on the client. Then click Specify NetBackup Machines and Policy Type on the File menu. On UNIX, Linux, and Macintosh clients, add a SERVER entry in the bp.conf file. On NetWare target clients, add a SERVER entry in the bp.ini file. If you continue to have problems, more information is available: See Resolving network communication problems on page 40. See Verifying host name and service entries on page 49.
To make non-encrypted backups of the client, set CRYPT_OPTION on the client to allowed or denied. Refer to the NetBackup Encryption Administrators Guide. If the NetBackup encryption evaluation license has expired on the server and you want to continue encrypting backups of the client, do the following: Purchase a permanent encryption license key and add it to the server. After you add the permanent encryption license key, check the attributes of the backup policy to make sure that encryption is selected. To check the validity of an evaluation license key, do the following: On Windows, go to the Help menu on the NetBackup Administration window on the NetBackup server and select License Keys. If the evaluation key is not listed in the NetBackup License Keys window, the key has expired. Use this window to add the new permanent encryption key. On UNIX and Linux, use the following command on the server:
/usr/openv/netbackup/bin/admincmd/get_license_key
Select option f to list the active license keys and features. If the evaluation key is not listed, the key has expired. Use this command to add the new permanent encryption key.
Verify that the name service (or services) used by the client are configured to resolve the host names of the NetBackup server correctly.
NetBackup status codes and messages About NetBackup status codes and messages
191
Verify that the name service (or services) used by the server are configured to resolve the host name of the NetBackup client correctly. Try to ping the client from the server and the server from the client. If you continue to have problems, do the following: See Resolving network communication problems on page 40.
The NetBackup policy configuration on the master server. The General tab in the following dialog boxes: NetBackup Client Properties and Specify NetBackup Machines and Policy Type (on Microsoft Windows and NetWare nontarget clients). To display these dialog boxes, start the Backup, Archive, and Restore interface on the client. For the General tab, click NetBackup Client Properties on the File menu; click Specify NetBackup Machines and Policy Type on the File menu. The bp.conf file on UNIX, Linux, and Macintosh clients. The bp.ini file on NetWare target clients.
On clients and servers, verify that the name service is set up to resolve the NetBackup client names correctly. On UNIX and Linux clients, verify that the clients host name is in the /etc/hosts file or the YP hosts file or NIS maps.
Make sure that software is installed on the client and it is the correct version. If necessary, reinstall the client software. Check for full file systems on the client. Enable detailed debug logging on the client by doing one of the following:
Create bpcd and bpbkar (UNIX, Linux, and Windows only) debug log directories.
192
NetBackup status codes and messages About NetBackup status codes and messages
On a UNIX or Linux client, add the VERBOSE option to the /usr/openv/netbackup/bp.conf file.
On PC clients, increase the debug or log level. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122. Retry the operation and examine the resulting logs.
On UNIX or Linux systems, use the UNIX sum command to check for corrupt binaries.
Create a bpbkar debug log directory (UNIX, Linux, and Windows clients only). Create a bpcd debug log directory. (This log is created automatically on Macintosh clients.) On UNIX and Linux clients, add the VERBOSE option to the /usr/openv/netbackup/bp.conf file.
On PC clients, increase the debug or log level. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122. Retry the operation and examine the resulting logs.
This error may occur if nbjm terminated while a backup job was running. Examine the unified logging files on the NetBackup server for nbjm (117) for more detail on the error. All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). On UNIX and Linux clients, check for core files in the / directory. On UNIX and Linux clients, check the system log (/usr/adm/messages on Solaris) for system problems. This problem can sometimes be due to a corrupt binary.
NetBackup status codes and messages About NetBackup status codes and messages
193
On UNIX and Linux clients, use the UNIX sum command to check the bpcd, bpbkar, and tar binaries, which are located in /usr/openv/netbackup/bin on the client. Reinstall them if they are not the same as in the client directory under /usr/openv/netbackup/client on the server. On a Windows client, check the bpinetd.exe, bpcd.exe, bpbkar32.exe, and tar32.exe files, which are located in the install_path\NetBackup\bin folder on the client. Reinstall the client if these files are as follows:
Not the same size as on other Windows clients Not at the same release level Do not have the same NetBackup patches as other Windows clients
Verify that the NetBackup database manager service or daemon is running. Verify that the file system that contains the NetBackup catalogs has enough space. Create bpbrm and bpdbm debug log directories on the server and retry the operation. Look in the debug log files to find more information on the problem.
Verify that the requested volume is available and an appropriate drive is ready and in the UP state.
194
NetBackup status codes and messages About NetBackup status codes and messages
If this error occurs during a read operation (restore, duplicate, verify), the drives could be busy. Increase the timeout for the media mount that the NetBackup global attribute specifies, to allow more time to mount and position the media. Verify that the tape is not a cleaning tape that is configured as a regular volume. When an Automated Cartridge System controls the robot, verify that the ACSLS system is up. If it is an initial installation, a procedure is available. See Resolving common configuration problems on page 28. On Windows, check the Event Viewer Application log for the error messages that indicate why the tape mount did not complete. On UNIX and Linux, check the system log.
Create bpbrm debug log directories on the server. On a UNIX or Linux NetBackup server, add the VERBOSE option to the bp.conf file. On a Windows NetBackup server, set the Global logging level option on the Logging tab in the Master Server Properties dialog box. To display this dialog box, refer to the following topic: See Using the Host Properties window on page 61. Increase the unified logging levels by using the vxlogcfg command as explained in the following procedure: See Configuring and using unified logging on page 94. Retry the operation and check the resulting debug logs for detailed troubleshooting information.
NetBackup status codes and messages About NetBackup status codes and messages
195
Some third-party software packages (for example, a personal firewall product) can affect the TCP/IP stack in Windows. This action can cause a loss of connection between the NetBackup server and the bpcd process on the client. NetBackup tries to set SO_REUSEADDR (allow local address reuse) on the inbound socket connection so that the port can be handed off from bpinetd.exe (the NetBackup Client Service) to bpcd.exe. Some products may not allow this functionality due to the various methods that can be used to violate system security. Recommended Action: Do the following, as appropriate:
For a Macintosh or NetWare target client: Verify that the server does not try to connect when a backup or restore is already in progress on the client. These clients can handle only one NetBackup job at a time. On a Macintosh, check for activity by examining the NetBackupListen file in the following folder on the startup disk of the Macintosh client:
:System Folder:Preferences:NetBackup:logs:inetd:log.mmddyy
Perform the following procedure: See Resolving network communication problems on page 40. On UNIX and Linux clients, verify that the /usr/openv/netbackup/bin/bpcd binary exists and that it is the correct size. Check the /etc/inetd.conf file to make sure the bpcd path is correct in the following entry:
bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd
On the systems that include the following, make sure that the client name is in the masters /etc/hosts file: NetBackup master, media, and clients (with NetBackup database extension products installed on one or more clients). Completely uninstall the third-party software package on the client that causes the failure. Or, contact the software manufacturer to investigate if other configuration options or workarounds are possible.
196
NetBackup status codes and messages About NetBackup status codes and messages
No process listening activity occurs on the bpcd port The number of connections to the bpcd port is more than the network subsystem can handle with the listen() call
Make sure the NetBackup client software is installed. Verify that the bpcd and bprd port numbers in the %SystemRoot%\system32\drivers\etc\services file on the server matches the setting on the client. Verify that the NetBackup Client Service Port number and NetBackup Request Service Port number on the Network tab in the NetBackup Client Properties dialog match the bpcd and bprd settings in the services file. To display this dialog, start the Backup, Archive, and Restore interface on the server and click NetBackup Client Properties on the File menu. The values on the Network tab are written to the services file when the NetBackup Client service starts. Verify that the NetBackup client service is running. Use the following command to see if the master server returns correct information for the client:
install_path\VERITAS\NetBackup\bin\bpclntcmd -pn
NetBackup status codes and messages About NetBackup status codes and messages
197
Make sure the NetBackup client software is installed. Verify that the bpcd port number on the server (either NIS services map or in /etc/services) matches the number in the clients services file.
For a Macintosh or NetWare target client, verify that the server is not trying to connect when a backup or restore is already in progress on the client. These clients can handle only one NetBackup job at a time. Additional help is available. See Resolving network communication problems on page 40.
If the server is a valid server, verify that it is in the server list on the client. If necessary add it as follows:
On Windows clients: Add the server on the Server to use for backups and restores drop-down in the Specify NetBackup Machines and Policy Type dialog box. To display this dialog box, start the Backup, Archive, and Restore interface on the client. Then click Specify NetBackup Machines and Policy Type on the File menu. On UNIX and Linux, and Macintosh clients: add a SERVER entry in the bp.conf file. On NetWare target clients: add a SERVER entry in the bp.ini file.
If you change the server list on a UNIX or Linux master server, do the following for the changes to take effect: stop and then restart the NetBackup Request daemon (bprd) and NetBackup database manager daemon (bpdbm). On Windows,
198
NetBackup status codes and messages About NetBackup status codes and messages
stop and restart the NetBackup Request Manager and NetBackup Database Manager services.
Create a bpinetd debug log directory on the client. Increase the debug or log level. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122. Retry the backup and examine the resulting logs to determine the cause of the failure.
Create a bpcd debug log directory on the client. On a UNIX or Linux client, add the VERBOSE option to the /usr/openv/netbackup/bp.conf file. On PC clients, increase the debug or log level as explained in the debug log topics in Chapter 3. Retry the backup and examine the resulting logs to determine the cause of the failure.
Check the bpcd debug log to determine the servers peer name and what comparisons are made. The bpcd process compares NetBackup server list entries to the peer name of the server that tries the connection. It rejects the connection if the names are different. If necessary, change the server list entry on the client to match the peer name. On Windows clients, check the following:
Verify that NetBackup for Windows software was installed under a Windows administrator account. If NetBackup is under another type of account, reinstall it under an administrator account. The installation completes successfully under a non-administrator account except for the following: the NetBackup Client service is not added to Windows and the NetBackup server cannot access the client. Verify that the Windows TCP/IP service specifies the domain server that resolves names for the subnet that contains the NetBackup servers. UNIX, Linux, and Windows clients are frequently not on the same subnet and use different domain servers. When this condition exists, NetBackup
NetBackup status codes and messages About NetBackup status codes and messages
199
servers and Windows clients may be able to ping one another, but the server still cannot access the Windows client.
The preceding items may not resolve this problem. See Resolving network communication problems on page 40. If NetBackup use multiple network interfaces with media servers, make sure the interface names appear in the clients /usr/openv/netbackup/bp.conf file. For the Enterprise Vault Agent: See the Troubleshooting section of the NetBackup for Enterprise Vault Agent Administrators Guide.
Run a df to see if the system can read the mount table. On an SCO system, code 60 can occur because the mount-point path name exceeds 31 characters (the maximum number on an SCO system). The bpbkar debug log on the client shows a message similar to the following:
bpbkar build_nfs_list: FTL - cannot statfs net Errno: 42406
To eliminate these errors for future backups, create a mount point with a shorter name and symbolically link the long name to the short name.
For detailed troubleshooting information, create a bpbkar debug log directory. Then retry the operation and check the resulting log.
200
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The client did not send a ready message to the server within the allotted time. Recommended Action: Do the following, as appropriate:
Create a bpcd debug log directory on the client. On a UNIX or Linux client, add the VERBOSE option to the /usr/openv/netbackup/bp.conf file. On PC clients, increase the debug or log level. See How to set logging levels on PC clients on page 122.
On a UNIX, Linux, or Windows client, create the bpbkar debug log directory on the client. On Windows clients, verify that the NetBackup Client service is running. On a UNIX or Linux client, use the ps command to check for a client process that uses too much CPU time. Retry the backup and examine the debug logs for clues on the cause of the failure.
Create a bptm debug log directory on the server. On a UNIX or Linux NetBackup server, add the VERBOSE option to the bp.conf file. On a Windows NetBackup server, set the Verbose logging level option on the Logging tab in the Master Server Properties dialog box. See Using the Host Properties window on page 61. Retry the operation and check the bptm debug log file for information on the drive, robot, and tape that causes the timeout. On a Windows NetBackup server (master or media): check the Event Viewer Application log for the error messages that indicate why the tape mount did not complete.
NetBackup status codes and messages About NetBackup status codes and messages
201
Verify that the server did not crash. On UNIX, Linux, and Windows clients, enable bpbkar debug logging.
Create a bpbkar debug log directory. On a UNIX or Linux client, add the VERBOSE option to the bp.conf file. On a Windows client, set Verbose on the TroubleShooting tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface on the client. Then select NetBackup Client Properties from the File menu.
On other PC clients except Macintosh, create a debug log directory for bpcd (the bpcd log is created automatically on Macintosh). Increase the amount of information that appears in the logs. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122.
On the master server, create bpbrm debug log directories. Increase the logging level for the diagnostic and debug logs for nbpem, nbjm, and nbrb. Use the vxlogcfg command as described in the following topic: See Configuring and using unified logging on page 94. Retry the operation and check the resulting debug logs.
Verify that the server did not crash. Set up debug logging.
On the server, create a bpbrm debug log directory. On UNIX, Linux, and Windows clients, create a bpbkar debug log directory. On other PC clients except Macintosh, create a debug log directory for bpcd (the bpcd log is created automatically on Macintosh).
202
NetBackup status codes and messages About NetBackup status codes and messages
Increase the amount of information that appears in the logs. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122.
Verify that the server did not crash. Set up debug logging.
On the server, create a bpbrm debug log directory. On UNIX, Linux, and Windows clients, create a bpbkar debug log directory. On other PC clients except Macintosh, create a debug log directory for bpcd (the bpcd log is created automatically on Macintosh).
Increase the amount of information that appears in the logs. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122.
NetBackup status codes and messages About NetBackup status codes and messages
203
In an off-host backup (NetBackup Media Server or Third-Party Copy Device), code 69 may indicate that the file list contains the ALL_LOCAL_DRIVES entry. NetBackup does not support the ALL_LOCAL_DRIVES entry for off-host backup. Remove the ALL_LOCAL_DRIVES entry from the file list. For the Enterprise Vault Agent only: See the Troubleshooting section of the NetBackup for Enterprise Vault Agent Administrators Guide.
Verify that the correct file list is specified for this client. On Windows clients, verify that the account used to start the NetBackup Client service has read access to the files. If you back up a network drive or a UNC (universal naming convention) path, do the following: use the Services application in the Windows Control Panel to verify that the NetBackup Client service does not start under the SYSTEM account. The SYSTEM account cannot access network drives. To back up network drives or UNC paths: change the NetBackup Client service startup to log in as a user that has permission to access network drives. Check the All Log Entries report for clues. To set up debug logging, do one of the following:
On UNIX, Linux, and Windows clients, create a debug log directory for bpbkar. On other PC clients except Macintosh, create a debug log directory for bpcd (the bpcd log is created automatically on Macintosh).
204
NetBackup status codes and messages About NetBackup status codes and messages
Increase the amount of information that appears in the logs. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122. Retry the operation and check the resulting debug logs. On Novell systems, do one of the following:
For the nontarget version of NetBackup for NetWare, the backup policy type must be "NetWare," and the files list should include a forward slash (/) only. There should be nothing else in the files list. To check the policy type and files list, start Backup Policy Management and right-click the name of a policy. Click the Attributes tab to check the policy type; click the Files tab to check the contents of the files list. For the target version, the backup policy type must be "Standard," and the policy files list must be formatted as follows: /target_name where a forward slash precedes the variable target_name. To check the policy type and files list, start Backup Policy Management and right-click the name of a policy. Click the Attributes tab to check the policy type; click the Files tab to check the contents of the files list. For the target version, the following NetWare message may be another indicator of incorrect policy type (this message appears in the Novell clients bpcd log):
unable to connect to service, scheduled access not specified
For either the target or the nontarget version of NetBackup for NetWare, make sure that the following are all at the same version: the NetWare loadable modules (NLMs) SMDR and TSAxxx (such as TSAFS and TSANDS). If they are not at the same version, status 71 may occur.
NetBackup status codes and messages About NetBackup status codes and messages
205
Explanation: The bpstart_notify script returned a nonzero exit code. Recommended Action: Check the bpstart_notify script on the client to see if it performs as expected.
206
NetBackup status codes and messages About NetBackup status codes and messages
Verify that the command is specified correctly. For NetBackup Snapshot Client only, do the following: The policy file list may contain the files that do not reside within a file system that was designated as the snapshot source. To apply a snapshot method to the backup of individual files, the snapshot source must be a file system. (It cannot be a raw partition or Volume Manager volume.) The files in the policy file list must reside within that file system. Run the command manually to see if the wanted result is produced. For detailed troubleshooting information, set up debug logging, as follows:
On UNIX, Linux, and Windows clients, create a debug log directory for bpbkar. On other PC clients except Macintosh, create a debug log directory for bpcd (the bpcd log is created automatically on Macintosh). Increase the amount of information that appears in the logs. See How to set logging levels on UNIX clients on page 118. See How to set logging levels on PC clients on page 122. Retry the operation and check the resulting debug log.
Check the NetBackup Problems Report for additional information on why the command failed. The bpbkar debug log shows the command that was run. Create a debug log directory for bpbkar. Retry the operation and retry the resulting debug log. Try running the vos command manually to duplicate the problem.
NetBackup status codes and messages About NetBackup status codes and messages
207
On Windows, use the Activity Monitor or the Services application in the Windows Control Panel to see if the NetBackup Device Manager service is running. If it is not running, start it. To enable verbose logging, place VERBOSE on a line by itself in the install_path\Volmgr\vm.conf file before you start the service. On UNIX and Linux, use vmps to see if ltid is running and if necessary start ltid in verbose mode with the following command:
/usr/openv/volmgr/bin/ltid -v
Or, add a VERBOSE entry to the /usr/openv/volmgr/vm.conf file. Create the vm.conf file if necessary.
On UNIX and Linux, check the system logs to verify that ltid starts.
Note: On UNIX and Linux systems, ltid, and on Windows systems, the NetBackup Device Manager service, is used only if devices are attached to the system.
208
NetBackup status codes and messages About NetBackup status codes and messages
To resolve the problem, increase the CLIENT_READ_TIMEOUT value. In this instance, set the value to 900.
NetBackup Problems report Event Viewer Application log (Windows) System log (UNIX and Linux) Typically, this status code indicates a drive configuration problem that allows more than one process at a time to open the device. On UNIX and Linux, the problem may be due one or more of the following:
Two (or more) devices were configured that are the same physical device (for different densities perhaps). Verify that none of the /dev files that were used for these devices have the same major or minor numbers. Links exist in the file system that allow users access to the drives. The configuration for the drives was modified (in the administrator interface or vm.conf) and the Media Manager device daemon, ltid, was not restarted. Verify the configuration and then start ltid.
On Windows, the problem may be that the Media and Device Management device configuration was modified but the NetBackup Device Manager service
NetBackup status codes and messages About NetBackup status codes and messages
209
was not restarted. Verify the configuration and restart the NetBackup Device Manager service.
On Windows, make sure the tapes are not write protected. For detailed troubleshooting information:
Create a debug log directory for bpdm (if the device is disk) or bptm (if the device is tape). On UNIX and Linux, restartltid in the verbose mode by running the following:
/usr/openv/volmgr/bin/ltid -v
Or, add a VERBOSE entry to the /usr/openv/volmgr/vm.conf file. Create the vm.conf file if necessary.
On Windows, enable verbose logging by adding VERBOSE on a line by itself in the install_path\Volmgr\vm.conf file. Then, stop and restart the NetBackup Device Manager service. Retry the operation and check the resulting debug log files. On Windows systems, look at the
install_path\VERITAS\NetBackup\db\media\errors log for a drive that
the /usr/openv/netbackup/bin/goodies/support/support script output) for a drive that frequently produces errors.
For NetBackup Snapshot Client only: If the following message appears in the /usr/openv/netbackup/bptm log, and the values for key, asc, and ascq are all zero (0x0) as shown in this example message:
tape error occurred on extended copy command, key = 0x0, asc = 0x0, ascq = 0x0
210
NetBackup status codes and messages About NetBackup status codes and messages
your host-bus adapter and its driver are probably not supported by NetBackup Snapshot Client. The host-bus adapters supported in the release are listed in the NetBackup Release Notes.
NetBackup Problems report to determine the device or media that caused the error System and error logs for the system (UNIX and Linux) Event Viewer Application and System logs (Windows)
If NetBackup writes backups to a disk file, verify the following: the fragment size that is configured for the disk storage unit is not greater than the maximum file size that the operating system specifies. On Windows, make sure the tapes are not write protected. If bpbackupdb was used to back up the NetBackup catalog to a disk path on a UNIX or Linux system, do the following: The image you try to write may be greater than the maximum file size that the operating system specifies. Tape files do not have this limit. If the media is tape, check for the following:
A defective or a dirty drive. Clean it or have it repaired (refer to the tpclean command for robotic drives). The wrong media type. Verify that the media matches the drive type you use. Defective media. If it is defective, use the bpmedia command to set the volume to the FROZEN state so it is not used for future backups. Incorrect drive configuration. Verify the Media and Device Management and system configuration for the drive. For example, on UNIX and Linux the drive may be configured for fixed mode when it must be variable mode. See the NetBackup Device Configuration Guide for more information. This configuration often results in the media being frozen with the message "too many data blocks written, check tape and drive block size configuration." See Troubleshooting frozen media on page 63.
NetBackup status codes and messages About NetBackup status codes and messages
211
Explanation: The system device driver returned an I/O error while NetBackup reads from tape or a disk file. Recommended Action: For additional information, check the following:
NetBackup Problems report to determine the device or media that caused the error System and error logs for the system (UNIX and Linux) Event Viewer Application and System logs (Windows) Check for the following:
A defective or a dirty drive. Clean it or have it repaired (see the tpclean command for cleaning). Incorrect drive configuration. Verify the Media and Device Management and system configuration for the drive. For example, on UNIX and Linux, the drive may be configured for fixed mode when it must be variable mode. More information is available. See the NetBackup Device Configuration Guide. Defective media. In this case, you may not be able to recover all the data on the media. Use the bpmedia command to set the volume to the FROZEN state so it is not used for future backups. See Troubleshooting frozen media on page 63. The wrong media type. Verify that the media matches the drive type you use.
NetBackup Problems report to determine the device or media that caused the error System and error logs for the system (UNIX and Linux) Event Viewer Application and System logs (Windows) Check for the following:
A defective or a dirty drive. Clean it or have it repaired (see the tpclean command for cleaning).
212
NetBackup status codes and messages About NetBackup status codes and messages
Incorrect drive configuration. Verify the Media and Device Management and system configuration for the drive. For example, on UNIX and Linux, the drive may be configured for fixed mode when it must be variable mode. See the NetBackup Device Configuration Guide for more information. Defective media. In this case, some data may be lost. Use the bpmedia command to set the volume to the FROZEN state so it is not used for future backups. See Troubleshooting frozen media on page 63. The wrong media type. Verify that the media matches the drive type you use.
NetBackup Problems report to determine the device or media that caused the error System and error logs for the system (UNIX and Linux) Event Viewer Application and System logs (Windows) Check for the following:
A defective or a dirty drive. Clean it or have it repaired (see the tpclean command for cleaning). Defective media. In this case, some data may be lost. Use the bpmedia command to set the volume to the FROZEN state so it is not used for future backups. See Troubleshooting frozen media on page 63.
NetBackup status codes and messages About NetBackup status codes and messages
213
multiplexing, which increases the amount of shared memory required for the operation. An entry similar to the following may be seen in a NetBackup log or report:
system cannot allocate enough shared memory
If you see this type of message, refer to your platform vendor documentation for instructions on how to increase shared memory on your system. For older levels of Solaris: you may need to change one or more default System V Shared Memory settings to prevent jobs failing with "system cannot allocate enough shared memory," as follows.
For Solaris 9, the default shminfo_shmmax value is 8 megabytes. You can place the following line in your /etc/system file to increase this setting. A value of 32 megabytes has been used in this example. Your system may require a greater value under some circumstances such as a high value for the NetBackup multiplexing parameter. According to Sun Microsystems documentation, setting this parameter to its maximum possible value has no side effects. (This parameter is not applicable to Solaris 10).
set shmsys:shminfo_shmmax=33554432
For Solaris 9, the default shminfo_shmmni value is 100. You can place the following line in your /etc/system file to increase this setting. The default value is usually sufficient for NetBackup. In some circumstances, such as installing a NetBackup media server on a large database server, this setting may need to be increased. A value of 220 has been used in this example. (This parameter is not applicable to Solaris 10).
set shmsys:shminfo_shmmni=220
Note: If you modify any of these values in the /etc/system file, you must reboot the system with boot -r for the new settings to take effect. Refer to your vendor documentation for detailed instructions on how to modify these values. Note that these shminfo parameters are not applicable to Solaris 10.
214
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The tape manager (bptm) or disk manager (bpdm) received no data when it performed a backup, archive, or duplication. This error can occur for incremental backups where no data was backed up because no files have changed. Recommended Action: Do the following, as appropriate:
Check the All Log Entries report. For detailed debug information, create bpdm or bptm debug log directories on the server. If the client is Windows, also create a bpbkar debug log directory on the client. Retry the operation and check the resulting debug logs. For additional information, check the following:
NetBackup Problems report to determine the device or media that caused the error System and error logs for the system (UNIX and Linux) Event Viewer Application log (Windows)
Verify the Media and Device Management and system configuration for the drive. For example, on UNIX and Linux, the drive may not be set for variable mode in a case where NetBackup requires that mode. Check the NetBackup Device Configuration Guide for drive configuration information. Verify that the Media and Device Management configuration for the backup device matches what is specified for the storage unit in the NetBackup policy. Verify that you use the correct media in the drive. For detailed debug information, create a bpdm or bptm debug log directory (whichever applies) on the server. If the client is Windows, also create a bpbkar debug log directory on the client. Retry the operation and check the resulting debug logs. If the error occurred during duplication or a Vault session that uses an Alternate Read Server to perform duplication, do the following: verify that the Alternate Read Server has access to the source media.
NetBackup status codes and messages About NetBackup status codes and messages
215
Check the All Log Entries report for more information. Check the NetBackup Media Lists report to see if the catalog is intact. If the catalog is not intact, you may want to reload it from the latest NetBackup catalog backup volume. Verify that the disk partition on which the catalog resides has enough space. If these actions do not explain the problem, check the NetBackup Problems report. For detailed troubleshooting information, create a bptm debug log directory on the server and retry the operation. Check the resulting debug log file. Contact customer support and send appropriate problem and debug log sections.
Perform a bpverify of the affected image to determine if it is written correctly. Check the NetBackup Problems report for additional information about the error. Verify the Media and Device Management and system configuration for the drive. For example, on some UNIX and Linux systems if you do not configure the drive for variable-mode block size writes, the following occurs: the backup images that write to the media produce this error when you attempt to restore the image. The following sequence of events occurs:
216
NetBackup status codes and messages About NetBackup status codes and messages
In this case, configure the drive for variable-mode block sizes and suspend the media that writes on that device. See the NetBackup Device Configuration Guide. The images that were written to those media may be restorable (platform dependent), but single file restores are almost guaranteed to fail. You can expire these media and regenerate the backups. Or you can attempt to duplicate the images on these media to another device and then expire the original copy.
This error has occurred on re-labeled and value-added 8-mm tape drives where the drives micro code incorrectly processes a "forward space record" SCSI command. If the problem is not one of those discussed, create a debug log directory for either bpdm or bptm and retry the operation. Check the resulting debug log file.
If the volume is in a robot and the robot supports bar codes, do the following: perform a Compare Contents with Volume Configuration (on Windows) or Compare robot contents with volume configuration (on UNIX and Linux). The resulting report shows the media ID that was found and validates its slot number with what is in the volume configuration. Then, either change the physical location in the robot or change the volume configuration to show the correct slot. If the volume was mounted on a nonrobotic drive, verify that the correct volume was mounted and assigned.
NetBackup status codes and messages About NetBackup status codes and messages
217
NetBackup Problems report to determine the device or volume that caused the error System and error logs for the system (UNIX and Linux) Event Viewer Application and System logs (Windows)
For detailed troubleshooting information, create a debug log directory for bptm and retry the operation. Check the resulting debug log files.
If the storage unit is a robot with empty slots, add more volumes (remember to specify the correct volume pool).
If there are no empty slots, move some media to nonrobotic and then add new volumes. If you have difficulty keeping track of your available volumes, try the available_media script: On UNIX and Linux, this script is in:
/usr/openv/netbackup/bin/goodies/available_media
218
NetBackup status codes and messages About NetBackup status codes and messages
install_path\NetBackup\bin\goodies\available_media.cm d
This script lists all volumes in the volume configuration, and augments that list with information on the volumes currently assigned to NetBackup.
Set up a scratch volume pool as a reserve of unassigned tapes. If NetBackup needs a new tape and none are available in the current volume pool, it does the following: moves a tape from the scratch pool into the volume pool that the backup uses. If the storage unit and volume pool appear to have media, verify the following:
Volume is not FROZEN or SUSPENDED. Check for this condition by using the NetBackup Media List report. If the volume is frozen or suspended, use the bpmedia command to unfreeze or unsuspend it (if that is wanted). See Troubleshooting frozen media on page 63. Volume has not expired or exceeded its maximum number of mounts. The EMM database host name for the device is correct. If you change the EMM database host name, stop and restart the following: the Media Manager device daemon, ltid, (if the server is UNIX or Linux) or the NetBackup Device Manager service (if the server is a Windows system). The correct host is specified for the storage unit in the NetBackup configuration. The host connection should be the server (master or media) with drives connected to it. The Media and Device Management volume configuration has media in the correct volume pool. Unassigned or active media is available at the required retention level. Use the NetBackup Media List report to show the retention levels, volume pools, and status (active and so on) for all volumes. Use the NetBackup Media Summary report to check for active volumes at the correct retention levels.
The NetBackup bptm process is rejected when it requests media from the vmd process (UNIX and Linux) or the NetBackup Volume Manager service (Windows). The cause of this problem is that the process or service cannot determine the name of the host that makes the request. This error can be due to an incorrect network configuration that involves the following:
NetBackup status codes and messages About NetBackup status codes and messages
219
Create bptm and vmd debug log directories and retry the operation. Examine the bptm debug log to verify that bptm connects to the correct system. If an error is logged, examine the vmd log. On UNIX and Linux, the vmd log is:
/usr/openv/volmgr/debug/daemon/log.xxxxxx
If this storage unit is new and this attempt to use it is the first, stop and restart NetBackup on the master server. Note: The mds unified logging files (OID 143) usually show the NetBackup media selection process.
Check the NetBackup Problems report to determine the reason for the failure. The most common cause is that the NetBackup Device Manager service (on
220
NetBackup status codes and messages About NetBackup status codes and messages
Windows) or the Media Manager device daemon (ltid) (on UNIX and Linux) is not running. Start it if necessary.
If you duplicate backups or use Vault to duplicate backups, this error could indicate the following: the Alternate Read Server does not have access to the tape where the original backup resides.
Enable debug logging for bparchive, bpbackup, bplist, or bprestore (as appropriate) by creating debug log directories for them. On UNIX and Linux, if a nonroot user has problems, verify that the directory that was created has mode 666. Look for and correct any reported errors. Retry the operation and check the resulting logs. If the logs do not reveal the problem, use the command line version of the command and correct any problems that are reported on stderr.
NetBackup status codes and messages About NetBackup status codes and messages
221
Explanation: The process that sends mail could not close the pipe to the server. Recommended Action: Make sure that mail is configured on the client. For detailed troubleshooting information, create a bpcd debug log directory and retry the operation. Check the resulting bpcd debug log.
222
NetBackup status codes and messages About NetBackup status codes and messages
If the error occurred on a command line, examine the standard error output from the command for an explanatory message. Refer to the format for the date options in the usage statement for the command. Look up the locale of the master server. Compare the date format of that locale with the date format on the usage statement for the command. Check the NetBackup Problems report for clues. If the error appears in a Java interface, enable the debug print manager in the Java startup file. Retry and compare the parameters that are logged in the Java log with the parameters listed in the commands usage statement. If these actions do not reveal the problem, create a debug log directory for the process that returned this status code. Then retry the operation and check the resulting debug log.
On a UNIX and Linux client, add the following line to the top of the /usr/openv/netbackup/bp.conf file:
SERVER = server_name
On a Microsoft Windows or nontarget NetWare client, do the following: add the server name on the Server to use for backups and restores drop-down in the Specify NetBackup Machines and Policy Type dialog box. To display this
NetBackup status codes and messages About NetBackup status codes and messages
223
dialog box, start the Backup, Archive, and Restore interface on the client. Then click Specify NetBackup Machines and Policy Type on the File menu.
On an NetWare target client, add the server name to the bp.ini file. On a Macintosh client, add the SERVER = server_name line to the bp.conf file in the NetBackup folder in the Preferences folder.
Specify at least one file to be restored. This status code may occur if nbjm is running and a stream discovery fails to find all stream files. Examine the nbjm unified log (originator ID 117) for more details on the cause of the error.
Ensure that the Veritas Security Services is installed and configured. For complete installation instructions, see the Veritas Security Services Installation Guide. Check that both parties have a valid certificate. Examine the expiry date that is listed from a bpnbat -WhoAmI. For example:
bpnbat -WhoAmI Name: JDOG Domain: MYCOMPANY
224
NetBackup status codes and messages About NetBackup status codes and messages
Issued by: /CN=broker/[email protected]/O=vx Expiry Date: Sep 19 12:51:55 2007 GMT Authentication method: Microsoft Windows
Operation completed successfully. Shows an expiry date of September 19th, 2007. After 12:51:55 GMT this credential is no longer valid and a new credential is required.
If you run from the NetBackup Administration console, close and reopen the console. The console automatically obtains a credential for the currently logged in identity, if possible. By default these certificates are valid for 24 hours. To set a longer default time, consult the Veritas Security Services Administrators Guide. Ensure the following: that the certificates for both sides use the same broker or are children of the same root broker and that trusts were established between them. See the Veritas Security Services Administrators Guide for more information on broker hierarchies and how to establish trust relationships between brokers. Ensure that connectivity between the physical systems in question is possible. If general sockets cannot connect between the machines (such as ping and telnet), issues within the network unrelated to NetBackup can cause this problem. Ensure that the system has sufficient swap space and the following directories are not full:
If you use the default groups, make certain that the user attempts to perform an operation appropriate for that group. For example, a member of NBU_Operators is unable to modify policy information, which is a permission reserved for administrator roles.
NetBackup status codes and messages About NetBackup status codes and messages
225
Ensure that the system has sufficient swap space and the following directories are not full:
If you use your own defined groups and permissions, first determine the object with which the operation is associated. Then add the permissions relative to the action. For example, a user is required to up and down drives but currently does not have permission to do so. Verify that the user belongs to the correct authorization group. If necessary, verify that the group has Up and Down permissions on the Drive object within the Group Permission tab. If necessary, increase the verbosity level of NetBackup to locate what object and what permissions are required for the failing request. The pertinent lines in the debug logs look similar to the following:
17:19:27.653 [904.872] <2> GetAzinfo: Peer Cred Info. Name: JMIZZLE Domain: MYCOMPANY Expiry: Sep 24 21:45:32 2003 GMT Issued by: /CN=broker/[email protected]/O=vx AuthType: 1 17:19:37.077 [904.872] <2> VssAzAuthorize: vss_az.cpp.5082: Function: VssAzAuthorize. Object NBU_RES_Drives 17:19:37.077 [904.872] <2> VssAzAuthorize: vss_az.cpp.5083: Function: VssAzAuthorize. Permissions Up 17:19:40.171 [904.872] <2> VssAzAuthorize: vss_az.cpp.5166: Function: VssAzAuthorize. 20 Permission denied.
In this example, the user JMIZZLE attempts to perform an operation that requires the Up permission on the Drives object. To diagnose the problem, examine the group(s) to which the user belongs to ensure that the appropriate group includes the Up permission. (Up is a member of the Operate permission set for Drives.)
226
NetBackup status codes and messages About NetBackup status codes and messages
Ensure that the Authorization Service or Daemon is running. Refer to the Veritas Security Services Administrators Guide for more information on authentication and authorization daemons. Ensure that you are in communication with the correct master server. Within the bp.conf files on the local server, verify that the entry AUTHORIZATION_SERVICES specifies the proper host name (fully qualified) of the Authorization service. For example, AUTHORIZATION_SERVICE = machine2.mycompany.com 0 specifies that the server contacts machine2 to perform Authorization checks. Also ensure that this entry matches that of the master server. Ensure that the system has sufficient swap space and the following directories are not full:
Ensure that the server that contacts the master has a valid certificate. The machine certificate can be examined as follows: For UNIX and Linux:
# bpnbat -WhoAmI -cf /usr/openv/var/vxss/credentials/machine3.mycompany.com
For Windows:
Bpnbat WhoAmI -cf "c:\Program Files\VERITAS\NetBackup\var\vxss\credentials\machine3.my company.com"
If the expiry date was exceeded, use bpnbat -LoginMachine to obtain a new credential for the machine. See the Netbackup Commands manual for more information on bpnbat.
NetBackup status codes and messages About NetBackup status codes and messages
227
The server that attempts the check is not authorized to examine the Authorization database. Ensure that bpnbaz -ShowAuthorizers re-tuned the machines identity. Ensure that the machine has a machine credential under the directory as follows: Program Files\VERITAS\var\vxss\credentials (Windows) /usr/openv/var/vxss/credentials (UNIX and Linux) This credential should have the full name of the machine as in the following example: machine1.company.com.
Check that the maximum number of open sockets to the Authorization database was not exhausted. Use netstat to determine the number of sockets that are opened to port 4032 on the Authorization server and that refer to the following configurations: Windows:
HKLM\SOFTWARE\VERITAS\Security\Authorization\Communication\ClientMaxConnections
UNIX and Linux: /etc/vx/vss/VRTSaz.conf entry "ClientMaxConnections" If the maximum number of open connections was reached, you may need to increase the number of maximum open connections. An increase in the number of open connections increases the memory footprint of the Authorization service or daemon. Note that extreme increases in the maximum number of connections can cause performance degradation.
Check the NetBackup Problems report for additional information about the error. For detailed troubleshooting information, create admin and bpdbm debug log directories and retry the operation. Check the resulting debug logs. Contact customer support and send the appropriate problem and debug log sections that detail the error.
228
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: When NetBackup attempted to back up its internal catalogs, no media IDs were defined in the catalog backup configuration. Recommended Action: Add the media IDs to the catalog backup configuration. Verify that the media IDs are in the NetBackup volume pool.
Check the NetBackup Problems report for additional information about the error. The following are some possible causes:
The path does not exist. On a UNIX or Linux system, one of the paths contains a symbolic link.
After you determine which path cannot be accessed, correct the path names in the catalog backup configuration.
NetBackup status codes and messages About NetBackup status codes and messages
229
Explanation: Only one NetBackup catalog backup can be active at any given time. Certain operations are not allowed during an online catalog backup. (These include catalog archiving, catalog compression, and expiration of the last copy of an image.) Recommended Action: Retry the operation after the catalog backup completes.
Fix the problem that was reported in the error message in the bprecover output. Refer to one of the following topics to identify which NetBackup services to shut down before a NetBackup database recovery attempt: See Catalog recovery from an online backup on page 591. The NetBackup services should be shut down except for the NetBackup Client Service, which must be running for the database recovery to succeed. Check the NetBackup Problems report for additional information about the error.
230
NetBackup status codes and messages About NetBackup status codes and messages
A tape storage unit in the storage unit group has preference over any disk storage units since tape storage units usually have more capacity. If the storage unit with the most unused capacity is busy, NetBackup skips it. NetBackup then selects an available storage unit with the next largest, unused capacity. If the storage unit with the greatest unused capacity is the one that lacked capacity when the job first failed, the scheduler tries it again. That storage unit may have more unused capacity now than it did when the job failed.
Either free sufficient space or add more space to the file system for this storage unit. Lower the high capacity mark for this disk storage unit. Configure the policies to access it through a storage unit group that provides alternative storage to use when this storage unit fills up. Ideally, if an image exceeds the file systems high capacity mark, it also completes successfully. This image leaves the storage unit in a "full" state (over the high capacity mark). The storage unit then is not assigned to other jobs until its capacity falls under its high capacity mark. If the Staging attribute is set on the disk storage unit that did not have enough capacity, it may be unable to create free space. It cannot create space because the backups that are staged to the disk are not relocated (eligible to be deleted from the staging storage unit). Ensure that stagings relocation (duplication)
NetBackup status codes and messages About NetBackup status codes and messages
231
jobs successfully copy enough images to provide sufficient free space for new backups.
Check the NetBackup Problems report for additional information about the error. Check the system log for reported problems. For detailed troubleshooting information, create bpdbm, bptm, and bprd debug log directories on the master server. Increase the unified logging level by using the vxlogcfg command. See Configuring and using unified logging on page 94. Retry the operation and check the resulting debug logs.
Examine the NetBackup Problems report. Create a debug log directory for bprd and retry the operation. Check the resulting debug log to determine the connection and the client names. Depending on the request type (restore, backup, and so on), you may need or want to do the following:
Change the clients configured name. Modify the routing tables on the client. On the master server, set up an altnames directory and file for this client See the NetBackup Administrators Guide, Volume I.
232
NetBackup status codes and messages About NetBackup status codes and messages
or
On a UNIX or Linux master server, create a soft link in the NetBackup image catalog.
A request was made that is not recognized. This usually results from different versions of NetBackup software being used together. If a client receives this error in response to a list or restore request, then the DISALLOW_CLIENT_LIST_RESTORE or DISALLOW_CLIENT_RESTORE. option exists in the bp.conf file on the following: a UNIX or Linux NetBackup server or in the registry on a Windows NetBackup server. These options deny list and restore requests from all NetBackup clients.
If you suspect that the software versions are the problem, verify that all NetBackup software is at the same version level.
On UNIX and Linux NetBackup servers and clients, check the /usr/openv/netbackup/bin/version file. On Windows NetBackup servers, check the install_path\netbackup\version.txt file or the About NetBackup item on the Help menu. On Microsoft Windows clients, check the About NetBackup item on the Help menu. On NetWare target clients, check the Version entry in the bp.ini file. On Macintosh clients, check the version file in the bin folder in the NetBackup folder in the Preferences folder.
If the server denies list and restore requests, remove the DISALLOW_CLIENT_LIST_RESTORE and DISALLOW_CLIENT_RESTORE options from
NetBackup status codes and messages About NetBackup status codes and messages
233
the bp.conf file on the following: a UNIX or Linux NetBackup server or from the registry on a Windows NetBackup server. Then, stop and restart the NetBackup request daemon (UNIX and Linux) or NetBackup Request Manager service (Windows).
For detailed troubleshooting information, create bpdbm, bprd, and admin debug log directories. Retry the operation and check the resulting debug logs.
The most recent backup image for the client is a synthetic full or cumulative backup
234
NetBackup status codes and messages About NetBackup status codes and messages
The TIR information from one or more of the component images before the synthetic full (or cumulative) backup is pruned
The TIR information is automatically restored to the image catalog if you do the following: expire the synthetic backup (full or cumulative) image and try to rerun the synthetic backup job for the client. However, the synthetic backup job fails with this error if the TIR restore fails due to bad, missing, or vaulted media or a bad drive. Recommended Action: Reimport the TIR information into the catalog of each component image (from which the TIR information was pruned). Then rerun the synthetic backup job. The TIR information can be imported into the image catalog by initiating a true image restore of any file from that component image. The restore process also restores the TIR information in the image catalog.
For most snapshot backups, the following message appears in the /usr/openv/netbackup/logs/bpfis log on the client:
09:36:48.299 [527] <32> fs_dev_rt_check: FTL - snapshot method: nbu_snap abort - required VxFS dynamic linked libraries for
NetBackup status codes and messages About NetBackup status codes and messages
235
NetBackup are not installed. Please visit the Symantec support web site, and refer to Technote number 262225 for further information.
For the backups that run from a FlashBackup policy, the following appears in the /usr/openv/netbackup/logs/bpbkar log on the client:
10:09:56.566 [1146] <32> bpfsmap: FTL - bpfsmap: FlashBackup abort - required VxFS dynamic linked libraries for NetBackup are not installed. Please visit the Symantec support web site, and refer to Technote number 262225 for further information. 10:09:56.571 [1146] <16> bpbkar Exit: ERR - bpbkar FATAL exit status = 142: file does not exist 10:09:56.573 [1146] <4> bpbkar Exit: INF - EXIT STATUS 142: file does not exist
Recommended Action: Install the VxFS dynamic libraries on the NetBackup client as described in Technote 262225 and try the backup again.
236
NetBackup status codes and messages About NetBackup status codes and messages
Examine the NetBackup Problems and All Log Entries reports. Create bprd and bpdbm debug log directories and retry the operation. Check the resulting logs to see the system error message that resulted from the attempt. If another process has the port, use other system commands to determine the process. Based on this research, either change the port number in your services file or map or terminate the process that acquired the port. On UNIX and Linux, another possible cause for this error is the use of the kill command to terminate bprd or bpdbm. If you have to stop bprd, use the Terminate Request Daemon option on the Special Actions menu in bpadm. To stop bpdbm, use the /usr/openv/netbackup/bin/bpdbm -terminate command. Use of the kill command to stop these processes can leave them unable to bind to their assigned ports the next time they are started. To identify a bprd or a bpdbm problem, look for lines similar to the following in the debug log for the respective process:
<16> getsockbound: bind() failed, Address already in use (114) <32> listen_loop: cannot get bound socket. errno = 114 <4> terminate: termination begun...error code = 146
If the problem persists longer than ten minutes, it may be necessary to restart the server.
NetBackup status codes and messages About NetBackup status codes and messages
237
Restart the service at a later time and investigate the system problems that limit the number of processes. On Windows systems, check the Event Viewer Application and System logs.
238
NetBackup status codes and messages About NetBackup status codes and messages
Verify that all software is at the same version level. For detailed troubleshooting information, create bpdbm and admin debug log directories and retry the operation. Check the resulting debug logs.
For a NetBackup Snapshot Client, the policy storage unit was set to Any_available and the off-host backup method was set to Third-Party Copy Device or NetBackup Media Server. Do not choose Any_available. A particular storage unit (such as nut-4mm-robot-tl4-0) must be specified when Third-Party Copy Device or NetBackup Media Server is specified as the off-host backup method. For an NDMP policy type, verify the following: a storage unit of type NDMP is defined and the NDMP host value matches the host name of the client. For example, if the NDMP policy specifies toaster as the client, the configuration for the storage unit must specify toaster as the NDMP host. For a policy type other than NDMP, verify that the policy specifies a Media Manager or Disk type storage unit.
NetBackup status codes and messages About NetBackup status codes and messages
239
An Enterprise Vault Agent related error. The following errors can result in a status code 156:
A snapshot-backup related error regarding Windows Open File Backup or Snapshot Client.
For the Enterprise Vault Agent: See the Troubleshooting section of the NetBackup for Enterprise Vault Agent Administrators Guide. For a Windows Open File Backup Snapshot Provider that uses VSS: See the Troubleshooting section of one of the following guides:
NetBackup for VMware Administrators Guide NetBackup for Hyper-V Administrators Guide NetBackup Snapshot Client Administrators Guide
The VSS cache files may be too small for the number of files being backed up using VSS. If bpbkar debug logs are turned on, a message similar to the following appears in the bpbkar debug log for the backup.
8:51:14.569 AM: [1924.2304] <2> tar_base::V_vTarMsgW: ERR failure reading file: D:\ test.file (WIN32 5: Access is denied. ) 8:51:14.569 AM: [1924.2304] <4> tar_base::V_vTarMsgW: INF - tar message received from dos_backup::tfs_readdata 8:51:14.569 AM: [1924.2304] <2> tar_base::V_vTarMsgW: ERR -
240
NetBackup status codes and messages About NetBackup status codes and messages
Snapshot Error while reading test.file 8:51:14.569 AM: [1924.2304] <4> tar_base::V_vTarMsgW: INF message received from tar_backup::nextfile_state_switch 8:51:14.569 AM: [1924.2304] <2> tar_base::V_vTarMsgW: FTL Backup operation aborted! 8:51:14.569 AM: [1924.2304] <2> tar_base::V_vTarMsgW: INF Client completed sending data for backup 8:51:14.569 AM: [1924.2304] <2> tar_base::V_vTarMsgW: INF STATUS 156: snapshot error encountered
- tar - EXIT
To increase the VSS cache size by using the Shadow Copy configuration in Windows 2003, do the following in the order listed:
In Windows, right-click My Computer and select Manage. In the console tree, right-click Shared Folders, select All Tasks, and select Configure Shadow Copies. Select the volume where you want to make changes, and then select Settings. In the Settings dialog box, change the Maximum Size setting to either of the following: No Limit or a size large enough to suit the requirements of your installation and your usage of VSS.
For backups using Snapshot Client and the NAS_Snapshot method, with or without SnapVault:
If the backup fails with status code 156, consult the bpfis legacy log, in /usr/openv/netbackup/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). If the bpfis directory does not already exist, you must create it and rerun the job. If necessary, increase the logging level and retry the job. See About using logs and reports on page 81. On Windows clients, when restoring files from a backup that is made with the NAS_Snapshot method, log into the "NetBackup Client Service" as the Administrator account, not as the local system account. Otherwise, the backup fails with status 156.
In Windows Services, double-click the NetBackup Client Service. Then check the Log On tab: if the service is not logged in as Administrator, stop the service. Change the login to the Administrator account and restart the service. Retry the restore.
NetBackup status codes and messages About NetBackup status codes and messages
241
The file system that is specified as a snapshot source is not mounted. In this case, you may see the following in the /usr/openv/netbackup/logs/bpfis log:
17:12:51 bpfis: FTL - snapshot creation failed, status 156 17:12:51 bpfis: INF - EXIT STATUS 156: snapshot error encountered
Make sure that the file system that is specified for the snapshot source has been mounted.
The file system that is specified as the snapshot source does not correspond to the file system that contains the actual files (as opposed to symbolic links to the files). The mounted file system for the snapshot source must contain the actual files, not symbolic links. If items in the file list, such as /oracle, is a symbolic link to /export/home/oracle, the snapshot source must specify /export, or /export/home, not /oracle. vxvm is selected as the snapshot method but the snapshot source is not configured over a Veritas Volume Manager (VxVM) volume. In this case, you may see the following in the /usr/openv/netbackup/logs/bpfis log:
17:12:51 bpfis: FTL - snapshot creation failed, status 156 17:12:51 bpfis: INF - EXIT STATUS 156: snapshot error encountered
242
NetBackup status codes and messages About NetBackup status codes and messages
17:12:51 onlfi_vfms_logf: INF - vxvm_freeze: Snapshot source /cockpit1 on device /dev/dsk/c1t0d0s6 is not on a VxVM volume 17:12:51 delete_mount_point: INF - Deleted mount point /tmp/_cockpit1_coc_group1:3518 17:12:51 onlfi_freeze: FTL - VfMS error 11; see following messages: 17:12:51 onlfi_freeze: FTL - Fatal method error 17:12:51 onlfi_freeze: FTL - vfm_freeze: method: vxvm, type: FIM, function: vxvm_freeze 17:12:51 onlfi_freeze: FTL - VfMS method error 9; see following message: 17:12:51 onlfi_freeze: FTL - vxvm_freeze: Snapshot source /cockpit1 on device /dev/dsk/c1t0d0s6 is not on a VxVM volume 17:12:51 onlfi_thaw: INF - fim=vxvm 17:12:51 onlfi_thaw: WRN - /cockpit1 is not frozen
Make sure that the snapshot source is configured over a Veritas Volume Manager (VxVM) volume.
vxvm was selected as the snapshot method, but a Veritas Volume Manager snapshot mirror of the snapshot source volume had not been created before running the backup, or another backup is currently running that uses the snapshot mirror. In either case, you may see the following in the /usr/openv/netbackup/logs/bpfis log:
17:12:51 onlfi_freeze: FTL messages: 17:12:51 onlfi_freeze: FTL 17:12:51 onlfi_freeze: FTL FIM, function: vxvm_freeze 17:12:51 onlfi_freeze: FTL message: 17:12:51 onlfi_freeze: FTL available snapshot mirror - VfMS error 11; see following - Fatal method error - vfm_freeze: method: vxvm, type: - VfMS method error 3; see following - find_ready_snapshot: Cannot find
Refer to the NetBackup Snapshot Client Administrators Guide for information on how to create a snapshot mirror on the client before you run the backup.
vxvm was selected as the snapshot method, and a Veritas Volume Manager snapshot mirror of the snapshot source volume has been created. However, two different backup jobs (A and B) try to back up the same volume (for example, vol01), but job A starts before job B. After an available snapshot mirror is found, a brief pause occurs before the snapshot is formed. Job B that runs slightly behind job A may try to create a snapshot of the snapshot
NetBackup status codes and messages About NetBackup status codes and messages
243
mirror immediately before job A creates the snapshot and gets the lock on it. In this case, you may see the following in the /usr/openv/netbackup/logs/bpfis log:
17:12:51 onlfi_freeze: FTL - VfMS error 11; see following messages: 17:12:51 onlfi_freeze: FTL - Fatal method error 17:12:51 onlfi_freeze: FTL - vfm_freeze: method: vxvm, type: FIM, function: vxvm_freeze 17:12:51 onlfi_freeze: FTL - VfMS method error 3; see following message: 17:12:51 onlfi_freeze: FTL - vxvm_freeze: Command failed with status=11: /usr/sbin/vxassist -g rootdg snapshot vol01 VfMSCAAu7a4Uw </dev/null>/var/tmp/VfMSAAAs7a4Uw 2>/var/tmp/VfMSBAAt7a4Uw
The job that was unable to get a lock (job B in the preceding example) fails, and must be run again.
When using nbu_snap as a snapshot method, you may have stale snapshots if status code 156 occurs with the following messages in the /usr/openv/netbackup/logs/bpfis log. (Stale snapshots are those that nbu_snap did not automatically delete.)
17:12:51 onlfi_freeze: FTL - VfMS error 11; see following messages: 17:12:51 onlfi_freeze: FTL - Fatal method error 17:12:51 onlfi_freeze: FTL - vfm_freeze: method: nbu_snap, type: FIM, function: nbu_snap_freeze 17:12:51 onlfi_freeze: FTL - VfMS method error 5; see following message: 17:12:51 onlfi_freeze: FTL - nbu_snap_freeze: Cannot turn on snapshot; snapshot source=/oracle/ufs_r, cache=/dev/rdsk/c4t1d11s4,snap error=11
no active backups running. If the snaplist command shows cache entries, there are stale snapshots. Nothing is displayed if there are no stale snapshots. Example snaplist output:
244
NetBackup status codes and messages About NetBackup status codes and messages
minblk 0 0
where id is the ID from the snaplist output (such as 43 in the preceding example).
If a backup using the VxFS_Checkpoint snapshot method failed, the NetBackup bpbkar process should automatically remove the clone. Sometimes, however, bpbkar is unable to remove the clone. In this case, you may see messages such as the following in the /usr/openv/netbackup/logs/bpfis log:
15:21:45.716 [4236] <4> create_mount_point: INF - Created mount point /tmp/_vtrax_test:4236 15:21:45.869 [4236] <2> onlfi_vfms_logf: INF - vxfs clone handle : 9600344 15:21:45.870 [4236] <2> onlfi_vfms_logf: INF VxFS_Checkpoint_freeze: Cannot create checkpoint; status=17 15:21:45.872 [4236] <4> delete_mount_point: INF - Deleted mount point /tmp/_vtrax_test:4236 15:21:45.873 [4236] <32> onlfi_freeze: FTL - VfMS error 11; see following messages: 15:21:45.873 [4236] <32> onlfi_freeze: FTL - Fatal method error was reported 15:21:45.873 [4236] <32> onlfi_freeze: FTL - vfm_freeze: method: VxFS_Checkpoint, type: FIM, function: VxFS_Checkpoint_freeze 15:21:45.873 [4236] <32> onlfi_freeze: FTL - VfMS method error 17; see following message: 15:21:45.874 [4236] <32> onlfi_freeze: FTL VxFS_Checkpoint_freeze: Cannot create checkpoint; status=17
Remove the clone as follows. Note: If the checkpoint is not removed, you cannot use VxFS_Checkpoint to back up any data in the file system where the checkpoint is mounted. List the name of the checkpoint by entering the following VxFS command:
NetBackup status codes and messages About NetBackup status codes and messages
245
where file_system is the name of the file system where the checkpoint is mounted. A sample output follows. In this example, /vtrax_test is the file system and fi_ckpt is the name of the checkpoint.
/vtrax_test fi_ckpt: ctime = Mon Nov 12 10:08:13 2001 mtime = Mon Nov 12 10:08:13 2001 flags = largefiles
If the checkpoint cannot be removed, unmount the checkpoint and retry the first step in this procedure. If a snapshot backup fails using TimeFinder, ShadowImage, or BusinessCopy method, there may be a VxVM clone left over from a previous backup. You may see messages similar to the following in the /usr/openv/netbackup/logs/bpfis log:
19:13:07.686 [14981] <2> onlfi_vfms_logf: INF - do_cmd: Command failed with status=20: /usr/openv/netbackup/bin/bpdgclone -g wil_test -n vol01 -f /var/tmp/HDSTFCAAs7aOqD </dev/null >/var/tmp/VfMSAAAq7aOqD 2>/var/tmp/VfMSBAAr7aOqD 19:13:07.687 [14981] <2> onlfi_vfms_logf: INF - --- Dumping file /var/tmp/VfMSAAAq7aOqD (stdout): 19:13:07.687 [14981] <2> onlfi_vfms_logf: INF - --- End of file /var/tmp/VfMSAAAq7aOqD 19:13:07.687 [14981] <2> onlfi_vfms_logf: INF - --- Dumping file /var/tmp/VfMSBAAr7aOqD (stderr): 19:13:07.687 [14981] <2> onlfi_vfms_logf: INF clone group and volume already exists 19:13:07.688 [14981] <2> onlfi_vfms_logf: INF - --- End of file /var/tmp/VfMSBAAr7aOqD
NetBackup automatically creates VxVM clones for TimeFinder, ShadowImage, or BusinessCopy backups of data that is configured over volumes. After the backup has completed, NetBackup removes the VxVM clone. In this case, a system crash or restart may have prevented the removal. Remove the clone as follows.
246
NetBackup status codes and messages About NetBackup status codes and messages
(Do the following on the client or alternate client, depending on the type of backup.)
When no backups are running, use the following VxVM command to list any clones: vxdg list The clone name is of the form clone_disk_group_clone.
For example:
/usr/openv/netbackup/bin/bpdgclone -g wil_test -n vol01 -c
where wil_test is the name of the disk group and volo1 is the name of the VxVM volume. For more information on now to remove a VxVM clone, refer to the NetBackup Snapshot Client Administrators Guide. For vxdg, refer to the Veritas Volume Manager Administrators Guide.
Before running the backup again, resynchronize the primary disk with the secondary disk. For assistance, refer to the NetBackup Snapshot Client Administrators Guide. If a snapshot backup fails using the FlashSnap or VVR snapshot method, a VxVM snapshot may be left over from a previous backup. You may see messages similar to the following in the /usr/openv/netbackup/logs/bpfis log:
14:41:15.345 [22493] <32> onlfi_freeze: FTL - VfMS error 11; see following messages: 14:41:15.345 [22493] <32> onlfi_freeze: FTL - Fatal method error was reported 14:41:15.345 [22493] <32> onlfi_freeze: FTL - vfm_freeze_commit: method: FlashSnap, type: FIM, function: FlashSnap_freeze_commit 14:41:15.345 [22493] <32> onlfi_freeze: FTL - VfMS method error 8; see following message: 14:41:15.345 [22493] <32> onlfi_freeze: FTL vxvm__find_ready_snapshot: Cannot find available snapshot mirror
NetBackup automatically creates VxVM snapshots for backups of data that is configured over volumes. After the backup completes, NetBackup removes the VxVM snapshot. In this case, a system crash or restart may have prevented the removal. Remove the snapshot as follows. For FlashSnap:
NetBackup status codes and messages About NetBackup status codes and messages
247
(Do the following on the client or alternate client, depending on the type of backup.)
If vxdg list does not show the disk group, the group might have been deported. You can discover all the disk groups, that includes deported ones, by entering:
vxdisk -o alldgs list
The disk groups that are listed in parentheses are not imported on the local system.
248
NetBackup status codes and messages About NetBackup status codes and messages
When you use a snapshot method such as VxFS_Checkpoint to back up a Veritas File System (VxFS), the backup fails if the VxFS license has expired. Messages such as the following appear in the /usr/openv/netbackup/logs/bpfis log:
11:37:42.279 [24194] <2> onlfi_vfms_logf: INF VxFS_Checkpoint_freeze: Cannot open checkpoint; status=100 11:37:42.283 [24194] <4> delete_mount_point: INF - Deleted mount point /tmp/_vrts_frzn_img__test1_24194 11:37:42.283 [24194] <32> onlfi_freeze_fim_fs: FTL - VfMS error 11; see following messages: 11:37:42.283 [24194] <32> onlfi_freeze_fim_fs: FTL - Fatal method error was reported 11:37:42.284 [24194] <32> onlfi_freeze_fim_fs: FTL - vfm_freeze: method: VxFS_Checkpoint, type: FIM, function: VxFS_Checkpoint_freeze 11:37:42.284 [24194] <32> onlfi_freeze_fim_fs: FTL - VfMS method error 100; see following message: 11:37:42.284 [24194] <32> onlfi_freeze_fim_fs: FTL VxFS_Checkpoint_freeze: Cannot open checkpoint; status=100
If the backup is enabled for instant recovery with either the vxvm or VVR snapshot method, your VxVM mirrors may not be properly configured. In this case, you may see the following in the /usr/openv/netbackup/logs/bppfi log on the client (when verbose mode is set high).
13:43:39.095 [16375] <2> onlfi_vfms_logf: INF - Executing command: 13:43:39.095 [16375] <2> onlfi_vfms_logf: INF /usr/sbin/vxprint -g rootdg -q -t -e 'assoc="pfi_concat"' </dev/null >/var/tmp/VfMSAA Arja4.F 2>/var/tmp/VfMSBAAsja4.F 13:43:39.215 [16375] <2> onlfi_vfms_logf: INF pfi_find_snapdone: 0 SNAPDONE plexes found 13:43:39.215 [16375] <2> onlfi_vfms_logf: INF - Executing command: 13:43:39.215 [16375] <2> onlfi_vfms_logf: INF /usr/sbin/vxassist -g rootdg snapprint pfi_concat </dev/null >/var/tmp/VfMSAAArja4.F 2>/var/tmp/VfMSBAAsja4.F 13:43:39.512 [16375] <2> onlfi_vfms_logf: INF - 0 active plexes for /rootdg/pfi_concat: 0 are PFI 0 non-PFI 13:43:39.512 [16375] <2> onlfi_vfms_logf: INF -
NetBackup status codes and messages About NetBackup status codes and messages
249
pfi_find_active.3309: exiting with VXVM_E_SYS = 3 13:43:39.512 [16375] <2> onlfi_vfms_logf: INF pfi_snapshot.3866: No PFI snapshot. err= 3
Configure the VxVM mirrors as described in the Instant Recovery chapter of the NetBackup Snapshot Client Administrators Guide.
When you use the VxFS_Checkpoint snapshot method, the backup fails if the clients file system does not support mountable checkpoints using the Storage Checkpoint feature. Messages such as the following appear in the /usr/openv/netbackup/logs/bpfis log:
14:54:27.530 [23563] <32> onlfi_freeze_fim_fs: FTL - VfMS error 11; see following messages: 14:54:27.530 [23563] <32> onlfi_freeze_fim_fs: FTL - Fatal method error was reported 14:54:27.530 [23563] <32> onlfi_freeze_fim_fs: FTL - vfm_freeze: method: VxFS_Checkpoint, type: FIM, function: VxFS_Checkpoint_freeze 14:54:27.531 [23563] <32> onlfi_freeze_fim_fs: FTL - VfMS method error 2; see following message: 14:54:27.531 [23563] <32> onlfi_freeze_fim_fs: FTL - open_ckpt: Cannot open checkpoint on /mnt_vxvm/2G_concat : fsckpt_get_api_version returns 1; mountable checkpoints not supported with this version
Upgrade the client file system to a version that supports mountable VxFS Storage Checkpoints Configure the policy with a snapshot method that supports the clients current file system.
250
NetBackup status codes and messages About NetBackup status codes and messages
Examine the NetBackup error log to determine why the system call failed. Then correct the problem. It may be a permission problem. If the error log does not show the error, create a debug log directory for bprd or bpdbm (depending on which process encountered the error). Increase the unified logging level if nbpem, nbjm, or nbrb encountered the error. Use the vxlogcfg command as explained in the following topic: See Configuring and using unified logging on page 94. Retry the operation and check the resulting debug log.
A storage unit that is on a SAN media server A client that is not the SAN media server itself
SAN media servers can only back up themselves. This status code is used when the creation of a storage unit on a SAN media server fails because "On demand only" is not selected. "On demand only" is required for storage units on a SAN media server. Recommended Action: To determine the cause of the error, examine the NetBackup All Log Entries report for the command that was ran. See also the Activity Monitor details for informative messages. If the job fails on a SAN media server storage unit, ensure that only the local client is specified in the policy. If remote clients are specified in the policy, do one of the following: remove them and place them in a policy that specifies a different storage unit or change the storage unit for that policy. If you want to back up remote clients by using the SAN media server, you can purchase a regular NetBackup media server license.
NetBackup status codes and messages About NetBackup status codes and messages
251
On Macintosh:
:System Folder:Extensions:libvopie.dll :System Folder:Extensions:libvnoauth.dll
Check the methods_allow.txt files on the systems that have problems to ensure that authentication is enabled. The files are in the following locations: Windows: install_path\NetBackup\var\auth UNIX and Linux: /usr/openv/var/auth Macintosh: :System Folder:Preferences:NetBackup:: One system may report authentication failure (status code 160) while the other system reports that a network connection timed out (status code 41). In this case, authentication may be enabled in the methods_allow.txt file on the first system but not on the second system.
On the systems with the authentication problem, remove the remote host that is not authenticated from the methods_allow.txt file. For example, if host A and host B have the problem, remove host A from the file on host B and vice versa. Retry the operation. If the problem still exists, it indicates that connection problems are not related to authentication. If connections are now successful, proceed to the next step.
252
NetBackup status codes and messages About NetBackup status codes and messages
Run bpauthsync -vopie on the master server to synchronize the key files again on the systems. On Windows:
install_path\NetBackup\bin\admincmd\bpauthsync -vopie -servers -clients
Add back the names that were removed and retry the operation. Create debug log directories for the processes that are involved in authentication between NetBackup systems. These include:
On the server, create debug log directories for bprd, bpdbm, bpcd.
On the client, create debug log directories for bpbackup, bprestore, bpbkar (Windows only). Retry the operation and check the logs.
NetBackup status codes and messages About NetBackup status codes and messages
253
resumed. Since the media block size must be consistent, the job was restarted from the beginning. Recommended Action: Check the Activity Monitor job details for the job ID of the restarted job.
If volume is in a DOWN drive, remove it and place it in its designated slot. Then, retry the restore. If the volume is in the wrong slot, use a robot inventory option to reconcile the contents of the robot with the volume configuration.
254
NetBackup status codes and messages About NetBackup status codes and messages
NetBackup status codes and messages About NetBackup status codes and messages
255
If a non third-party copy device is listed in 3pc. file, correct it or remove the non third-party copy device entry. If an incorrect lun is specified in the 3pc. file or the device does not exist, correct the 3pc. file as appropriate. If an appropriate mover.conf file (with or without file name extension) cannot be found, the /usr/openv/netbackup/logs/bptm log may show the following:
09:51:04 [22281] <2> setup_mover_tpc: no mover.conf.vertex_std_tpc or mover.conf file exists, cannot perform TPC backup 09:51:04 [22281] <16> bptm: unable to find or communicate with Third-Party-Copy mover for policy vertex_std_tpc
Make sure that an appropriate mover.conf file exists in /usr/openv/netbackup on the media server. This file can be any of the following:
the name of the storage in the Backup Policy Management Policy attributes dialog box (such as nut-4mm-robot-tl4-0).
mover.conf file (no extension) for the configurations that have only one
third-party copy device. Note that NetBackup looks for an appropriate mover.conf file in the order.
If the SCSI pass-through path of the third-party copy device, as entered in the mover.conf file, does not exist, the bptm log may show the following:
09:50:12 [22159] <16> setup_mover_tpc: open of passthru path /dev/sg/cXtXlX failed, No such file or directory 09:50:12 [22159] <16> bptm: unable to find or communicate with Third-Party-Copy mover for policy vertex_std_tpc
Correct the SCSI pass-through path of the third-party copy device that is entered in the mover.conf file.
If the third-party copy device returned an error, you may see either of the following messages in /usr/openv/netbackup/logs/bptm log:
cannot process extended copy error due to truncated sense data, may be HBA problem
256
NetBackup status codes and messages About NetBackup status codes and messages
disk error occurred on extended copy command, key = 0x0, asc = 0x0, ascq = 0x0
(where key, asc, and ascq are all zero) Your host-bus adapter (HBA) and its driver may need to be updated, or NetBackup Snapshot Client may not support them. The supported host-bus adapters are listed. See the NetBackup Release Notes.
If the volume is in a robot that supports bar codes, verify the robot contents by using a robot inventory option. If the volume was mounted on a nonrobotic drive, verify that the correct volume was mounted and assigned. Check the NetBackup Problems report. If it shows a fatal read error, try the operation again with another drive, if possible. If your configuration has multiple servers or HBAs with access to your tape services, make sure the SCSI Reserve or Release is configured correctly. (Most likely, the tape services configuration is an SSO configuration.) For more information, refer to the NetBackup Shared Storage Guide.
NetBackup status codes and messages About NetBackup status codes and messages
257
Explanation: When the tape manager (bptm) searches for a backup image to restore, the following occurs: it cannot find the correct backup ID at the position on the media where NetBackup expected it. This status code can indicate a drive hardware problem. Recommended Action: Do the following, as appropriate:
Check the NetBackup Problems report for clues as to what caused the error. Try the restore on another drive if possible. For detailed troubleshooting information, create a debug log directory for bptm and retry the operation. Check the resulting debug log.
Check the NetBackup Problems report to see if it shows the cause of the problem. If you see a Problems report message similar to the following, save all logs and call Symantec customer support:
"attempted to write 32767 bytes, not a multiple of 512"
On UNIX and Linux, if this error occurs during a restore, the tape drive may be incorrectly configured. It may be configured to write in fixed length mode when it should write in variable length mode. Verify your drives configuration by comparing it to what is recommended in the NetBackup Device Configuration Guide. If your configuration incorrectly specifies fixed length mode, change it to variable length mode and suspend the media that were written on that device. The images that were written to those media may be restorable (platform dependent), but single file restores are likely to fail. If the problem occurs with a particular client only, verify that the client binaries are correct, especially for bpcd. If you can read or write any other images on this media, check the following reports for clues:
258
NetBackup status codes and messages About NetBackup status codes and messages
The media, by using the NetBackup image verify option. That you used the correct media type for the device.
Check the system or the console log for errors (on UNIX and Linux) or the Event Viewer Application log (on Windows). For detailed debug information, create a debug log directory for either bptm or bpdm (whichever applies) and retry the operation. Check the resulting debug log. On UNIX and Linux, if the bptm debug log shows an error similar to the following, the tape drive is configured to write in fixed length mode rather than variable length mode:
00:58:54 [2304] <16> write_data: write of 32768 bytes indicated only 29696 bytes were written, errno = 0
If the backup was configured for an OpenStorage disk storage unit, the OpenStorage vendor's plug-in may not be installed on all media servers in the storage unit's media server list. Either install the vendor plug-in on all of the media servers or remove from the list the servers that do not have the plug-in installed.
Check the NetBackup Problems report and the status lot or the progress log on the client for additional information about the error For detailed troubleshooting information, create a debug log directory for either bptm or bpdm (whichever applies) and retry the operation. Check the resulting debug log.
NetBackup status codes and messages About NetBackup status codes and messages
259
Media ID is already active in the NetBackup media catalog on this server Media ID is not in the volume configuration Fatal tape manager (bptm) error occurred Total image was not obtained from Phase 1 of import The barcode and media ID do not match in the volume database. One possible reason for a mismatch is when using tapes previously written by a different barcode, labels are placed into an ACSLS library using a new barcode label. Another possible reason is when the administrator attempts to change the media ID field (using vmchange) to match the RVSN. When the inventory is updated, the barcode and media ID are synchronized.
Recommended Action: Check the NetBackup Problems report to find the exact cause of the failure and try the following:
If the media ID is already active, duplicate all images on the original media ID to another volume. Then, manually expire the original media and redo the import. If the media ID is not present in the volume configuration, add it. If you received a fatal bptm error, verify that the following are active: the NetBackup Volume Manager (vmd) on UNIX and Linux or the NetBackup Volume Manager on Windows. If the entire image is not present, perform import phase 1 on the media IDs that have the remainder of the image. If the barcode and media ID are a mismatch, use a barcode label that matches the recorded media ID. Tapes can then be mounted in a drive with AVRD running so that the recorded media ID is displayed. Then, a barcode with that label needs to be placed on the tape.
Check the NetBackup Problems report for the cause of the problem. Verify that the NetBackup Volume Manager (vmd) is active on UNIX and Linux or the NetBackup Volume Manager service is active on Windows.
260
NetBackup status codes and messages About NetBackup status codes and messages
For detailed troubleshooting information, create a debug log directory for bptm and retry the operation. Check the resulting debug log.
Ensure that the tar command in /usr/openv/netbackup/bin is the one provided by NetBackup. If you are in doubt, reinstall it. Check /usr/openv/netbackup/bin/version on the client to verify that the client is running the correct level software. If the software is not at the
NetBackup status codes and messages About NetBackup status codes and messages
261
correct level, update the software per the directions in the NetBackup release notes.
On a Windows client, create a tar debug log directory, retry the operation, and check the log. On a Macintosh client, check the version file that is in the bin folder in the NetBackup folder in the Preferences folder. If the software is not at the correct level, install the correct software. See the NetBackup Installation Guide for UNIX.
Create a bpcd debug log directory on the client (on a Macintosh, NetBackup creates the log automatically). On a Windows client, create a tar debug log directory. Increase the logging level on the client:
On a UNIX or Linux client, add the VERBOSE option to the /usr/openv/netbackup/bp.conf file. On PC clients, increase the debug or log level. See How to set logging levels on PC clients on page 122.
Rerun the operation, check the resulting debug logs for the parameters that were passed to tar and call customer support.
If the problem is with a UNIX or Linux client, create a /usr/openv/netbackup/logs/tar debug log directory on the client and rerun the operation.
Check the tar debug log file for any error messages that explain the problem. Restart the client to see if it clears the problem.
262
NetBackup status codes and messages About NetBackup status codes and messages
When you finish with your investigation of the problem, delete the /usr/openv/netbackup/logs/tar directory on the client.
If the problem is with a Microsoft Windows, NetWare, or Macintosh client, do the following in the order presented:
Create a bpcd debug log directory on the client (on a Macintosh NetBackup creates the log automatically). On a Windows client, create a tar debug log directory. Increase the debug or log level. See How to set logging levels on PC clients on page 122. Rerun the operation and check the resulting debug logs. Restart the client to see if it clears the problem.
If the problem is with a UNIX or Linux client, create a /usr/openv/netbackup/logs/tar debug log directory on the client and rerun the operation.
Check the tar debug log file for any error messages that explain the problem. Restart the client to see if it clears the problem. When you finish your investigation of the problem, delete the /usr/openv/netbackup/logs/tar directory on the client.
Create a bpcd debug log directory on the client (on a Macintosh NetBackup creates the log automatically). Increase the debug or log level. See How to set logging levels on PC clients on page 122. On a Windows client, create a tar debug log directory. Retry the operation and check the resulting debug logs. Restart the client to see if it clears the problem.
NetBackup status codes and messages About NetBackup status codes and messages
263
Enable bpcd debug logging by creating the /usr/openv/netbackup/logs/bpcd directory on the client. Rerun the operation, check the resulting bpcd log file for the parameters that were passed to tar, and call customer support.
Create a bpcd debug log directory on the client (on a Macintosh NetBackup creates the log automatically). Increase the debug or log level. See How to set logging levels on PC clients on page 122. On a Windows client, create a tar debug log directory. Retry the operation. Check the resulting debug logs for the parameters that were passed to tar and call customer support.
Retry the operation and check the status or the progress log on the client for any error messages that reveal the problem. Verify that the tape is available and readable. Verify that the drive is in an UP state. Use the Device Monitor. For detailed troubleshooting information:
Create a bptm debug log on the server. On a Windows client, create a tar debug log. Retry the operation and check the resulting debug logs.
264
NetBackup status codes and messages About NetBackup status codes and messages
On a UNIX or Linux client, delete DISALLOW_SERVER_FILE_WRITES from the /usr/openv/netbackup/bp.conf file. On a Microsoft Windows or NetWare nontarget client, select Allow server-directed restores on the General tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface on the client and select NetBackup Client Properties from the File menu. On a Macintosh client, delete DISALLOW_SERVER_FILE_WRITES from the bp.conf file in the NetBackup folder in the Preferences folder. On a NetWare target client, set ALLOW_SERVER_WRITE to yes in the bp.ini file.
Check the NetBackup Problems report for the cause of the error. To obtain detailed troubleshooting information, create an admin debug log directory and retry the operation. Check the resulting debug log. If the error was encountered during duplication of backups, check the duplication progress log to help determine the root cause of the problem. If a Vault job encountered the error responsible for the duplication, check the duplicate.log files in your sidxxx directories to determine the root cause:
NetBackup status codes and messages About NetBackup status codes and messages
265
266
NetBackup status codes and messages About NetBackup status codes and messages
See the NetBackup Administrators Guide, Volume I, for the following information: how to set the Access Control related host properties how to configure a system to use Access Control.
Retry the backup either immediately with a manual backup or allow the normal scheduler retries. For additional information, check the All Log Entries report. For detailed troubleshooting information, increase the logging level for the diagnostic and debug logs for nbpem, nbjm, and nbrb. Use the vxlogcfg command as explained in the following topic: See Configuring and using unified logging on page 94. After the next backup try, check the logs.
Verify that the vmd and the ltid daemons (UNIX and Linux) or the NetBackup Volume Manager and NetBackup Device Manager services (Windows) are running. Look for a problem in an earlier backup that made the media or the storage unit unavailable.
NetBackup status codes and messages About NetBackup status codes and messages
267
If possible, change the schedule to extend the backup window for this combination of policy and schedule so it does not occur again. If the backup must be run, use the Manual Backup command on the Policy menu in the Backup Policy Management window to perform the backup. Manual backups ignore the backup window.
On Microsoft Windows and NetWare nontarget clients, you can specify a policy or schedule on the Backups tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface on the client and select NetBackup Client Properties on the File menu. On UNIX, Linux, and Macintosh clients, you can specify a policy or schedule by using the bp.conf options, BPBACKUP_POLICY or BPBACKUP_SCHED. On NetWare target clients, you can specify a policy or schedule in the bp.ini file.
Check the client progress log (if available) to determine the policy and schedule that were specified. Check the configuration on the master server to determine if the schedule is valid for the policy. If the schedule is not valid, either add the schedule to the policy configuration or specify a valid schedule on the client.
If the client is in such a policy, check the general policy attributes to verify that the policy is set to active. If the client is not in such a policy, do either of the following:
268
NetBackup status codes and messages About NetBackup status codes and messages
Add a schedule of the appropriate type to an existing policy that has this client Create a new policy that has this client and a schedule of the appropriate type
If possible, retry the operation when the backup window is open. If the backup window is not open during appropriate time periods, adjust a backup window for a schedule in one of the policies.
No backup time windows are open (applies only to full and to incremental schedules). Policies are set to inactive. The clients were recently backed up and are not due for another backup (based on Frequency setting for the schedules). Policies do not have any clients.
Recommended Action: Usually, this message can be considered informational and does not indicate a problem. However, if you suspect a problem, do the following:
Examine the NetBackup All Log Entries report for any messages in addition to the one that indicates the scheduler found nothing to do. Examine the policy configuration for all policies or the specific policy in question to determine if any of the reasons in the Explanation section apply.
NetBackup status codes and messages About NetBackup status codes and messages
269
To obtain detailed troubleshooting information, increase the unified logging level for the diagnostic and debug logs. Use the vxlogcfg command as explained in the following procedure: See Configuring and using unified logging on page 94. Retry the operation and check the resulting logs.
Determine the activity that encountered the handshake failure by examining the NetBackup All Log Entries report for the appropriate time period. If there are media servers, determine if:
The handshake failure was encountered between the master and a media server. or Only the master server was involved.
If necessary, create the following debug log directories and increase the logging level:
bpcd on the NetBackup media host (can be either the master or a media
server).
If the error was encountered during a backup operation, increase the logging level for the diagnostic and debug logs for nbpem, nbjm, and nbrb. Use the vxlogcfg command as explained in the following procedure: See Configuring and using unified logging on page 94. If the error was encountered during a restore operation, bprd on the master server. If the error was encountered during a media list operation, admin in the NetBackup logs/admin directory on the master server.
Status code 201 may occur if nbjm fails after connecting to bpbrm or bpmount but before the policy file list is sent. Examine the nbjm unified log (originator
270
NetBackup status codes and messages About NetBackup status codes and messages
ID 117) or the bpbrm or the bpmount legacy logs for more detail on the cause of the error.
Retry the operation and examine the resulting debug logs for information on why the error occurred.
Verify that the schedule specifies the correct storage unit. Run the ping command from one host to another by using the following combinations:
From the master server, ping the master and all media servers by using the host names that are found in the storage unit configuration. From each of the media servers, ping the master server by using the host name that is specified in the NetBackup server list. On a UNIX or Linux server, the master is the first SERVER entry in the bp.conf file. On a Windows server, the master is designated on the Servers tab in the Master Server Properties dialog box. To access this dialog box, see the following: See Using the Host Properties window on page 61.
Verify that the master server can communicate with bpcd on the host that has the storage unit. Perform the following procedures: See Testing media server and clients on page 38. See Resolving network communication problems on page 40. If necessary, create debug log directories for the following processes and retry the operation. Then, check the resulting debug logs on the master server:
If the error occurred during a backup operation, increase the logging level for the diagnostic and debug logs for nbpem, nbjm, and nbrb. Use the vxlogcfg command as explained in the following procedure:
NetBackup status codes and messages About NetBackup status codes and messages
271
See Configuring and using unified logging on page 94. Also, check the bpcd legacy debug logs.
If the error occurs during a restore operation, check the bprd debug logs.
From the master server, ping the master and all media servers by using the host names in the storage unit configuration. From each of the media servers, ping the master server host by using the host name that is specified in the NetBackup server list. On a UNIX or Linux server, the master is the first SERVER entry in the bp.conf file. On a Windows server, the master is designated on the Servers tab in the Master Server Properties dialog. To access this dialog box, see the following topic: See Using the Host Properties window on page 61. Perform the following procedures: See Testing media server and clients on page 38. See Resolving network communication problems on page 40. If necessary, create debug log directories for bprd and retry the operation. Then, check the resulting debug logs on the master server. If the error occurred during a restore, check the bprd debug logs.
272
NetBackup status codes and messages About NetBackup status codes and messages
Recommended Action: Run the ping command from one host to another by using the following combinations:
From the master server, ping the master and all media servers by using the host names in the storage unit configuration. From each of the media servers, ping the master server by using the name that was specified in the NetBackup server list. On a UNIX or Linux server, this master is the first SERVER entry in the bp.conf file. On a Windows server, the master is designated on the Servers tab in the Master Server Properties dialog box. The following topic shows how to access this dialog box: See Using the Host Properties window on page 61. On UNIX and Linux servers, verify that the bpcd entries in /etc/services or NIS on all the servers are identical. Verify that the media host listens on the correct port for connections to bpcd. To verify, run one of the following commands (depending on platform and operating system):
netstat -a | grep bpcd netstat -a | grep 13782 (or the value that was specified during the install) rpcinfo -p | grep 13782 (or the value that was specified during the install)
On UNIX and Linux servers, it may be necessary to change the service number for the following: bpcd in /etc/services and the NIS services map and send SIGHUP signals to the inetd processes on the clients.
/bin/ps -ef | grep inetd kill -HUP the_inetd_pid
or
/bin/ps -aux | grep inetd kill -HUP the_inetd_pid
Verify that the following numbers match the settings in the services file: NetBackup Client Service Port number and NetBackup Request Service
NetBackup status codes and messages About NetBackup status codes and messages
273
Port number on the Network tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface and select NetBackup Client Properties on the File menu. The values on the Network tab are written to the services file when the NetBackup Client service starts.
Perform the following procedures: See Testing media server and clients on page 38. See Resolving network communication problems on page 40. If necessary, create debug log directories for bprd and retry the operation. Then, check the resulting debug logs on the master server:
If the error occurred during a backup operation, check the nbpem, nbjm, and nbrb logs by using the vxlogview command. If the error occurred during a restore operation, check the bprd debug logs.
Also, see the following topic: See Resolving network communication problems on page 40.
From the master server, ping the master and all media servers by using the host names in the storage unit configuration. From each of the media servers, ping the master server by using the name that is specified in the NetBackup server list. On a UNIX or Linux server, this master is the first SERVER entry in the bp.conf file. On a Windows server, the master is designated on the Servers tab in the Master Server Properties dialog box. To access this dialog box, see the following topic: See Using the Host Properties window on page 61. On a UNIX or Linux server, verify that the bpcd entry in /etc/services or NIS on all the servers are identical. Verify that the media host listens on the
274
NetBackup status codes and messages About NetBackup status codes and messages
correct port for connections to bpcd. To verify, run one of the following commands (depending on platform and operating system):
netstat -a | grep bpcd netstat -a | grep 13782 (or the value that is specified during the install) rpcinfo -p | grep 13782 (or the value that is specified during the install)
Verify that the bpcd entries are correct in the services file:
%SystemRoot%\system32\drivers\etc\services
Verify that the following numbers match the settings in the services file: NetBackup Client Service Port number and NetBackup Request Service Port number on the Network tab in the NetBackup Client Properties dialog box. To display this dialog box, start the Backup, Archive, and Restore interface and select NetBackup Client Properties on the File menu. The values on the Network tab are written to the services file when the NetBackup Client service starts. Stop and restart the NetBackup services.
Perform the following procedures: See Testing media server and clients on page 38. See Resolving network communication problems on page 40. Create a bpcd debug log directory on the server that has the storage unit and retry the operation. Then, check for additional information in the debug log.
Also, see the following topic: See Resolving network communication problems on page 40.
NetBackup status codes and messages About NetBackup status codes and messages
275
Verify that the master server appears as a server in its own server list as well as being listed on all media servers. If you change the server list on a master server, stop and restart the NetBackup database manager and request the following: daemons (UNIX and Linux) or the NetBackup Database Manager and NetBackup Request Manager services (Windows). If necessary, create debug log directories for bprd and retry the operation. Then, check the resulting debug logs on the master server:
If the error occurred during a backup operation, check the nbpem, nbjm, and nbrb logs by using the vxlogview command. If the error occurred during a restore operation, check the bprd debug logs.
Verify that the NetBackup database manager (bpdbm) process (on UNIX and Linux) or the NetBackup Database Manager service (on Windows) is running. Examine the All Log Entries report for the appropriate time frame to gather more information about the failure. For detailed troubleshooting information, create a bpdbm log directory on the master server. Increase the logging level for the diagnostic and debug logs for nbpem. Use the vxlogcfg command as explained in the following topic: See Configuring and using unified logging on page 94. Retry the operation, then check the resulting logs.
276
NetBackup status codes and messages About NetBackup status codes and messages
Recommended Action: Create debug log directories on the master server and retry the operation. Then, determine the type of system failure by examining the logs. Start with the bprd debug log. On UNIX and Linux servers, also gather the output of the ipcs -a command to see what system resources are currently in use.
NetBackup status codes and messages About NetBackup status codes and messages
277
configured for On demand only. In addition, the policy and schedule does not require a specific storage unit. Recommended Action: Do the following, as appropriate:
Examine the Backup Status and All Log Entries report for the appropriate time period to determine the policy or schedule that received the error. Verify that the storage units drives are not down. Verify the following attribute settings for all storage units:
For disk storage units, the Maximum concurrent jobs is not set to 0 For Media Manager storage units, the Maximum concurrent write drives is not set to 0
Verify that the robot number and host name in the storage unit configuration matches the Media and Device Management device configuration. Determine if all storage units are set to On demand only for a combined policy and schedule that does not require a specific storage unit. In this case, either specify a storage unit for the policy and the schedule combination or turn off On demand only for a storage unit. If the storage unit is on a UNIX or Linux NetBackup media server, it may indicate a problem with bpcd. Check /etc/inetd.conf on the media server to verify that the bpcd entry is correct. If the storage unit is on a Windows NetBackup media server, verify that the NetBackup Client service was started on the Windows NetBackup media server.
For detailed troubleshooting information, increase the logging levels of nbrb and mds on the master server. Use the vxlogcfg command as explained in the following topic: See Configuring and using unified logging on page 94. Retry the operation and check the resulting debug logs.
On UNIX and Linux master servers, verify that the NetBackup database manager (bpdbm) process is running. On a Windows master server, verify that the NetBackup Database Manager service is running.
278
NetBackup status codes and messages About NetBackup status codes and messages
Try to view the global configuration settings by using the NetBackup administration interface (on UNIX and Linux systems), or by using Host Properties (on Windows systems). For detailed troubleshooting information, create debug log directories for nbproxy and bpdbm on the master server and retry the operation. Check the resulting debug logs for these processes. Also check the nbpem logs by using the vxlogview command.
On a UNIX or Linux master server, verify that the NetBackup database manager (bpdbm) process is running. On a Windows master server, verify that the NetBackup Database Manager service is running. For detailed troubleshooting information, create a debug log directory for bpdbm on the master server. Increase the logging level for nbpem by using the vxlogcfg command as described in the following topic: See Configuring and using unified logging on page 94. Retry the operation and check the resulting logs.
On a UNIX or Linux server, verify that the NetBackup database manager (bpdbm) process is running. On a Windows server, verify that the NetBackup Database Manager service is running. Try to view the storage unit configuration by using the NetBackup administration interface. For detailed troubleshooting information, create debug logs for nbproxy and bpdbm on the master server and retry the operation. Check the resulting debug logs. Also check the nbpem logs by using the vxlogview command.
NetBackup status codes and messages About NetBackup status codes and messages
279
Ensure that the correct master server is specified for the connection.
On a UNIX or Linux server, verify that the NetBackup Database Manager (bpdbm) process is running. On a Windows server, verify that the NetBackup Database Manager service is running. Try to view the policy configuration by using the NetBackup administration interface. For detailed troubleshooting information, create debug log directories for nbproxy and bpdbm on the master server and retry the operation. Check the resulting debug logs. Also check the nbpem logs by using the vxlogview command. Ensure that the correct master server is specified for the connection.
Verify that the schedule specifies the correct storage unit and the storage unit exists. Verify that the following devices are running: the Media Manager device daemon (ltid) (UNIX or Linux server) or the NetBackup Device Manager service (Windows server). Use bpps on UNIX and Linux and the Activity Monitor on Windows or the Services application in the Windows Control Panel. Verify the following attribute settings:
For a disk storage unit, Maximum concurrent jobs is not set to 0. For a Media Manager storage unit, the Maximum concurrent drives attribute is not set to 0.
If the storage unit is a tape, verify that at least one of the drives is in the UP state. Use the Device Monitor.
280
NetBackup status codes and messages About NetBackup status codes and messages
Verify that the robot number and host in the storage unit configuration match what is specified in the Media and Device Management device configuration. Verify that the master server can communicate with the bpcd process on the server that has the storage unit.
Verify that bpcd listens on the port for connections. On a UNIX or Linux server where the storage unit is connected, if you run netstat -a | grep bpcd, it should return something similar to the following:
*.bpcd *.* 0 0 0 0 LISTEN
On a Windows NetBackup server where the storage unit is connected, run netstat -a to print several lines of output. If bpcd listens, one of those lines is similar to the following:
TCP
myhost:bpcd
0.0.0.0:0
LISTENING
Check the nbrb and the mds logs by using the vxlogview command. If the cause of the problem is not obvious, perform some of the steps in the following procedure: See Resolving network communication problems on page 40.
NetBackup status codes and messages About NetBackup status codes and messages
281
it is associated with a subsequent error, it usually indicates a communication problem. In this case, concentrate your troubleshooting efforts on the subsequent error. Recommended Action: Determine the cause of the status code that follows this one.
282
NetBackup status codes and messages About NetBackup status codes and messages
If this status involves a media server, verify that its server list specifies the correct master server. On a UNIX or Linux server, the master server is the first SERVER entry in the bp.conf file. On a Windows server, the master is designated on the Servers tab in the Master Server Properties dialog box. To access this dialog box, see the following topic: See Using the Host Properties window on page 61.
NetBackup status codes and messages About NetBackup status codes and messages
283
For detailed troubleshooting information, create a bpdbm debug log directory and retry the operation. Then, check the resulting debug log.
284
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: This status code is an intermediate one that usually precedes another status code and is associated with a problem in network communication. Recommended Action: During a restore, this status codes means that tar (on the client) received a stream of data that was not what it expected. If the restore is a new configuration, verify that the tape drive is configured for variable mode. See the NetBackup Device Configuration Guide. If the communication failure is not due to an interrupt on a client system, save all error information and call customer support.
Verify that all NetBackup software is at the same version level. Update earlier versions of NetBackup software.
On UNIX and Linux, NetBackup servers and clients, check the /usr/openv/netbackup/bin/version file. On Windows NetBackup servers, check the install_path\NetBackup\version.txt file or the About NetBackup item on the Help menu. On Microsoft Windows clients, check the About NetBackup item on the Help menu. On NetWare target clients, check the Version entry in the bp.ini file. If the client software is earlier than 3.0, verify that the client is in a Standard type policy. On Macintosh clients, check the version file in the bin folder in the NetBackup folder in the Preferences folder.
NetBackup status codes and messages About NetBackup status codes and messages
285
If the problem persists, save all error information and call customer support.
286
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The appropriate schedule was not found in the specified policy. For example, a user backup specified a policy name but no user backup schedule exists in that policy. Recommended Action: Specify a different policy or create a schedule of the needed type in the policy. After you make the correction, retry the operation.
NetBackup status codes and messages About NetBackup status codes and messages
287
On the server, bptm, bpbrm, and bpdbm. On UNIX, Linux, and Windows clients, bpbkar. On other clients, bpcd.
To increase the amount of information that is included in the logs, see the following: See Debug logs on PC clients on page 118.
288
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: This error is internal and should not appear to customers. Recommended Action: Obtain detailed troubleshooting information by creating debug logs for bptm or bpdbm on the server. Then, retry the operation and check the resulting debug logs.
NetBackup status codes and messages About NetBackup status codes and messages
289
Vault failed to get job data because of a broken connection with the job manager (nbjm). Vault received empty job data. This error occurs if a user-specified job ID on the vltrun -haltdups command is out of range. (That is, the job ID is not among the job IDs created by job manager.)
Recommended Action: Contact customer support and send the appropriate logs.
290
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: This error code should not occur. Recommended Action: Contact customer support and send appropriate logs.
Make sure that the file system is not full and that no one has inadvertently edited the session.last file. To correct the problem, store the highest session ID that was assigned to a session for this Vault in the session.last file. If the problem persists, contact customer support and send the appropriate logs.
Recommended Action: Ensure that the session ID that is stored in the session.last file is not corrupt. Make sure that the file system is not full and
NetBackup status codes and messages About NetBackup status codes and messages
291
that no one has inadvertently edited the file. To correct the problem, store the highest session ID that was assigned to a session for this Vault in the session.last file. If the problem persists, contact customer support and send the appropriate logs.
This directory is created when Vault is installed. Recommended Action: Make sure you are running on the master server where Vault is installed and configured. Also ensure that no one accidentally removed the sessions directory or changed permission on the directory path so it is inaccessible to the Vault job.
292
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: This error is encountered when vltopmenu cannot find a sidxxx session id directory for the specified profile. Either no Vault jobs were run for this profile or the corresponding sidxxx session id directory (or directories) were removed from the following directory:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name Windows: install_path\NetBackup\vault\sessions\vault_name
Recommended Action: Either specify a different profile for the Vault jobs that were run or exit vltopmenu and run a Vault job for the specific profile. Then rerun vltopmenu and select the profile.
NetBackup status codes and messages About NetBackup status codes and messages
293
Recommended Action: Ensure that the notify script is executable and runs without errors. You must debug the script by running it manually to eliminate coding errors.
The specified job is not an active Vault job The specified active Vault job is not at the duplication step
Recommended Action: Specify the job ID of the active Vault job that is currently at the duplication step or operation.
294
NetBackup status codes and messages About NetBackup status codes and messages
The Symantec Private Branch Exchange service (VRTSpbx) or NetBackup Request Manager (bprd) is down. The NetBackup Vault Manager service is down, possibly because of the following: the Vault is not licensed, the vault.xml configuration file is corrupt, or the vault.xml configuration file upgrade failed during an upgrade installation.
NetBackup status codes and messages About NetBackup status codes and messages
295
Recommended Action: To determine the reason for failure, examine the logs for the service or services that are down and the operating system logs (EventLog on Windows). Restart the service or services that are down after resolving the problem. The following are the locations of the nbvault logs:
UNIX and Linux: /usr/openv/netbackup/logs/nbvault/ Windows: install_path\NetBackup\logs\nbvault
Reports were not emailed (possibly because of malformed email addresses in the vault.xml file). On Windows, the third-party mail client (such as blat) is not configured properly. The reports destination directory is not present or it does not have appropriate permissions. The printer is not set up correctly or the printer command in vault.xml is incorrect.
296
NetBackup status codes and messages About NetBackup status codes and messages
Some of the media is in use by NetBackup Some of the media are in a drive Not enough empty slots are available in the media access port (MAP)
Recommended Action: Ensure that the media are not loaded in a drive and in use by other processes. Ensure that empty slots are available in the media access port (MAP).
NetBackup status codes and messages About NetBackup status codes and messages
297
Also review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where xxx is the session ID) After the problem is identified and corrected, the media that were not ejected may need to be ejected manually by means of vlteject or vltopmenu. This error often indicates that the media were left in the off-site Vault volume group but physically reside in the robot or the robotic MAP. To solve this problem, do one of the following:
Manually remove any media that are in the off-site Vault volume group but are still in the robotic library. Inventory the robotic library. An inventory puts any media that were in the off-site Vault volume group back into the robotic volume group. Then rerun the Vault sessions that failed.
298
NetBackup status codes and messages About NetBackup status codes and messages
Also review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where xxx is the session ID) Use the robtest utility to ensure that you can communicate with the Vault robotic library. After the problem is resolved, rerun the Vault session, vlteject command, or vltopmenu command.
Could not open a pipe to vmchange -verify_eject call. Unexpected output from vmchange -verify_eject call. No MAP elements exist to eject media into. The robotic library had problems putting media into the MAP. The user pressed Return in interactive mode and did not first remove the media from the MAP. In this case, the media that were in the MAP are put back into their original slots in the robotic library.
Recommended Action: For detailed information about why the process was canceled, review the Vault debug log in the following directory:
NetBackup status codes and messages About NetBackup status codes and messages
299
Also review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where xxx is the session ID) This error often indicates that the media were left in the off-site Vault volume group but physically reside in the robot or the robotic MAP. To solve this problem, do one of the following:
Manually remove any media that are in the off-site Vault volume group but are still in the robotic library. Inventory the robotic library. An inventory puts any media that were in the off-site Vault volume group back into the robotic volume group. Then, rerun the Vault sessions that failed.
To find the actual problem that caused the catalog backup (bpbackupdb) to fail, review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where xxx is the session ID) Correct the problem and rerun the Vault job.
300
NetBackup status codes and messages About NetBackup status codes and messages
Also review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where xxx is the session ID) Correct the error and rerun the Vault session, vlteject command, or vltopmenu eject command.
vlteject is called with -eject but the system has no tapes to eject vlteject is called with -eject and the eject is already done vlteject is called with -report and the reports are already done vlteject is called with -eject and -report, and both the eject and the reports
are done Recommended Action: This error is an informative one and does not require any action.
NetBackup status codes and messages About NetBackup status codes and messages
301
The bad media ID was added by means of the vlt_ejectlist_notify script. The bad media ID is already in the MAP or not in the robotic library. The bad media ID is in a robotic drive. The bad media ID is in transit in the robotic library.
Recommended Action: Remove or correct the defective media ID from the vlt_ejectlist_notify script and rerun the Vault session. If the bad media ID is in the MAP or a drive or in transit, something is misconfigured.
Currently, the library ejects media The library waits to eject media Currently, the library injects media The library waits to inject media
Recommended Action: Wait until the robotic library can support the eject action and rerun the Vault session, vlteject command, or vltopmenu command.
302
NetBackup status codes and messages About NetBackup status codes and messages
Review the Vault debug log in the following directory for detailed information about why the process failed:
UNIX and Linux: /usr/openv/netbackup/logs/vault Windows: install_path\NetBackup\logs\vault
Also review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where xxx is the session ID) This error often indicates that the media were left in the off-site Vault volume group but physically reside in the robot or the robotic MAP. To solve this problem, do one of the following:
Manually remove any media that are in the off-site Vault volume group but are still in the robot. Inventory the robot. An inventory puts any media that were in the off-site Vault volume group back into the robotic volume group. Then, rerun the Vault sessions that failed.
Also review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
NetBackup status codes and messages About NetBackup status codes and messages
303
(where xxx is the session ID) When the problem is resolved, rerun the Vault session, vlteject command, or vltopmenu command.
Review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/vault_name/sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where xxx is the session ID) In addition, review the admin debug log in the following directory:
UNIX and Linux: /usr/openv/netbackup/logs/admin Windows: install_path\NetBackup\logs\admin
Also review the summary.log in each of the sidxxx directories that had problems:
304
NetBackup status codes and messages About NetBackup status codes and messages
(where xxx is the session ID) Try running the command (with the same arguments as in the log file) to see the actual error. Ensure that the Media and Device Management daemons are running. Also ensure that the robot is functional and you can communicate with it (for example, inventory the robot through the GUI).
NetBackup status codes and messages About NetBackup status codes and messages
305
Explanation: This error occurs when vlteject is run to eject media for a session ID for which media has already been ejected. Recommended Action: Rerun vlteject for another session ID for which media has not been ejected.
Also review the summary.log in each of the sidxxx directories that had problems:
UNIX and Linux: /usr/openv/netbackup/vault/sessions/<vault_name/>sidxxx Windows: install_path\NetBackup\vault\sessions\vault_name\sidxxx
(where vault_name is the name of the vault, and xxx is the session ID) Look for the log entry that gives the total number of images processed. A common cause of failure is a lack of resources, such as no more media available in the specified pools for duplication. Correct the problem and rerun the Vault job. Note that the NetBackup scheduler retries a Vault job that terminates with this error. Review the admin debug log for induplicate entries and the bptm debug log.
Rerun vlteject or vltopmenu to eject the media for the session before you generate the reports. Reconfigure the profile to allow the eject step to be performed when the next Vault session for this profile runs. Disable the report generation in the profile for the reports that require media to be ejected.
306
NetBackup status codes and messages About NetBackup status codes and messages
NetBackup status codes and messages About NetBackup status codes and messages
307
Recommended Action: Verify that you put some media into this container by using the vltcontainers command. Verify that you did not delete it by using the vltcontainers -delete command.
308
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: This error occurs while closing the container database. This error may occur during the reading, addition, modification, or deletion of an entry from the container database. Note that the container database exists in file cntrDB, which is located at install_path/netbackup/vault/sessions/cntrDB. Recommended Action: Read the relevant log file under the directory netbackup/logs/vault for more details. Be aware that if this directory does not already exist, a log file is not created.
NetBackup status codes and messages About NetBackup status codes and messages
309
Inject all the media it contains into a robot Clear the Vault container ID fields for these media from the EMM database by using vmchange -vltcid with a value of - .
310
NetBackup status codes and messages About NetBackup status codes and messages
NetBackup status codes and messages About NetBackup status codes and messages
311
312
NetBackup status codes and messages About NetBackup status codes and messages
On UNIX and Linux, verify that the NetBackup Volume Manager (vmd) is running. On Windows, verify that the NetBackup Volume Manager service is running. See the process-specific error log directory for more details.
UNIX and Linux: /usr/openv/netbackup/logs/process_name Windows: install_path\NetBackup\logs\process_name
For example, if you get this error while running a Vault command (such as vltcontainers or vltopmenu), look at the following logs to learn why:
/usr/openv/netbackup/logs/vault
Note: The log file cannot be created unless the appropriate log directory such as /usr/openv/netbackup/logs/vault is already created.
NetBackup status codes and messages About NetBackup status codes and messages
313
Remove the media from the MAP if it is already full. Make sure that the MAP is closed properly.
314
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The Vault configuration file is malformed. Unless the file has been manually modified, this error is internal. Note that the Vault configuration file is located at install_path/netbackup/db/vault/vault.xml. Recommended Action: Contact customer support and send appropriate logs.
A robot or vault or profile in between the read Operations of the same robot or vault Profile by another instance of an administration interface
Recommended Action: Check the latest attributes of the robot or vault or profile. To check, refresh the view in the NetBackup Administration Console or retrieve the attributes in the Vault Administration menu user interface again. Then retry the operation.
NetBackup status codes and messages About NetBackup status codes and messages
315
316
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: An invalid Vault report was requested for generation. It is an internal error. Recommended Action: Contact customer support and send appropriate logs.
NetBackup status codes and messages About NetBackup status codes and messages
317
Select a valid server group type: MediaSharing, Symantec OpsCenter, or AltServerRestore. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator ID 111), which uses unified logging.
Verify that the specified server group name is not in use. Create the server group by specifying a name that is not currently in use. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator ID 111), which uses unified logging.
Verify that the specified server group name is not in use. Try to create the server group by specifying a name that is not currently in use. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator ID 111), which uses unified logging.
318
NetBackup status codes and messages About NetBackup status codes and messages
Valid server group states are: ACTIVE and INACTIVE For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator ID 111), which uses unified logging.
Verify that the specified media is correct. Verify the media ownership. Verify that the server group exists. Verify that the server where the operation is performed is a member of the owning server group. If not, try the operation from a server that is a member of the server group. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator IDs 111 and 143), which uses unified logging.
The Media Sharing server group can contain the following types of servers: Master, Media, NDMP, and cluster. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator IDs 111 and 143), which uses unified logging.
NetBackup status codes and messages About NetBackup status codes and messages
319
Verify that the specified media is correct. Verify the media ownership Verify that the server where the operation is performed is a member of the owning server group. If not, try the operation from a server that is a member of the server group. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator IDs 111 and 143), which use unified logging.
Ensure that each member server has NetBackup 6.5 or later. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator IDs 111 and 143), which uses unified logging.
Ensure that the server group is not the owner of any media by running bpmedialist -owner group_name from the master server.
320
NetBackup status codes and messages About NetBackup status codes and messages
For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator IDs 111 and 143), which uses unified logging.
Ensure that the server group member that you add does not already exist in the group. For detailed troubleshooting information, create the admin debug log directory and retry the operation. Check the resulting debug logs. Additional debug information can be found in the log for nbemm (originator IDs 111 and 143), which uses unified logging.
NetBackup status codes and messages About NetBackup status codes and messages
321
For UNIX and Linux hosts: the user name must be a valid user name in the passwd file on the host that is specified in the logon dialog box. For Windows hosts: refer to the LogonUser function in the section titled Client/Server Access Control Functions of the Windows Platform Software Developers Kit to determine the required privileges.
Enter the correct password. On Windows hosts: The exact error can be found in the bpjava-msvc log file.
For more details, refer to the LogonUser function in the section Client/Server Access Control Functions of the Windows Platform Software Developers Kit.
322
NetBackup status codes and messages About NetBackup status codes and messages
the same port. The default port is 13722. The NetBackup Administration Console log file should contain more detail about this error. Recommended Action: Do the following, as appropriate:
On UNIX and Linux, compare the bpjava-msvc entry in the /etc/services file with the BPJAVA_PORT entry in the /usr/openv/java/nbj.conf file On Windows, compare the bpjava-msvc entry in the %systemroot%\system32\drivers\etc\services file with the install_path\java\setconf.bat file (Windows). The entries must match.
Ensure that no other application uses the port that is configured for the NetBackup-Java interface.
On UNIX and Linux: the port configuration options are specified in the /usr/openv/netbackup/bp.conf file or through Administration Console Host Properties. On Windows: from the NetBackup Administration Console, select Host Properties. Select Properties on the Actions menu. The Port Ranges tab contains the port options.
Restart the NetBackup-Java interface and log in again. If the problem persists, enable detailed debug logging. Restart the NetBackup-Java interface and examine the logs.
NetBackup status codes and messages About NetBackup status codes and messages
323
Retry the last operation. If the problem persists, restart the NetBackup-Java interface and try again. If the problem still persists, enable detailed debug logging. See Enabling detailed debug logging on page 127. Restart the NetBackup-Java interface and examine the logs. Note: You may have network or system problems unrelated to NetBackup.
The NetBackup-Java user service tries to write to a file that does not have write permissions. The solution is to enable write privileges. The NetBackup-Java user service tries to write to a temporary file whose unique name cannot be constructed. This condition is unlikely, but can result from an exhaustion of system resources (from the filling of the name space).
Recommended Action: Retrieve the specific details from the user service log files. Enable detailed debug logging as explained in the following topic: See Enabling detailed debug logging on page 127.
324
NetBackup status codes and messages About NetBackup status codes and messages
NetBackup-Java user service. When it is created and connected to, the NetBackup-Java user service creates all other child processes on behalf of requests from the NetBackup-Java interface. The cause of status code 509 can be found in the appropriate log file, either for bpjava-msvc, bpjava-susvc, or bpjava-usvc. The cause can be categorized as one of the following:
A job (started by either the NetBackup-Java authentication service or user service) no longer exists and did not report its result status. The NetBackup-Java service cannot monitor a job (started by either the NetBackup-Java authentication service or user service). The reason it cannot monitor is probably due to a lack of system resources (insufficient memory). The maximum number of non-transient activity monitor jobs (>100) have already been started.
If the problem persists, restart the NetBackup-Java interface and try again. If the problem still persists, enable detailed debug logging as explained in the following topic: See Enabling detailed debug logging on page 127. Restart the NetBackup-Java interface and examine the logs.
The error is probably the result of a system resource issue. When detailed debug logging is enabled, you can retrieve the details from the bpjava-msvc, bpjava-susvc, or bpjava-usvc log files.
NetBackup status codes and messages About NetBackup status codes and messages
325
This generic error for all non-socket IO/connection-broken related errors (status code 507) can occur when data is processed from the NetBackup-Java authentication or user services. The Java exception provides some additional detail about the error. This error usually results from system or network problems. Recommended Action: Do the following, as appropriate:
If the problem persists, restart the NetBackup-Java interface and try again. If the problem still persists, enable detailed debug logging as explained in the following topic: See Enabling detailed debug logging on page 127. Restart the NetBackup-Java interface and examine the logs.
The error is probably the result of a system resource issue. When detailed debug logging is enabled, the details can be retrieved from the bpjava-msvc, bpjava-susvc, or bpjava-usvc log files.
If the problem persists, restart the NetBackup-Java interface and try again. If the problem still persists, enable detailed debug logging as explained in the following topic: See Enabling detailed debug logging on page 127. Restart the NetBackup-Java interface and examine the logs.
The error is probably the result of a system resource issue. When detailed debug logging is enabled, the details can be retrieved from the bpjava-msvc, bpjava-susvc, or bpjava-usvc log files.
326
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The NetBackup-Java application server (on the remote host you log in to) is not the same version as the NetBackup-Java interface on your local host. The two are therefore incompatible. Recommended Action: Do the following, as appropriate:
Log in to a different NetBackup remote host. Upgrade the NetBackup software on either of the following: the computer that is specified in the logon dialog box or the local host where you started the NetBackup Java interface.
Log in to a different NetBackup remote host. Upgrade the NetBackup software on either of the following: the specified computer in the logon dialog box or the local host where you started the NetBackup Java interface.
NetBackup status codes and messages About NetBackup status codes and messages
327
Recommended Action: On the specified host in the NB-Java logon dialog box, check the configuration file to ensure that a mapping is available for the indicated locale. For information on locale configuration and mapping, refer to the NetBackup Administrator's Guide, Volume II. If there is a mapping, try to set the mapped locale on the host that was specified in the NB-Java logon dialog box. This system may not be configured properly.
On UNIX and Linux: Compare the VNETD entry in the /etc/services file with the VNETD_PORT entry in /usr/openv/java/nbj.conf On Windows: Compare the VNETD entry with the VNETD_PORT entry in the install_path\java\setconf.bat file. These entries must match. Ensure that no other application uses the port that is configured for VNETD.
328
NetBackup status codes and messages About NetBackup status codes and messages
If the problem persists, increase the range of ports by changing the NBJAVA_CLIENT_PORT_WINDOW option in the /usr/openv/java/nbj.conf file (UNIX and Linux) or the install_path\java\setconf.bat file (Windows).
NetBackup status codes and messages About NetBackup status codes and messages
329
Explanation: In some instances, this message concludes with "Check the log file for more details." The NetBackup-Java interface received an incorrectly formatted protocol sequence from its application server. Recommended Action: Do the following, as appropriate:
If the problem persists, restart the NetBackup-Java interface and try again. If the problem still persists, enable detailed debug logging as explained in the following topic: See Enabling detailed debug logging on page 127. Restart the NetBackup-Java interface and examine the logs.
Note: The error is probably the result of a system resource issue. When detailed debug logging ID is enabled, the details can be retrieved from the bpjava-msvc, bpjava-susvc, or bpjava-usvc log files.
On UNIX and Linux: Compare the VNETD entry in the /etc/services file with the VNETD_PORT entry in /usr/openv/java/nbj.conf On Windows: Compare the VNETD entry with the VNETD_PORT entry in the install_path\java\setconf.bat file. These entries must match. Ensure that no other application uses the port that is configured for VNETD.
330
NetBackup status codes and messages About NetBackup status codes and messages
Recommended Action: Contact customer support and send appropriate debug logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
331
Recommended Action: Ensure that the bpsynth binary matches the installed NetBackup version. Retry the synthetic backup job. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
332
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The database query to obtain the images to synthesize for the given policy did not return any images. Recommended Action: Ensure that a synthetic full backup has one full image (real or synthetic) and one or more subsequent incremental images (differential or cumulative) to synthesize. For a cumulative synthetic backup, there must be two or more incremental (differential or cumulative) images to synthesize. Adjust your schedules so the appropriate backup jobs complete successfully before the synthetic job is run. The scheduler does not retry a synthetic backup job that fails with this error code.
NetBackup status codes and messages About NetBackup status codes and messages
333
Explanation: An error indication was received on the connection to the bptm or the bpdm process. Recommended Action: Review the errors in the NetBackup error log that the following processes logged: bpsynth and bptm or bpdm. Refer to the debug logs for these processes for more information. The connection may have broken due to the following: an error condition that the bptm or the bpdm process detects or network problems between the master and the media server. Check the network connectivity between the master and the media server. Retry the job and if the problem persists, contact customer support and send the appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
334
NetBackup status codes and messages About NetBackup status codes and messages
debug logs for bpsynth on the master server and bpcd and bptm or bpdm on the media server. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
Examine the NetBackup error log for any errors that bpsynth logged. For more information, refer to the following debug logs: for bpsynth on the master server and for bpcd and bptm or bpdm on the media server. Ensure that the bptm or the bpdm binaries on the media server are executable and are not corrupt. Try running bptm or bpdm commands locally on the media server to ensure that the binary is executable and not corrupt. For instance, you can run the following command
/bp/bin/bptm -count -rn 0 -rt 8
where robot number is 0 and the robot type is 8. The robot type that corresponds to the robot number can be taken from the command line that is logged in the debug log for bptm. This command displays the counts for the up, shared, and assigned drives in the robot. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
335
Explanation: This error code is no longer used. Recommended Action: Submit a problem report along with appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
336
NetBackup status codes and messages About NetBackup status codes and messages
NetBackup status codes and messages About NetBackup status codes and messages
337
Recommended Action: Ensure that network connectivity exists between the master and the media server. Look for additional error messages in the NetBackup error log. More detailed information is present in the debug logs for bpsynth (on master server) and bptm or bpdm on the media server. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
338
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: This error occurs in the following situation: partial data is read from the input socket and cannot be parsed until the remaining data that comprises the message is read. The lower layers encounter this error; it should not cause a process to be terminated. Recommended Action: If this error causes the bpsynth binary to hang or malfunction, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
339
to indicate which particular media in the list was not suspended. media (ordinal) indicates the unsuspended media by means of its order in the list. For example, if the unsuspended media was the second media in the list, the failure message says media (1). Recommended Action: Examine the bptm debug log for more information on the reason for the suspend failure. The bpsynth process ignores this error and continues to process. It has the potential to fail later if the media with the images to be read gets assigned to another backup or restore job. If the synthetic backup job fails, fix the condition that lead to the suspend failure and retry the job.
340
NetBackup status codes and messages About NetBackup status codes and messages
Recommended Action: Ensure that the target storage unit that is configured for the synthetic backup schedule has an available drive to write the synthetic backup image.
NetBackup status codes and messages About NetBackup status codes and messages
341
The errno that logged to the NetBackup error log indicates the reason for the failure. For more information, refer to the following: the debug log for bpsynth (on the master server) and for the bptm or the bpdm reader or writer process (on the media server). Check the connectivity between the master and the media server. Retry the synthetic backup job. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
The synthetic backup job was scheduled or started by using the NetBackup Administration console (manual start). The policy exists in the bppllist command configuration. Check the logs for nbjm, which uses unified logging (OID 117).
A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
342
NetBackup status codes and messages About NetBackup status codes and messages
The command line initiated the bpsynth command The specified schedule was deleted from the policy after nbjm started bpsynth, before bpsynth issued the database query.
Recommended Action: If the command line initiated bpsynth, do the following: rerun the command with the correct synthetic schedule label defined in the policy of the synthetic backup job to be run. If the synthetic backup job was scheduled or started with the NetBackup Administration console, define a new schedule in the policy and retry the job. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
343
Explanation: This error code is no longer used. Recommended Action: Submit a problem report along with appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
where robot number is 0 and robot type is 8. The robot type that corresponds to the robot number can be taken from the command line that is logged in the debug log for bptm. This command displays the counts for the up, shared, and assigned drives in the robot. In case the synthetic image is to be written to a disk storage unit, verify the bpdm binary by running the following command:
install_path/netbackup/bin/bpdm
344
NetBackup status codes and messages About NetBackup status codes and messages
It should print the following: "bpdm: media manager operation not specified". Retry the synthetic backup job. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
345
Recommended Action: This error indicates a communication problem between bpsynth and the bptm or the bpdm reader process on the media server. Ensure that the media server is accessible and that the bptm or the bpdm process is running on the media server. Examine the NetBackup error log for any errors that the following logged: bpsynth (on the master server) and the bptm or the bpdm reader process (on the media server). Examine the debug logs for bpsynth and bptm or bpdm for additional information. Rerun the synthetic backup job. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
346
NetBackup status codes and messages About NetBackup status codes and messages
backup job. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
347
348
NetBackup status codes and messages About NetBackup status codes and messages
debug logs for bpsynth on the master server and bptm or the bpdm process on the media server for more information. Ensure that network connectivity exists between the master and the media servers. If the problem persists, contact customer support and send appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
349
media server. The network connection between the master and the media server may have terminated or the bptm or the bpdm reader process has terminated. Recommended Action: Check the network connectivity between the master and the media server. Examine the NetBackup error log for any errors that bpsynth and bptm or the bpdm reader process logged. Examine the debug logs for bpsynth on the master server and bptm or the bpdm reader process on the media servers for more detailed information. If the problem persists, contact customer support and provide the appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
350
NetBackup status codes and messages About NetBackup status codes and messages
Recommended Action: Contact customer support and provide the appropriate logs. A complete list of required logs and configuration information is available. See Logs to accompany problem reports for synthetic backup on page 113.
NetBackup status codes and messages About NetBackup status codes and messages
351
352
NetBackup status codes and messages About NetBackup status codes and messages
Unified logging files on the NetBackup server for nbpem (originator ID 116), nbjm (117), nbrb (118), and PBX (103). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). Legacy logs:
On the NetBackup master server for bpbrm, bpjobd, bpcompatd, bpdbm, and nbproxy On the media server for bpcd, bpbrm, and bptm or bpdm On the client for bpcd and bpbkar
Legacy logs are in subdirectories under /usr/openv/netbackup/logs/ (UNIX and Linux) or install_path\Netbackup\logs\ (Windows). If the directories do not exist, create directories for each of these processes and rerun the job.
Contents of /usr/openv/db/jobs/trylogs (UNIX and Linux) or install_path\NetBackup\db\jobs\trylogs (Windows). bpdbjobs output: run bpdbjobs to obtain the state and status of all jobs.
Unified logging files on the NetBackup server for nbpem (originator ID 116), nbjm (117), nbrb (118), and PBX (103). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). Legacy logs:
On the NetBackup master server for bpbrm, bpjobd, bpcompatd, bpdbm, and nbproxy On the media server for bpcd, bpbrm, and bptm or bpdm On the client for bpcd and bpbkar
NetBackup status codes and messages About NetBackup status codes and messages
353
Legacy logs are in subdirectories under /usr/openv/netbackup/logs/ (UNIX and Linux) or install_path\Netbackup\logs\ (Windows). If the directories do not exist, create directories for each of these processes and rerun the job.
Contents of /usr/openv/db/jobs/trylogs (UNIX and Linux) or install_path\NetBackup\db\jobs\trylogs (Windows). bpdbjobs output: run bpdbjobs to obtain the state and status of all jobs.
Unified logging files on the NetBackup server for nbpem (originator ID 116), nbjm (117), nbrb (118), and PBX (103). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). Legacy logs:
On the NetBackup master server for bpbrm, bpjobd, bpcompatd, bpdbm, and nbproxy On the media server for bpcd, bpbrm, and bptm or bpdm On the client for bpcd and bpbkar
Legacy logs are in subdirectories under /usr/openv/netbackup/logs/ (UNIX and Linux) or install_path\Netbackup\logs\ (Windows). If the directories do not exist, create directories for each of these processes and rerun the job.
Contents of /usr/openv/db/jobs/trylogs (UNIX and Linux) or install_path\NetBackup\db\jobs\trylogs (Windows). bpdbjobs output: run bpdbjobs to obtain the state and status of all jobs.
354
NetBackup status codes and messages About NetBackup status codes and messages
Recommended Action: The requested operation may refer to a job that no longer exists or is not known to nbjm. Or the job ID is invalid (less than or equal to 0). Ensure that the command used to start the job did not specify a job ID already in use by another job. If the problem persists, submit a report with the following items.
Unified logging files on the NetBackup server for nbpem (originator ID 116), nbjm (117), nbrb (118), and PBX (103). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). Legacy logs:
On the NetBackup master server for bpbrm, bpjobd, bpcompatd, bpdbm, and nbproxy On the media server for bpcd, bpbrm, and bptm or bpdm On the client for bpcd and bpbkar
Legacy logs are in subdirectories under /usr/openv/netbackup/logs/ (UNIX and Linux) or install_path\Netbackup\logs\ (Windows). If the directories do not exist, you must create directories for each of these processes and rerun the job.
Contents of /usr/openv/db/jobs/trylogs (UNIX and Linux) or install_path\NetBackup\db\jobs\trylogs (Windows). bpdbjobs output: run bpdbjobs to obtain the state and status of all jobs.
NetBackup status codes and messages About NetBackup status codes and messages
355
Verify connectivity between the master and the media server. Verify Private Branch Exchange (PBX) configuration and permissions. For information on PBX is available. See Resolving PBX problems on page 67.
Verify connectivity between the master sever and the EMM server. Verify Private Branch Exchange (PBX) configuration and permissions. For information on PBX is available. See Resolving PBX problems on page 67.
356
NetBackup status codes and messages About NetBackup status codes and messages
Unified logging files on the NetBackup server for nbpem (originator ID 116), nbjm (117), nbrb (118), and PBX (103). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). The following legacy logs:
On the NetBackup master server for bpbrm, bpjobd, bpcompatd, bpdbm, and nbproxy On the media server for bpcd, bpbrm, and bptm or bpdm On the client for bpcd and bpbkar
Legacy logs are in subdirectories under /usr/openv/netbackup/logs/ (UNIX and Linux) or install_path\Netbackup\logs\ (Windows). If the directories do not exist, create directories for each of these processes and rerun the job.
Contents of /usr/openv/db/jobs/trylogs (UNIX and Linux) or install_path\NetBackup\db\jobs\trylogs (Windows). bpdbjobs output: run bpdbjobs to obtain the state and status of all jobs.
No drives are configured for the media requested for the job. The job requires an NDMP drive path, but no NDMP drive paths are up or configured.
NetBackup status codes and messages About NetBackup status codes and messages
357
The job cannot use an NDMP drive path, but these are the only drive paths that are up or configured for the drive.
Make sure the drives and drive paths are up and correct for the media that you configure. Verify that ltid is running on the required media server, and that the media server is active for tape. Use the device monitor to start up the drives paths if they are down. If the drives are downed again, clean the drives.
For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). If necessary, set global logging to a higher level by using Host Properties > Master Server > Properties > Logging. Retry the operation and examine the nbrb logs.
358
NetBackup status codes and messages About NetBackup status codes and messages
For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). If necessary, set global logging to a higher level, by using Host Properties > Master Server > Properties > Logging. Retry the operation and examine the nbrb logs.
For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). If necessary, set global logging to a higher level, by using Host Properties > Master Server > Properties > Logging. Retry the operation and examine the nbrb logs.
For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). If necessary, set global logging to a higher level, by using Host Properties > Master Server > Properties > Logging. Retry the operation and examine the nbrb logs.
NetBackup status codes and messages About NetBackup status codes and messages
359
Explanation: Under some conditions, failed mounts are retried; the number of retries for the resource request has been exceeded. Recommended Action: For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118) and for nbemm (originator ID 111). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). Also examine the legacy bptm log.
Configure the backup schedule with a storage unit or storage unit groups that can be run on the same media server. For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows).
Determine the cause of the mount request denial and retry the job. For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows).
360
NetBackup status codes and messages About NetBackup status codes and messages
Determine the action that resulted in cancellation of the resource request. For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows).
Determine the action that reset nbrb resources and the nbemm database. For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). All unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows).
For detailed information, examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). Unified logging is written to /usr/openv/logs (UNIX and Linux) or install_path\NetBackup\logs (Windows). If necessary, set global logging to a higher level: Host Properties > Master Server > Properties > Logging. Retry the operation and examine the nbrb logs.
NetBackup status codes and messages About NetBackup status codes and messages
361
Broker (nbrb) receives an allocation request for a read media that was never reserved. It can result from an internal error in bpduplicate, nbjm, or nbrb. Recommended Action: Do the following, as appropriate:
Examine the unified logging files on the NetBackup server for the nbrb service (originator ID 118). Unified logging is written to /usr/openv/logs (UNIX and Linux) or to install_path\NetBackup\logs (Windows). If necessary, set global logging to a higher level: Host Properties > Master Server > Properties > Logging. Retry the operation and examine the nbrb logs.
Verify that processes are not in the disk volume mount directories, which prevents them from being unmounted. If the problem persists, restart NetBackup on the media server.
362
NetBackup status codes and messages About NetBackup status codes and messages
Verify that the specified storage unit or storage unit group exists in the NetBackup database. Verify that the specified storage unit is not of type BasicDisk, SnapVault, or Staging, because storage lifecycle policies do not support them. Verify that the storage unit is not specified for snapshot destinations in storage lifecycle policy. Snapshot destinations do not require a storage unit in the storage lifecycle policy.
Verify that the specified source refers to a valid destination in the list of destinations in storage lifecycle policy. Verify that the Backup or Snapshot destination type does not have any source specified. If you are using nbstl to configure storage lifecycle policy, use value zero (0) as the source for a backup or snapshot destination type. Verify that the destination that refers to the specified source is not a Snapshot destination type. NetBackup cannot duplicate a backup image that uses snapshot copy as a source. Verify that the specified source does not refer to the destination itself for which the source is mentioned. Verify that the specified list of destinations in a storage lifecycle has a circular dependency for the source.
Verify that the retention type specified in storage lifecycle policy is either of Fixed, Staged capacity managed, or Expire after duplication. Verify that the Expire after duplication retention type is used for a destination only if it is specified as a source to other destinations in storage lifecycle policy.
NetBackup status codes and messages About NetBackup status codes and messages
363
Verify that the Staged capacity managed retention type is used in storage lifecycle policy only for disk destinations that support Capacity management capabilities. Verify that the Snapshot destination type in storage lifecycle policy uses Fixed retention type only.
Verify that the specified volume pool exists in NetBackup database. Verify that the volume pool is not specified for the Backup destination type in storage lifecycle policy. Verify that the volume pool is not specified for the Snapshot destination type in storage lifecycle policy. Verify that the volume pool is not specified for the Duplication destination type that is using disk storage units in storage lifecycle policy.
Verify that the specified media server group exists in NetBackup database. Verify that the media server group is not specified for Backup destination types in storage lifecycle policy. Verify that the server group is not specified for Snapshot destination types in storage lifecycle policy. Verify that the media server group is not specified for Duplication destination types that are using disk storage unit in storage lifecycle policy.
364
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The alternate read server specified for one or more destinations in storage lifecycle policy is not valid. Recommended Action: Do the following, as appropriate:
Verify that the specified alternate read server exists in the NetBackup database. Verify that the alternate read server is not specified for Backup destination types in storage lifecycle policy. Verify that the alternate read server is not specified for Snapshot destination types in storage lifecycle policy.
Verify that the number of destinations specified in storage lifecycle policy does not exceed maximum backup copies configured in NetBackup. Modify maximum backup copies to allow the expected number of destinations in storage lifecycle policy or reduce the number of destinations to not exceed maximum backup copies.
NetBackup status codes and messages About NetBackup status codes and messages
365
366
NetBackup status codes and messages About NetBackup status codes and messages
Verify that the destination type specified on each destination in storage lifecycle policy is either Backup, Duplication, or Snapshot. If you are use the nbstl or bpadm command to configure storage lifecycle policy, the following are the valid values for the destination type: 0 indicates Backup, 1 indicates Duplication, and 2 indicates Snapshot.
If you use the nbstl command to configure storage lifecycle policy, then verify that the valid values are used to indicate the preserve multiplexing flag for each destination. The value T or t indicates true (Preserve multiplexing). The value F or f indicates false (do not preserve multiplexing). Verify that destinations of type Backup are not configured to preserve multiplexing. Verify that destinations of type Snapshot are not configured to preserve multiplexing. Verify that destinations of type Duplication that are using disk storage units are not configured to preserve multiplexing.
NetBackup status codes and messages About NetBackup status codes and messages
367
Verify that if a NetBackup policy is configured to perform snapshot backups and uses storage lifecycle policy as its storage destination, then the specified storage lifecycle policy must be configured with a snapshot destination. Otherwise, backup images created by those NetBackup policies are not processed further by storage lifecycle policy for any lifecycle operations. Verify that NetBackup policies using storage lifecycle policy as a storage destination are not configured to perform snapshots-only operations and they must create backups in addition to snapshots. Storage lifecycle policy even though configured with a snapshot destination would perform lifecycle operations on such images only if they have at least one backup copy.
Recommended Action: Specify only one client in the policy, or remove the directive from the backup selection that does not support multiple client. See the Troubleshooting section of the NetBackup for Enterprise Vault Agent Administrators Guide.
368
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: The NetBackup Media and Device Selection component (MDS) did not allocate a new tape volume for backups. This error indicates that the s torage unit has no more volumes available in the volume pool for this backup. NetBackup does not change storage units during the backup. Recommended Action: Try the following:
Recommended Action: Check the NetBackup Problems report to determine the storage unit that is out of media. If the storage unit is a robot with empty slots, add more volumes and specify the correct volume pool. If no empty slots exist, move some media to nonrobotic and add new volumes. If you have difficuly keeping track of your available volumes, try the available_media script located in the following directory: On UNIX: /usr/openv/netbackup/bin/goodies/available_media On Windows: install_path\NetBackup\bin\goodies\available_media.cmd This script lists all volumes in the volume configuration, and augments that list with information on the volumes currently assigned to NetBackup.
Set up a scratch volume pool as a reserve of unassigned tapes. If NetBackup needs a new tape and none are available in the current volume pool, it moves a tape from the scratch pool into the volume pool that the backup uses. If the storage unit and volume pool appear to have media, verify the following:
Use the NetBackup Media List report to check that the volume is not FROZEN or SUSPENDED. If the volume is frozen or suspended, use the bpmedia command to unfreeze or unsuspend it, if so desired. Volume has not expired or exceeded its maximum number of mounts. The EMM database host name for the device is correct. If you change the EMM database host name, stop and restart the following: the Media Manager device daemon, ltid, (if the server is UNIX) or the NetBackup Device Manager service (if the server is a Windows system). The correct host is specified for the storage unit in the NetBackup configuration. The host connection must be the server (master or media) with drives connected to it. The Media and Device Management volume configuration has media in the correct volume pool. Unassigned or active media is available at the required retention level. Use the NetBackup Media List report to show the retention levels, volume pools, and status for all volumes. Use the NetBackup Media Summary report to check for active volumes at the correct retention levels.
Create the bptm debug log directory, and set mds VxUL logging (oid 143) to debug level 2, then retry the operation.
NetBackup status codes and messages About NetBackup status codes and messages
369
If this storage unit is new and this attempt to use it is the first, stop and restart NetBackup on the master server. The mds unified logging files (OID 143) at debug level 2 usually show the NetBackup media selection process.
Verify that the required drives and robots are configured and up. Verify that ltid is active and a drive path exists that is configured on the drive for the media server that the storage unit requires. If the job requires an NDMP drive path, verify that one exists.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2.
370
NetBackup status codes and messages About NetBackup status codes and messages
The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3. The bptm legacy log from the media server, with VERBOSE=5.
NetBackup status codes and messages About NetBackup status codes and messages
371
Recommended Action: This error can occur when a job tries to run on a media server that is not at a recent enough NetBackup revision level to run the job. The job may require a feature that is not available on the media server being requested. Verify that the storage unit type and media server called out by the policy and storage unit support the feature the job is trying to use. This error can occur when the policy being run is not compatible with the storage units requested by the policy:
SnapVault storage units are requested for multiple copy jobs. Catalog backups are directed to shared disk storage units. Multiplexed jobs are directed to storage units that do not have the multiplex feature configured. NDMP backup policies are directed to non-NDMP storage units.
372
NetBackup status codes and messages About NetBackup status codes and messages
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
NetBackup status codes and messages About NetBackup status codes and messages
373
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
374
NetBackup status codes and messages About NetBackup status codes and messages
NetBackup status codes and messages About NetBackup status codes and messages
375
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2.
376
NetBackup status codes and messages About NetBackup status codes and messages
The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
Verify that the media server required for this allocation is running. If this is a tape job, verify that ltid is running on the media server. If a media server has gone offline and returned online, a delay of several minutes may occur before a job can use that media server.
NetBackup status codes and messages About NetBackup status codes and messages
377
Use the vmoprcmd command to show the state of the media server. If the job is trying to access a disk storage unit, verify that the nbrmms process is running on the media server. If the job is trying to access a tape storage unit, verify that ltid is running on the media server.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
Run the nbrbutil -resetall command to get the NetBackup allocation database back into a consistent state. Note that this command cancels all jobs including those that are currently running.
378
NetBackup status codes and messages About NetBackup status codes and messages
If there are images on disk that can be expired, expire them, and run nbdelete to delete the images from disk. It may be that NetBackup is creating disk images more quickly than they are expiring. If so, modifications to policies may be necessary to change the rate of image creation/expiration for the disk. It may be helpful to lower the high water mark and/or low water mark for the disk group.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2.
NetBackup status codes and messages About NetBackup status codes and messages
379
The nbrb unified logging files (OID 118) from the master server at debug level 3. The verbose bptm logs on the media server(s) that access this media.
380
NetBackup status codes and messages About NetBackup status codes and messages
on. It may take some time for the drive to become ready after the tape has been inserted. If the drive never goes to a ready state after a tape has been inserted, there may be a problem with the drive.
NetBackup status codes and messages About NetBackup status codes and messages
381
382
NetBackup status codes and messages About NetBackup status codes and messages
NetBackup status codes and messages About NetBackup status codes and messages
383
For more information, refer to the section on configuring ACSLS robots in the NetBackup Device Configuration Guide.
384
NetBackup status codes and messages About NetBackup status codes and messages
Recommended Action: Specify a different file name for tpreq, or run the tpunmount command for the media which has been loaded using tpreq with the conflicting file name.
NetBackup status codes and messages About NetBackup status codes and messages
385
Recommended Action:
386
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: When a job is run on a media server installed with pre NetBackup 6.0 software, there are some situations where the job starts, but the drives designated for the job are busy or otherwise unavailable, causing the job to fail. In this case, the NetBackup master server stops using that drive for a period of time (several minutes), and returns this error for jobs requesting that drive. Recommended Action: Make sure all of the drives in the storage unit are up and available for use. Clean all drives that need cleaning.
NetBackup status codes and messages About NetBackup status codes and messages
387
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
388
NetBackup status codes and messages About NetBackup status codes and messages
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
NetBackup status codes and messages About NetBackup status codes and messages
389
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
390
NetBackup status codes and messages About NetBackup status codes and messages
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
NetBackup status codes and messages About NetBackup status codes and messages
391
Explanation: MDS expected to find a group of reservations for this media, but did not find the expected reservation group. Recommended Action: This error is internal. If the problem persists, check the following logs to help identify the problem:
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
The mds unified logging files (OID 143) from the master server at debug level 2. The nbrb unified logging files (OID 118) from the master server at debug level 3.
392
NetBackup status codes and messages About NetBackup status codes and messages
Verify that the Private Branch Exchange (PBX) service is installed and running. Run the bpps -x command to ensure that pbx_exchange is listed in the processes that are running. Verify that the client is configured as a SAN Client. Run the bpclntcmd -sanclient command to return the current state of the SAN Client. A response of 0 (zero) indicates that SAN Client is not configured. Rerun the command as bpclntcmd -sanclient 1. Verify that the client is not also a media server, master server, or EMM server. The SAN Client process runs only on a NetBackup client. Verify that a valid license for the SAN Client feature resides on the master server. SAN Client is a separately licensed feature which requires a key called "SAN Client" on the master server. Verify that no server entries for the SAN Client exist on the NetBackup master server. Remove any SERVER=clientname entry in the master server for the SAN Client. If the master server has the SAN Client listed also as a server, it may shut down the SAN Client.
NetBackup status codes and messages About NetBackup status codes and messages
393
Restart the SAN Client service. The SAN Client restarts when the server reboots, but does not automatically restart after you run the bpclntcmd command. To start the SAN Client service, run the client startup script or use the Service Manager in Windows.
For further information, see the Troubleshooting chapter of the NetBackup SAN Client and Fibre Transport Guide.
Verify that the Private Branch Exchange (PBX) service is installed and running. Run the bpps -x command to ensure that pbx_exchange is listed in the processes that are running. Verify that the client is configured as a SAN Client. Run the bpclntcmd -sanclient command to return the current state of the SAN Client. A response of 0 (zero) indicates that SAN Client is not configured. Rerun the command as bpclntcmd -sanclient 1. Verify that the client is not also a media server, master server, or EMM server. The SAN Client process runs only on a NetBackup client. Verify that a valid license for the SAN Client feature resides on the master server. SAN Client is a separately licensed feature which requires a key called "SAN Client" on the master server. Verify that no server entries for the SAN Client exist on the NetBackup master server. Remove any SERVER=clientname entry in the master server for the SAN Client. If the master server has the SAN Client listed also as a server, it may shut down the SAN Client.
394
NetBackup status codes and messages About NetBackup status codes and messages
Restart the SAN Client service. The SAN Client restarts when the server reboots, but does not automatically restart after you run the bpclntcmd command. To start the SAN Client service, run the client startup script or use the Service Manager in Windows.
For further information, see the Troubleshooting chapter of the NetBackup SAN Client and Fibre Transport Guide.
Verify that marked QLogic Target ports exist in the server The nbftsrvr and nbfdrv64 processes exit if the system has no ports available for Target Mode use. Verify that a valid license for the SAN Client feature resides on the master server. SAN Client is a separately licensed feature which requires a key called "SAN Client" on the master server. The Fibre Transport server performs a license check during startup. Determine if a reboot is required for the installation of the Fibre Transport server. Solaris installations require that you reboot the media server after you install Fibre Transport to load the Fibre Transport drivers on the marked target ports. In addition, reboot the Linux servers if you choose to not unload the existing QLogic drivers during the installation of Fibre Transport.
For further information, see the Troubleshooting chapter of the NetBackup SAN Client and Fibre Transport Guide.
Verify that marked QLogic Target ports exist in the server The nbftsrvr and nbfdrv64 processes exit if the system has no ports available for Target Mode use. Verify that a valid license for the SAN Client feature resides on the master server. SAN Client is a separately licensed feature which requires a key called
NetBackup status codes and messages About NetBackup status codes and messages
395
"SAN Client" on the master server. The Fibre Transport server performs a license check during startup.
Determine if a reboot is required for the installation of the Fibre Transport server. Solaris installations require that you reboot the media server after you install Fibre Transport to load the Fibre Transport drivers on the marked target ports. In addition, reboot the Linux servers if you choose to not unload the existing QLogic drivers during the installation of Fibre Transport.
Recommended Action: For further information, please refer to the NetBackup SAN Client and Fibre Transport Troubleshooting Guide.
396
NetBackup status codes and messages About NetBackup status codes and messages
Explanation: You cannot run this job because it is configured to use a storage unit group that does not exist in the NetBackup configuration. Recommended Action: Reconfigure the policy to use a storage unit or storage unit group that exists in the configuration.
NetBackup status codes and messages About NetBackup status codes and messages
397
NetBackup database, you may use the Force Media Server Restore option to force NetBackup to replace the missing media server with a new media server.
NetBackup Messages
This section lists the NetBackup error messages alphabetically. The status code for each message is listed in the right column of the table. Refer to the previous list of status codes for explanations and recommended actions. Table 5-1 lists the NetBackup error messages alphabetically.
398
NetBackup status codes and messages About NetBackup status codes and messages
/usr/openv/netbackup/bp.conf not found a protocol error has occurred access to server backup restore manager denied access to the client was not allowed ACS media has an unreadable external label ACS media is not in the drive's domain ACS physical drive is not available afs/dfs command failed all compatible drive paths are down all compatible drive paths are down but media is available all configured vault steps failed all volumes are not available to eject allocation identifier is not known to EMM allocation failed allocation record insert failed allocation request delete failed allocation request update failed allocation status record insert failed allocation status request delete failed an ACS Library Storage Module (LSM) is offline an entry in the filelist expanded to too many characters an exception condition occurred
an extended error status has been encountered, check detailed status 252 an extension package is needed but was not installed an invalid entry was encountered 9 223
NetBackup status codes and messages About NetBackup status codes and messages
399
another NB database backup is already in progress archive file removal failed a session is already running for this vault a synthetic backup request for media resources failed authentication failed Backup Exec operation failed backup restore manager failed to read the file list backups are not allowed to span media bpcoord startup validation failure bpjava-msvc: the client is not compatible with this server version server_version bpstart_notify failed cannot connect on socket cannot connect to nbvault server cannot connect to read media server cannot connect to server backup restore manager
Can not connect to the NB-Java authentication service on the 505 configured port - configured_port_number. Check the log file for more details. Can not connect to the NB-Java authentication service via VNETD on 525 host on port vnetd_configured_port_number. Check the log file for more details. Can not connect to the NB-Java user service on port port_number. Check the log file for more details. Can not connect to the NB-Java user service via VNETD on (host) or port (configured_port_number) 506
517
cannot consolidate reports of sessions from container and slot-based 289 vaults
400
NetBackup status codes and messages About NetBackup status codes and messages
cannot find configuration database record for requested NB database 120 backup cannot find requested volume pool in EMM database cannot find robot in vault configuration file cannot find robot, vault, or profile in the vault configuration cannot find the local host name cannot find vault in vault configuration file cannot get a bound socket cannot make required directory cannot modify - stale view cannot overwrite media, data on it is protected cannot perform specified media import operation cannot position to correct image cannot read backup header, media may be corrupted 167 323 266 267 322 146 35 342 168 176 94 173
cannot read media header, may not be NetBackup media or is corrupted 172 \cannot register handler for accepting new connections cannot send extents to bpsynth cannot set non blocking mode on the listen socket cannot start reader on the media server Can not write file cant connect to client child process killed by signal Cleaning media is not available client backup failed to read the file list 603 612 602 614 508 58 27 2088 67
NetBackup status codes and messages About NetBackup status codes and messages
401
client backup failed to receive the CONTINUE BACKUP message client backup was not attempted client backup was not attempted because backup window closed client cannot read the mount table client connection refused client did not start client hostname could not be found client is not validated to perform the requested operation client is not validated to use the server client name mismatch client process aborted client timed out reading file client timed out waiting for bpend_notify to complete client timed out waiting for bpstart_notify to complete client timed out waiting for the continue message from the media manager client timed out waiting for the file list clients network is unreachable client/server handshaking failed communication interrupted connection refused by server backup restore manager connection to the peer process does not exist container cannot hold any media from the specified robot container database close operation failed container database lock operation failed
402
NetBackup status codes and messages About NetBackup status codes and messages
container database open operation failed container database truncate operation failed container does not exist in container database container_id is not unique in container database continue could not deassign media due to Media Manager error could not get group information could not get passwd information could not set group id for process could not set user id for process daemon fork failed daemon is already running data marshalling error data un-marshalling error database system error density is incorrect for the media id density mismatch detected disk is full Disk pool is down Disk pool not found Disk storage unit is full Disk volume cannot be used for more than one copy in the same job Disk volume has no max readers count Disk volume has no max writers count Disk volume is down
NetBackup status codes and messages About NetBackup status codes and messages
403
Disk volume is not available Disk volume mount point not found Disk volume mount point record insert failed Disk volume not found done drive is already allocated drive is in a robotic library that is up drive is not allocated drive is not ready DSM has already mounted the volume
DSM has detected that an invalid filesystem is mounted on the volume 2079 DSM returned an unexpected error duplicate backup images were found duplicate MAP duplicate reference string specified EC_badop (there is no explanation for this status code) EC_end (there is no explanation for this status code) EC_error (there is no explanation for this status code) eject process could not obtain information about the robot eject process failed to start eject process has already been run for the requested vault session eject process has been aborted eject process is complete EMM database is inconsistent end point terminated with an error 2076 642 346 813 113 115 114 295 292 307 293 279 2029 610
404
NetBackup status codes and messages About NetBackup status codes and messages
error encountered attempting backup of catalog (multiple tape catalog 302 backup) error encountered executing Media Manager command error getting information from EMM database error getting information from media manager command line error obtaining date of last backup for client error occurred during initialization, check configuration file error(s) occurred during vault report distribution error receiving information on message queue error record insert failed error requesting media (tpreq) error sending information on message queue error(s) occurred during vault report generation 303 332 333 207 103 284 210 2013 98 212 283
Evaluation software has expired. See www.symantec.com for ordering 161 information events out of sequence - image inconsistency execution of a command in a forked process failed execution of a vault notify script failed 229 623 272
execution of the specified system command returned a nonzero status 77 extent directive contained an unknown media id failed accessing daemon lock file failed appending to container database failed closing mail pipe failed opening mail pipe failed reading policy database information 644 158 315 102 101 218
NetBackup status codes and messages About NetBackup status codes and messages
405
failed reading global config database information failed reading retention database information failed reading storage unit database information failed to communicate with resource broker failed to communicate with resource requester failed to get job data failed to send signal failed trying to allocate memory failed trying to exec a command failed trying to fork a process failed waiting for child process failed while trying to send mail failure occurred while suspending media for eject failure occurred while updating session information failure occurred while updating the eject.mstr file fatal NB media database error Fibre Transport resources are not available File already exists: file_name file close failed file does not exist file open failed file path specified is not absolute file pathname exceeds the maximum length allowed file read failed file write failed
406
NetBackup status codes and messages About NetBackup status codes and messages
found no images or media matching the selection criteria FT client devices are offline FT client has no devices configured FT client is not running FT server devices for client are offline getservbyname failed handshaking failed with server backup restore manager host is unreachable image does not have a fragment map inadequate buffer space incorrect catalog backup policy Incorrect password Incorrect server platform identifier incorrect vault catalog backup schedule insufficient data received insufficient disk space or high water mark would be exceeded internal error 615 internal error 616 internal error 618 internal error 619 internal error 620 internal error 654 Internal error - a bad status packet was returned by NB-Java application server that did not contain an exit status code invalid arguments specified
638
NetBackup status codes and messages About NetBackup status codes and messages
407
invalid command parameter invalid command protocol invalid command usage invalid container database entry invalid container description invalid container id invalid database host invalid data found in retention map file for duplication invalid date specified
invalid file pathname invalid file pathname found, cannot process request invalid filelist specification invalid job id invalid media type specified in the storage unit
Invalid NBJAVA_CLIENT_PORT_WINDOW configuration option value: 519 (option_value) Invalid operation on static mount point invalid recall status invalid request invalid STU identifier type invalid username invalid value for NB-Java configuration option (option_name): (option_value) Iron Mountain report is already created for this session JM internal error 2072 329 133 2002 503 520
311 801
408
NetBackup status codes and messages About NetBackup status codes and messages
JM internal protocol error JM terminating job history indicates that no drive is available job history indicates that no media is available job type is invalid licensed use has been exceeded logic error encountered
LTID needs to be restarted on media servers before the device can be 2058 used master server request failed maximum job count has been reached for the storage unit Maximum number of mounts has been exceeded for tape media MDS has received an invalid message from a media server Media affinity group record insert failed Media allocation would exceed maximum partially full media limit media block size changed prior to resume media close error media has been misplaced media has conflicts in EMM media id is either expired or will exceed maximum mounts media id is not in NetBackup volume pool media id must be 6 or less characters media id is not assigned to this host in the EMM database media is assigned to another application media is assigned to another server 149 2040 2100 2005 2084 2087 163 87 2033 2012 169 178 171 95 2044 2016
NetBackup status codes and messages About NetBackup status codes and messages
409
media is in a drive that is currently in a DOWN state media is in a drive that is not configured on local system media is in an inaccessible drive media is in use according to EMM media is in use by the ACS robotic library media is not assigned media is not defined in EMM media is reserved media is unmountable media is write protected media loaded in drive is not write-enabled Media Manager device daemon (ltid) is not active Media Manager volume daemon (vmd) is not active media manager detected image that was not in tar format media manager found wrong tape in drive media manager killed by signal media manager received no data for backup image media manager - system error occurred media needs to be rewound or unmounted from a drive media needs to be unmounted from a drive media not found in the ACS robotic library media open error Media pool is not eligible for this job media position error
410
NetBackup status codes and messages About NetBackup status codes and messages
media server is not active Media server mismatch Media server not found in EMM database media write error Member already exists in server group Members NetBackup version not compatible with Server Group Member's server type not compatible with Server Group multiple profiles exist NB database backup failed, a path was not found or is inaccessible NB database backup header is too large, too many paths specified
NB database recovery failed, a process has encountered an exceptional 128 condition NB image database contains no image fragments for requested backup 165 id/copy number NB-Java application server interface error: Java exception NB-Java application server protocol error 511 523
NB-Java: bpjava-msvc is not compatible with this application version 514 (application_version). You may try login to a different NetBackup host or exit the application. The remote NetBackup host will have to be configured with the same version of NetBackup as the host you started the application on. NB-Java Configuration file (file_name) does not exist NB-Java Configuration file (file_name) is not readable due to the following error: (message) NDMP backup failure NDMP credentials are not defined in EMM NDMP operation does not support multiple inline copies 521 522
99 2006 2096
NetBackup status codes and messages About NetBackup status codes and messages
411
network connection broken network connection timed out network read failed network write failed no active policies contain schedules of the requested type for this client no active policies in the configuration database are of the correct client type No authorization entry exists in the auth.conf file for username username. None of the NB-Java applications are available to you. no BRMComm to join no connection to reader no drives are available for this job no drives available to start the reader process no drives available to start the writer process no entity was found no files specified in the file list No FT servers for this client are running no images duplicated no images were found to synthesize no images were successfully processed no media ejected for the specified vault session no media is defined for the requested NB database backup no message was received from bptm
246
502
823 611 2001 617 634 227 112 2093 308 607 191 327 121 629
No ports available in range (port_number) through (port_number) per 518 the NBJAVA_CLIENT_PORT_WINDOW configuration option no profile was specified 274
412
NetBackup status codes and messages About NetBackup status codes and messages
no robot on which the media can be read no schedules of the correct type exist in this policy no storage units available for use no target storage unit specified for the new job no target storage unit was specified via command line no vault session id was found none of the files in the file list exist none of the requested files were backed up not all requested files were restored number of cleanings is invalid number of media has exceeded the capacity of MAP one or more errors detected during consolidated eject processing operation not allowed during this time period operation requested by an invalid server operation would cause an illegal duplication permission denied by client during rcmd pipe close failed pipe fgets call from bpcoord failed premature eof encountered problems encountered during setup of shared memory process called but nothing to do process was killed by a signal profile already exists query for list of component images failed RB communication error
NetBackup status codes and messages About NetBackup status codes and messages
413
RB disk volume mount failed RB disk volume mount must retry RB internal error RB invalid argument RB max reallocation tries exceeded RB media reservation not found RB media server mismatch RB operator denied mount request RB user cancelled resource request RB was reset read from input socket failed reader failed received an error from bptm request to suspend media received an error from bptm request to un-suspend media received error notification for the job report requested without eject being run request attempted on a non reserved port request needs to pend requested media id is in use, cannot process request requested slot is empty Required drive or drive path is not configured required or specified copy was not found required value not set resource request failed resource request timed out
414
NetBackup status codes and messages About NetBackup status codes and messages
retry nbrb request later Retry the allocation request later robot already exists robotic library is down on server schedule windows overlap scheduler found no backups due to run SCSI reservation conflict detected send buffer is full server backup restore managers network is unreachable Server Group Active State is not valid Server Group Already Exists Server Group Already Exists with a different type Server Group does not exist Server Group is in use Server Group Type is Invalid server is not licensed for the Remote Client Option server is not the master server server name not found in the bp.conf file server not allowed access SERVER was not specified in /usr/openv/netbackup/bp.conf Session id assignment failed Session id file is empty or corrupt Snapshot error encountered socket close failed
NetBackup status codes and messages About NetBackup status codes and messages
415
Socket connection to the NB-Java user service has been broken. Please 507 retry your last operation. Check the log file for more details. socket open failed socket read failed socket write failed specified device path does not exist specified disk path is not a directory specified file contains no valid entry 21 23 24 122 123 326
specified media or path does not contain a valid NB database backup 127 header specified policy does not exist specified profile not found specified report does not exist specified schedule was not found storage unit characteristics mismatched to request Storage Unit group does not exist in EMM configuration storage unit is disabled since max job count is less than 1 Storage unit is down Storage unit is not compatible with requesting job storage unit query failed Storage units are not available STU cannot run Lifecycle backups STU cannot run VMWare backup suspend requested by administrator system call failed system error occurred 639 304 348 640 154 2097 2045 2041 2007 608 2104 2094 2095 157 11 130
416
NetBackup status codes and messages About NetBackup status codes and messages
system error occurred while processing user command tar did not find all the files to be restored tar had an unexpected error tar received an invalid archive tar received an invalid argument tar received an invalid file name tar received no data tar was successful termination requested by administrator termination requested by bpcoord text exceeded allowed length the archive failed to back up the requested files the backup failed to back up the requested files the catalog image .f file has been archived the client is not in the configuration the client type is incorrect in the configuration database the database contains conflicting or erroneous entries The drive needs to be marked as available the entity already exists the file list is incomplete the file name used for the mount request already exists The host is not an active node of a cluster the image was not created with TIR information the library is not ready to eject volumes the machine specified is not a member of the server group specified
NetBackup status codes and messages About NetBackup status codes and messages
417
the maximum number of jobs per client is set to 0 The maximum number of mounts for the disk volume have been exceeded The media affinity group is not defined in EMM the media server reported a system error the requested operation was partially successful the requested operation was successfully completed the required storage unit is unavailable the restore failed to recover the requested files the robotic library is not available the scan host of the drive is not active the server is not allowed to write to the clients filesystems the specified container is not empty the specified mount path will not fit in the allocated space the specified policy does not exist in the configuration database the specified policy is not active the specified policy is not of the correct client type the specified client does not exist in an active policy within the configuration database the specified client does not exist in the specified policy the specified schedule does not exist in an active policy in the configuration database the specified schedule does not exist in the specified policy the specified schedule is the wrong type for this request the TIR information is zero length the vault session directory is either missing or inaccessible
2083 2011 1 0 219 5 2059 2057 189 320 2070 230 247 245 236
239 237
418
NetBackup status codes and messages About NetBackup status codes and messages
there are no active policies in the configuration database there are no volumes to eject there is no available MAP for ejecting there was a conflicting specification third-party copy backup failure this mpx group is unjoinable throttled job count has been reached for the storage unit timed out connecting to client timed out connecting to server backup restore manager timed out waiting for database information timed out waiting for media manager to mount volume timed out waiting for the client backup to start tir info was pruned from the image file unable to accept connection from the reader unable to accept connection from the writer unable to allocate new media for backup, storage unit has none available unable to collect pre eject information from the API unable to connect to bpcoord unable to determine the status of rbak unable to find any storage servers for the request unable to find policy/schedule for image using retention mapping unable to get the address of the local listen socket unable to issue the database query for policy unable to issue the database query for policy information
NetBackup status codes and messages About NetBackup status codes and messages
419
unable to listen and register service via vnetd unable to locate vault directory unable to mount media because its in a DOWN drive or misplaced unable to obtain process id, getpid failed unable to open listen socket unable to open pipe between bpsynth and bpcoord unable to process request unable to process request because the server resources are busy unable to receive response from robot; robot not ready unable to register handle with the reactor unable to send a message to bpcoord unable to send a message to the writer child process
unable to send a start command to a reader/writer process on media 624 server unable to send exit message to the BPXM reader unable to send extent message to bpcoord unable to send extent message to BPXM unable to send start synth message to bpcoord unable to start the writer on the media server unexpected message received unexpected message received from bpcoord unexpected message received from bpsynth unexpected message received from BPXM unexpected message was received from bptm unimplemented error code 661 650 648 656 645 43 643 627 649 630 114
420
NetBackup status codes and messages About NetBackup status codes and messages
unimplemented feature unknown image referenced in the SYNTH CONTEXT message from BPXM unsupported image format for the requested database query Updating of Media Manager database failed user id was not superuser user is not validated to use the server valid archive image produced, but no files deleted due to non-fatal problems validation of synthetic image failed vault already exists vault catalog backup failed vault configuration cache not initialized vault configuration file format error vault configuration file not found vault configuration serialization failed vault configuration tag not found vault core error vault duplication partially succeeded vault duplication was aborted by administrator request vault eject failed vault eject partially succeeded vault eject timed out vault internal error 261 vault internal error 262 vault internal error 286
647 344 294 347 339 259 341 340 281 306 258 287 288 338 261 262 286
NetBackup status codes and messages About NetBackup status codes and messages
421
vault XML version mismatch vmchange api_eject command failed vmchange eject verify not responding VxSS access denied VxSS authentication failed VxSS authorization failed VxSS authentication is requested but not allowed VxSS authentication is required but not available write on output socket failed You are not authorized to use this application zero extents in the synthetic image, cannot proceed
422
NetBackup status codes and messages About NetBackup status codes and messages
Chapter
About media and device management status codes and messages Media and device management messages
424
Media and device management status codes and messages About media and device management status codes and messages
1 2
Enable legacy debug logging by creating the necessary directories and folders. Increase the level of verbosity for media and device management processes by adding the VERBOSE option in the vm.conf file. This file is located in /usr/openv/volmgr/ (UNIX and Linux) and install_path\Volmgr\ (Windows). Restart the daemons and services or run the commands verbose option, if available. The term media server, as distinct from master server or server, does not apply to the NetBackup Server product. In this case, the media server is the master server. When you troubleshoot a Server installation, ignore any references to media server. (This does not apply to NetBackup Enterprise Server.)
Check for other error messages in the command or the interface output to determine which system call failed. See Using debug logs on page 423. Check the system application log for error and warning messages. Verify that the system is not running out of virtual memory. If virtual memory is the problem, shut down unused applications or increase the amount of virtual memory.
Media and device management status codes and messages About media and device management status codes and messages
425
Display the Control Panel. Double-click System. On the Performance tab, set Virtual Memory to a higher value.
Verify that all product binaries are properly installed. Verify that no unexpected media and device management processes are in operation by running vmps. Some processes are expected to continue running. Others that continue to run can indicate a more serious problem, such as a hung system call.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the usage statement for expected usage and compare with the parameters being sent to start the new process. Verify that all media and device management binaries are at a compatible version level.
426
Media and device management status codes and messages About media and device management status codes and messages
Explanation: A requested operation cannot be processed because resources were busy. Recommended Action: Check the status of any resources that the requested operation uses. On a robotic inventory request, verify that the inventory operation completes within a reasonable time.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Identify the target components (for example, vmd, nbemm, and robotic processes on local or remote hosts). Then verify that all media and device management binaries are at compatible version level.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the media ID, where requested, is not blank.
Media and device management status codes and messages About media and device management status codes and messages
427
Ensure that the specified media IDs contain valid characters only: alphanumeric characters, the period (.), the plus sign (+), and the underscore (_). A hyphen (-) is also a valid character when not the first character in the media ID. If media are specified to be ejected from a library, ensure the following: they exist in the EMM database and are associated with the correct robot number. Ensure that the media ID is from 1 to 6 characters in length. Ensure that a valid media and seed were specified. If the operation is an inventory request for an ACS robot, use the robtest utility to verify the following: the ACS interface returns cleaning media IDs both in the query volume list and in the query cleaning volume list.
If you run a robot inventory on a robot of type ACS, TLH, or TLM, ensure the following: the installed version of NetBackup supports and recognizes the vendor media type that the robot control software returns. If using a command line interface directly, verify that a valid media type has been passed, according to vmadd(1m) command line documentation. Ensure that an operation valid only for cleaning media has not been requested on a media ID that does not correspond to cleaning tape. Ensure that the media type in all bar code rules is a valid media type or the ordinal zero (0), to represent the default media type.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the bar code, where requested, is not blank. Ensure that the specified bar codes contain valid characters only: alphanumeric characters, and the period (.), plus sign (+), and underscore (_). A hyphen (-) is also a valid character when not the first character in the media ID.
428
Media and device management status codes and messages About media and device management status codes and messages
Ensure that the number of characters in the bar code does not exceed the maximum that is allowed for the robot type. Ensure that the bar code tag in all bar code rules is a subset of a valid, supported bar code format.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify a robot type that supports the volumes media type. Check the EMM database and ensure that the specified robot type agrees with the type for all volumes having the specified robot number. If a robot type is required for the requested operation, ensure that a robot type has been specified.
Specify a robot number in the range of 0 to 32767. If you run vmphyinv, the global device database may not be updated, or the specified robot number may not be configured.
Media and device management status codes and messages About media and device management status codes and messages
429
It is not specified. It is not valid for the given robot type. It is not in an acceptable format. It exceeds the allowed length of a robot control host name. Examine command output, debug logs, and system logs for a more detailed message on the error See Using debug logs on page 423. If possible, attempt the requested operation again with another user interface that supports the type of request.
Is not specified Is not in an acceptable format Exceeds the allowed length of a volume group name Examine command output, debug logs, and system logs for a more detailed message on the error See Using debug logs on page 423. Specify a volume group where one is required to ensure that it contains the following:
430
Media and device management status codes and messages About media and device management status codes and messages
Explanation: A requested operation encountered a missing or out-of-range robot slot number. Or a move by volume group residence was attempted when the volume did not originate from a valid robotic library slot.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify a slot number (robot coordinate 1) where required and ensure that the number is within the allowable range for the given robot type.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify a valid EMM database host on which the following is running: a version of nbemm (the NetBackup Enterprise Media Manager) or an operator request daemon or process that supports the requested operation.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
431
Retry the operation and examine the logs. Ensure that no whitespaces are embedded in the fields that do not allow embedded whitespace.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. This error can occur if a cold catalog backup is in progress. Retry the request after this operation has completed.
432
Media and device management status codes and messages About media and device management status codes and messages
Recommended Action: Determine why the directory /usr/openv/tmp (UNIX and Linux) or install_path\tmp (Windows) cannot be created. Check to see what account nbpushdata was run under. Compare it against the security properties of the database folder.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. The EMM database may be corrupted. Restore an older EMM database from a saved version or from catalog backups.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
433
The EMM database may be corrupted. Restore an older EMM database from a saved version or from catalog backups.
Examine the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. When you add volumes to the EMM database, specify a media ID that is unique. If you run vmphyinv, there may be two or more media in the tape library with the same media ID.
Examine the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that volumes are configured properly on the EMM server that matches the EMM server configured for the robot or set of stand-alone drives. Use tpconfig -d to list the configured EMM server.
434
Media and device management status codes and messages About media and device management status codes and messages
Do the following so the volume query can find a matching volume: update the volume or the device configurations, specify the correct EMM server, modify volume properties, or adjust search criteria. If you run vmphyinv, none of the media satisfy the search criterion. As such, vmphyinv cannot inventory the tape library.
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Query or sort volume records by bar code to identify the volume entry with the duplicate bar code.
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Query or sort volume records by slot number to identify the volume entry with the duplicate robotic coordinate. Change (update or move volume) or delete the existing volume entry if it does not reflect the following: the correct robotic coordinate corresponding to the volumes storage position in the robotic library. If a volume is currently in a drive, the EMM database should still reflect the volumes home slot.
Media and device management status codes and messages About media and device management status codes and messages
435
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the protocol error. See Using debug logs on page 423. Verify that the server being connected to is operational.
Verify that the correct version of software is running on all servers. Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the protocol error. See Using debug logs on page 423. Retry the operation and examine the logs. Ensure that no embedded whitespaces exist in the fields that do not allow embedded whitespace.
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423.
436
Media and device management status codes and messages About media and device management status codes and messages
Identify the robotic process: look at the robot type and at the robot host on the robotic request or the robot host field of the volume being operated on. Verify that the robotic process to use for robotic control is available See Table B-2 on page 695. If necessary, start the robotic process. Ensure that only one configured robot control host exists for each TL8, TLD, and TLH robot. Also ensure that all volumes in the volume configuration have a robot host that matches the configured robot control host. Change the volumes or reconfigure the robot in the device configuration as needed. Check the system log on the robot control host to see if the robotic process processes requests when connections to it are attempted.
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message error. See Using debug logs on page 423. Identify the robotic process: look at the robot type and at the robot host on the robotic request or the robot host field of the volume being operated on. Verify that the robotic process to use for robotic control is available and that it handles requests. Identify the robot control host by checking the device configuration. Only one configured robot control host should exist for each TL8, TLD, and TLH robot. All volumes in the volume configuration should have a robot host that matches the configured robot control host. Check the system log on the robot control host to see if the robotic process processes requests when communications with it are attempted.
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error.
Media and device management status codes and messages About media and device management status codes and messages
437
Identify the targeted robotic process: look at the robot type and at the robot host on the robotic request or the robot host field of the volume being operated on. Verify that the robotic process to use for robotic control is available and that it handles requests. See Media and device management components on page 693. Identify the robot control host by checking the device configuration. Only one configured robot control host should exist for each TL8, TLD, and TLH robot. All volumes in the volume configuration should have a robot host that matches the configured robot control host. Check the system log on the robot control host to see if the robotic process handles requests when communications with it are attempted. See Resolving network communication problems on page 40.
Verify that the correct version of software is running on all servers. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Retry the operation and examine the logs. Ensure that no embedded whitespaces exist in the fields that do not allow embedded whitespace. Check the system log on the robot control host for errors that the robotic software logged.
438
Media and device management status codes and messages About media and device management status codes and messages
No action is needed if the administrator aborted the change operation. Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423.
No action is needed, unless the volumes that were requested to be deleted were not in fact deleted. Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423.
No action is needed unless the volumes that were requested to be inserted were not inserted. Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
439
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the usage statement for expected usage and compare with the parameters being sent to start the new process.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that change volume residence requests are not sent to vmd on a system that runs an older, incompatible software version level. (The change volume residence requests are requests with eject for the robot type that is involved with a newer release version level.)
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that change volume residence requests are not sent to vmd on a system that runs an older, incompatible software version level. (The change volume residence requests are requests with inject for the robot type that is involved with a newer release version level.)
440
Media and device management status codes and messages About media and device management status codes and messages
Explanation: A robotic-related request was made specifying a media movement option that not all affected software components supports.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the robotic request is sent to a system that runs a release version of software that supports the particular request.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that robot numbers are unique for each physical robot in the EMM database. Delete and re-add a robot. Use a unique robot number if duplicate robot numbers are in use. Use a media management interface to identify robot numbers currently in use for all volumes in the volume configuration. If you use a command line interface, specify the correct robot type for the robot number that is associated with the request.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the specified robot number and volume group are compatible. If volumes in the volume group have a given robot number, then volumes with a different robot number cannot be added to that volume group. Volumes cannot be moved directly from one robotic volume group to another robotic
Media and device management status codes and messages About media and device management status codes and messages
441
volume group. The intermediate steps (some volume entries are changed, some are not) would cause a conflict with robot numbers. Choose a different volume group on the request, or let the volume group be selected automatically. Volume group selection depends on the specific interface being used.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. From catalog backups or another source if available, restore an earlier version of the database file: /usr/openv/volmgr/database/volDB (UNIX and Linux) or install_path\Volmgr\database\volDB (Windows). Then restart vmd.
Examine the daemon debug log for a more detailed message on the system error.
442
Media and device management status codes and messages About media and device management status codes and messages
If another process has the port, use other system commands to determine the process. Based on the result, either change the port number in your services file or map, or terminate the process that has acquired the port. UNIX and Linux only: Another possible cause for this error is the use of the kill command to terminate vmd. To stop vmd, the recommended method is to use the Terminate Media Manager Volume Daemon option on the Special actions menu in vmadm. (Or use the equivalent command line request, vmctrldbm -t). The use of the kill command to stop this process can leave it unable to bind to its assigned port the next time it restarts. When the socket problem occurs, the daemon debug log contains lines similar to the following:
unable to obtain bound socket, Address already in use (125)
Examine the daemon debug log for a more detailed message on the system error. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Obtain the specific system call failure from the debug log, and investigate the operating system functionality that is related to the failure.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Robot inventory update must be initiated only on the host where the robotic control and drives are configured.
Media and device management status codes and messages About media and device management status codes and messages
443
Explanation: A request is made to add or change volumes in the volume configuration, or to issue a robot inventory update request. A specified robot host differs from the robot host for other volumes in the same robot (defined as those volumes having the same robot number). All volumes in the EMM database that have a given robot number (for instance, 0) must have the same robot host name.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify the robot host in the device configuration to be the same case-sensitive host name on all hosts where the robot is configured. Re-issue the request. As an alternative, use move-by-volume group to move all volumes logically from a robotic volume group to stand-alone and then back into the robot. Specify the robot host as the host name that is used in the robot configuration. Then re-issue the request.
Examine interface output and debug logs for a more detailed message error. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Examine interface output and debug logs for a more detailed message error. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
444
Media and device management status codes and messages About media and device management status codes and messages
Retry the operation (or try to use a different media management interface) and examine the logs.
Examine the daemon debug log for a more detailed message on the system error. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Check for data integrity or consistency problems in the EMM database by using a media management interface. Delete or move volume entries so that the volume group issues are corrected.
Examine the daemon debug log for a more detailed message on the system error. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Obtain the specific send or write system call failure from the debug log, and investigate the operating system functionality that is related to the failure. Use the following steps to check whether the command or the application interface that sends the request aborts prematurely: enable reqlib debug logs, retry the operation, check the debug logs, and observe application interface output.
Media and device management status codes and messages About media and device management status codes and messages
445
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check to see if volumes are defined in the EMM database. They may be defined in the EMM database, which is associated with a slot number zero that may not exist in the robot. Run a robot inventory Show contents of robot report and observe the starting slot number. If the starting slot number is one and a volume is defined in the robot at slot zero, delete the volume entry. Or move it to stand-alone so that the remaining available media slots can be used.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check to see whether the vmd or the oprd process continues to run once it receives the connection from the requestor. Run netstat -a or an equivalent socket diagnostic utility. Check the following to see if the server process is hung up: the daemon debug log on the server-side system and the process status of vmd or oprd.
On the host where vmd is the recipient of the connection, verify that the daemon or the service is running. (The host is the Media Manager host, the Device Host, or the EMM server.) If the daemon or the service is not running, start it. On Windows, vmd is the NetBackup Volume Manager service.
446
Media and device management status codes and messages About media and device management status codes and messages
If vmd is already running, examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the correct host names are defined in the configuration. Check the services file. On UNIX and Linux, verify that the /etc/services file (and NIS services if NIS is used) has entries for the vmd service. (Note that the vmd service always starts oprd.) On Windows, verify that the %systemroot%\system32\drivers\etc\services file has the correct entry for vmd. Also verify that the vmd port number in the services file agrees with the port number configuration. The port number is noted in the man page for vmd(1M). Verify that all operating system patches or service packs are installed. Ensure that the Media Manager configuration is not tuned so that the load on vmd exceeds its ability to service requests. Look for entries in the vm.conf file that increase the load. Consider placing the EMM database on a higher performance server and file system if performance is an issue. To reduce the number of volumes in the volume configuration, use inventory filtering for the robot types that support it Check utilities such as ipcs -a to ensure that shared memory functions properly. The oprd process may not respond because it cannot attach to shared memory.
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Identify the system where vmd is running. The system usually is termed the Media Manager host or EMM server and defaults to the local system in some user interfaces (such as vmadm). Possible causes for the error are high network load, missing operating system patches or service packs, or unexpected vmd process failure.
Media and device management status codes and messages About media and device management status codes and messages
447
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Identify the system where vmd is running. The system is usually termed the Media Manager host or EMM server and defaults to the local system in some user interfaces (such as vmadm). Possible causes for the error are high network load, missing operating system patches or service packs, or unexpected vmd process failure. Also, the socket read may have failed because the requested operation did not complete within a specified time period. The robotic process and vmd interactions can affect some requests to vmd; check the system log for errors on the robotic control host.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that all Media Manager and user interface binaries are at a compatible version level.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
448
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that all Media Manager and user interface binaries are at a compatible version level.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Obtain the specific system call failure from the debug log, and investigate the operating system functionality that is related to the failure. Run the hostname system command to see if the command operates correctly.
Examine the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that volumes are properly configured in the EMM database. Use tpconfig -d to list the configured EMM server. Select the current server (the one being administered) to be the same as the host, which is the correct EMM server. Do the following so that the volume query can find a matching volume: update the volume or the device configurations, modify volume properties, or adjust search criteria as needed. For media in their correct slot locations, run the Rescan or the update bar code request so the following occurs: the bar code
Media and device management status codes and messages About media and device management status codes and messages
449
field in the volume configuration matches the actual bar code as interpreted by the robotic library bar code reader.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that volumes are properly configured in the EMM database. Use tpconfig -d to list the configured EMM server. Select the current server (the one being administered) to be the same as the host which is the correct EMM server. Perform the following so the volume residence query can find a matching volume: update the volume or the device configurations, modify volume properties, or adjust search criteria as needed.
A Media Manager volume record belongs to a different robot with the same media ID as the media ID that the tape header read. The media type or media GUID or the volume pool of an assigned volume record needs to be changed. A bar code conflict is detected and vmphyinv needs to change the bar code of the existing volume record.
Recommended Action: vmphyinv, in such a scenario, generates a list of errors. Examine the output. You must resolve all these errors before you run the utility again.
450
Media and device management status codes and messages About media and device management status codes and messages
volume groups. All volumes in a volume group are required to have the same residence, which includes having the same robot type. A requested operation may have tried to associate the special No Volume Group name "---" with a robotic residence.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the specified robot residence and volume group are compatible with other volumes in the volume configuration that are in the specified volume group. Do not move volumes in the special No Volume Group name "----" to a robotic residence without moving them to a new or auto-generated volume group. Choose a different volume group on the request, or let the volume group be automatically selected. Volume group selection depends on the specific interface being used.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the specified robot residence and volume group are compatible with other volumes in the volume configuration that are in the specified volume group. Do not try to add volumes for a robot host by using a different form of the robot host name. For example, "acme" is not the same as "acme.symantec.com." Use the same host name that is used for other volumes in the volume group. If the robot host needs to be changed for volumes in a volume group, do the following: use a single move volume group request (available only in certain media management interfaces) to move the volume group to stand-alone residence. Then move the volume group back to the robotic residence. Specify the robot control host that you want to be associated with the new volume group.
Media and device management status codes and messages About media and device management status codes and messages
451
Initiate vmd on the local host only, by logging on to the host where vmd needs to run Start vmd on that host. On UNIX and Linux, run /usr/openv/volmgr/bin/vmd [-v]. On Windows, start the NetBackup Volume Manager service in Services of the system Control Panel. If more information is needed to explain the problem, examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Make sure port numbers are consistent.
Restart the service at a later time and investigate the system problems that limit the number of processes. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
452
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the permissions on the vmcheckxxx, vmupdate, and oprd binaries, and (on Windows only) the rdevmi installed binary.
Media and device management status codes and messages About media and device management status codes and messages
453
terminated, remove the lock file. The lock file is /usr/openv/volmgr/misc/vmd.lock (UNIX and Linux) or install_path\Volmgr\misc\vmd.lock (Windows). Then restart vmd.
Volumes in scratch pools cannot be assigned until they are first moved to another pool. Volume pool numbers cannot be negative. Volume pool names must consist of from 1 to 20 printable ASCII characters with no embedded whitespace. The None volume pool is the only valid pool for the bar code rule entries that specify cleaning a media type. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the specified volume pool does not violate the requirements noted. Use the vmpool command to display the pool information. Use the vmrule command to display the bar code rule information. Add or change volume pools and bar code rules as needed to rectify inconsistencies in cases where the databases are inconsistent or corrupted.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Assigned volumes cannot be deleted. If no worthwhile data is on the volume, unassign the media by using the appropriate application interface (which is bpexpdate for NetBackup). Then retry the delete volume request.
454
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Do not try to manually assign any volumes that are already assigned, because it is not valid except for one condition: you can assign volumes for NetBackup catalog backups if the volume is already assigned for NetBackup catalog backups. Always use barcodes that are unique in the six least significant characters, across all media in all robots. Or use media ID generation rules to ensure that unique media IDs are generated in a robot inventory update.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. When you assign volumes manually, specify the volume pool that is associated with the volume. Always use barcodes that are unique in the six least significant characters, across all media in all robots. Or use media ID generation rules to ensure that unique media IDs are generated in a robot inventory update.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. When you use robot inventory update to make changes to the volume configuration, do the following: ensure that all volumes of the same physical cartridge type (for example, 3590J in TLH robots) map to a single media type
Media and device management status codes and messages About media and device management status codes and messages
455
such as HCART. This check ensures that all media in the robotic library can be mounted on drives with a compatible drive type.
When you assign volumes manually, specify the media type that is associated with the volume. Always use bar codes that are unique with respect to the six least significant characters, across all media in all robots. Or use media ID generation rules to ensure that unique media IDs are generated when you use robot inventory update.
On Windows, do the following when you auto-configure devices or initiate the NetBackup Device Manager service from a graphical or a command line interface: ensure that the service is not disabled in the system services configuration. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. In general, any device management-related errors that occur on a particular host accompany operator request daemon or process and remote device management errors. Check for errors in the following: the debug and the system or the application logs on the host where oprd was started or where it is running. The host is often a targeted device host or scan host. The kinds of failed requests that oprd services can include the following:
Down, up, or reset drives Change drive comments Deny or resubmit mount requests Assign drives Start or stop ltid Obtain ltid status Display drive status Manage pending actions Set NDMP attributes Configure devices
456
Media and device management status codes and messages About media and device management status codes and messages
Clean drives Obtain host version and device configuration information Scan shared drives
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. List the configured bar code rules in the EMM database. Adjust the bar code tag that is specified on the change or the delete request or on the targeted host. Then the bar code rule is found when the request is retried.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the specified media type and volume group are compatible with other volumes in the volume configuration that are in the specified volume group. Choose a different volume group on the request, or let the volume group be automatically selected. Volume group selection depends on the interface being used.
Media and device management status codes and messages About media and device management status codes and messages
457
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Use vmpool to investigate the integrity of the volume pool database. The daemon debug log file should indicate the expected number of fields and the found number of fields in the pool record. Restore a saved version of the pool database if the pool database cannot be manually corrected.
Examine the daemon debug log for a more detailed message on the system error. See Using debug logs on page 423. Examine the permissions and available file system space for writing to the database: on UNIX and Linux, /usr/openv/volmgr/database/poolDB; on Windows, install_path\Volmgr\database\poolDB.
Add, change, delete, or query volume pool Add or change bar code rule Add or change volume
458
Media and device management status codes and messages About media and device management status codes and messages
Query scratch volumes Robot inventory report or update Examine the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that volumes are properly configured on the EMM server. Use the tpconfig -d command to list the configured EMM server. Select the current server (the one being administered) to be the same as the host which is the correct EMM server for a targeted device. Do the following so the requested operation can find the requested volume pool: update the volume or the device configurations, modify volume properties, or adjust search criteria as needed. Investigate inconsistencies between the EMM database and the volume pool database, and restore or correct those databases from a previous state as needed.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Use a media management interface to query for the volumes that are associated with the pool specified for deletion. Ensure that all volumes in a volume pool are associated again with another pool before trying to delete the volume pool. Use change volume operations to change the volume pool for a volume.
Media and device management status codes and messages About media and device management status codes and messages
459
Explanation: A request was made to change the media expiration for one or more volumes in the volume configuration, but the date specified was not valid. Recommended Action: When you change the media expiration, provide the date in the format that the media management interface documentation specifies.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify a maximum mounts value within the range of 0 to 2,147,483,647.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Extend the active life of the physical media: change the volume expiration date to a future date in relation to the current system date or time. Alternatively, replace the media with other media that still contains useful life. Check the system date and time and reset it as needed.
Examine command output, debug logs, and system logs for a more detailed message on the error.
460
Media and device management status codes and messages About media and device management status codes and messages
Extend the active life of the physical media: increase the volumes maximum number of mounts or set the maximum number of mounts to infinite. Alternatively, replace the media with other media that still contains useful life.
If the volume is a cleaning cartridge, perform a valid operation such as changing the number of cleanings that remain for the cleaning cartridge. If the volumes media type cannot be determined, examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. If the targeted volume is incorrectly configured as a cleaning tape, delete the cleaning volume. Then update the volume configuration using options to define a new volume with the appropriate media type.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
461
Use vmrule to investigate integrity of the bar code rule database. The daemon debug log file should indicate the number of expected fields and the number of found fields in the bar code rule record. Restore a saved version of the bar code rule database if the bar code rule database cannot be manually corrected.
See the NetBackup Security and Encryption Guide for information on vmd security. vmd security is based on NetBackup authentication or authorization, but has extensions for handling SERVER entries in the Media Manager configuration file. Examine the debug log files for a more detailed message on the authentication or the authorization problem. See Using debug logs on page 423. Correct the vmd security configuration by adjusting the authentication configuration, the AUTHORIZATION_REQUIRED entry, and SERVER entries.
462
Media and device management status codes and messages About media and device management status codes and messages
Check the methods_allow.txt files on the systems that have problems to ensure that authentication is enabled. The files are in the following locations: Windows: install_path\NetBackup\var\auth UNIX and Linux: /usr/openv/var/auth
On the systems that have the authentication problem, remove the remote host that is not authenticated from the methods_allow.txt file. For example, if Host A and Host B have the problem, remove Host A from the file on Host B, and vice versa. Retry the operation. If the problem still exists, the connection problems are not related to authentication. If connections are successful, proceed to the next step. Run bpauthsync -vopie on the master server to synchronize the key files on the systems again. On Windows:
install_path\NetBackup\bin\admincmd\bpauthsync -vopie -servers
Media and device management status codes and messages About media and device management status codes and messages
463
Examine the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the drive name is from 1 to 48 ASCII characters in length. The following special characters are allowed: period (.), plus (+), minus (-), and underscore (_). Verify that the correct version of software is running on all servers.
Independent schedulers or applications access the same pool of drives Hardware or media errors cause some drives that are allocated to jobs to become unavailable.
464
Media and device management status codes and messages About media and device management status codes and messages
Check the system log and application (bptm) debug log to determine if hardware or media errors have caused drives to become unavailable. If more information is needed on the drive reservation problem, examine the following for a more detailed message on the error: command output, debug logs, and system logs. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
465
Host A becomes unavailable for some time, unable to communicate with other hosts. Host B determines that the host having the reservation (Host A) is no longer available. Host B then makes a request to the EMM/DA denoting Host A as unavailable. Some other host (such as Host A or Host C) reserves the drive. The host that originally owned the drive reservation tries to release the drive.
Recommended Action: Correct the network or the process problem that led to the communications problem. Ensure that unique non-duplicate strings are used for host names and for SSO_HOST_NAME configuration file entries.
Host A reserves a shared drive. Host A becomes unavailable for some time, unable to communicate with other hosts. Host B determines that the host having the reservation (Host A) is no longer available. Host B then makes a request to the EMM/DA denoting Host A as unavailable. The host that originally owned the drive reservation tries to release the drive.
Recommended Action: Correct the network or the process problem that led to the communications problem. Ensure that unique non-duplicate strings are used for host names and for SSO_HOST_NAME configuration file entries.
466
Media and device management status codes and messages About media and device management status codes and messages
The EMM server was stopped and restarted. This situation is automatically handled, because the requesting host re-registers its drives with the EMM server when this error is encountered. A host was unregistered with the EMM server, and another host declared the host to be unavailable.
Recommended Action: If the host was declared unavailable, determine whether it should be available. Correct the underlying network problems or restart ltid (the device daemon on UNIX and Linux or NetBackup Device Manager service on Windows).
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Identify the targeted host. Verify that all Media Manager binaries on that host are at a compatible version level with other hosts that are part of the configuration. Update the software version as needed.
Media and device management status codes and messages About media and device management status codes and messages
467
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify a number of mounts value within the acceptable range of 0 to 2,147,483,647.
468
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
469
Identify the target components (for example, vmd and daemons or services, or user interfaces on local or remote hosts). Verify that all Media Manager binaries are at a compatible version level. Depending on which requests encountered the invalid version, determine whether or not the database is corrupt. Use an appropriate interface to query for the type of information that is involved in the error condition.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify an off-site slot value within the range of 0 to 2,147,483,647.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify an off-site session ID within the range of 0 to 2,147,483,647.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. List the license keys that were installed and verify that the referenced functionality is supported with the currently installed license keys. Check to see that the databases that define externalized object types are in place and not corrupted. These database files are the following, in the
470
Media and device management status codes and messages About media and device management status codes and messages
Restrict the size of the SSO configuration to no more than 255 hosts. Break up the media and the device management domain into multiple domains, with all domains having 255 or fewer hosts that register shared drives.
Coordinate changes to the device configuration so that changes come from a single source. Investigate the global device database changes on the server (database) side by examining the daemon debug log file for a more detailed error message.
Media and device management status codes and messages About media and device management status codes and messages
471
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Identify the targeted host. Verify that all Media Manager binaries on that host are at a compatible version level with other hosts that are part of the configuration. Update the software version as needed.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Identify the targeted host. Verify that all Media Manager binaries on that host are at a compatible version level with other hosts that are part of the configuration. Update the software version as needed.
472
Media and device management status codes and messages About media and device management status codes and messages
Recommended Action: This error code may appear for a number of reasons. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Examine the systems application log, the Removable Storage system interface, and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. Retry the operation and examine the logs. From the daemon debug log file, determine the media ID that has the invalid RSM GUID. Make sure that the software components are compatible.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that the move request was not sent to a robotic control daemon or process on a system that runs an older, incompatible software version. Ensure that the targeted robotic control daemon or process operates normally.
Media and device management status codes and messages About media and device management status codes and messages
473
Examine command output (if available) and the daemon and reqlib debug logs for a more detailed message on the error. See Using debug logs on page 423. From the daemon debug log file, determine the volume that has an ADAMM GUID conflict with the volume entry that is added or changed.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Run the tpconfig -d or vmquery -a command to verify that the EMM server is actively running and that it accepts new connections.
474
Media and device management status codes and messages About media and device management status codes and messages
Recommended Action: Use the scratch pool already defined, or delete the current scratch pool and create a new scratch pool.
Media and device management status codes and messages About media and device management status codes and messages
475
Verify that you specified the correct EMM server (the -h option on the command line). If you did not specify the database host, the command line defaults to the local host, while the console uses the currently administered host. Contact the administrator in charge of this configuration and verify that the host was intentionally blocked from being a database host. If not, remove the NOT_DATABASE_HOST flag in the hosts vm.conf file. To do so without having to stop and restart the daemons, use: vmquery -h <hoostname> -remove_not_db_host. To add this entry to a host without having to stop and restart the daemons, use: vmquery -h <hostname> -add_not_db_host.
See the NetBackup Security and Encryption Guide for information on Media Manager security. Media Manager security is based on NetBackup authentication and authorization, but has extensions for handling SERVER entries in the Media Manager configuration file. Examine the debug log files for a more detailed message on the authentication and authorization problem. See Using debug logs on page 423. Determine whether authorization fails on vmd. Examine the debug log files for Media Manager status code 126 occurrences ("not authorized to connect to vmd"). Correct the Media Manager security configuration by adjusting the authentication configuration, the AUTHORIZATION_REQUIRED entry, the ENABLE_ROBOT_AUTH entry, and the SERVER entries.
476
Media and device management status codes and messages About media and device management status codes and messages
Check the methods_allow.txt files on the systems that have problems to ensure that authentication is enabled. The files are in the following locations: Windows: install_path\NetBackup\var\auth UNIX and Linux: /usr/openv/var/auth
On the systems that have the authentication problem, remove the remote host that is not authenticated from the methods_allow.txt file and retry the operation. For example, if Host A and Host B have the problem, remove Host A from the file on Host B, and vice versa. If the problem still exists, the error is caused by connection problems not related to authentication. If connections are successful after you remove the host, run bpauthsync -vopie on the master server to synchronize the key files on the systems again. On Windows:
install_path\NetBackup\bin\admincmd\bpauthsync -vopie -servers
Media and device management status codes and messages About media and device management status codes and messages
477
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Retry the operation with a container ID that does not contain invalid characters.
Ensure that the Veritas Security Services is installed and configured. For complete installation instructions, see the Veritas Security Services Installation Guide. Check that both parties have a valid certificate by examining the listed expiry date from a bpnbat -WhoAmI. For example:
478
Media and device management status codes and messages About media and device management status codes and messages
bpnbat -WhoAmI Name: JDOG Domain: MYCOMPANY Issued by: /CN=broker/[email protected]/O=vx Expiry Date: Sep 19 12:51:55 2003 GMT Authentication method: Microsoft Windows Operation completed successfully.
Shows an expiry date of September 19th, 2003. After 12:51:55 GMT this credential is no longer valid and a new credential is required. If you are running from the NetBackup Administration console, close and reopen the console. The console automatically obtains a credential for the currently logged in identity, if possible. By default these certificates are valid for 24 hours. To set a longer default time, consult the following: See the NetBackup Security and Encryption Guide. Ensure that the certificates for both sides either use the same broker, are children of the same root broker, or have trusts established between them. More information is available on broker hierarchies and how to establish trust relationships between brokers. See the NetBackup Security and Encryption Guide. Ensure that connectivity is possible between the physical systems in question. If general sockets cannot connect between the machines (such as ping and telnet), network issues unrelated to NetBackup may be the cause of this problem. Ensure that the system has sufficient swap space and the following directories are not full:
If you use the default groups, make certain that the user attempts to perform an operation appropriate for that group. For example, a member of NBU_Operators is unable to modify policy information, a permission reserved for administrator roles.
Media and device management status codes and messages About media and device management status codes and messages
479
Ensure that the system has sufficient swap space and the following directories are not full:
If you use your own defined groups and permissions, first determine the object with which the operation is associated. Then add the permissions relative to the action. For example, a user is required to up and down drives but does not currently have permission to do so. Verify that the user belongs to the correct authorization group. If necessary, verify that the group has Up and Down permissions on the Drive object within the Group Permission tab. If necessary, you can increase the verbosity level of NetBackup to locate what object and what permissions are required for the failing request. The pertinent lines in the debug logs look similar to the following:
17:19:27.653 [904.872] <2> GetAzinfo: Peer Cred Info. Name: JMIZZLE Domain: MYCOMPANY Expiry: Sep 24 21:45:32 2003 GMT Issued by: /CN=broker/[email protected]/O=vx AuthType: 1 17:19:37.077 [904.872] <2> VssAzAuthorize: vss_az.cpp.5082: Function: VssAzAuthorize. Object NBU_RES_Drives 17:19:37.077 [904.872] <2> VssAzAuthorize: vss_az.cpp.5083: Function: VssAzAuthorize. Permissions Up 17:19:40.171 [904.872] <2> VssAzAuthorize: vss_az.cpp.5166: Function: VssAzAuthorize. 20 Permission denied.
In the example, the user JMIZZLE attempts to perform an operation that requires the Up permission on the Drives object. To diagnose the problem, examine the group(s) to which the user belongs to ensure that the appropriate group includes the Up permission. (Up is a member of the Operate permission set for Drives.)
480
Media and device management status codes and messages About media and device management status codes and messages
Verify that pbx_exchange and nbemm are running. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Run the tpconfig -d or vmquery -a command to verify that the EMM server is actively running and that it accepts new connections.
Verify that pbx_exchange and nbemm are running. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Run the tpconfig -d or vmquery -a command to verify that the EMM server is actively running and that it accepts new connections. For a DA reserve drive request, verify that another host had not already reserved the drive. For a DA release drive request, verify the following: that the drive is DA reserved by the host requesting the DA release and has not already been released.
Media and device management status codes and messages About media and device management status codes and messages
481
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the proper upgrade order is followed as described in the NetBackup Installation Guide.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that Symantec Private Branch Exchange (VxPBX) processes new requests. Verify that the EMM server processes new requests.
482
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the EMM server processes new requests.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Retry the operation once the media is no longer in use.
Media and device management status codes and messages About media and device management status codes and messages
483
Explanation: A user or process that did not have root privileges (on UNIX and Linux) or administrator privileges (on Windows) started the process. Or the EMM server name cannot be set.
If appropriate, give the user or the process administrator privileges (on Windows) or root privileges (on UNIX and Linux) and reissue the device configuration request. Establish a common EMM server name as follows:
Run tpautoconf -get_gdbhost on other hosts. Set the EMM server name with
tpautoconf -set_gdbhost host_name
The file does not exist. The file is for a different version of NetBackup. You can find what version it is for by reading the header in the file. The file has a corrupted licensing digest.
Recommended Action: Download the latest device mapping file from the Symantec support website at www.support.symantec.com.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the host name is resolvable.
484
Media and device management status codes and messages About media and device management status codes and messages
Explanation: A request was made to add or change the SCSI port number for a robot, but the SCSI port number provided was not valid.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify the correct SCSI port number for the robot. Perform device discovery by using the Device Configuration wizard, or check the Windows registry as needed to obtain the SCSI port number.
Media and device management status codes and messages About media and device management status codes and messages
485
Explanation: One of the Media Manager device configuration commands (tpconfig or tpautoconf) was executed with improper options. Or an incompatibility exists between components or versions of the product.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the tpconfig or the tpautoconf usage statement for expected usage and compare with the parameters being sent to start the new process. Verify that all Media Manager binaries are at a compatible version level.
Examine the daemon debug log and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that nbemm is running. Display the device configuration to determine whether or not the database is corrupt. Restore a saved copy of the databases from catalog backups, or delete them and recreate the device configuration as needed.
Display the device configuration to obtain the list of valid drives. Avoid making device configuration changes from multiple sources simultaneously. If more information is needed, examine the daemon debug log and command or interface output for a more detailed message on the error. See Using debug logs on page 423.
486
Media and device management status codes and messages About media and device management status codes and messages
robot number did not correspond to a currently configured robot, or the robotic database is corrupted.
Specify a robot number in the range of 0 to 32767. Ensure that all device configuration changes or deletions are performed on the devices that are currently part of the device configuration. Verify that nbemm is running. Restore a saved copy of the robotic database from catalog backups, or delete it and recreate any needed robotic configuration information.
Ensure that the device is physically connected. Ensure that the operating system is configured to recognize the device and that the operating system can see the device. Ensure that no other process uses the device and that the device is not offline.
Configure all drives for this robotic library with the same drive type. If you use NetBackup Server and want a robotic library with multiple media types, contact Symantec to purchase NetBackup Enterprise Server.
Media and device management status codes and messages About media and device management status codes and messages
487
Explanation: On a request to modify the device configuration, the specified robot type was invalid. Or it did not match the robot type for the robot that is associated with the specified robot number.
Check the device configuration for configured robots, and specify the correct robot type applicable for the device configuration information being updated. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that all Media Manager binaries are at a compatible version level.
To avoid configuring invalid device paths and device names, use the Device Configuration wizard (on supported device discovery platforms). The wizard automatically configures device paths and device names. On Windows hosts, check the operating system configuration or registry for device names and refer to the tpconfig command in NetBackup online help. Information for UNIX and Linux hosts is available See the NetBackup Device Configuration Guide. Always use no-rewind device files for drives that are attached to UNIX and Linux hosts. Check to ensure that the specified device paths exist as character-special files. Check for detailed errors from the command or the user interface output.
488
Media and device management status codes and messages About media and device management status codes and messages
To avoid configuring the device paths that are already in use, use the Device Configuration wizard (on supported device discovery platforms). The wizard automatically configures device paths and device names. Display the device configuration using tpconfig -d or a device configuration interface to see the robotic information that is already configured. Windows hosts have multiple ways to configure robots (changer names or port, bus, target, LUN). Check the operating system configuration or registry for changer names and their associated SCSI paths. Check for detailed errors from the command or the user interface output.
Media and device management status codes and messages About media and device management status codes and messages
489
To avoid configuring a drive index that is already in use, use the Device Configuration wizard (on supported device discovery platforms). The wizard automatically configures the drive index. If you use a device configuration interface that allows the drive index to be specified, do the following: use tpconfig -d to determine the drive indexes already in use on the targeted device host. Then specify a drive index that is not in use.
490
Media and device management status codes and messages About media and device management status codes and messages
Check that you configured the correct robot type. Configure the device with local control by using its local device path.
Media and device management status codes and messages About media and device management status codes and messages
491
To avoid configuring any paths that are already in use, use the Device Configuration wizard (on supported device discovery platforms). The wizard automatically configures the drive paths. Before you make configuration changes, check the existing drive configuration through a device configuration interface. Or run tpconfig -d to determine the drive paths that are already in use on the targeted device host. Then specify a drive path that is not already in use.
Examine the daemon debug log and command or interface output for a more detailed message on the error. See Using debug logs on page 423. Retry the device configuration request and examine the logs.
492
Media and device management status codes and messages About media and device management status codes and messages
Media and device management status codes and messages About media and device management status codes and messages
493
Examine the daemon debug log and command or interface output for a more detailed message on the system error. See Using debug logs on page 423. Retry the operation and examine the logs. One of the following may have occurred:
Lock file problems: The device discovery process sets a lockfile in the /usr/openv/volmgr/misc (UNIX and Linux) or install_path\Volmgr\misc (Windows) directory named tpac.lock. It sets the lockfile to ensure that only one instance of discovery runs on a particular host. It then checks the lockfile before it updates the configuration.
Cannot obtain lockfile. The lockfile may be held by another discovery process. In this case the following error is displayed:
"another tpautoconf is already running"
Use standard OS process tools (ps on UNIX and Linux or Task Manager on Windows) to determine if another tpautoconf process is running. If not, delete the lockfile and re-run device discovery. If another tpautoconf process is running, wait for it to complete before retrying.
494
Media and device management status codes and messages About media and device management status codes and messages
In the case of long device-discovery cycles, the interface may timeout or the user may cancel the process. Part of the timeout or cancellation is to remove the lockfile. This action tells the device discovery process that it should not continue making modifications to the device configuration. If action occurs, run the discovery process again.
Media and device management status codes and messages About media and device management status codes and messages
495
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that an active SSO license is installed on the following: all servers that have a path configured to this drive and the server where this operation is performed.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Use tpconfig -dnh to verify that the host has been configured. Check the usage of the tpautoconf -list_snapvault_volumes command.
496
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Use tpconfig -dnh to verify that the host has been configured.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the usage of the set_ndmp_attr -probe or tpautoconf -probe commands. An error occurs while trying to get the serial number and inquiry string for a device connected to an NDMP filer. Verify that the device is properly attached to the filer.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the usage of the tpautoconf -verify commands. Verify that the device is properly attached to the filer.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the NDMP option is properly installed and licensed. Verify that NDMP is supported on the platform in question.
Media and device management status codes and messages About media and device management status codes and messages
497
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the password is appropriate for the media server and filer pair. Verify that the password was provided correctly on the command or in the NDMP Host dialog box.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Use the -update option of tpconfig instead of -add.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Use the -add -nh option on the tpconfig command to add the NDMP host.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
498
Media and device management status codes and messages About media and device management status codes and messages
Verify NAS filer licenses, supported operating system levels, and network connectivity.
Use the tpautoconf -verify command to verify the hostname, username, and password. Use the ping command to verify network access.
Media and device management status codes and messages About media and device management status codes and messages
499
Examine debug logs for more information on the error. See Using debug logs on page 423. Use the tpautoconf -verify command to verify the hostname, username, and password. Use the ping command to verify network access.
500
Media and device management status codes and messages About media and device management status codes and messages
Examine debug logs for a more detailed message on the error. See Using debug logs on page 423. See your NAS vendor documentation for instructions on how to switch NDMP versions.
Media and device management status codes and messages About media and device management status codes and messages
501
Explanation: A request was made to the EMM server, but it either did not reach the EMM server or resulted from a communication failure.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that pbx_exchange and nbemm are running. Run the tpconfig -d or vmquery -a command to verify that the EMM server is actively running and that it accepts new connections.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that pbx_exchange and nbemm are running. Run the tpconfig -d or vmquery -a command to verify that the EMM server is actively running and that it accepts new connections.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that adequate memory is available.
Examine command output, debug logs, and system logs for a more detailed message on the error.
502
Media and device management status codes and messages About media and device management status codes and messages
Check tpconfig usage to ensure that the command is used properly and use tpconfig -update -drpath instead.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. The rule being added already exists. The specified host does not exist in the EMM database. Use the nbemmcmd -addhost command to add the host to the EMM database.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. The rule being updated does not exist. Ensure that the drive name rule is entered correctly. The specified host does not have a local drive name rule configured. Configure a drive name rule.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. A local drive name rule does not exist on the hosts specified.
Media and device management status codes and messages About media and device management status codes and messages
503
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the hosts are known in the EMM database.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Observe the rules for drive names:
Cannot begin with a dash. Cannot exceed 48 characters. A literal field can only contain alphanumeric characters and plus (+), dash (-), period (.), or underscore (_).
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that adequate memory is available. Verify that Windows networking is properly installed.
504
Media and device management status codes and messages About media and device management status codes and messages
Explanation: An attempt was made to add a device to a host that the EMM database does not recognize.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Use nbemmcmd -addhost to add the host to the EMM database.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Choose a shorter rule.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Retry the operation after the current instance is done. A lock on the device configuration cache may have been orphaned. Use the nbemmcmd command to clear the lock, then retry the operation.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the added drive has a unique serial number.
Media and device management status codes and messages About media and device management status codes and messages
505
Verify that the user has the correct permissions to perform this operation. Verify that the VxSS settings are correct, under Host Properties in the NetBackup Administration Console. See the NetBackup Administrators Guide Volume I for information on how to use the Veritas Security Subsystem (VxSS)
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. This error can occur if a cold catalog backup is in progress. Retry the request after this operation has completed.
506
Media and device management status codes and messages About media and device management status codes and messages
Explanation: You must first add this disk array host to the NetBackup host configuration before this operation can be performed. To view hosts, see nbemmcmd -listhosts in the NetBackup Commands Guide.
If you try to update an existing hosts credentials, this host no longer exists in NetBackup. It must be added again by using the tpconfig or the nbemmcmd command. The name you entered for the disk array host does not match any of the machine aliases in the NetBackup machine configuration. Use the nbemmcmd command to add the fully qualified array host name (or the name entered) to the machine alias list for your disk array.
Media and device management status codes and messages About media and device management status codes and messages
507
508
Media and device management status codes and messages About media and device management status codes and messages
communications. The error probably indicates a lack of system resources for message queues.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather output from the ipcs -a command to see what resources are currently in use.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather output from the ipcs -a command to see what resources are currently in use. Investigate whether ltid is tied up in communications with devices or other components.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather output from the ipcs -a command to see what resources are currently in use. Investigate whether the robotic daemon
Media and device management status codes and messages About media and device management status codes and messages
509
or process on the local device host is tied up in communications with devices or other components.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather output from the ipcs -a command to see what resources are currently in use. Check the installed software components and verify that they are all at a compatible release version.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather output from the ipcs -a command to see what resources are currently in use.
510
Media and device management status codes and messages About media and device management status codes and messages
Device Manager service on Windows.) An error was encountered in the message communications. The error probably indicates a lack of system resources for message queues. On Windows, this error can also occur if an internal-system-registered event cannot be opened.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather output from the ipcs -a command to see what resources are currently in use.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
511
On UNIX and Linux servers, gather output from the ipcs -a command to see what resources are currently in use.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the application log files (such as the bptm log) to see why the drive may have been configured DOWN. Check the integrity of the drive, drive path, and media.
512
Media and device management status codes and messages About media and device management status codes and messages
Explanation: A drive was selected that has a drive type which is not compatible with the requested density.
Allow the drive selection to be determined automatically. When you select the drive manually, check the device configuration and the valid density table (available in the tpreq man page or command description). Then specify a drive that is compatible with the requested density.
Media and device management status codes and messages About media and device management status codes and messages
513
Recommended Action: Wait until the drive is ready before you manually assign a drive to a pending mount request.
If ltid is not running, start ltid and try the operation again. On UNIX and Linux, run /usr/openv/volmgr/bin/ltid, and on Windows, start the NetBackup Device Manager service. If ltid is already running, check for the existence and permissions of the lock file itself and the lock file directory, which are as follows: /usr/openv/volmgr/misc/.ltipid (UNIX and Linux) or install_path\ Volmgr\misc\.ltipid (Windows). Terminate the ltid process if it is running. Create the lock directory or folder and adjust the permissions as needed so that ltid can obtain the lock. On UNIX and Linux, check the msgget man page and look for suggestions on how to troubleshoot the system message queues.
514
Media and device management status codes and messages About media and device management status codes and messages
Explanation: An attempt was made to reserve a shared drive, but the drive reservation request failed. This status code is related to the internal implementation of the SSO feature. It is not related to SCSI Reserve or Release. Recommended Action: This condition is expected for any shared drives that are retried automatically. If problems persist, verify the following: the EMM server services requests and it does not list drives as reserved to hosts that currently do not use the drives.
Query the volume pool information for the requested volume pool on the host where the mount request was issued by running vmpool -listall -b. Check the system log to obtain the name of the host where the mount request originated. This host name is the one returned by the system hostname(1) command. Change the volume pool host name security with vmpool or another user interface that supports volume pool host attributes. Or change the volume pool that is associated with the volume (if it is not assigned). Or log in to the host that is allowed to use media in the targeted volume pool. Then, submit the mount request again.
Media and device management status codes and messages About media and device management status codes and messages
515
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use. Check the installed software components and verify that they are all at a compatible release version.
516
Media and device management status codes and messages About media and device management status codes and messages
Explanation: ltid made an attempt to obtain a semaphore that was used for arbitrating access to shared memory. (ltid is the Media Manager device daemon on UNIX and Linux or the NetBackup Device Manager service on Windows) The request failed due to a system error. The error probably indicates a lack of system resources for semaphores, or mismatched software components.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use. Check the installed software components and verify that they are all at a compatible release version.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use.
Media and device management status codes and messages About media and device management status codes and messages
517
or request the media with read-only access if the write protection was the cause of the error. If the problem is a cleaning tape with no cleanings remaining, replace the cleaning tape.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Specify a maximum-mounts value within the range of 0 to 2,147,483,647.
To avoid configuring invalid device paths and device names, use the Device Configuration wizard (on supported device discovery platforms). The wizard automatically configures paths and device names for tape drives.
518
Media and device management status codes and messages About media and device management status codes and messages
Always use no-rewind tape device files or recognized drive name syntax (such as for NDMP) for tape drives. Make sure that the specified device paths exist as character-special files. Check for detailed errors from the command or the user interface output. See the NetBackup Device Configuration Guide.
If a cleaning operation was requested, check the tpclean usage statement and compare with the parameters that were specified. Check the installed software components and verify that they are all at a compatible release version.
Ensure that ltid was stopped and restarted after changes were last made to the device configuration. (ltid is the Media Manager device daemon on UNIX and Linux or the NetBackup Device Manager service on Windows.) Display the device configuration (use tpconfig -d or other means) to see the list of valid drives. Specify the drive name or index as appropriate for the interface being used. Check integrity of the EMM database. Display the device configuration to do the following:
Media and device management status codes and messages About media and device management status codes and messages
519
Determine if the database is corrupt. Restore a saved copy of the database file from catalog backups, or remove the devices and recreate the device configuration.
520
Media and device management status codes and messages About media and device management status codes and messages
Explanation: On a request to change a shared drives status, an attempt to establish a connection to the drives scan host failed.
Determine which host serve as the drives scan host: use vmoprcmd output or by checking the Device Monitor in the Administration Console. Ensure that vmd (the NetBackup Volume Manager daemon on UNIX and Linux or NetBackup Volume Manager service on Windows) is running on the scan host. On the scan host, examine debug logs and system logs for any messages that are related to the error. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. The detailed reason for the canceled request should be available in the daemon debug logs on the scan host. Correct the problem and submit the request again if needed.
If problems are encountered as a result of the reported error, check for communication, configuration, and system problems among the associated hosts. To check, use vmoprcmd output or check the Device Monitor in the Administration Console. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Configure scan ability priorities for assigning scan hosts by changing the Media Manager configuration. Configure so that less network connections need to be maintained, and greater system load is placed on hosts with more capability to service the load.
Media and device management status codes and messages About media and device management status codes and messages
521
Recommended Action: When you request a tape mount, ensure that the file name does not exceed 255 ASCII characters in length. If the mount requests come from an application, request an application change to use a shorter file name. Or install the product in a directory or a folder that does not cause the file name limit to be exceeded.
522
Media and device management status codes and messages About media and device management status codes and messages
for a number of reasons: missing or faulty media or the need to allow other, higher priority requests to obtain drive resources. Check with the administrator or operator for more information.
Ensure that cleaning media was added to the robotic library for each drive type capable of being cleaned with a separate cleaning cartridge. Ensure that a positive number of cleanings is available for the cleaning media in the EMM database for the robotic library. Replace the cleaning tape or increase the number of cleanings for the cleaning media before the count reaches zero. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
523
524
Media and device management status codes and messages About media and device management status codes and messages
Recommended Action: Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. The detailed reason for the canceled request should be available in the system log, command output, or from a device monitor interface. Correct the problem and resubmit the request if needed.
Media and device management status codes and messages About media and device management status codes and messages
525
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that pbx_exchange is running. Run the tpconfig -d or vmquery -a command to verify that the EMM server is actively running and accepting new connections.
526
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the robotic hardware is functional.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. A tpclean operation was attempted and Windows networking was not properly configured. A malloc system call failed when trying to stop ltid.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that ltid, bptm, and the robotic daemons are at a compatible NetBackup level.
Media and device management status codes and messages About media and device management status codes and messages
527
Verify that the user is logged in with permissions adequate for this operation. Verify that the VxSS settings are correct, under Host Properties in the NetBackup Administration Console. See the NetBackup Security and Encryption Guide.
Verify that the correct EMM server name is listed in the NetBackup configuration. Verify that the media server that is encountering this error is listed in the NetBackup configuration on the EMM server. Verify that EMM is running on the EMM server. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the NetBackup Job Manager is running on the master server.
528
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that the NetBackup Job Manager is running on the master server. Verify that the arguments provided to the tpunmount command are correct.
Verify that the drive is not DOWN on the intended media server. Verify that the media server where the drive is found is ACTIVE. Retry the command with a different drive.
Verify that the requested media ID was entered correctly. Retry with a different media ID.
Media and device management status codes and messages About media and device management status codes and messages
529
The robot device, path, or library name in the device configuration may not be valid. The configured robotic device may not exist. The robotic device may be incorrect, such as a UNIX and Linux device file that is not of a character special file format. The robotic daemon/process lock file could not be opened or a lock obtained. The open operation on the device or through the API interface (such as NDMP) failed. Stop any robot test utilities that may be running, since they have the lock on the robotic device when they are active. Check the configuration of the robot against the recommended configuration as indicated in the documentation for robot configuration. Check the health of the robotic device by using a robot test utility, then close the test utility when finished. Check for the existence and permissions of the lock file itself and the lock file directory, which is /usr/openv/volmgr/misc/vmd.lock (UNIX and Linux) or install_path\Volmgr\misc\vmd.lock (Windows). Create the directory/folder and adjust the permissions as needed so that the robotic daemon/process can use the lock file. Stop and restart ltid (the device daemon on UNIX and Linux or the NetBackup Device Manager service on Windows).
1 2 3 4
530
Media and device management status codes and messages About media and device management status codes and messages
The SCSI commands mode sense, mode select, or read element status (of slot, drive, transport, i/e element) may have failed. A network API-controlled library inventory request may have failed. The robotic daemon/process could not initialize a robotic database file. Check the configuration of the robot against the recommended configuration as indicated in the documentation for robot configuration. Check the health of the robotic device by using a robot test utility, then close the test utility when finished. Check for the existence and permissions of the temporary robotic database and the temporary database directory/folder, which is /usr/openv/volmgr/misc/robotic_db (UNIX and Linux) or install_path\Volmgr\misc\robotic_db (Windows). Create the directory/folder and adjust the permissions as needed so that the robotic daemon/process can create it or use it. Stop and restart ltid (the device daemon on UNIX and Linux or the NetBackup Device Manager service on Windows).
Stop any robot test utilities, since they have the lock on the robotic device when they are active, and can block other requests. Check whether excessive hardware retries have delayed the completion of a robotic command. Check to see whether the robotic device still functions. Use a robot test utility to send commands to the device to see whether it is responsive. Execute vmps to verify that no unexpected Media Manager processes are running. Some processes should remain running, but some processes that do not go away can indicate a more serious problem, such as a hung system call.
Media and device management status codes and messages About media and device management status codes and messages
531
Explanation: The robot could not be initialized. This generic status is used for many conditions.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Investigate the system log messages that are related to the specific error leading to the robot initialization failure.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Investigate the system log messages that are related to the specific error leading to the media mount failure.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Investigate the system log messages that are related to the specific error leading to the media dismount failure.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
532
Media and device management status codes and messages About media and device management status codes and messages
Verify that all Media Manager binaries and user interfaces are at a compatible version level.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check for improperly configured cleaning media or interference with the drive cleaning operation. Check for bad media that may have led to the drive not becoming ready after media was placed within it. To avoid configuring incorrect device paths and device names, which is a common cause of drive open problems, do the following: use the Device Configuration wizard (on supported device discovery platforms) so that device paths and device names can be automatically configured. Investigate the system log messages that are related to the specific error leading to the open failure.
Media and device management status codes and messages About media and device management status codes and messages
533
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check for improperly configured cleaning media or interference with the drive cleaning operation. Check for bad media that may prevent unloading the drive. To avoid configuring incorrect device paths and device names, which is a common cause of drive unload problems, do the following: use the Device Configuration wizard (on supported device discovery platforms) so that device paths and device names can be automatically configured. Investigate the system log messages that are related to the specific error leading to the unload failure.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check vendor or operating system administrative interfaces and logs to see if robotic commands are being canceled.
Stop any robot test utilities, since they have the lock on the robotic device when they are active, and can block other requests. Check to see whether the robotic device still functions. Check whether excessive hardware or communication problems have delayed the completion of a robotic command. Use a robot test utility to send commands to the device to see whether it is responsive. Execute vmps to verify that no unexpected Media Manager processes are running. Some processes should remain running, but some processes that do not go away can indicate a problem, such as a hung system call.
534
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Obtain the list of drives using a method that involves a robotic library query, such as a query available from the robot test utility. Compare the list of drives against the device configuration. Ensure that ltid was stopped and restarted after changes were last made to the device configuration. ltid is the Media Manager device daemon on UNIX and Linux or the NetBackup Device Manager service on Windows.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Ensure that ltid was stopped and restarted after changes were last made to the device configuration. (ltid is the Media Manager device daemon on UNIX and Linux or the NetBackup Device Manager service on Windows.) On commands to robotic libraries, specify only robotic libraries that are actively part of the device configuration.
If the media is needed immediately, examine command output (if available), debug logs, and system logs for messages relating to the targeted media. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
535
Check for conflicts between multiple applications using media in the robotic library. Check integrity of the drive and drive paths, so that media is not routinely left in other drives.
See whether the robotic library has a media access port (use the robot test utility to validate). Investigate whether the administrator or operator has canceled the inject operation. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
536
Media and device management status codes and messages About media and device management status codes and messages
See whether the robotic library has a media access port (use the robot test utility to validate). Investigate whether the administrator or operator has canceled the eject operation. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Coordinate inject/eject operations between all operators and administrators.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Media Manager status codes on page 424. Coordinate inject/eject operations between all operators and administrators. Ensure that the media access port is empty of media before an eject operation.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages About media and device management status codes and messages
537
Investigate the state of the physical hardware and correct the holder status for storage, drive, and transport elements as needed. Then, resubmit the request.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Investigate the system log messages that are related to the specific error leading to the media mount failure. Verify that all Media Manager binaries are at a compatible version level. Verify that robotic interfaces to vendor and operating system software have compatible versions.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Investigate the system log messages that are related to the error leading to the media mount failure.
538
Media and device management status codes and messages About media and device management status codes and messages
Verify that all Media Manager binaries are at a compatible version level. Verify that robotic interfaces to vendor and operating system hardware and software have compatible versions.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Issue a robot inventory Contents report to determine the valid slot range for the robot. Check the volume configuration to ensure that only valid slots are referenced in volume records, paying particular attention to the starting and ending slot numbers. Update the volume configuration as needed, or request only valid slot ranges for robotic operations.
Examine command output, debug logs, and system logs for a more detailed message on the error.
Media and device management status codes and messages About media and device management status codes and messages
539
Verify that all Media Manager binaries and user interfaces are at a compatible version level.
Check for other error messages in the command or interface output to indicate which system call failed. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Check the system application log for error and warning messages. Verify that the system is not running out of virtual memory. If virtual memory is the problem, shut down unused applications or increase the amount of virtual memory. To increase virtual memory on Windows: display the Control Panel, double-click System, and on the Performance tab, set Virtual Memory to a higher value. Verify that all product binaries are properly installed. Verify that no unexpected Media Manager processes are running by executing vmps. Some processes should remain running, but some processes that do not go away could indicate a problem, such as a hung system call.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Issue a robot inventory Contents report to obtain the list of media in the robotic library. See whether inventory filters have been enabled in the Media Manager configuration file. Inventory filters affect the contents of the media list returned from the robotic daemon or process. Use a robot test utility or an operating system/vendor administrative interface to verify the status of media, as needed. Update the volume configuration and search for the media if it was not in the robotic library, as needed, and resubmit the request.
540
Media and device management status codes and messages About media and device management status codes and messages
Media and device management status codes and messages About media and device management status codes and messages
541
Update the volume configuration and search for the media if it was not in the robotic library, as needed, and resubmit the request.
542
Media and device management status codes and messages About media and device management status codes and messages
Media and device management status codes and messages About media and device management status codes and messages
543
for drive requests. Also, the relationship between the number of logical drives and physical drives may change as hardware failures occur. NetBackup scheduling, drive allocation, and drive assignment algorithms can only determine logical drive availability. NetBackup attempts to fully utilize all configured and available logical drives. If the number of required logical drives exceeds the physical drives available, a NetBackup job may be started with insufficient drive resources. Instead of queueing the job in the scheduler, the job runs and encounters the resource issue when it makes an ACS tape mount request.
Install the Shared Storage Option (SSO) license for mount requests to requeue when physical drive resources are not available. The number of drives that can be in use at any one time is limited. Configure backup windows so the different storage units that are tied to the same physical drives are active only at non-overlapping times. Increase the media mount timeout to avoid job failures when the job cannot get a physical drive due to the drives all being busy.
544
Media and device management status codes and messages About media and device management status codes and messages
Media and device management status codes and messages About media and device management status codes and messages
545
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Retry the operation once the drive cleaning has completed.
Verify that the user has the necessary permissions to perform this operation. Verify that the VxSS settings are correct, under Host Properties in the NetBackup Administration Console. Information is available on using the Veritas Security Subsystem (VxSS). See the NetBackup Administrators Guide, Volume I.
546
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Retry the operation when the robot diagnostic cycle is complete.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Run the tpconfig -d or vmquery -a command to verify that the EMM server is actively processing commands.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Restart ltid and the robotics on this system or on the robot control host system.
Media and device management status codes and messages About media and device management status codes and messages
547
These status codes are also logged to the system log. Usually, robotic daemons/processes are not started from the command line, but are started automatically, as needed, when ltid starts.
Start ltid so that shared memory can be initialized, allowing the robotic daemon/process to function. If problems persist, examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use.
548
Media and device management status codes and messages About media and device management status codes and messages
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use.
Media and device management status codes and messages About media and device management status codes and messages
549
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. On UNIX and Linux servers, gather the output of the ipcs -a command to see what resources are currently in use.
Restart the device daemon at a later time and investigate system problems that limit the number of processes. Examine the system logs for a more detailed message on the error. See Using debug logs on page 423. Restart the device daemon, then retry the operation and examine the system log file.
550
Media and device management status codes and messages About media and device management status codes and messages
Recommended Action: Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423. Verify that all Media Manager binaries are at a compatible version level.
Make sure nbemm is running and responding to requests. Examine command output, debug logs, and system logs for a more detailed message on the error. See Using debug logs on page 423.
Media and device management status codes and messages Media and device management messages
551
Recommended Action: Display the device configuration using tpconfig -d or a device configuration interface to see the robotic and the drive information that is already configured. Ensure that the drive addresses do not span physical domains. Drives can only be configured in the same robot if they can be used with media from a single domain. The domain must include a single physical library or multiple libraries that are connected by a cartridge exchange or pass-through mechanism.
Check product documentation for supported device configurations. Obtain an additional software license that allows robots of the associated robot type to be configured. Or, limit the configuration to robot types that current licensing allows. Check for the existence and permissions of the external_robotics.txt file in the /usr/openv/share directory (UNIX and Linux) or in the install_path\NetBackup\share folder (Windows).
Media and device management messages and status codes Status code number
Device Configuration status code 36 Robotic status code 240
A SCSI inquiry sent to the device has failed Device Configuration status code 16 ADAMM GUID does not exist in database Media Manager status code 168
ADAMM GUID is not unique in the database Media Manager status code 167
552
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Device Configuration status code 82 Device Configuration status code 40
Error message
Add Drive Name Rule request failed Adding this device would exceed the maximum allowed Adding this drive would exceed the maximum allowed another daemon already exists Another device configuration is already in progress barcode does not exist in database barcode label is unreadable barcode not unique in database barcode tag is not unique in rule database cannot allocate requested memory
Media Manager status code 78 Robotic status code 235 Media Manager status code 36 Media Manager status code 122 Media Manager status code 18
Cannot assign a robotically controlled device Device management status code 64 Cannot assign due to media ID mismatch cannot auto-eject this robot type cannot auto-inject this robot type Cannot change terminal mode Device management status code 57 Media Manager status code 51 Media Manager status code 52 Device Configuration status code 41
cannot connect to robotic software daemon Media Manager status code 42 cannot connect to vmd [on host host name] Media Manager status code 70 Cannot create miscellaneous working repository cannot delete assigned volume cannot delete one of the default volume pools Cannot discover devices. See the Troubleshooting Guide for details. Device Configuration status code 42
Media and device management status codes and messages Media and device management messages
553
Table 6-1
Media and device management messages and status codes (continued) Status code number
Error message
Cannot execute command, permission denied Device Configuration status code 1 cannot get host name Media Manager status code 76
Cannot move from media access port to slot Robotic status code 218 Cannot move from slot to media access port Robotic status code 219 cannot obtain daemon lockfile Cannot open/create the media access port status file cannot perform operation on this host Can not retrieve Job ID from Job Manager Cannot stop device daemon with tapes assigned cannot update database due to existing errors child process killed by signal Cleaning/unknown media in drive Media Manager status code 21 Robotic status code 243
Media Manager status code 60 Device management status code 79 Device management status code 20
Configuration has changed, robot daemons Robotic status code 262 and ltid need restarting Control daemon connect or protocol error CORBA communication error Could not get hostname Credentials already exist Current version does not support remote device host current version does not support this configuration daemon cannot obtain socket daemon failed accepting connection Robotic status code 224 Media Manager status code 195 Device Configuration status code 3 Device configuration status code 99 Device Configuration status code 38
554
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Media Manager status code 5 Media Manager status code 7 Media Manager status code 26
Error message
daemon resources are busy daemon terminated database open operation failed
database read operation read too few bytes Media Manager status code 28 database read record operation failed database server is down Media Manager status code 27 Media Manager status code 23 and Device Configuration status code 93 Media Manager status code 32 Device Configuration status code 84 Media Manager status code 153
database write record operation failed Delete Drive Name Rule request failed device entry is not unique in global device database device management error Device path is already in use device test state file does not exist Devices located in multiple domains Disk Optimization feature is not licensed Door is open on cabinet
Media Manager status code 83 Device Configuration status code 22 Media Manager status code 182 Robot Error status code 16 Device Configuration status code 94 Robotic status code 216
Drive being assigned is either not NDMP or Device management status code 37 on the wrong NDMP client Drive does not exist in robot Drive does not support pending request density Drive index is in use by another drive Drive is currently assigned Drive name does not exist Robotic status code 213 Device management status code 17
Device Configuration status code 29 Device management status code 2 Device Configuration status code 35
Drive name is already in use by another drive Device Configuration status code 34
Media and device management status codes and messages Media and device management messages
555
Table 6-1
Media and device management messages and status codes (continued) Status code number
Error message
Drive name rule has exceeded its maximum Device Configuration status code 89 length of 48 characters Drive not available in library for mount request Duplicate device path names EMM database error EMM DB record not found EMM error Device management status code 62
Device Configuration status code 20 Media Manager status code 196 Media Manager status code 194 Robotic status code 261 and Robot Error status code 13 Device management status code 72 Media Manager status code 57 Robot Error status code 4 Robot Error status code 5 Device management status code 32 Robot Error status code 3 Device management status code 14 Device management status code 13 Device management status code 28 Device management status code 6 Device management status code 4 Device management status code 8 Device management status code 33 Robotic Error status code 9 Device management status code 5 Device management status code 3 Device management status code 7
EMM library call failed error auto-generating volume group Error in attaching the shared memory Error in getting process Id Error in getting semaphore Error in getting shared memory Error in IPC SHMAT call Error in IPC SHMGET call Error in MsgGet Error in Receiving Daemon Message Error in Receiving Operator Message Error in Receiving User Message Error in SEMAPHORE operation Error in SEMAPHORE operation Error in Sending Daemon Message Error in Sending Operator Message Error in Sending User Message
556
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Media Manager status code 165
Error message
Evaluation period expired. Go to www.symantec.com to order this product. failed appending to pool database failed appending to rule database failed changing terminal characteristics failed initiating child process failed making the database directory failed opening tmp output file Failed reading drive or robot config file failed receiving from robotic software daemon failed receiving from vmd failed redirecting input to pipe failed redirecting tmp output file failed sending request to vmd failed sending to robotic software daemon failed sending to vmd Failed to find an available slot to inject to failed to initialize a connection to the Enterprise Media Manager File name already exists File name does not exist Fork failure occurred generic EMM SQL error global device database append operation failed
Media Manager status code 104 Media Manager status code 121 Media Manager status code 45 Media Manager status code 88 Media Manager status code 25 Media Manager status code 86 Device Configuration status code 13 Media Manager status code 44
Media Manager status code 72 Media Manager status code 62 Media Manager status code 87 Media Manager status code 69 Media Manager status code 43 Media Manager status code 71 Robotic status code 246 Media Manager status code 189
Device management status code 40 Device management status code 25 Robot Error status code 10 Media Manager status code 193 Media Manager status code 155
Media and device management status codes and messages Media and device management messages
557
Table 6-1
Media and device management messages and status codes (continued) Status code number
Error message
global device database host name is invalid Device Configuration status code 49 global device database record not found Host is not the scan host for this shared drive incompatible database version Incomplete robot information Incorrect tpreq access mode internal database access failure Internal NDMP error invalid barcode invalid change type invalid change-entry request Invalid command code invalid command usage. invalid container id. invalid database host. invalid database version header invalid description Invalid device path name Invalid drive index invalid drive name Invalid Drive Name Rule Invalid Drive Number Invalid drive type for the robot Invalid Drive Type/Density Media Manager status code 152 Device management status code 48
Media Manager status code 146 Device Configuration status code 24 Device management status code 42 Media Manager status code 169 Device Configuration status code 57 Media Manager status code 10 Media Manager status code 75 Media Manager status code 50 Robotic status code 207 Media Manager status code 4 Media Manager status code 186 Media Manager status code 19 Media Manager status code 56 Media Manager status code 11 Device Configuration status code 19 Device Configuration status code 14 Media Manager status code 129 Device Configuration status code 86 Device management status code 23 Device Configuration status code 27 Device management status code 1
558
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Media Manager status code 191 Media Manager status code 113 Device Configuration status code 88 Media Manager status code 136 Media Manager status code 114 Media Manager status code 140 Media Manager status code 41 Media Manager status code 8 Media Manager status code 9 Device Configuration status code 64 Device Configuration status code 55 Device Configuration status code 60 Media Manager status code 74 Media Manager status code 141 Media Manager status code 142 Media Manager status code 144 Media Manager status code 143 Media Manager status code 148 Media Manager status code 147 Device management status code 12 Media Manager status code 102 Media Manager status code 6 Media Manager status code 73 Media Manager status code 16
Error message
invalid EMM argument invalid expiration date Invalid host invalid host name invalid maximum mounts invalid media generation rule invalid media ID for naming mode invalid media ID invalid media type Invalid NDMP device Invalid NDMP hostname Invalid NDMP password invalid number of cleanings invalid number of mounts invalid offsite location invalid offsite return date invalid offsite sent date invalid offsite session id invalid offsite slot Invalid Operator invalid pool database entry invalid protocol request invalid query type invalid robot coord1
Media and device management status codes and messages Media and device management messages
559
Table 6-1
Media and device management messages and status codes (continued) Status code number
Media Manager status code 17
Error message
invalid robot coord2
Invalid robot drive number for the robot type Device Configuration status code 28 invalid robot host Invalid robot number invalid robot number Invalid robot type invalid robot type invalid rule database entry invalid scratch pool name Invalid SCSI bus number for the robot Invalid SCSI logical unit number for the robot Invalid SCSI port number for the robot Invalid SCSI target for the robot Invalid Usage invalid volgroup invalid volume move mode Invalid volume pool specified invalid volume pool IPC Error: Daemon may not be running IPC sequence error Job Manager returned error: see activity monitor List Drive Name Rule request failed LTI Daemon may not be running Media Manager status code 14 Device Configuration status code 15 Media Manager status code 13 Device Configuration status code 18 Media Manager status code 12 Media Manager status code 119 Media Manager status code 173 Device Configuration status code 8 Device Configuration status code 10
Device Configuration status code 7 Device Configuration status code 9 Device Configuration status code 11 Media Manager status code 15 Media Manager status code 53 Device management status code 65 Media Manager status code 90 Device management status code 22 Device management status code 10 Device management status code 80
560
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Device management status code 75 Robotic status code 221 Robotic status code 220 Robotic status code 250 Robotic status code 253 Robotic status code 252 Robotic status code 251 Media Manager status code 166 Device management status code 83 Media Manager status code 138 Media Manager status code 139 Media Manager status code 95 Media Manager status code 34 Media Manager status code 101 Device management status code 56
Error message
LTI system error Media access port already contains media Media access port does not contain media Media access port is available Media access port is in eject mode Media access port is in inject mode Media access port is unavailable media access port not available Media does not exist in database media generation rule already exists media generation rule does not exist media ID is not the specified media type media ID not unique in database media type and volume group mismatch Mount canceled, device daemon is terminating NDMP authorization error, verify username/password NDMP config_get_connection_type failed NDMP config_get_mover_type failed NDMP failed to verify host NDMP get_host_info failed NDMP get_server_info failed NDMP host does not exist NDMP host exists, use change option
Device Configuration status code 76 Device Configuration status code 75 Device Configuration status code 58 Device Configuration status code 71 Device Configuration status code 72 Device Configuration status code 62 Device Configuration status code 61
Media and device management status codes and messages Media and device management messages
561
Table 6-1
Media and device management messages and status codes (continued) Status code number
Device Configuration status code 69 Device Configuration status code 59 Device Configuration status code 63 Device Configuration status code 66 Device Configuration status code 65 Device configuration status code 94
Error message
NDMP host not connected NDMP is not installed on platform NDMP request failed NDMP robot does not exist NDMP robot exists, use change option NetApp Disk Storage Unit feature is not licensed NetBackup Snapshot client not licensed network protocol error No action pending for given mount index no child process to wait for No cleaning tape is defined in the device's robot or 0 cleanings remaining No compatible device is registered at these SCSI coordinates No devices are configured on the robot No drive available no entries changed no entries deleted no entries inserted No Error on operation, sideband data only No media found in device or robot slot, please verify No memory available No mount pending for given mount index no pools in the pool list
Device configuration status code 100 Media Manager status code 39 Device management status code 50 Media Manager status code 64 Device management status code 59
Robot Error status code 6 Device management status code 82 Media Manager status code 47 Media Manager status code 48 Media Manager status code 49 Device management status code 84 Device management status code 61
Robot Error status code 8 Device management status code 16 Media Manager status code 112
562
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Device management status code 60
Error message
Robot daemon and/or robotic hardware is not available No robot is defined of this type No robots are configured No valid license key for Disk Array configuration <NONE> Not authorized by VxSS not authorized to connect to vmd Only the administrative user can perform the requested operation Open Storage feature is not licensed
Device management status code 52 Robot Error status code 7 Device Configuration status code 97
Device Configuration status code 36 Robotic status code 259 Media Manager status code 126 Device management status code 19
operation not allowed on cleaning cartridge Media Manager status code 117 Operator denied mount request Device management status code 55
oprd request is not supported on the remote Media Manager status code 137 host oprd returned abnormal status Parameter is invalid Physical drive is not available pool does not exist in pool database poolname is not unique in pool database pool not defined as a catalog backup pool pool not defined as a scratch pool pool type change is not allowed for <CatalogBackup> pool Process killed by parent Media Manager status code 96 Device management status code 39 Robotic status code 245 Media Manager status code 109 Media Manager status code 105 Media Management status code 198 Media Manager status code 172 Media Manager status code 22
Media and device management status codes and messages Media and device management messages
563
Table 6-1
Media and device management messages and status codes (continued) Status code number
Robotic status code 211 Media Manager status code 20 Media Manager status code 150
Error message
Process killed by signal protocol error registering this host would exceed the maximum allowed
request can only be performed on the Media Media Manager status code 177 and Device management Domain Server request completed Request has been queued (Cancel to clear message Request terminated because host not validated for volume pool Request terminated because media id is expired Request terminated because media id will exceed maximum mount count Request terminated because media is a cleaning tape Request terminated because media is unavailable (in DOWN drive, misplaced, write protected or unmountable Request terminated because media is unmountable Media Manager status code 1 Device management status code 53
Request terminated because media is write Device management status code 70 protected Request terminated because mount requests Device management status code 63 are disabled Request terminated because of volume pool Device management status code 66 mismatch Request terminated by tpunmount call from Device management status code 36 another process
564
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Device management status code 24
Error message
Requested drive could not be reserved
requested drive is already reserved by host Media Manager status code 145 requested drive is already reserved Requested drive is in an offline domain requested drive is not currently registered requested drive is not currently reserved requested drive is not registered for host requested drive is not reserved by host requested host is not currently registered Media Manager status code 130 Robotic status code 238 Media Manager status code 132 Media Manager status code 134 Media Manager status code 131 Media Manager status code 133 Media Manager status code 135
Requested operation is not supported by the Robotic status code 229 robot Requested slot already has cartridge Requested slot contains the wrong tape Requested slot does not exist in robot Requested slot is empty Requested tape in other or non-configured drive Requested volume is in an offline domain Residence is not licensed for multihosted drive support Retry later Robot busy, cannot perform operation Robot busy, inject operation in progress Robotic status code 217 Robotic status code 226 Robotic status code 228 Robotic status code 208 Robotic status code 215
Device management status code 81 Robotic status code 223 Robotic status code 255
Robot busy, inventory operation in progress Robotic status code 254 Robot busy, multiple eject operation in progress Robotic status code 256
Media and device management status codes and messages Media and device management messages
565
Table 6-1
Media and device management messages and status codes (continued) Status code number
Robotic status code 257
Error message
Robot busy, multiple inject operation in progress Robot busy, robot diagnostics in progress Robot denied access to the resource Robot drive number in use for this robot Robot hardware or communication error Robot has misplaced the media robot host and volume group mismatch Robot/LTI protocol error Robot media access port does not exist robot number and robot host mismatch robot number and robot type mismatch
Robotic status code 260 Robotic status code 234 Device Configuration status code 25 Robotic status code 225 Robotic status code 236 Media Manager status code 82 Device management status code 76 Robotic status code 242 Media Manager status code 61 Media Manager status code 54
robot number and volume group mismatch Media Manager status code 55 Robot number does not exist Robot number does not exist. Robot number is already in use Robot number is in use by another robot Robot operation failed robot type and volume group mismatch Robot type must be controlled locally Robotic arm has no addressable holder Robotic daemon not licensed Robotic dismount failure Robotic mount failure robotic volume position is already in use Device Configuration status code 31 Robotic status code 214 Device Configuration status code 21 Device Configuration status code 30 Device management status code 74 Media Manager status code 81 Device Configuration status code 33 Robotic status code 222 Robot Error status code 17 Robotic status code 206 Robotic status code 205 Media Manager status code 37
566
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Device Configuration status code 48 Media Manager status code 97
Error message
RSM is not supported rule does not exist in rule database
Shared Storage Option (SSO) is not licensed Device Configuration status code 53 specified robot is unknown to vmd STATUS_SUCCESS Media Manager status code 79 Robotic status code 200
Stopping device daemon with tapes assigned Device management status code 73 Success System Error Device Configuration status code 0 Device Configuration status code 87 and Media Manager status code 2 Robot Error status code 11 Robotic status code 230
System error occurred System error occurred during robot operation Tape file path exceeds 255 character maximum Tape needs to be write enabled The device is not robotic, cannot perform cleaning
The device_mappings file has invalid license Device Configuration status code 2 info The device name is not valid, no device responded The drive is DOWN The drive is not ready or inoperable Device Configuration status code 52
The drive serial number already exists in the Device Configuration status code 91 device database The eject command was aborted by the user Robotic status code 244 The EMM server failed to process the request Device Configuration status code 78
Media and device management status codes and messages Media and device management messages
567
Table 6-1
Media and device management messages and status codes (continued) Status code number
Media Manager status code 162
Error message
the global device database device name is invalid the global device database device type is invalid the media is allocated for use the operation requested has failed the query with time failed because a limit was reached
Media Management status code 199 Media Manager status code 163 Media Manager status code 184
The requested operation is not valid for the Device Configuration status code 95 specified Disk Type the request sent to the Device Allocator has Media Manager status code 190 failed the robotic daemon returned an invalid volume GUID Media Manager status code 164
the robotic library is full and may still have Media Manager status code 185 media in its map The specified Disk Array Host is not configured in NetBackup the specified pool is not empty This is a drive path operation, use the -drpath option this machine is not the database host This robot type does not support multiple media types Timeout waiting for robotic command too many volumes in volume group Device Configuration status code 96
Unable to allocate memory for this process Device configuration status code 79 Unable to connect to NDMP host verify hostname Device configuration status code 67
568
Media and device management status codes and messages Media and device management messages
Table 6-1
Media and device management messages and status codes (continued) Status code number
Device configuration status code 77 and Device management status code 78 Device configuration status code 70
Error message
Unable to connect to the EMM server
unable to find any records in the device test Media Manager status code 176 database unable to generate a unique media id Unable to initialize robot Unable to open drive Unable to open robotic path unable to open the device test state file Unable to process NDMP message Unable to SCSI unload drive unable to send exit status Unable to sense robotic device unexpected data from robotic software daemon unexpected data received Unknown drive name unknown EMM error code Unsupported NDMP version Update Drive Name Rule request failed Usage error in creating child process user id was not superuser volume daemon fork failed volume does not exist in database volume group does not exist Media Manager status code 127 Robotic status code 204 Robotic status code 209 Robotic status code 201 Media Manager status code 175 Device configuration status code 68 Robotic status code 210 Media Manager status code 67 Robotic status code 202 Media Manager status code 46
Media Manager status code 40 Device management status code 41 Media Manager status code 192 Device configuration status code 73 Device configuration status code 83 Robot Error status code 12 Media Manager status code 3 Media Manager status code 85 Media Manager status code 35 Media Manager status code 65
Media and device management status codes and messages Media and device management messages
569
Table 6-1
Media and device management messages and status codes (continued) Status code number
Media Manager status code 116 Media Manager status code 115 Media Manager status code 93 Robotic status code 249
Error message
volume has exceeded maximum mounts volume has passed expiration date volume is already assigned Volume is in home slot
Volume is in library, but not in drive domain Robotic status code 233 Volume is in use volume is not in specified pool Volume not found in library VxSS Access Denied Robotic status code 237 Media Manager status code 94 Robotic status code 232 Media Manager status code 188, Device management status code 77, and Device configuration status code 92 Media Manager status code 187
You do not have permission to create the file Device management status code 44 You must be administrator to execute You must be ROOT to start daemon Robot Error status code 14 Robot Error status code 1
570
Media and device management status codes and messages Media and device management messages
Chapter
Disaster recovery
This chapter includes the following topics:
About disaster recovery Recommended backup practices Disk recovery procedures for UNIX and Linux Disk recovery procedures for Windows Catalog recovery from an online backup Clustered NBU server recovery for UNIX and Linux Clustered NBU server recovery for Windows
572
Warning: NetBackup may not function properly if you reinstall NetBackup and recover its catalogs on a system disk to a different partition or one that is partitioned differently due to internal configuration information. Instead, configure a replacement disk with partitioning identical to the failed disk, then reinstall NetBackup on the same partition on which it was originally installed. Specific procedures that replace failed disks, build partitions and logical volumes, and reinstall operating systems can be complicated and time consuming. Such procedures are beyond the scope of this manual. Appropriate vendor-specific information should be referenced.
573
Critical policies
When configuring a policy for online catalog backup, you can designate certain NetBackup policies as critical. Critical policies back up systems and data deemed critical to end-user operation. During a catalog recovery, NetBackup verifies that all of the media that is needed to restore critical policies are available.
Automated recovery
The catalog disaster recovery file (created during an online catalog backup) is intended to automate the process of NetBackup recovery. If you recover a system other than the one that originally made the backups, it should be identical to the original system. For example, if the system that performs the recovery does not include NetBackup servers with identical names to those where the backups were made, the automated recovery may not succeed.
574
Tape media images have all expired after the catalog backup that the recovery was done from and the tape designated for and possibly re-used. Disk-based media whose images expired after the catalog backup that the recovery was done from and the images were deleted from the disk.
575
Master server disk recovery procedures Media server disk recovery procedures Client disk recovery procedures
Disk-based images that reside on SharedDisk, AdvancedDisk, or on OpenStorage disks cannot be recovered by means of the NetBackup catalog. These disk images must be recovered by means of the NetBackup import feature. For information on import, See the topic on importing NetBackup images in the NetBackup Administrators Guide, Volume I. When the disk image is imported, NetBackup does not recover the original catalog entry for the image. Instead, a new catalog entry is created.
Root file system is intact. The operating system, NetBackup software and some (if not all) other files are assumed to be lost. Root file system is lost along with everything else on the disk. This situation requires a total recovery. This recovery reloads the operating system to an alternate boot disk and boots from this disk during recovery. This operation lets you recover the root partition without risking a crash caused by overwriting files that the operating system uses during the restore.
For NetBackup master and media servers, the directory locations of the NetBackup catalog become an integral part of NetBackup catalog backups. Any recovery of the NetBackup catalog requires identical directory paths or locations be created during the NetBackup software reinstallation. Disk partitioning, symbolic links, and NetBackup catalog relocation utilities may be needed. NetBackup Bare Metal Restore (BMR) protects client systems by backing them up with a policy configured for BMR protection. Information is available that describes backup and recovery procedures. See the Bare Metal Restore System Administrator's Guide.
576
Verify that the operating system works, that any require patches are installed, and that specific configuration settings are made. Take corrective action as needed. Reinstall NetBackup software on the server you want to recover. See the NetBackup Installation Guide for UNIX for instructions.
Install any NetBackup patches that had been previously installed. See the documentation that was included with the patch software. Note: Symantec does not support the recovery of a catalog image that was backed up using an earlier version of NetBackup.
If any of the default catalog directories have changed that may be reflected in the NetBackup catalog backups, recreate those directories before the catalog recovery. The following are examples:
Use of symbolic links as part of the NetBackup catalog directory structure. Use of the NetBackup nbdb_move command to relocate parts of the NetBackup relational database catalog.
If the recovery scenario involves restoring policy or catalog backups, the appropriate recovery device(s) must be configured, which may involve the following tasks:
Install and configure the robotic software for the devices that read backups of the NetBackup catalog and regular backups of the disk being restored. If a non-robotic drive is available that can read these backups, then no robot is required. Although manual intervention is required if multiple pieces of media are required. See the NetBackup Device Configuration Guide. Using the NetBackup Device Configuration Wizard to discover and configure the recovery device in NetBackup. See the NetBackup Administrator's Guide, Volume I. Using the NetBackup command tpautoconf to discover and configure the recovery device in NetBackup.
577
See the NetBackup Command Guide. Updating the device mapping files. See the NetBackup Administrators Guide, Volume II.
If you must restore from policy or catalog backups that were done to media, the appropriate media may have to be configured in NetBackup. See the NetBackup Administrators Guide, Volume I. Configuring the media may require some or all of the following tasks:
Manually load the required media into a stand-alone recovery device. Use the NetBackup utilities such as robtest or vendor-specific robotic control software to load media into the required recovery device or devices. Using the NetBackup Volume Configuration Wizard to inventory the media contents of a robotic device. Using the vendor-specific robotic control software to load the media into the required recovery device(s).
Recover the NetBackup catalogs to the server you are recovering. The NetBackup catalogs can be recovered only to the same directory structure from which they were backed up (alternate path recovery is not allowed).
Stop and restart all NetBackup daemons. Use the following NetBackup commands, or use the Activity Monitor in the NetBackup Administration Console. Note: Your configuration may include an EMM server that is separate from the master server. If so, start NetBackup on the EMM server before starting NetBackup on the master server.
/usr/openv/netbackup/bin/bp.kill_all /usr/openv/netbackup/bin/bp.start_all
Start the NetBackup Backup, Archive, and Restore interface (or the bp command) and restore other files to the server as desired. When the files are restored, you are done.
578
recovery disk, and 5) copy the NetBackup catalogs from the alternate disk to the recovery disk. This procedure assumes that the root file system is lost along with everything else on the disk. This procedure reloads the operating system to an alternate boot disk and boots from that disk during recovery. This operation lets you recover the root partition without risking a crash caused by overwriting files that the operating system uses during the restore. To recover the master server when the root partition is lost
1 2
Load the operating system on an alternate boot disk, using the same procedure as you would normally use for the server type. Create on the alternate disk the partition and directory where NetBackup and its catalogs (if applicable) and databases resided on the original disk. By default, they reside under the /usr/openv directory. Verify that the operating system works, that any required patches are installed, and that specific configuration settings are made. Take corrective action as needed. Install NetBackup on the alternate disk. Install only the robotic software for the devices required to read backups of the NetBackup catalogs and regular backups of the disk being restored. If a non-robotic drive can read these backups, no robot is required. Install any NetBackup patches that had been previously installed. See the documentation that was included with the patch software. If any changes to the default catalog directories would be reflected in the NetBackup catalog backups, recreate those directories before the catalog recovery. Examples of those directories are the following:
5 6
Use of symbolic links as part of the NetBackup catalog directory structure. Use of the NetBackup nbdb_move command to relocate parts of the NetBackup relational database catalog.
If the recovery scenario involves restoring policy or catalog backups, the appropriate recovery device(s) must be configured. Device configuration may include the following tasks:
Install and configure the robotic software for the devices that read backups of the NetBackup catalog and regular backups of the disk being restored. If a non-robotic drive is available that can read these backups, then no robot is required. Although manual intervention is required if multiple pieces of media are required.
579
Use the NetBackup Device Configuration Wizard to discover and configure the recovery device in NetBackup. See the NetBackup Administrator's Guide, Volume I. Use the NetBackup command tpautoconf to discover and configure the recovery device in NetBackup. See the NetBackup Command manual. Update the device mapping files. See the NetBackup Administrator's Guide, Volume II.
If you must restore from policy or catalog backups that were done to media, the appropriate media may have to be configured in NetBackup. See the NetBackup Administrators Guide, Volume I. Configuring the media may require some or all of the following tasks:
Manually load the required media into a stand-alone recovery device. Use the NetBackup utilities such as robtest or vendor-specific robotic control software to load media into the required recovery device or devices. Using the NetBackup Volume Configuration Wizard to inventory the media contents of a robotic device. Using the vendor-specific robotic control software to load the media into the required recovery device(s).
Recover the NetBackup catalogs to the alternate disk. See Catalog recovery from an online backup on page 591. The catalogs can be recovered only to the same directory structure from which they were backed up (alternate path recovery is not allowed).
580
10 Start the NetBackup Backup, Archive, and Restore interface (or the bp
command) and restore the latest backed up version of all files to the disk you are recovering. You restore these files from the backup of the master server, not from the NetBackup catalog backup. Be sure to specify the disk that you recover as the alternate recovery location. Warning: Do not restore files to the /usr/openv/var, /usr/openv/db/data, or /usr/openv/volmgr/database directories (or relocated locations) or directories that contain NetBackup database data. This data was recovered to the alternate disk in step 9 and is copied back to the recovery disk in step 12.
11 Stop all NetBackup processes that you started from NetBackup on the alternate
disk. Use the Activity Monitor in the NetBackup Administration Console or the following:
/usr/openv/netbackup/bin/bp.kill_all
12 Maintaining the same directory structure, copy the NetBackup catalogs from
the alternate disk to the disk that you recover. These are the catalogs recovered in step 9.
13 Make the recovered disk the boot disk again and restart the system. 14 Start and test the copy of NetBackup on the disk that you have recovered.
Note: If your configuration includes an Enterprise Media Manager (EMM) server that is separate from the master server, start NetBackup on the EMM server before starting NetBackup on the master server.
/usr/openv/netbackup/bin/bp.start_all
Try the NetBackup Administration utilities. Also, try some backups and restores.
15 When you are satisfied that the recovery is complete, delete the NetBackup
files from the alternate disk. Or, unhook that disk, if it is a spare.
581
Recovering the NetBackup media server disk for UNIX and Linux
NetBackup 6.0 and later media servers store information in the NetBackup relational database. If you need to recover the system disk on a NetBackup media server, the recommended procedure is similar to disk recovery for the client. See Recovering the client disk on page 581. Note: A separate computer that functions as a NetBackup 6.0 or later media server is available only on NetBackup Enterprise Server. For NetBackup Server installations, the master server and the media server are installed on the same system and have the same host name. Therefore, recovering the master server disk also recovers the media server.
Reload the operating system as you normally would for a client workstation of that type. If the root file system is lost, the best approach may be to reload the operating system on an alternate boot disk and start from this disk. After restoring the system, restore root to its original partition. This operation lets you recover the root partition without risking a crash due to overwriting files that the operating system uses during the restore. The procedure is similar to the procedure that is used for the master server, except that recovering the NetBackup catalogs is not necessary. See Recovering the master server disk for Windows on page 582.
2 3
Reinstall NetBackup client software and patches. Use the NetBackup Backup, Archive, and Restore interface to select and restore files.
582
Master server disk recovery procedures Media server disk recovery procedures Client disk recovery procedures
Disk-based images that reside on SharedDisk, AdvancedDisk, or on OpenStorage disks cannot be recovered by means of the NetBackup catalog. These disk images must be recovered by means of the NetBackup import feature. For information on import, refer to the section on importing NetBackup images in the following manual: See NetBackup Administrators Guide, Volume I. Note: When the disk image is imported, NetBackup does not recover the original catalog entry for the image. Instead, a new catalog entry is created.
Windows is intact and not corrupted. The system still starts Windows, but some or all other partitions are lost. NetBackup software is assumed to be lost. All disk partitions are lost. Windows must be reinstalled, which is a total recovery. These procedures assume that the NetBackup master disk was running a supported version of Windows and that the defective hardware has been replaced.
For NetBackup master and media servers, the directory locations of the NetBackup catalog become an integral part of NetBackup catalog backups. Any recovery of the NetBackup catalog requires the identical directory paths or locations be created before the catalog recovery.
583
1 2 3
Determine the install_path in which NetBackup is installed. By default, NetBackup is installed in the C:\Program Files\VERITAS directory. Determine if any directory paths or locations need to be created for NetBackup catalog recovery. Partition any disks being recovered as they were before the failure (if partitioning is necessary). Then reformat each partition as it was before the failure. Reinstall NetBackup software on the server you are recovering. Refer to the NetBackup Installation Guide for Windows.
5 6
Install any NetBackup patches that had been previously installed. See the documentation that was included with the patch software. If any changes to the default catalog directories would be reflected in the NetBackup catalog backups, recreate those directories before the catalog recovery. For example, use the NetBackup nbdb_move command to relocate parts of the NetBackup relational database catalog. If the recovery scenario involves restoring policy or catalog backups, the appropriate recovery devices must be configured. You may have to do some or all of the following:
Install and configure the robotic software for the devices that read backups of the NetBackup catalog and regular backups of the disk being restored. If a non-robotic drive is available that can read these backups, then no robot is required. Although manual intervention is required if multiple pieces of media are required. See the NetBackup Device Configuration Guide. Use the NetBackup Device Configuration Wizard to discover and configure the recovery device in NetBackup. See the NetBackup Administrator's Guide, Volume I. Using the NetBackup command tpautoconf to discover and configure the recovery device in NetBackup. See the NetBackup Commands manual. Updating the device mapping files. See the NetBackup Administrators Guide, Volume II.
If the recovery scenario involves restoring policy or catalog backups, the appropriate recovery device(s) must be configured. Configuring the media may involve the following tasks:
584
Manually load the required media into a stand-alone recovery device. Use NetBackup utilities such as robtest or vendor-specific robotic control software to load media into the required recovery devices. Use the NetBackup Volume Configuration Wizard to inventory the media contents of a robotic device. Use the vendor-specific robotic control software to load the media into the required recovery device(s).
Recover the NetBackup catalogs. See Catalog recovery from an online backup on page 591.
10 When catalog recovery is complete, stop and restart the NetBackup services.
Use the following bpdown and bpup commands, the Activity Monitor in the NetBackup Administration Console, or the Services application in the Windows Control Panel.
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
Your configuration may include an EMM server that is separate from the master server. If so, start NetBackup on the EMM server before starting NetBackup on the master server. Warning: In step 11, do not restore files to the install_path\NetBackup\db, install_path\NetBackupDB, install_path\NetBackup\var, or install_path\Volmgr\database directories. The catalogs were recovered in step 9 and overwriting them with regular backups leave them in an inconsistent state. If the NetBackup relational database files were relocated using nbdb_move from install_path\NetBackupDB\data, they are recovered in step 9 and should not be restored in step 11.
Start the NetBackup Administration interface on the master server. Start the Backup, Archive, and Restore utility. Browse for restores and select only the partitions that were lost. Select the system directory (typically C:\Winnt), which ensures that all registry files are restored.
585
Deselect the install_path\NetBackup\db, install_path\NetBackupDB, install_path\NetBackup\var, and install_path\Volmgr\database directories (see the caution in step 10). If you reinstall Windows, select the Overwrite existing files option, which ensures that existing files are replaced with the backups. Start the restore.
12 Reboot the system, which replaces any files that were busy during the restore.
When the boot process is complete, the system is restored to the state it was in at the time of the last backup.
Install the same type and version of Windows software that was used previously. Install Windows in the same partition that was used before the failure. Install any required patches. Take corrective action as needed. Specify the default workgroup. Do not restore the domain. Install and configure special drivers or other software that is required to get the hardware operational (for example, a special driver for the disk drive). Install SCSI or other drivers as needed to communicate with the tape drives on the system. Follow any hardware manufacturer's instructions that apply, such as loading SSD on a Compaq system. Reboot the system when Windows installation is complete.
2 3 4 5
Determine the install_path in which NetBackup is installed. By default, NetBackup is installed in the C:\Program Files\VERITAS directory. Determine if any directory paths or locations need to be created for NetBackup catalog recovery. If necessary, partition any disks being recovered as they were before the failure. Then reformat each partition as it was before the failure. Reinstall NetBackup software on the server being recovered. Do not configure any NetBackup policies or devices at this time.
586
6 7
Install any NetBackup patches that had been previously installed. See the documentation that was included with the patch software. If any changes to the default catalog directories would be reflected in the NetBackup catalog backups, recreate those directories before the catalog recovery. For example, use the NetBackup nbdb_move command to relocate parts of the NetBackup relational database catalog. If the recovery scenario involves restoring policy or catalog backups, the appropriate recovery device or devices have to be configured. You may have to do all or some of the following tasks:
Install and configure the robotic software for the devices that read backups of the NetBackup catalog and regular backups of the disk being restored. If a non-robotic drive is available that can read these backups, then no robot is required. Although manual intervention is required if multiple pieces of media are required. See the NetBackup Device Configuration Guide. Using the NetBackup Device Configuration Wizard to discover and configure the recovery device in NetBackup. See the NetBackup Administrator's Guide, Volume I. Using the NetBackup command tpautoconf to discover and configure the recovery device in NetBackup. See the NetBackup Commands manual. Updating the device mapping files. See the NetBackup Administrators Guide, Volume II.
If you must restore from policy or catalog backups that were done to media, the appropriate media may have to be configured in NetBackup. See the NetBackup Administrators Guide, Volume I. When you configure the media, you may have to do some or all of the following:
Manually load the required media into a stand-alone recovery device. Use the NetBackup utilities such as robtest or vendor-specific robotic control software to load media into the required recovery devices. Using the NetBackup Volume Configuration Wizard to inventory the media contents of a robotic device. Using the vendor-specific robotic control software to load the media into the required recovery devices.
587
11 When catalog recovery is complete, stop and restart the NetBackup services.
Use the following bpdown and bpup commands, the Activity Monitor in the NetBackup Administration Console, or the Services application in the Windows Control Panel.
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
If your configuration includes an Enterprise Media Manager (EMM) server that is separate from the master server, start NetBackup on the EMM server first. Warning: In step 12, do not restore files to the install_path\NetBackup\db, install_path\NetBackupDB, install_path\NetBackup\var, or install_path\Volmgr\database directories. These directories were recovered in step 10 and overwriting them with regular backups leaves the catalogs in an inconsistent state. If the relational database files were relocated using nbdb_move from install_path\NetBackupDB\data, they are recovered in step 10 and should not be restored in step 12.
Start the NetBackup Administration interface on the master server. Start the Backup, Archive, and Restore client interface. Browse for restores and select only the partitions that were lost. Select the system directory (typically C:\Winnt), which ensures that all registry files are restored. Deselect the install_path\NetBackup\db, install_path\NetBackupDB (or relocated NetBackup relational database path), install_path\NetBackup\var, or install_path\Volmgr\database directories. See the caution in this procedure). If you reinstall Windows, select the Overwrite existing files option, which ensures that existing files are replaced with the backups.
588
13 Restart the system, which replaces any files that were busy during the restore.
When the boot process is complete, the system is restored to the state it was in at the time of the last backup.
The NetBackup client was running a supported Microsoft Windows version. The NetBackup client was backed up with a supported version of NetBackup client and server software. The NetBackup master server to which the client sent its backups is operational. You request the restore from this server.
589
The backups included the directory where the operating system and its registry resided. If the backups excluded any files that resided in the directory, you may not be able to restore the system to be identical to the previous configuration. Defective hardware has been replaced.
Windows system software to reinstall on the NetBackup client that is being restored. Reinstall the same type and version of software that was previously used. NetBackup client software to install on the client that is being restored. Special drivers or other software that is required to make the hardware operational (for example, a special driver for the disk drive). IP address and host name of the NetBackup client. IP address and host name of the NetBackup master server. The partitioning and formatting scheme that was used on the system to be restored. You must duplicate that scheme during Windows installation.
Install a minimal Windows operating system (perform the Express install). During the installation, do the following tasks:
Partition the disk as it was before the failure (if partitioning is necessary). Then, reformat each partition as it was before the failure. Install the operating system in the same partition that was used before the failure. Specify the default workgroup. Do not restore to the domain. Follow any hardware manufacturers instructions that apply.
2 3
Reboot the system when the installation is complete. Configure the NetBackup client system to re-establish network connectivity to the NetBackup master server. For example, if your network uses DNS, the configuration on the client must use the same IP address that was used before the failure. Also, it must specify the same name server (or another name server that recognizes both the NetBackup client and master server). On the client, configure DNS in the Network dialog, accessible from the Windows Control Panel.
590
Refer to the NetBackup Installation Guide for Windows for instructions. Ensure that you specify the correct names for the client server and master server.
To specify the client name, start the Backup, Archive, and Restore interface on the client and click NetBackup Client Properties on the File menu. Enter the client name on the General tab of the NetBackup Client Properties dialog. To specify the server name, click Specify NetBackup Machines and Policy Type on the File menu.
5 6
Install any NetBackup patches that had previously been installed. Enable debug logging by creating the following debug log directories on the client:
install_path\NetBackup\Logs\tar install_path\NetBackup\Logs\bpinetd
Stop and restart the NetBackup Client service. This action enables NetBackup to start logging to the bpinetd debug log.
Use the NetBackup Backup, Archive, and Restore interface to restore the system files and user files to the client system. For example, if all files are on the C drive, restoring that drive restores the entire system. To restore files, you do not need to be the administrator, but you must have restore privileges. For instructions, refer to the online Help or refer to the following: See the NetBackup Backup, Archive, and Restore Getting Started Guide. NetBackup restores the registry when it restores the Windows system files. For example, if the system files are in the C:\Winnt directory, NetBackup restores the registry when it restores that directory and all its subordinate subdirectories and files.
Check for ERR or WRN messages in the log files that are in the directories you created in step 6. If the logs indicate problems with the restore of Windows system files, resolve those problems before proceeding.
591
10 Stop the NetBackup Client service and verify that bpinetd is no longer
running.
Recovering the entire catalog from an online backup This procedure is the recommended method for recovering the entire catalog and recovers the NetBackup relational database as well as NetBackup policy files, backup image files, and configuration files. Recovering the catalog image file This method recovers only the NetBackup policy files, backup image files, and configuration files. Use this method if the NetBackup relational database is valid but NetBackup policy, backup image, or configuration files are lost. The
592
NetBackup relational database can also be recovered separately using the bprecover -nbdb command.
Note: During the catalog recovery process, services may be shut down and restarted. If NetBackup is configured as a highly available application (cluster or global cluster), freeze the cluster before starting the recovery process to prevent a failover. Then unfreeze the cluster after the recovery process is complete.
593
Your configuration may include an Enterprise Media Manager (EMM) server that is separate from the master server. If so, start NetBackup on the EMM server before starting NetBackup on the master server. Start NetBackup by entering the following: On UNIX and Linux:
/usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpup
3 4 5
If the necessary devices are not already configured, configure them in NetBackup. Make available to NetBackup the media that contains the catalog backup. Click Recover the Catalogs on the NetBackup Administration Console to start the Catalog Recovery Wizard.
594
Click Next on the Welcome screen to display the Catalog Disaster Recovery File screen.
This wizard relies on the disaster recovery information that is generated during the online catalog backup. Part of the online catalog backup configuration that is included indicates where the disaster recovery information file was to be stored and-or sent. In most cases, you specify the most recent disaster recovery information file available, unless some form of corruption has occurred and you want to restore to an earlier state of the catalog. If the most recent catalog backup was an incremental, use the disaster recovery file from the incremental backup. (There is no need to first restore the full backup that is followed by the incremental.) Indicate where the disaster recovery file is stored by entering the fully qualified path to the disaster recovery file. More information is available on the email that is sent and the attached disaster recovery file. See Recovering the catalog without the disaster recovery file on page 615.
595
The wizard waits while NetBackup searches for the necessary media sources, then informs you if the necessary backup ID of the disaster recovery image is located.
Or, if the media is not located, the wizard lists which media is needed to update the database. Follow the wizard instructions to insert the media that is indicated and run an inventory to update the NetBackup database. The information that is displayed on this screen depends on whether the recovery is from a full backup or an incremental backup. If an online catalog backup policy includes both full and incremental backups, the disaster recovery email may indicate either a full or an incremental backup for recovery. An incremental backup recovery completely recovers the entire catalog because it references information from the last full backup. You dont need to first recover the last full catalog backup, then subsequent incremental backups.
596
When the required media sources are all found, click Next to display the Disaster Recovery Method screen. The Recover entire NetBackup catalog radio option is selected.
With the Recover entire NetBackup catalog radio option selected, click Next to initiate the recovery of the entire NetBackup catalog. NetBackup restores the entire NetBackup relational database, which includes the following:
NBDB database (including the EMM database) BMR database (if applicable) NetBackup policy files backup image files other configuration files
If the EMM server is located on a remote computer, the NBDB database is recovered on the remote computer.
597
If the recovery is not successful, consult the log file messages for an indication of the problem.
11 The final screen announces that the full recovery is complete. Each image
file is restored to the proper image directory, and the NetBackup relational databases (NBDB and optionally BMRDB) have been restored and recovered. If this step is part of a server recovery procedure, complete the remaining steps in the appropriate Server Disk Recovery procedure.
12 NetBackup does not run scheduled backup jobs until NetBackup is stopped
and restarted. Before you restart NetBackup, protect the media that contains any backups that were successfully performed after the catalog backup that was used to recover the catalog. This recovery can include the following:
importing the backups from the backup media into the catalog write protecting the media ejecting the media and setting it aside freezing the media
598
13 You can manually submit backup jobs before you stop and restart NetBackup.
Be aware that if you have not protected the media containing the backups done after the catalog backup, the media may be overwritten.
On Windows:
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
If a remote EMM server is being used, start NetBackup on it before you start NetBackup on the master server. If you have recovered from removable media, that media is now frozen. To unfreeze, see the following: See Unfreezing online catalog recovery media on page 624.
Note: During the catalog recovery process, services may be shut down and restarted. If NetBackup is configured as a highly available application (cluster or global cluster), freeze the cluster before starting the recovery process to prevent a failover. Then unfreeze the cluster after the recovery process is complete. The steps are the same as those in the following topic: See Recovering the entire catalog using the Catalog Recovery Wizard on page 592.
599
Start NetBackup by entering the following: If your configuration includes an Enterprise Media Manager (EMM) server that is separate from the master server, start NetBackup on the EMM server before starting NetBackup on the master server. On UNIX and Linux:
/usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpup
Enter the fully qualified pathname to the Backup ID file. For example:
C:\DR_INFO\HotCatBack_1120078077_FULL
600
The image file is restored to the proper image directory and the NetBackup relational databases (NBDB and optionally BMRDB) are restored and recovered.
NetBackup does not run scheduled backup jobs until NetBackup is stopped and restarted. Before you restart NetBackup, protect the media that contains backups that were successfully performed after the catalog backup that was used to recover the catalog. This procedure can include the following tasks:
importing the backups from the backup media into the catalog write protecting the media ejecting the media and setting it aside freezing the media
On Windows:
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
If a remote EMM server is being used, start NetBackup on it before you start NetBackup on the master server.
The NetBackup relational database is valid, but NetBackup policy, backup image, or configuration files are lost.
601
You want to restore part of the NetBackup catalog before you restore the entire catalog. This procedure recovers only the catalog images and configuration files.
The catalog backup images contain information about all the data that has been backed up. This information constitutes the largest part of the NetBackup catalog. If the backup images are intact but the NetBackup relational database files are not, you can still recover these files. See Recovering relational database files from an online catalog backup on page 610. The wizard restores whatever catalog images and configuration files are in the backup set identified by the disaster recovery file. If the disaster recovery file is from a full backup, all catalog images and configuration files are restored. For an incremental backup, the wizard restores only catalog images and configuration the files that were changed since the previous backup. However, all catalog backup image files back to the last full catalog backup are automatically included in an incremental catalog backup. This operation allows for the complete restoration of all backup images by the Backup, Archive, and Restore user interface. For a catalog that was backed up using the online method of NetBackup catalog image and configuration files, recovery in either of the following ways:
Use the Catalog Recovery Wizard Use the bprecover -wizard command
During a manual recovery, the wizard recovers only NetBackup policy files, NetBackup backup image files, and other NetBackup configuration files. It does not recover the NBDB (includes EMM) or BMR databases. If the backup from which you recover is an incremental catalog backup and no catalog backup images exist in the catalog, the following occurs: only the NetBackup policy, backup image, and configuration files that are backed up in that incremental backup are restored. However, all of the catalog backup images up to the last full catalog backup are restored. So you can restore the rest of the policy, images, and configuration files from the Backup, Archive and Restore interface. If catalog backup images already exist, all files that were included in the related set of catalog backups are restored. The NBDB (includes EMM) and BMR (if applicable) databases must then be recovered by running the following:
bprecover -r -nbdb
Table 7-1 is a list of the files that you recover in a manual recovery (an asterisk indicates multiple files within that folder).
602
Files you recover when you recover catalog image files Windows
install_path\NetBackup\db\* install_path\NetBackup\vault\sessions\* install_path\NetBackup\var\*
install_path\Volmgr\database\* install_path\Volmgr\vm.conf
The following is a list of NetBackup relational database (SQL Anywhere) files that are not recovered in a manual recovery.
NBDB.db NBDB.log EMM_DATA.db EMM_INDEX.db BMRDB.db BMRDB.log BMR_DATA.db BMR_INDEX.db vxdbms.conf DARS_DATA.db DARS_INDEX.db DBM_DATA.db DBM_INDEX.db
You can recover these files. See Recovering relational database files from an online catalog backup on page 610.
Recovering the catalog image files using the Catalog Recovery Wizard
You must have root (administrative) privileges to perform this procedure.
603
Note: The Catalog Recovery wizard does not work after performing a change server operation. You must be logged on locally to the master server that is being recovered. To recover the catalog image files
Start NetBackup by entering the following: If your configuration includes an EMM server that is separate from the master server, do the following: start NetBackup on the EMM server before starting NetBackup on the master server. On UNIX and Linux:
/usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpup
Click Recover the Catalogs in the NetBackup Administration Console to start the Catalog Recovery Wizard. Warning: Do not run any client backups before you recover the NetBackup catalog.
604
This wizard relies on the disaster recovery information that was generated during the online, hot catalog backup. Part of configuring the catalog backup included the indication of where the disaster recovery information was to be stored and sent.
Indicate where the disaster recovery file is stored by entering the fully qualified path to the disaster recovery file. For example:
/net/lex/Cat_DR/CatBk_1119304246_INCR
Note: Specify the most recent disaster recovery file available, unless there is a reason to restore from an earlier state. Note whether the disaster recovery file is based on a full (*_FULL) or an incremental (*_INCR) catalog backup. More information is available on the email that is sent and the attached disaster recovery file. See Recovering the catalog without the disaster recovery file on page 615.
605
The wizard waits while NetBackup searches for the necessary media sources, then tells you if the necessary backup ID of the DR image was located. If the media is not located, the wizard lists which media is needed to update the database.
Follow the wizard instructions to insert the indicated media and run an inventory to update the NetBackup database.
Click Next to display the Disaster Recovery Method dialog. Select the Recover only NetBackup catalog image and configuration files radio option and click Next.
606
The wizard displays the recovery progress and announces when the catalog has been recovered.
If the recovery is not successful, consult the log file messages for an indication of the problem.
607
The final screen indicates that the catalog backup images have been recovered.
NetBackup does not run scheduled backup jobs until NetBackup is stopped and restarted. Before you restart NetBackup, protect the media that contains backups that were successfully performed after the catalog backup that was used to recover the catalog. Media protection may include the following tasks:
importing the backups from the backup media into the catalog write protecting the media ejecting the media and setting it aside
608
Stop and restart NetBackup on all the servers. On UNIX and Linux:
/usr/openv/netbackup/bin/bp.kill_all /usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
If a remote EMM server is being used, start NetBackup on it before you start NetBackup on the master server.
Start NetBackup by entering the following: If your configuration includes an EMM server separate from the master server, start NetBackup on the EMM server before starting NetBackup on the master server. On UNIX and Linux:
/usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpup
609
Because this operation is a partial recovery, any remaining portions of the catalog must be restored using Backup, Archive, and Restore. Please review the following log file for more information
C:\Program Files\VERITAS\NetBackup\Logs\user_ops\ Administrator\logs\Recover1123008613.log
NetBackup does not run scheduled backup jobs until NetBackup is stopped and restarted. Before you restart NetBackup, protect the media that contains backups that were successfully performed after the catalog backup that was used to recover the catalog. This media protection may include the following tasks:
importing the backups from the backup media into the catalog write protecting the media ejecting the media and setting it aside
610
Stop and restart NetBackup on all the servers. On UNIX and Linux:
/usr/openv/netbackup/bin/bp.kill_all /usr/openv/netbackup/bin/bp.start_all
On Windows:
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
If a remote EMM server is being used, start NetBackup on it before you start NetBackup on the master server.
611
On Windows:
install_path\NetBackup\bin\bpdown
Move the following set of existing database files from their current location to a temporary directory. NBDB:
nbdb.db, nbdb.log, emm_index.db, emm_data.db
BMRDB:
bmrdb.db, bmrdb.log
Change databases.conf so SQL Anywhere does not try to automatically start them when the server is started. On UNIX and Linux:
/usr/openv/db/bin/nbdb_admin -auto_start NONE
On Windows:
install_path\NetBackup\bin\nbdb_admin -auto_start NONE
Windows:
install_path\NetBackup\bin\bpup -e SQLANYs_VERITAS_NB
612
Windows:
install_path\NetBackup\bin\create_nbdb -drop
If the database has been moved or the environment is clustered, add -staging staging_dir to the end of the create_nbdb command line. If the database has been moved or the environment is clustered, and space constraints or other issues force you to create this temporary database in the final location, use the following command: UNIX and Linux:
/usr/openv/db/bin/create_nbdb -drop -data <data_dir> -index \ <index_dir> -tlog <tlog_dir> -staging <staging_dir>
Windows:
install_path\NetBackup\bin\create_nbdb -drop -data <data_dir> -index <index_dir> -tlog <tlog_dir> -staging <staging_dir>
Where the <data_dir>, <index_dir>, <tlog_dir>, and <staging_dir> values are defined in the vxdbms.conf file as VXDBMS_NB_DATA, VXDBMS_NB_INDEX, VXDBMS_NB_TLOG, and VXDBMS_NB_STAGING.
Windows:
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
613
Windows:
install_path\Volmgr\bin\tpext
If you used the nbdb_move command to relocate NetBackup database files, re-create the directories where the files were located when you backed up the catalog. The default location is the following: UNIX and Linux:
/usr/openv/db/data
Windows:
install_path\NetBackupDB\data
10 Configure the necessary recovery device in NetBackup. 11 Make available to NetBackup the media that contains the catalog backup.
Inventory the robot or add the media for stand-alone drives.
614
12 For online catalog recovery, run the following command on the master server:
UNIX and Linux:
/usr/openv/netbackup/bin/admincmd/bprecover -r -nbdb
Windows:
install_path\NetBackup\bin\bpdown install_path\NetBackup\bin\bpup
If a remote EMM server is being used, start NetBackup on it before you start NetBackup on the master server.
Follow the normal NetBackup catalog recovery procedures. Ensure that NetBackup Access Management Control is installed but disabled before you run the actual catalog recovery wizard or bprecover command. You must have root privileges to execute the recovery. Turn off the authentication and authorization services (Windows) or daemons (UNIX and Linux).
615
Recover the NetBackup catalog from the online catalog backup using the recovery wizard or bprecover command. Authentication and authorization data is not copied back to the hosts from which it was backed up. Instead, it is copied to a staging area for use in step 4. Run "bprecover -r -vxss -p <policy name>" and supply the name of the online catalog backup policy. This action recovers authentication and authorization data from the staging area to the hosts from which it was backed up. Start up the authentication and authorization services (Windows) or daemons (UNIX and Linux). Configure NetBackup to use NetBackup Access Management Control by setting up the proper Access Control host properties for master server(s), media server(s), and client(s). Restart NetBackup.
5 6
The email identifies the media that contains the Disaster Recovery file, as well as the media that was used to back up critical policies. Ensure that this media is available. Follow the normal catalog recovery steps up until the point where the NetBackup Recovery Wizard or bprecover command is called for.
616
Run the following command to retrieve all Disaster Recovery files from the catalog backup media:
bpimport -drfile media_id -drfile_dest fully_qualified_directory_name
This command recovers all disaster recovery files from the specified media ID and places them in the specified directory. The ID can be either a tape media ID or the fully qualified location of a disk storage unit.
617
Verify that the correct Disaster Recovery file is available in the specified directory and that the directory is available from the NetBackup master server.
618
Continue with the normal catalog recovery procedure by running the NetBackup Recovery Wizard or bprecover command, providing the Disaster Recovery file location when prompted. Refer to the email as your primary source for recovery instructions, because they are the most current instructions for recovering your catalog. The instructions are sent when the catalog backup is completed, or when a catalog backup image is duplicated. Note: If you restore catalog files directly using bprestore on a Solaris system, use the following path: /opt/openv/netbackup/bin/bprestore. The name of the online catalog backup policy is CatalogBackup. The email is written to the following file:
/storage/DR/CatalogBackup_1123605764_FULL.
The file name itself indicates if the backup was full or not. The following is an example of a Disaster Recovery email:
Server ant Date Tue Aug Policy CatalogBackup Catalog Backup Status the requested operation was successfully completed (status 0). To ensure that the NetBackup catalog data is protected through Tue Aug 9 11:41:48 2005, retain a copy of the attached file, and the media or files listed below: Catalog Recovery Media Media Server Disk image path * ant /storage/DiskUnit1/ant_1123605764_C1_TIR * ant /storage/DiskUnit1/ant_1123605764_C1_F1 * ant /storage/DiskUnit1/ant_1123605713_C1_F1 DR file written to /storage/DR/CatalogBackup_1123605764_FULL
9 11:41:48 2005
619
* - Primary Media Catalog Recovery Procedure for the Loss of an Entire Catalog Symantec recommends creating a detailed disaster recovery plan should it become necessary to restore your organization's data in the event of a disaster. A checklist of required tasks can be a tremendous tool in assisting associates in triage. For example, after the facility is safe for data to be restored, the power and data infrastructure need to be verified. When these tasks are completed, the following scenarios will help to quickly restore the NetBackup environment, and in turn, restore applications and data. Disaster Recovery Procedure using the DR Image File In the event of a catastrophic failure, use the following procedure to rebuild the previous NetBackup environment. Note: If new hardware is required, make sure that the devices contain drives capable of reading the media and that the drive controllers are capable of mounting the drives. 1. Install NetBackup. 2. Configure the devices necessary to read the media listed above. 3. Inventory the media. 4. Make sure that the master server can access the attached DR image file. Start the NetBackup Recovery Wizard from the NetBackup Administration Console. Or, start the wizard from a command line by entering bprecover -wizard. Disaster Recovery Procedure without the DR Image File NOTE: ONLY ATTEMPT THIS AS A LAST RESORT If you do not have the attachment included with this email, use the following instructions to recover your catalog: 1. Install NetBackup. 2. Configure the devices necessary to read the media listed above.
620
3. Inventory the media. 4. Run: bpimport -create_db_info [-server name] -id /storage/DiskUnit1 5. Go to the following directory to find the DR image file CatalogBackup_1123605764_FULL: /usr/openv/netbackup/db/images/ant/1123000000/tmp 6. Delete the other files in the directory. 7. Open CatalogBackup_1123605764_FULL file and find the BACKUP_ID (for example: ant_1123605764). 8. Run: bpimport [-server name] -backupid ant_1123605764 9. Run: bprestore -T -w [-L progress_log] -C ant -t 35 -p CatalogBackup -X -s 1123605764 -e 1123605764 / 10. Run the BAR user interface to restore the remaining image database if the DR image is a result of an incremental backup. 11. To recover the NetBackup relational database, run: bprecover -r -nbdb 12. Stop and Start NetBackup 13. Configure the devices if any device has changed since the last backup. 14. To make sure the volume information is updated, inventory the media to update the NetBackup database.
621
Verify the location of the Disaster Recovery files that are created from Full and Incremental Hot Catalog backups. These files can be stored in a specified path of the file system on the master server and in email attachments to the NetBackup administrator. Set up each master server and media server in the same configuration as the configuration used during the last catalog backup. The master server and media servers have the following same properties as the backed up catalog configuration: name, NetBackup version, operating system patch level, and path to storage devices. Configure any devices and volumes you may need for the recovery.
Locate the latest DR image file corresponding to the backup that are used for recovery. Open the file in an editor and find values for the following:
master_server use the exact name that is specified in NetBackup configuration for the Master Server media_server the location of the robot or disk storage unit that is used for catalog backup. timestamp the four most significant digits in the DR filename and six zeroes attached. media the media where the catalog backup that is specified by the DR file is located. Found in the DR file under the FRAGMENT keyword. backup_id found in the DR file under BACKUP_ID.
Create the DR recovery directory on the master server. UNIX and Linux:
/usr/openv/netbackup/db/images/master_server/timestamp/tmp
Windows:
C:\Program Files\VERITAS\NetBackup\db\images\master_server \timestamp\tmp
622
Change the value of IMAGE_TYPE to 1 Change the value of TIR_INFO to 0 Change the value of NUM_DR_MEDIAS to 0 Remove ALL lines containing DR_MEDIA_REC
If your catalog recover media is on tape, run the vmquery command to assign the media to the media server.
vmquery -assigntohost media timestamp master_server
Example:
vmquery -assigntohost DL005L 1122000000 klingon
Run a Phase II import on the media that is specified by the DR file to recover the catalog .f file from the hot catalog backup.
bpimport -server master_server -backupid backup_id
If your catalog backup was an incremental, recover all the other catalog backup images up to and including the most recent Full Catalog backup.
Open the Backup, Archive, and Restore client interface for NetBackup. Select NBU-Catalog as the policy type. Set the source clients and destination clients to your master server. Search the backups and restore all files that are located in the following directory:
install_path/netbackup/db/images/master_server
Verify that all files are restored successfully on the master server.
Restore your critical data by using the Backup, Archive, and Restore client interface or the command line.
Restore the catalog backup images for each media server which requires data recovery. To restore the backup images, select NBU-Catalog as the policy type. Source and destination clients should be your master server. Refresh your view in the BAR GUI. Traverse the file system for the master server to the following:
install_path/netbackup/db/images
623
and restore the images for each configured media server. Verify that your images are present by searching for them in the catalog.
10 Recover backup data from each media server in the previous step. Change
the Policy Type, Source, and Destination client to match the client that is used to back up the desired data. Select the desired files from the Backup, Archive, and Restore client interface and restore them.
This command restores NetBackup media usage information, ensure that media containing backups are not overwritten, and restore the storage unit configuration. You cannot recover the NetBackup relational database to a configuration that is not identical to the configuration on which the catalog was backed up. Instead, you must import each piece of backup media.
12 If your catalog recovery media is on tape, freeze the media that contains the
catalog backup that is used for recovery. This action protects the media from being reused:
bpmedia -freeze -m media -h master_server
13 Recover your policies and configuration data on each master server and media
server. Before recovering NetBackup policy files, ensure that you have recovered all of your critical data, or protected the media that contains your critical data. When policy information is recovered, NetBackup starts to run scheduled jobs that may overwrite media that was written after the last catalog backup. Open the Backup, Archive, and Restore client interface for NetBackup and select NBU-Catalog as the policy type. For each server to be restored, set the source clients and destination clients to your server, starting with the master server. Restore all files that are backed up by the hot catalog backup on each server.
624
interface. Restoring catalog files directly to their original location may cause inconsistencies in the NetBackup catalog or cause NetBackup to fail. Instead, you should restore catalog files to an alternate location.
Select the "NBU-Catalog" policy type (from the Specify NetBackup Machines and Policy Type menu). Specify the Master Server as the source client for the restore.
On the master server, go to the image database. In the master server's portion of the image catalog, locate the catalog backup image file from which the recovery was done.
Identify the associated catalog backup parent image file by viewing the PARENT_IMAGE_ID value. Identify the media that the catalog backup was written to by viewing the second to last field in the DR_MEDIA_REC line(s). Save the catalog backup parent image file identified in step a. Relocate or remove all other image files that relates to the catalog backup policy.
If the NetBackup configuration includes a remote EMM server, on the master server, go to the image database for the remote EMM server. Relocate or remove any images that relates to the catalog backup policy. On the master server, for each media that is identified in step 1b, run the following command:
bpimport -create_db_info -server server_name -id media_id
On the master server, for each media that is identified in step 1b, run the following command:
bpmedia -unfreeze -m media_id -h server_name
Disaster recovery Clustered NBU server recovery for UNIX and Linux
625
1 2 3 4 5 6
Configure the hardware, system software, and cluster environment on the replacement node. Verify that the device configuration matches that of the surviving nodes. Ensure that the NetBackup Resource group is offline on all nodes before installing NetBackup on the replacement node. Ensure that the NetBackup shared disks are not mounted on the node on which NetBackup is to be installed. Freeze the NetBackup service. Reinstall NetBackup on the new or replacement node. Be sure to use the NetBackup Virtual Name as the name of the NetBackup server. Follow the instructions for installing the NetBackup Server software. Refer to the NetBackup Installation Guide.
7 8
Install any maintenance packs and patches required to bring the newly installed node to the same patch level as the other cluster nodes. Bring the NetBackup Resource group online on a node other than the freshly installed node.
626
Disaster recovery Clustered NBU server recovery for UNIX and Linux
Log onto the node on which the NetBackup resource group is online and run the following command:
/usr/openv/netbackup/bin/cluster/cluster_config -s nbu -o add_node -n node_name
10 Switch the NetBackup resource group to the replacement node. 11 Freeze the NetBackup group. 12 Ensure that the appropriate low-level tape device and robotic control device
configuration necessary for your operating system has been performed. Information is available for the your operating system. Refer to the NetBackup Device Configuration Guide.
13 Run the Device Configuration wizard to configure the devices. You do not
have to rerun the device configuration on the pre-existing nodes. Configuration information on your particular cluster is available. Refer to the NetBackup High Availability Guide.
14 Check that the robot numbers and robot drive numbers for each robot are
consistent across all nodes of the cluster. Repeat for any other servers connected to that robot, and correct if necessary. Refer to the "Media" section of the NetBackup Administrators Guide , Volume 1.
15 Test the ability of NetBackup to perform restores using the configured devices
on the replacement node.
The shared storage hardware is restored to a working state, so that the shared disk resource can be brought online with an empty shared directory. Valid online catalog backups exist.
Disaster recovery Clustered NBU server recovery for UNIX and Linux
627
1 2
Clear the faulted NetBackup resource group, disable monitoring and bring the shared disk and virtual name resources up on a functioning node. Manually create the following directories on the shared disk: <shared disk path>/netbackup/db <shared disk path>/db/data <shared disk path>/var/global <shared disk path>/volmgr/misc/robotic_db
If this is an emm server, enter the following to bring up the database server and emm, then run tpext to initialize the emm db:
# SHARED_DISK=<top-level shared disk mount point> # dataDir=${SHARED_DISK}/db/data # /usr/openv/netbackup/bin/nbdbms_start_stop start /usr/openv/db/bin/create_nbdb \ -data ${dataDir} \ -index ${dataDir} \ -tlog ${dataDir} \ -mlog ${dataDir} \ -staging ${dataDir}/staging \ -drop # /usr/openv/volmgr/bin/tpext -loadEMM
Configure required devices and media and recover the NetBackup catalogs. See Recovering the master server when root is intact on page 576.
5 6 7
Manually shut down and restart NetBackup on the active node. Re-enable monitoring of the NetBackup resource group. Verify that the NetBackup server can now be brought online on all configured nodes.
628
1 2
Configure the hardware, system software and cluster environment on the replacement cluster. Ensure that the appropriate low-level tape device and robotic control device configuration necessary for your operating system has been performed. Refer to the NetBackup Device Configuration Guide.
Reinstall NetBackup on the each of the cluster nodes. Be sure to use the NetBackup Virtual Name as the name of the NetBackup server. Follow the instructions for installing NetBackup Server software. Refer to the NetBackup Installation Guide.
Configure the clustered NetBackup server. Refer to the NetBackup High Availability Guide.
Install any maintenance packs and patches required to bring the newly installed NetBackup server to the same patch level as the server that is being replaced. Configure required devices and media and recover the NetBackup catalogs. See Recovering the master server when root is intact on page 576.
Bring the NetBackup resource group on each node in turn and run the Device configuration wizard to configure the devices. Configuration information on your particular cluster is available. Refer to the NetBackup High Availability Guide.
629
The hardware, system software and cluster environment on the replacement node have been configured. The reconfigured or replacement node has been made a member of the cluster and has the same name as the failed node.
The following procedure applies when the shared disk and at least one configured cluster node remain available. To replace a failed node on a Windows cluster using VCS
1 2 3
Freeze the NetBackup service. Ensure that the NetBackup shared disks are not mounted on the node on which NetBackup is to be installed. Reinstall NetBackup on the new or replacement node. Be sure to use the NetBackup Virtual Name as the name of the NetBackup server. Follow the instructions for installing the NetBackup Server software. Refer to the NetBackup Installation Guide.
4 5 6
Ensure that the node is a member of an existing cluster and that it performs the necessary configuration automatically. Install any maintenance packs and patches required to bring the newly installed node to the same patch level as the other cluster nodes. Unfreeze the NetBackup service and verify that it can be brought up on the replacement node.
The shared storage hardware is restored to a working state, so that the shared disk resource can be brought online with an empty shared directory. Valid online catalog backups exist.
630
1 2 3
Clear the faulted NetBackup resource group, disable monitoring, and bring the shared disk and virtual name resources up on a functioning node. Ensure that all NetBackup shared disks are assigned the same drive letters that were used when NetBackup was originally installed and configured. To reconfigure NetBackup for the cluster, initialize the database by running the following commands in sequence on the active node:
bpclusterutil -ci tpext bpclusterutil -online
Use the appropriate NetBackup catalog recovery procedure to restore the NetBackup catalog information on the shared disk. See Recovering the master server and Windows on page 585.
If the clustered NetBackup server is a media server, verify that the restored vm.conf file contains the correct host-specific MM_SERVER_NAME configuration entry for the active node. If MM_SERVER_NAME is different from the local host name, edit the file and change the server name to the local host name: MM_SERVER_NAME=<local host name>
Use NetBackup to restore any data on the shared disks. Details are available on how to perform a restore. Refer to the NetBackup Backup, Archive, and Restore Getting Started Guide.
7 8 9
Configure required devices and media and recover the NetBackup catalogs. Manually shut down and restart NetBackup on the active node. Re-enable monitoring of the NetBackup resource group. nodes.
10 Verify that the NetBackup server can now be brought online on all configured
631
1 2
Configure the hardware, system software and cluster environment on the replacement cluster. Ensure that the appropriate low-level tape device and robotic control device configuration necessary for your operating system has been performed. Refer to the NetBackup Device Configuration Guide.
Reinstall NetBackup on the each of the cluster nodes. Be sure to use the NetBackup Virtual Name as the name of the NetBackup server. Follow the instructions for installing NetBackup Server software. Refer to the NetBackup Installation Guide.
Configure the clustered NetBackup server. Refer to the NetBackup High Availability Guide.
Install any maintenance packs and patches required to bring the newly installed NetBackup server to the same patch level as the server that is being replaced. Configure required devices and media and recover the NetBackup catalogs. See Recovering the master server and Windows on page 585.
Bring the NetBackup resource group on each node in turn and run the Device configuration wizard to configure the devices. Configuration information on your cluster (MSCS or VCS) is available. Refer to the NetBackup High Availability Guide.
632
Appendix
About backup and restore functional overview Backup and restore startup process Backup and archive processes Backups and archives - UNIX clients Restore processes NetBackup directories and files NetBackup programs and daemons NetBackup catalogs
634
Backup and restore functional overview Backup and restore startup process
Job scheduling
The scheduler process bpsched consists of the following services:
The nbpem service (Policy Execution Manager) does the following: creates policy/client tasks and determines when jobs are due to run. It starts the job and upon job completion, determines when the next job should run for the policy-client combination.
Backup and restore functional overview Backups and archives - UNIX clients
635
The nbjm service (Job Manager) accepts requests from nbpem to run backup jobs, or to run media jobs from commands such as bplabel and tpreq. nbjm acquires resources for each job, such as storage unit, drives, media, and client and policy resources, and executes the job. The nbrb service (Resource Broker) allocates resources in response to requests from nbjm. nbrb acquires physical resources from nbemm (the Enterprise Media Manager service). It also manages logical resources such as multiplex groups, maximum jobs per client, and maximum jobs per policy. nbrb is also responsible for initiating drive unloads and manages pending request queues.
Scheduled backups begin when the nbpem service detects that a job is due. nbpem checks the policy configurations for the scheduled client backups that are due. Immediate manual backups begin if the administrator chooses this option in the NetBackup Administration Console or runs the bpbackup command with the i option. This action causes bprd to contact nbpem, which then processes the policy, client, and schedule that are selected by the administrator. User-directed backups or archives begin when a user on a client starts a backup or archive through the user interface on the client. The user can also enter
636
Backup and restore functional overview Backups and archives - UNIX clients
the bpbackup or bparchive commands on the command line. This action invokes the clients bpbackup or bparchive program, which sends a request to the request daemon bprd on the master server. When bprd receives the user request, it contacts nbpem, which checks the policy configurations for schedules. By default nbpem chooses the first user-directed schedule that it finds in a policy that includes the requesting client. For user-directed backups or archives, it is also possible to specify a policy and schedule. A description is available of the UNIX BPBACKUP_POLICY and BPBACKUP_SCHED options in bp.conf and the Windows equivalents. See the NetBackup Administrators Guide, Volume I I.
Backup process
This topic uses a diagram and a table to describe each step of a backup process. PBX (not shown in the diagram) must be running for NetBackup to operate. See Resolving PBX problems on page 67. Figure A-1 illustrates the various operations that comprise the backup process.
Backup and restore functional overview Backups and archives - UNIX clients
637
Figure A-1
Master server
Configuration Database File Database NetBackup Policy Management bprd EMM Database
UNIX client
Command line
bpdbm
nbproxy
nbproxy
Mo
t un
re
t es qu
nbemm
Ca
nbrb
nbjm
tal og
nbrmms
Inf
bpcd
ag e
up
Ba ck
bpbrm
Shar e mem d ory
Cat
bpcd
alo g In fo
Im
Im
ag e
bptm (child)**
Tape Mount
ltid*
Notes: * For details on these components, see the Media and Device Management Functional Description later in this chapter. Itid is for tape backup only . ** If the media server is backing up itself (server and client on same host), there is no bptm child: bpbkar sends the data directly to shared memory.
Client disk
Table A-1 shows the sequence of operation of a backup process. Table A-1 Agent
Start-up script
638
Backup and restore functional overview Backups and archives - UNIX clients
Issues a single request (with a request ID) to nbrb, for all resources that are required by a job. nbrb gets the storage unit, tape drive, and media id information from nbemm and allocates client and policy resources. nbrb returns to nbjm an allocation sequence that contains one allocation for each resource (each allocation contains a unique ID). nbrb also returns allocation data for the specific resource type. nbrb also returns the request ID along with the allocations so that nbjm can correlate the response with the right request (and job). Note that nbrb allocates all resources that are included in a request. If the resources are temporarily unavailable the request is queued in nbrb. If the resource cannot be allocated, nbrb fails the request. nbjm starts the backup by using the client daemon bpcd to start the backup and restore manager bpbrm. For normal backup (not snapshots), nbjm starts bpbrm on the media server, which may or may not be the same system as the master server.
Starts bptm. Starts the actual backup (or archive) by using the client daemon bpcd to start the backup program and archive program bpbkar on the client.
Backup and restore functional overview Backups and archives - UNIX clients
639
640
Backup and restore functional overview Backups and archives - UNIX clients
Receives the completion status of the job from bpbrm. Releases the resources to nbrb and returns the status to nbpem.
Backup and restore functional overview Backups and archives - UNIX clients
641
Figure A-2 shows multiplexed images from two clients. Figure A-2 Multiplexed backups example (two streams)
NetBackup server
Only on maser server bpdbm bprm (parent)
UNIX client
bpcd bprm (child)
File in
File info
fo
bpbkar See Figure A-1 for process details leading to nbemm. nbemm Mount request nbrmms bptm (parent)
Ba ck
bptm (child)**
S m har em ed or y
k Bac
up I
mag
le Fi inf o
Client disk
UNIX client
ag e
bprm (child)
up
bpcd
up
Ba ck
File
bptm (child)**
Tape request
Im
info
bpbkar
Backup Image
Tape Mount
ltid*
Notes: * For details on these components, see the Media and Device Management Functional Description later in this chapter. . ** If the server is backing up itself (server and client on same host), there is no bptm child: bpbkar sends the data directly to shared memory.
Client disk
642
Backup and restore functional overview Backups and archives - UNIX clients
Figure A-3
Snapshot backup and Windows open file backup using multiple data streams
Master server
Configuration Database File Database Backup Policy Management bprd EMM Database
UNIX client
NetBackup user interface or command line
bpdbm
nbproxy
Mo
t un
st ue eq
nbemm
Ca
nbrb
nbjm
bpcd
tal og
nbrmms
Inf
bpbrm bpcd
Client disk
bpcd bpbrm
Sha r mem ed ory
ag e
ag e
up
Ba ck
Ba
ck
Cat alog
Tape request
Im
up
Im
bptm (child)**
Info
bpbkar
Backup Image
Tape Mount
ltid*
Notes: * For details on these components, see the Media and Device Management Functional Description later in this chapter. . ** If the media server is backing up itself (server and client on same host), there is no bptm child: bpbkar sends the data directly to shared memory.
A separate parent job creates all snapshots followed by a child job that backs up the snapshot. An exception is when Windows opens file backups that do not use multiple data streams. The following sequence of operation is for snapshot creation and backup that includes Windows open file backups that employ multiple data streams:
The NetBackup master server or primary client initiates the backup. This action causes the NetBackup request daemon bprd to submit a backup request
Backup and restore functional overview Backups and archives - UNIX clients
643
to the Policy Execution Manager nbpem. nbpem processes the policy configurations.
nbpem (through nbjm) starts a parent job to create the snapshot. This job is
nbjm starts an instance of bpbrm through bpcd on the media server, and bpbrm
When bpfis is finished, it sends snapshot information and completion status to bpbrm and exits. bpbrm, in turn, reports the snapshot information and status to nbjm and exits. nbjm relays the information and status to nbpem.
nbpem submits a child job for the backup to nbjm, with a file list derived from
bpbrm starts bpbkar on the client. bpbkar sends the file catalog information
to bpbrm, which relays it to the NetBackup file database bpdbm on the master server.
The next step depends on whether the media server backs up itself (bptm and bpbkar are on the same host) or the media server backs up a client that resides on a different host. If the media server backs up itself, bpbkar stores the snapshot-based image block by block in shared memory on the media server. If the media server backs up a client that resides on a different host, bptm on the server creates a child process of itself. The child receives the snapshot-based image from the client by means of socket communications and then stores the image block-by-block in shared memory. The original bptm process then takes the backup image from shared memory and sends it to the storage device (disk or tape). Information is available on how the tape request is issued. See Media and device management process on page 687.
bptm sends backup completion status to bpbrm, which passes it to nbjm.
When nbpem receives backup completion status from nbjm, nbpem tells nbjm to delete the snapshot. nbjm starts a new instance of bpbrm on the media server, and bpbrm starts a new instance of bpfis on the client. bpfis deletes the snapshot on the client, unless the snapshot is of the Instant Recovery type, in which case it is not automatically deleted. bpfis and bpbrm report their status and exit. For more information on snapshot backups involving Snapshot Client, refer to the following:
644
Backup and restore functional overview Backups and archives - UNIX clients
See the NetBackup Snapshot Client Administrators Guide. Note that Windows open file backups do not require Snapshot Client.
SAN client
For backups to disk, the SAN Client feature provides high speed data movement between NetBackup media servers and NetBackup SAN-attached clients. SAN-attached clients send backup data to the media server by means of fibre channel connections. As part of SAN Client, the FT Service Manager (FSM) is a domain layer service that resides on the EMM server. The FSM provides discovery, configuration, and event monitoring of SAN Client resources. The FSM collects fibre channel information from the client and from the media server; FSM then populates the EMM database with the information. (FSM runs in the same process as EMM.) FSM interacts with the nbftclnt process on NetBackup clients and with the nbftsrvr process on media servers. The initial stages of a backup are the same as shown in Figure A-1 Figure A-4 shows the server and client components that are unique to SAN client backup over Fibre Channel.
Backup and restore functional overview Backups and archives - UNIX clients
645
Figure A-4
nbjm
bpcd
Bptm (parent)
Backup Im age
bpbrm
catalog Info
bpbkar
Shared Memory
Shared Memory
Client disk nbftsrvr Backup Image sent over Fiber Channel nbftclnt
Storage Disk
The process flow for a SAN Client backup is as follows (in the order presented):
A start-up script launches bprd on the master server and ltid on the master server and all media servers. All other daemons and programs are started as necessary including nbpem, nbjm, nbrb, and nbemm. The policy execution manager service (nbpem) does the following:
Gets the policy list from bpdbm. Builds a work list of all scheduled jobs. Computes the due time for each job. Sorts the work list in order of due time. Submits to nbjm all jobs that are currently due. Sets a wakeup timer for the next due job.
646
Backup and restore functional overview Backups and archives - UNIX clients
When the job finishes, re-computes the due time of the next job and submits to nbjm all jobs that are currently due.
The job manager service (nbjm) requests backup resources from the resource broker (nbrb). nbrb returns information on the use of shared memory for SAN Client. nbjm starts the backup by means of the client daemon bpcd, which starts the backup and restore manager bpbrm. bpbrm starts bptm. bptm does the following:
Requests SAN Client information from nbjm. Sends a backup request to the FT server process (nbftsrvr). Sends a backup request to the FT Client process on the client (nbftclnt). nbftclnt opens a fibre channel connection to nbftsrvr on the media server, allocates shared memory, and writes shared memory information to the backup ID file.
Reads the shared memory information from the BID file (waits for the file to exist and become valid). Sends the information about files in the image to bpbrm. Writes the file data to tar, optionally compresses it, and writes the data to the shared buffer. When the buffer is full or the job is done, sets buffer flag.
The FT Client process nbftclnt waits for the shared memory buffer flag to be set. nbftclnt then transfers the image data to the FT Server (nbftsrvr) shared memory buffer, and clears the buffer flag. nbftsrvr waits for data from nbftclnt; the data is written to the shared memory buffer. When the transfer completes, nbftsrvr sets the buffer flag. bptm waits for the shared memory buffer flag to be set, writes data from the buffer to the storage device, and clears the buffer flag. At the end of the job:
bpbkar informs bpbrm and bptm that the job is complete. bptm sends bpbrm the final status of the data write. bptm directs nbftclnt to close the fibre channel connection. nbftclnt closes the fibre channel connection and deletes the BID file.
Backup and restore functional overview Backups and archives - UNIX clients
647
NBWIN is the user interface program on the client. The bpbackup function and
The server processes are the same as described for UNIX. Figure A-5 Backup and archive - Windows clients
Server For details on the server processes, see Backups and Archives - UNIX Clients earlier in this chapter.
bprd
request
NBWIN
bpbrm
File Inf
BPINETD
bptm
Backup Image
om
atio
BPCD
n
BPBKAR32
Client Disk
648
Backup and restore functional overview Backups and archives - UNIX clients
Raw partition backups are not supported. NetBackup for NetWare does not support archiving.
Figure A-6 shows the NetWare client processes. In this figure, the following item applies:
For NetWare nontarget operations, the Windows-based user interface program is called NBNWNT. For NetWare target operations, the user interface program is called BP.NLM on the Netware console. The bpbackup, bparchive, and bplist functions are merged into the user interface programs on the clients. The NetBackup NetWare client daemon is called BPCD. The bpbkar functions are merged into BPCD.
Backup and restore functional overview Backups and archives - UNIX clients
649
Figure A-6
Server
For details on the server processes, see Backups and Archives UNIX Clients earlier in this chapter.
NetWare client
bprd
Request
bpbrm
ge Ima
File information
bpcd
up ack
Synthetic backups
The typical NetBackup backup process accesses the client to create a backup. A synthetic backup is a backup image created without using the client. Instead, a synthetic backup process creates a full or a cumulative incremental image by using only previously created backup images, called component images. Note: Synthetic archives do not exist. For example, an existing full image and subsequent differential incremental images may be synthesized to create a new full image. The previous full image and the incrementals are the component images. The new synthetic full image behaves like a backup that is created through the traditional process. The new
650
Backup and restore functional overview Backups and archives - UNIX clients
synthetic full image is a backup of the client that is as current as the last incremental. The synthetic image is created by copying the most current version of each file from the most recent component image that contain the file. A synthetic backup must be created in a policy with the True Image Restore with Move Detection option selected. This option enables the synthetic backup to exclude the files that have been deleted from the client file system from appearing in the synthetic backup. Like a traditional backup, nbpem typically initiates a synthetic backup. nbpem submits a request to nbjm to start the synthetic backup job. nbjm starts bpsynth. bpsynth executes on the master server. It controls the creation of the synthetic backup image and the reading of the files that are needed from the component images. If directory bpsynth exists in the debug log directory, additional debug log messages are written to a log file in that directory.
bpsynth makes a synthetic image in several phases:
nbjm Request to make synthetic backup bpsynth Extents and media needed to form the synthetic backup bpdbm Catalog
Backup and restore functional overview Backups and archives - UNIX clients
651
bpsynth
data flow
Note that bpsynth only starts the parent bptm (writer) and bpdm (reader) process on the media server. The parent in turn starts a child process. The parent and child communicate by means of buffers in shared memory. The bpsynth process sends the extents (starting block and count) for each component image to the corresponding child bptm or bpdm reader process.
652
Backup and restore functional overview Backups and archives - UNIX clients
The parent bptm or bpdm reader process reads the data from the appropriate media into the shared buffers. The child bptm or bpdm reader process sends the data in the shared buffers to the child bptm writer process over a socket. The child bptm writer process writes the data into the shared buffers. The parent bptm writer process copies the data from the shared buffers to the media and notifies bpsynth when the synthetic image is complete.
That True Image Restore (TIR) with move detection be selected for each component image. That the component images are synthetic images.
Backup and restore functional overview Backups and archives - UNIX clients
653
Figure A-9
Master Server
nbpem 2 nbjm
bpdbm
/usr/openv/db/s taging
bprd
See Backup to tape or disk on page591. Note: the master server backs up itself.
See Backup to tape or disk on page591. Note: the master server backs up the EMM server.
A hot catalog backup consists of the following jobs that run on the master server:
A parent job that is started manually by the administrator or by a catalog backup policy schedule. A child job that backs up the NetBackup relational database files. A child job that copies the NetBackup database files on pre-6.0 media servers, if any. A child job that backs up the NetBackup database files (all files in /usr/openv/netbackup/db).
654
nbpem submits a parent job to nbjm; nbjm sends a request to bpdbm. bpdbm handles the backup of the relational database files, in two steps:
The SQL Anywhere files database agent makes an online copy of the relational database files to /usr/openv/db/staging. See the Disaster Recovery chapter for a list of the relational database files. After the files are in the staging area, the SQL Anywhere database agent backs them up in the same manner as is used for an ordinary backup.
NetBackup backs up the database files that are in /usr/openv/netbackup/db and important NetBackup files to the master server. NetBackup creates the disaster recovery file, and emails it to the administrator if the email option was selected in the policy.
Note: If the EMM server is on its own host (separate from the master server), consult this log on the EMM server: /usr/openv/netbackup/logs/admin (UNIX), or install_path\NetBackup\logs\admin (Windows). For messages pertaining only to the relational database files, see the progress log file in the following directory:
Restore processes
NetBackup restore operations, like backups, can vary according to client type. The following explains the variations.
655
The request daemon, in turn, queries bpdbm for the information and transmits it to bplist on the client. Figure A-10 List operation - UNIX and Linux client
UNIX Client
Command line
query
bpdbm
File list
bprd
File list
bplist
Refer to one of the following topics as you read through the restore process. See Figure A-11 on page 657. See Figure A-12 on page 658. The following are the processing steps in a restore (in the order presented):
When the user starts a restore, NetBackup invokes the clients bprestore program which sends a request to the request daemon, bprd. This request identifies the files and client. The request daemon then uses bpcd (client daemon) to start the backup and restore manager (bpbrm). Note: To restore Backup Exec images, bpbrm initiates mtfrd instead of tar on the clients. The server processes are the same as those used for NetBackup restores. If the disk device or tape device on which the data resides attaches to the master server, the following occurs: bprd starts the backup and restore manager on the master server. If the disk unit or tape unit connects to a media server, bprd starts the backup and restore manager on the media server. The backup and restore manager starts bptm and uses the client daemon (bpcd) to establish a connection between the NetBackup tar program on the client and bptm on the server.
656
The bptm process identifies which media (disk or tape) is needed for the restore, based on the image catalog. bptm then requests the allocation of the required media from nbrb through nbjm. nbjm then asks mds (part of nbemm)for the resources. nbemm allocates the media and selects and allocates an appropriate drive (for tape media). For tape: bptm asks ltid to mount the tape in the drive. For disk: (such as SharedDisk, AdvancedDisk, or OpenStorage), nbrb tells nbemm to issue the mount by means of nbrmms, after nbemm allocates the resources. For restore from non-shared disk (BasicDisk, PureDisk, NearStore, SnapVault), bptm does not need to ask nbrb for an allocation, because disk inherently supports concurrent access. bptm uses the file path in a read request to the system disk manager.
When the allocation is granted to it, bptm starts retrieving data. bptm stores the image block-by-block in shared memory.
bptm directs the image to the client in one of two ways. If the server restores
itself (server and client are on the same host), tar reads the data directly from shared memory. If the server restores a client that resides on a different host, it creates a child bptm process which transmits the data to tar on the client. Note: Only the part of the image that is required to satisfy the restore request is sent to the client, not necessarily the entire backup image. The NetBackup tar program writes the data on the client disk.
PBX must be running for NetBackup to operate (PBX is not shown in the next diagram). See Resolving PBX problems on page 67. Figure A-11 shows how to restore from tape in the UNIX and Linux environments:
657
Figure A-11
Master server
UNIX client
Command line
nbemm
nbrb
nbjm
bpcd
bpbrm
ck
bpcd
up
bptm (child)**
Backup Image
NetBackup tar
Tape
Mount
Notes: * For details on this component, see the Media and Device Management Functional Description later in this chapter. Itid is for tape backup only . ** If the media server is restoring its own data (server and client on same host), there is no bptm child: tar reads the data directly from shared memory.
Client disk
Figure A-12 shows how to restore from disk in the UNIX and Linux environments:
658
Figure A-12
EMM Database
Command line
nbemm
nbrb
nbjm
bprd
bprestore
ag e
up
Im
Disk volume
Note: * If the server is restoring its own data (server and client on same host), there is no . bptm child: tar reads the data directly from shared memory.
Ba ck
bpcd
bpbrm
Shar e mem d ory
bpcd
NetBackup tar
Client disk
659
Figure A-13
Master server
UNIX client
Command line
nbjm
bprd
bprestore
Shared memory
bptm child
nbftclnt
Client disk
The process flow for a SAN Client restore is as follows (in the order presented).
When the user starts a restore, NetBackup invokes the clients bprestore program which sends a request to the request daemon, bprd. This request identifies the files and client. The request daemon then uses bpcd (client daemon) to start the backup and restore manager (bpbrm). Note: To restore Backup Exec images, bpbrm invoke mtfrd instead of tar on the clients. The server processes are the same as those used for NetBackup restores.
660
If the disk device or tape device on which the data resides attaches to the master server, then bprd starts the backup and restore manager on the master server. If the disk unit or tape unit connects to a media server, bprd starts the backup and restore manager on the media server. bpbrm starts bptm and provides bptm with the backup ID and the shmfat (shared memory) flag. bptm does the following:
Requests SAN Client information from nbjm. Sends a restore request to the FT server process (nbftsrvr). Sends a restore request to the FT Client process on the client (nbftclnt). nbftclnt opens a fibre channel connection to nbftsrvr on the media server, allocates shared memory, and writes shared memory information to the backup ID file.
bpbrm starts tar by means of bpcd and provides tar with the backup ID, socket information, and the shmfat (shared memory) flag. bptm does the following:
Reads the image from the storage device. Creates a bptm child process. This process filters the backup image so that only the files that are selected for the restore are sent to the client. Writes the image data to the shared buffer on the server. When buffer is full or job is done, sets buffer flag (partial buffers may be sent to the client).
Sends the status and control information to bpbrm. Reads the shared memory information from the local backup ID file (waits for the file to exist and become valid). Waits for the buffer flag that indicates the data is ready to be read. Reads data from the buffer, extracts files and restores them. When the shmfat (shared memory) flag is provided, tar considers the data to be already filtered.
The FT Server process nbftsrvr waits for the shared memory buffer flag to be set. nbftsrvr then transfers the image data to the FT Client (nbftclnt) shared memory buffer, and clears the buffer flag. The FT Client (nbftclnt) waits for the data from nbftsrvr and writes the data to the shared memory buffer on the client. nbftclnt then sets the buffer flag.
661
bptm informs tar and bpbrm that the job is complete. bptm directs nbftclnt to close the fibre channel connection. nbftclnt closes the fibre channel connection and deletes the BID file.
NBWIN is the user interface program on the client. The bpbackup function and
NetBackup tar on UNIX. Note: To restore Backup Exec images, bpbrm invokes mtfrd.exe instead of tar32.exe on the clients. The server processes are the same as those used for NetBackup restores. The server processes are the same as described for UNIX. Figure A-14 shows the client processes involved in these operations.
662
Figure A-14
Server For details on the server processes, see Backups and archives - UNIX clients earlier in this chapter.
bprd
Request
NBWIN
BPINETD bpbrm
BPCD bptm
Backup Im
age
TAR32
Client Disk
The NetWare nontarget user interface program is called NBNWNT. The NetWare target user interface program is BP on the Netware console. The bprestore function and the bplist function are merged into the user interface programs on the clients. The NetBackup NetWare client daemon is called BPCD. The NetBackup tar functions are merged into BPCD.
663
mtfrd functionality (used to restore Backup Exec images) has been merged
into BPCD. The server processes involved in import and restore operations for Backup Exec images are the same as those involved for NetBackup restores. The server processes are the same as described for UNIX. Figure A-15 shows the restore operation for a NetWare client Figure A-15 Restore - NetWare client
Server
NetWare client
For details on the server processes, see Backups and Archives UNIX Clients earlier in this chapter.
bprd
Request
Client disk
.
664
Command line
bprd
bprecover
Restore NetBackup Database Files See Restore from tape (UNIX) or Restore from disk, depending on the catalog backup policy.
bprd
2 Restore Relational Database Files See Restore from tape (UNIX) or Restore from disk, depending on the catalog backup policy.
/usr/openv/db/ staging
665
A restore of the NetBackup database and relational database files from a hot catalog backup consists of the following steps (in the order presented):
The NetBackup database files are restored by means of the standard NetBackup restore procedure. The relational database files are restored by means of the standard NetBackup restore procedure. The database files are restored to /usr/openv/db/staging (UNIX and Linux), or to install_path\NetBackupDB\staging (Windows). After the files are restored to the staging directory, the relational database is recovered. Each transaction log in the staging area is applied in order, one by one. The relational database files are moved from the staging directory to a location determined by the following: the bp.conf file VXDBMS_NB_DATA setting on UNIX or Linux and by the corresponding registry key on Windows. The default location is /usr/openv/db/data on UNIX and Linux, and install_path\NetBackupDB\data on Windows. If the relational database files are relocated, they are moved from the staging directory to the /usr/openv/db/data/vxdbms.conf file (UNIX) or the install_path\NetBackupDB\data\vxdbms.conf file (Windows). A description is available of how the NetBackup relational database files can be relocated after installation. See "NetBackup Relational Database" in the NetBackup Administrators Guide, Volume I.
Messages that are related to this catalog recovery process are divided into the following three areas:
For messages that are related to all catalog recovery steps, consult the /usr/openv/netbackup/logs/admin logs (UNIX and Linux), or install_path\NetBackup\logs\admin (Windows). For messages that are related to the first two bulleted items, consult the tar, bpbrm, and bpcd logs. For messages pertaining only to the relational database files, see the progress logs in the following directory: /usr/openv/netbackup/logs/user_ops/root/logs (UNIX and Linux), or install_path\NetBackup\logs\user_ops\root\logs (Windows).
666
portion are present. If a host is both a client and a server, the client shares files as necessary from those in the Server portion. A Windows NetBackup server has equivalent files and folders that are located where NetBackup is installed (C:\Program Files\VERITAS by default).
NetBackup server /usr/openv/ bin/ man/ tmp/ db/ msg/ var/ java/ netbackup/ volmgr/ lib/ resources/ logs/ share/
/usr/openv/netbackup/ bin/ help/ remote_versions/ bp.conf logs/ version client/ nblog.conf version_master db/ nblog.conf.template dbext/ nbsvcmon.conf
NetBackup client /usr/openv/ bin/ java/ lib/ msg/ netbackup/ resources/ share/ tmp/ var/
667
Table A-2
File or directory in
/usr/openv/ bin/
Contains miscellaneous executable binaries including the vnetd daemon and utilities for legacy enhanced authentication. Contains the NetBackup Relational Database Manager (SQL Anywhere) and database data file. Contains the NetBackup-Java Administration Console and the Backup, Archive and Restore user interface. Contains shared libraries that are required for NetBackup operation. Contains all logs that are written by unified logging. You do not have to create subdirectories for these logs. Contains man pages for NetBackup commands. Contains the message files and a configuration file for all installed languages of NetBackup. A tar file that contains the NetBackup-Java interfaces. See Table A-3 on page 668. Contains the NetBackup message catalogs that are used by unified logging (VxUL). Contains static configuration files. These files are normally unchanged between NetBackup releases. Contains the NetBackup Relational Database Manager (SQL Anywhere) installation trace files, and the log files regarding to database start and stop. Contains the variable configuration files. These files, which are related to licensing, authentication, authorization, and networking, may change while NetBackup is running. /usr/openv/var/global contains various static and variable configuration files. In a cluster, the /global directory is shared between nodes. Contains the media and device management directories and files. See NetBackup directory structure - UNIX on page 666.
db/
java/
lib/
logs/
man/ msg/
share/
tmp/sqlany
var/
volmgr/
668
Contents of /usr/openv/netbackup
Table A-3 describes the /usr/openv/netbackup files and directories. Table A-3 Directories and files in /usr/openv/netbackup/ - servers and UNIX clients Contents
File or Directory in
/usr/openv/netbackup/ bin/
Commands, scripts, programs, daemons, and files that are required for NetBackup operation and administration. On a server, there are two subdirectories under bin. admincmd: Contains various commands that used internally by NetBackup. Use these commands ONLY if they are documented. Most of these commands are not documented and should not be used directly. goodies (UNIX only): Contains scripts and information that may be useful to the administrator. These subdirectories are not present on clients.
bp.conf
Configuration file containing options for NetBackup operation. A detailed explanation is available about each option and how to set it. See the NetBackup Administrators Guide, Vol II. On a Windows server, these options are set in the NetBackup Administration Console.
client/
NetBackup client software that is installed on the clients during installation. Do not install this directory on a media server. NetBackup catalogs. See Table A-5 on page 682.
db/
dbext/
For NetBackup database agent software, contains the version file, compressed tar file, and install_dbext script. Help files that are used by NetBackup programs. These files are in ASCII format.
help/
669
Table A-3
Directories and files in /usr/openv/netbackup/ - servers and UNIX clients (continued) Contents
File or Directory in
/usr/openv/netbackup/ logs/
Legacy debug logs for NetBackup processes. You must create the necessary subdirectories in order for these log files to be written. See Legacy NetBackup logging on page 103. See Table A-4 on page 670. for an explanation of the processes that produce the logs.
nblog.conf
670
Table A-4
Program/Daemon
bp
BP.NLM
On NetWare target clients, BP.NLM is the NetWare Loadable Module that starts the client-user interface. Started By: LOAD BP command. Stopped By: Choosing Quit Utility from the main menu. Debug Log: SYS:\VERITAS\NBUCLT\NETBACK\LOGS\BP\mmddyy.log file on the client.
bpadm
On a UNIX master server, this administrator utility has a menu-driven, character-based, interface with options for configuring and managing NetBackup. Started By: /usr/openv/netbackup/bin/bpadm command on the master server. Stopped By: Quit option from within bpadm. Debug Log: admin legacy log directory on the server.
bparchive
On UNIX clients, this program communicates with bprd on the master server when a user starts an archive. Started By: Starting an archive by using the client-user interface or by executing the /usr/openv/netbackup/bin/bparchive command on the client. Stopped By: Completion of operation. Debug Log: bparchive legacy log directory on the client.
671
Table A-4
Program/Daemon
bpbackup
bpbkar
On UNIX clients the Backup/Archive Manager generates the backup images. Started By: bpbrm on the server with the storage unit. Stopped By: Completion of operation. Debug Log: bpbkar legacy log directory on the client.
BPBKAR32
On Windows clients, the Backup/Archive Manager generates the backup images. Started By: BPCDW32 on the client. Stopped By: Completion of operation. Debug Log: BPBKAR legacy log directory in the NetBackup logs directory on the client.
bpbrm
On master and media servers, the Backup/Restore Manager manages the client and bptm or bpdm process. It also uses error status from the client and from bptm or bpdm to determine the final status of backup or restore operations. Started By: For each backup or restore, nbjm starts an instance of bpbrm on the server with the appropriate storage unit. Stopped By: Completion of operation. Debug Log: bpbrm legacy log directory on the server.
672
Table A-4
Program/Daemon
bpcd
BPCD.NLM
On NetWare clients, BPCD.NLM is the executable file that starts the NetBackup client daemon. Started By: When you enter BPSTART.NCF at the NetWare Server console. Or, add BPSTART.NCF to your autoexec.ncf file. Stopped By: UNLOAD BP command Debug Log: BPCD legacy log directory on the client.
BPCDW32.EXE
On Windows clients, BPCDW32.EXE is the executable file that starts the NetBackup client daemon. Started By: When Windows starts if the daemon is in the Startup group. Otherwise, by double clicking on its icon. Stopped By: On Windows, you can stop it through the Services application in the Control Panel. Debug Log: BPCD legacy log directory on the client.
bpdbjobs
On UNIX master servers, this program is used to clean up the NetBackup jobs database. Started By: /usr/openv/netbackup/bin/admincmd/bpdbjobs. When bprd starts, it runs this command automatically. The administrator can also execute it manually or with a cron job. Stopped By: No terminate option exists for this command outside of using kill. Debug Log: bpdbjobs legacy log directory on the server.
673
Table A-4
Program/Daemon
bpdbm
bpdm
On master and media servers, bpdm is used for the following disk operations: read phase of disk duplication, read phase of synthetic backups, disk verify and disk import, true image restore from disk, disk image deletion. Started By: For each backup or restore, bpbrm starts an instance of bpdm, on the server with the storage unit. Stopped By: Completion of operation. Debug Log: bpdm legacy log directory on the server.
bpfis
On clients, bpfis creates and deletes snapshots. Note that bpfis is part of the Snapshot Client add-on product. Started By: bpbrm. Stopped By: Completion of operation. Debug Log: bpfis legacy log directory on the client or alternate client.
bphdb
On SQL, Oracle, Informix, Sybase, DB2, and SAP database clients, bphdb executes scripts to back up the database. Started By: Client-user interface when the user starts a database backup operation. Stopped By: Completion of operation. Debug Log: bphdb legacy log directory on the client.
674
Table A-4
Program/Daemon
bpjava-msvc
bpjava-usvc
NetBackup-Java user server application program. This program services all requests from the NetBackup-Java user and administration interfaces. Started By: bpjava-msvc upon successful login through the Login dialog box that is presented when a NetBackup-Java interface is started. Stopped By: When the interface program is terminated. Debug Log: bpjava-usvc legacy log directory.
bplist
On UNIX clients, this program communicates with bprd on the master server when a user browses the database during a restore operation. Started By: Starting a search of the image database by using the client-user interface or by executing the /usr/openv/netbackup/bin/bplist command on the client. Stopped By: Completion of operation Debug Log: bplist legacy log directory on the client.
675
Table A-4
Program/Daemon
bprd
Restores Backups (scheduled and user-directed) Archives List that is backed up or archived files Manual immediate backups (started through the NetBackup administration interface manual backup option)
Started By: Initiate Request Daemon option on the Special Actions menu in bpadm (also the /usr/openv/netbackup/bin/initbprd command). Stopped By: Terminate Request Daemon option on the Special Actions menu in bpadm. Debug Log: bprd legacy log directory on the server. bprestore On UNIX clients, this program communicates with bprd on the master server when a user starts a restore. Started By: Starting restore by using the client-user interface (or by executing the /usr/openv/netbackup/bin/bprestore command on the client). Stopped By: Completion of operation Debug Log: bprestore legacy log directory on the client. BPSVR.NLM On NetWare nontarget clients, BPSVR.NLM is the program that allows the system that has the client-user interface to communicate with the Netware server that is the NetBackup client. Started By: Enter bpstart.ncf. Stopped By: Enter bpstop.ncf. Debug Log: SYS:VERITAS\NBUCLT\NetBack\logs\bpsrv\ directory on the client. BPSYS.EXE On Windows clients, BPSYS.EXE is the NetBackup System Registry Replacement utility. Started By: NetBackup as required. Stopped By: Completion of operation. Debug Log: BPSYS legacy log directory on the client.
676
Table A-4
Program/Daemon
bptm
jbpSA
A Java-based program for performing backups, archives, and restores of UNIX clients. Started By: On UNIX, the /usr/openv/netbackup/bin/jbpSA command. Debug Log: None, although the logs for the bpbackup, bparchive, bplist, and bprestore commands on the client can be useful. Also, check the bpjava-msvc and bpjava-usvc logs.
jnbSA
A Java-based administration utility for managing NetBackup on UNIX. In addition, administration of supported UNIX systems can be performed by using the NetBackup-Java Windows Display Console on a Windows system. Started By: On UNIX, the /usr/openv/netbackup/bin/jnbSA command. On a NetBackup-Java Windows Display console, the NetBackup - Java on host menu item on the Programs/NetBackup menu. Stopped By: Exit option in jnbSA. Debug Log: None, although the logs for bpjava-msvc and bpjava-usvc can be helpful.
677
Table A-4
Program/Daemon
nbemm
nbfdrv64
On a media server that is enabled for SAN Client backup over fibre channel, nbfdrv64 is the following: a user mode component that is used for both backup and restore. nbfdrv64 uses a windrvr6 proxy to move fibre channel data between nbftclnt and bptm buffers. Started By: /usr/openv/netbackup/bin/nbftsrvr Stopped By: /usr/openv/netbackup/bin/nbftsrvr -terminate Debug Log: On the server, /usr/openv/logs (UNIX) or install_path\logs (Windows). See Unified logging on page 85.
nbftclnt
On clients that are enabled for SAN Client backup over fibre channel, nbftclnt transfers the backup image over fibre channel to nbftsrvr on the media server. Started By: Started when NetBackup starts. Stopped By: /usr/openv/netbackup/bin/nbftclnt -terminate. Debug Log: On the client, /usr/openv/logs (UNIX) or install_path\logs (Windows). See Unified logging on page 85.
678
Table A-4
Program/Daemon
nbftsrvr
nbjm
On master servers, the nbjm service accepts job requests from nbpem and from media commands such as bplabel and tpreq. nbjm acquires job resources from nbrb, and runs the jobs once resources are available. Started By: Started when NetBackup starts. Stopped By: /usr/openv/netbackup/bin/nbjm -terminate Debug Log: On the server, /usr/openv/logs (UNIX) or install_path\logs (Windows). See Unified logging on page 85.
NBNWNT.EXE
For NetWare nontarget clients, NBNWNT.EXE is the executable file that starts the client-user interface on Windows systems. Started By: From the Windows Start menu, under Programs/ NetBackup. Stopped By: Exiting the client-user interface. Debug Log: none.
nbpem
On master servers, the nbpem service uses nbproxy to get the policy list from bpdbm, creates the policy-client tasks, determines when jobs are due to run, and starts due jobs. Started By: Started when NetBackup starts. Stopped By: /usr/openv/netbackup/bin/nbpem -terminate Debug Log: On the server, /usr/openv/logs (UNIX) or install_path\logs (Windows). See Unified logging on page 85.
679
Table A-4
Program/Daemon
nbproxy
nbrb
On the server that is defined as the EMM server, the nbrb service accepts resource requests from nbjm, acquires physical resources from nbemm, and manages logical resources. Started By: Started when NetBackup starts. Stopped By: /usr/openv/netbackup/bin/nbrb -terminate Debug Log: On the server, /usr/openv/logs (UNIX) or install_path\logs (Windows). See Unified logging on page 85.
ndmpagent
Controls backup and restore operations on a NAS server. ndmpagent is for remote NDMP: backing up NDMP data to a drive that is configured in a Media Manager storage unit on a NetBackup media server. Started By: bpbrm. Stopped By: Completion of backup or restore. Debug Log: On the server, /usr/openv/logs (UNIX) or install_path\logs (Windows). See Unified logging on page 85.
nbstserv
Runs on the master server. The nbstserv service manages lifecycle operations including duplication, staging, and image expiration. Started By: Started when NetBackup starts. Stopped By: /usr/openv/netbackup/bin/nbstserv -terminate Debug Log: On the server, /usr/openv/logs (UNIX) or install_path\logs (Windows). For more information about OID 226 and 272, see the following topic: See Server processes that use unified logging on page 87.
680
Table A-4
Program/Daemon
NBWIN.EXE
nbrmms
Remote Manager and Monitor Service (nbrmms) is the conduit through which EMM discovers and configures storage on media servers. In addition to configuration management, nbrmms provides all access to media server resources for monitoring and event notifications. Started By: Started when NetBackup starts, or by /usr/openv/netbackup/bin/nbrmms Stopped By: Stopped when NetBackup stops, or by /usr/openv/netbackup/bin/nbrmms -terminate Debug Log: On the server, /usr/openv/logs (UNIX) or install_path\logs (Windows). See Unified logging on page 85.
pbx_exchange
Private Branch Exchange (PBX) is a common services framework that helps limit the number of TCP/IP ports that the CORBA services of NetBackup use. Started By: Started when NetBackup starts, or by /opt/VRTSpbx/bin/vxpbx_exchanged start Stopped By: Stopped when NetBackup stops, or by /opt/VRTSpbx/bin/vxpbx_exchanged stop Debug Log: On the server, /opt/VRTSpbx/log (UNIX) or install_path\VxPBX\log (Windows). See PBX logging on page 69.
681
Table A-4
Program/Daemon
ql2300_stub
tar
On UNIX clients, the Tape ARchive program is a special version of tar provided with NetBackup and used to restore images. Started By: For each restore, bpbrm starts an instance of tar on the client. Stopped By: Completion of restore operation. Debug Log: tar legacy log directory on the client.
TAR32
On Windows clients, the TAR32 program is a special version of tar provided with NetBackup and used to restore images. Started By: For each restore, NetBackup starts an instance of TAR32 on the client. Stopped By: Completion of restore operation. Debug Log: TAR legacy log directory on the client.
682
Table A-4
Program/Daemon
windrvr6
NetBackup catalogs
The NetBackup catalogs contain the information that is used internally by NetBackup and reside in the /usr/openv/netbackup/db directory on UNIX servers and in the install_path\NetBackup\db directory on Windows NetBackup servers. Note also that the /usr/openv/netbackup/db/class directory (install_path\NetBackup\db\class on Windows) has a subdirectory for each NetBackuppolicy, that contains information about the policy. Table A-5 describes the NetBackup catalogs. Table A-5 Database
config
683
images
Information about the backup images and resides only on the master server. One of the files in the images directory is the file database. The file database is the one that NetBackup accesses when a user browses for files to restore. Job information that is used by the NetBackup job monitor (UNIX NetBackup server) and activity monitor (Windows NetBackup server). The Jobs database is on the master server. Media related information that is used by bptm. Also has an errors file that contains error history information for media and devices.
jobs
media
684
Appendix
Media and device management startup process Media and device management process Shared Storage option management process Barcode operations Media and device management components
686
Media and device management functional description Media and device management startup process
robotic
Each host with a robotic drive attached must have a robotic daemon. These daemons provide the interface between ltid and the robot or, if different drives within a robot can attach to different hosts, the robotic daemon communicates with a robotic-control daemon (see below). Robotic-control daemons centralize the control of robots when drives within a robot can connect to different hosts. A robotic-control daemon receives mount and unmount requests from the robotic daemon on the host to which the drive is attached and then communicates these requests to the robot.
robotic control
You must know the hosts involved in order to start all the daemons for a robot.
Media and device management functional description Media and device management process
687
Figure B-1
At system startup, the server automatically starts ltid which starts applicable robotic daemons.
To start the processes manually, enter: On UNIX: /usr/openv/netbackup/bin/bp.start_all On Windows: install_path \NetBackup\bin\bpup Automated Cartridge System
acsd
acsssi
tl4d
tl8d
tl8cd
tldcd
tlhcd
tlmd
tshd
688
Media and device management functional description Media and device management process
The resulting request to mount a device is passed from nbjm to nbrb, which acquires the physical resources from nbemm (the Enterprise Media Manager service). If the backup requires media in a robot, ltid sends a mount request to the robotic daemon that manages the drives in the robot that are configured on the local host. The robotic daemon then mounts the media, and sets a drive busy status in memory shared by itself and ltid. Drive busy status also appears in the Device Monitor. See Figure B-2 on page 689. Assuming that the media is physically in the robot, the media is mounted and the operation proceeds. If the media is not in the robot, nbrb creates a pending request, which appears as a pending request in the Device Monitor. An operator must then insert the media in the robot and use the appropriate Device Monitor command to resubmit the request so the mount request can occur. A mount request is also issued if the media is for a nonrobotic (standalone) drive and the drive does not contain media that meets the criteria in the request. If the request is from NetBackup and the drive does contain appropriate media, then that media is automatically assigned and the operation proceeds. More information is available on NetBackup media selection for nonrobotic drives. See the NetBackup Administrators Guide, Volume II. Note: On UNIX systems, when a tape is being mounted, the drive_mount_notify script is called. This script is in the /usr/openv/volmgr/bin directory. Information on the script can be found within the script itself. A similar script is called for the unmount process (drive_unmount_notify, in the same directory). When a robotic volume is added or removed through the media access port, the media management utility communicates with the appropriate robotic daemon to verify the volume location and/or barcode. The media management utility (through a library or command-line interface) also calls the robotic daemon for robot inventory operations. Figure B-2 shows an example of the media and device management process.
Media and device management functional description Shared Storage option management process
689
Figure B-2
User
Device monitor
Devicemanagement utility
nbemm
SDLT600
LT0-3
tl8d
Non-robotic drives
NetBackup, Storage Migrator, or users can initiate backups. nbjm makes a mount request for the backup.
690
Media and device management functional description Shared Storage option management process
nbrb tells the EMM server to obtain a drive for the backup. nbrb tells the device allocator (DA) in the EMM server to stop scanning the selected drive. nbemm tells the appropriate media server (the scan host for the selected drive) to stop scanning the drive. The stop scan request is carried out by means of oprd, ltid, and avrd in the media servers shared memory. nbemm informs nbrb when scanning on the selected drive has stopped. nbrb informs nbjm that the selected drive (A) is available for the backup. nbjm conveys the mount request and drive selection to bptm, which proceeds with the backup. To protect the integrity of the write operation, bptm uses SCSI reservations. See How NetBackup reserves drives in the NetBackup Administrators Guide, Volume II. The mount-media operation is initiated. bptm makes position checks on the drive to ensure that the drive has not been rewound by another application. bptm also does the actual write to the tape. When the backup is complete, nbjm tells nbrb to release resources. nbrb de-allocates the drive in EMM. EMM tells the scan host to resume scanning the drive. The scan request is carried out by means of oprd, ltid, and avrd in the media servers shared memory.
691
Figure B-3
User
nbjm
nbrb
nbemm/DA
ltid
St o
ltid
sc
bptm
an
ltid
ltid
bptm
avrd
avrd
bptm
Shared drive A Note: Shaded area represents shared memory on the media server.
Shared drive B
Barcode operations
Barcode reading is mainly a function of the robot hardware rather than media and device management. When a robot has a barcode reader, it scans any barcode
692
that may be on a tape and stores the code in its internal memory. This associates the slot number and the barcode of the tape in that slot. NetBackup determines that association for its own use by interrogating the robot. If a robot supports barcodes, NetBackup automatically compares a tapes barcode to what is in the EMM database as an extra measure of verification before mounting the tape. A request for media that is in a robot that can read barcodes begins in the same manner as other requests. See Figure B-4 on page 693. ltid includes the media ID and location information in a mount request to the robotic daemon for the robot that has the media ID. This request causes the robotic daemon to query the robotic-control daemon or the robot for the barcode of the tape in the designated slot. (This is a preliminary check to see if the correct media is in the slot.) The robot returns the barcode value it has in memory. The robotic daemon compares this barcode with the value it received from ltid and takes one of the following actions:
If the barcodes dont match, and the mount request is not for a NetBackup backup job, the robotic daemon informs ltid and a pending action request (Misplaced Tape) appears in the Device Monitor. An operator must then insert the correct tape in the slot. If the barcodes dont match and the mount request is for a NetBackup backup job, the robotic daemon informs ltid and the mount request is canceled. NetBackup (bptm) then requests a new volume from nbjm and from EMM. If the barcodes match, the robotic daemon requests the robot to move the tape to a drive. The robot then mounts the tape. At the start of the operation, the application (for example, NetBackup) checks the media ID and if it also matches what should be in this slot, the operation proceeds. For NetBackup, a wrong media ID results in a media manager found wrong tape in drive error (NetBackup status code 93).
Media and device management functional description Media and device management components
693
Figure B-4
Devicemanagement utility User
Barcode request
NetBackup EMM Database
Itid
nbemm
vmd
Media-management utility
694
Media and device management functional description Media and device management components
Figure B-5
NetBackup server
/usr/openv/volmgr/bin/ driver/ format/ goodies/ avrd/1 robots/1 1. Created by administrator to enable legacy debug logging.
1 /usr/openv/volmgr/debug/
daemon/1 tpcommand/1
ltid/1
reqlib/1
/vmscd
Table B-1 describes the directories and files that are of special interest. Table B-1 File or directory
bin
debug
Legacy debug logs for the Volume Manager daemon, vmd, and all requesters of vmd, ltid, and device configuration. The administrator must create these directories for debug logging to occur. Help files used by media and device management programs. These files are in ASCII format. Lock files and temporary files required by various components of media and device management.
help
misc
Media and device management functional description Media and device management components
695
Table B-2 describes the media and device management programs and daemons. The explanations include what starts and stops the program or daemon, and the log (if any) where it records its activities. On UNIX, all of the components discussed in this table reside under /usr/openv/volmgr/bin. On Windows, they reside under install_path\volmgr\bin. Note: The following table contains references to the system log. This log is managed by syslog on UNIX (the facility is daemon). On Windows the Event Viewer manages the system log (the log type is Application). Table B-2 Media and device management daemons and programs
696
Media and device management functional description Media and device management components
Table B-2
Media and device management functional description Media and device management components
697
Table B-2
698
Media and device management functional description Media and device management components
Table B-2
Media and device management functional description Media and device management components
699
Table B-2
700
Media and device management functional description Media and device management components
Table B-2
Media and device management functional description Media and device management components
701
Table B-2
702
Media and device management functional description Media and device management components
Table B-2
Appendix
704
routine). This name is visible in the bprd debug log when a connection is made as in the line:
Connection from host peername ipaddress ...
The clients configured name is then derived from the peername by querying the bpdbm process on UNIX systems. On Windows systems, you must query the NetBackup Database Manager service. The bpdbm process compares the peername to a list of client names that are generated from the following:
All clients for which a backup has been attempted All clients in all policies
The comparison is first a simple string comparison. The comparison is verified by comparing hostnames and aliases that are retrieved by using the network function gethostbyname(). If none of the comparisons succeed, a more brute force method is used, which compares all names and aliases using gethostbyname(). The configured name is the first comparison that succeeds. Note that other comparisons might also have succeeded if aliases or other "network names" are configured. If the comparison fails, the clients hostname as returned by the gethostname() function on the client is used as the configured name. An example of a failed comparison is when the client had changed its hostname but its new hostname is not yet reflected in any policies. These comparisons are logged in the bpdbm debug log if VERBOSE is set. You can determine a clients configured name by using the bpclntcmd command on the client. For example:
# /usr/openv/netbackup/bin/bpclntcmd -pn (UNIX) # install_path\NetBackup\bin\bpclntcmd -pn (Windows) expecting response from server wind.abc.me.com danr.abc.me.com danr 194.133.172.3 4823
Where the first output line identifies the server to which the request is directed and the second output line is the servers response in the following order:
Peername of the connection to the server Configured name of the client IP address of the connection to the server
705
When the client connects to the server, it sends the following three names to the server:
The browse client name is used to identify the client files to list or restore from. The user on the client can modify this name to restore files from another client. For example, on a Windows client, the user can change the client name by using the Backup, Archive, and Restore interface. (See the NetBackup online Help for instructions). For this change to work, however, the administrator must also have made a corresponding change on the server. See the NetBackup Administrators Guide, Volume I. The requesting client is the value from the gethostname() function on the client. The destination client name is a factor only if an administrator pushes a restore to a client from a server. For a user restore, the destination client and the requesting client are the same. For an administrator restore, the administrator can specify a different name for the destination client. By the time these names appear in the bprd debug log, the requesting client name has been translated into the clients configured name. The name that used to connect back to the client to complete the restore is either the clients peername or its configured name. The type of restore request (for example, from root on a server, from a client, to a different client, and so on) influences this action. When you modify client names in NetBackup policies to accommodate specific network paths, the administrator needs to consider:
The client name as configured on the client. For example, on UNIX the client name is CLIENT_NAME in the clients bp.conf file. On a Windows client, it is on the General tab of the NetBackup Client Properties dialog box. To open this dialog box, select NetBackup Client Properties from the File menu in the Backup, Archive, and Restore interface. The client as currently named in the policy configuration. Client backup and archive images that already exist as recorded in the images directory on the master server. On a UNIX or Linux server, the images directory is /usr/openv/netbackup/db/. On a Windows NetBackup server, the images directory is install_path\NetBackup\db\images.
706
Any of these client names can require manual modification by the administrator if the following: a client has multiple network connections to the server and restores from the client fail due to a connection-related problem. On UNIX, the public domain program traceroute (not included with NetBackup) often can provide valuable information about a networks configuration. Some system vendors include this program with their systems. The master server may be unable to reply to client requests, if the Domain Name Services (DNS) are used and the following is true: the name that the client obtains through its gethostname() library (UNIX) or gethostbyname() network (Windows) function is unknown to the DNS on the master server, The client and the server configurations can determine if this situation exists. gethostname() or gethostbyname()on the client may return an unqualified host name that the DNS on the master server cannot resolve. Although you can reconfigure the client or the master server DNS hosts file, this solution is not always desirable. For this reason, NetBackup provides a special file on the master server. This file is as follows:
/usr/openv/netbackup/db/altnames/host.xlate (UNIX and Linux) install_path\NetBackup\db\altnames\host.xlate (Windows)
You can create and edit this file to force the desired translation of NetBackup client host names. Each line in the host.xlate file has three elements: a numeric key and two hostnames. Each line is left-justified, and a space character separates each element of the line.
key hostname_from_client client_as_known_by_server
key is a numeric value used by NetBackup to specify the cases where the translation is to be done. Currently this value must always be 0, which indicates a configured name translation. hostname_from_client is the value to translate. This value must correspond to the name that is obtained by the clients gethostname() function and sent to the server in the request. client_as_known_by_server is the name to substitute for hostname_from_client when responding to requests. This name must be the name that is configured in the NetBackup configuration on the master server. It must also be known to the master servers network services.
707
0 danr danr.eng.aaa.com
When the master server receives a request for a configured client name (numeric key 0), the name danr is always replaced by the name danr.eng.aaa.com. The problem is resolved, assuming the following:
The clients gethostname() function returns danr. The master servers network services gethostbyname() function did not recognize the name danr. The client was configured and named in the NetBackup configuration as danr.eng.aaa.com and this name is also known to network services on the master server.
708
Appendix
About robotic test utilities Robotic tests on UNIX Robotic tests on Windows
710
If the robot is not configured, you cannot use robtest and must execute the command that applies to the robot you test.
ACS /usr/openv/volmgr/bin/acstest -r ACSLS_hostpath for acstest to work on UNIX and Linux, acssel and acsssi must be running ODL TL4 TL8 TLD TLH TLM TSH /usr/openv/volmgr/bin/odltest -r roboticpath /usr/openv/volmgr/bin/tl4test -r roboticpath /usr/openv/volmgr/bin/tl8test -r roboticpath /usr/openv/volmgr/bin/tldtest -r roboticpath /usr/openv/volmgr/bin/tlhtest -r robotic_library_path /usr/openv/volmgr/bin/tlmtest -r DAS_host /usr/openv/volmgr/bin/tshtest -r roboticpath
More information on ACS, TLH, and TLM robotic control is available. See the NetBackup Device Configuration Guide. In the previous list of commands, roboticpath is the full path to the device file for the robotic control (SCSI). You can review the section for your platform to find the appropriate value for roboticpath. An optional parameter specifies the device file path for the drives so that this utility can unload the drives using the SCSI interface.
711
Note: If the robot is not configured, you cannot use robtest and must execute the command that applies to the robot you are testing (see following list).
ACS TL4 TL8 TLD TLH install_path\Volmgr\bin\acstest -r ACSLS_HOST install_path\Volmgr\bin\tl4test -r roboticpath install_path\Volmgr\bin\tl8test -r roboticpath install_path\Volmgr\bin\tldtest -r roboticpath install_path\Volmgr\bin\tlhtest -r robotic_library_name install_path\Volmgr\bin\tlmtest -r DAS_Hostname
TLM
More information on ACS, TLH, and TLM robotic control is available. See the NetBackup Device Configuration Guide. In the previous list of commands, roboticpath is the full path to the device file for the robotic control (SCSI). You can review the section for your platform to find the appropriate value for roboticpath. An optional parameter specifies the device file path for the drives so that this utility can unload the drives using the SCSI interface. Usage is:
install_path <-p port -b bus -t target -l lan | -r roboticpath>
712
Index
A
acssel, description 695 acsssi, description 695 acstest 711 Adaptive Server Anywhere 63 admin log 104 admincmd directory 668 administration interface activity logging 127 errors 125 AdvancedDisk 575, 582 All Log Entries report 84 Allow multiple data streams 173 Alternate client restores host.xlate file 706 Alternate Read Server 220 altnames file 682 application server status codes (Java interface) 125 archiving for NBCC 149 for nbsu 140 asc value 255 ascd, description 695 ascq value 255 Auth User for PBX 69 auto-configuration problems 30 avrd, description 696
B
backup NetBackup catalogs 652 process files 634 multiplexing 640 NetWare clients 648 Windows clients 647 process overview 637, 645 snapshot overview 641 synthetic processes 649
backup (continued) UNIX clients 635 Backup Status report 84 Bare Metal Restore 572, 575, 588 base NetBackup license key 179 bin Media and Device Management 694 UNIX client 667668 BP 662 bp description 670 log 119 UNIX client log 116 bp.conf file 636 UNIX client/server 668 SERVER entries 73 bp.kill_all 70, 72, 177 BP.NLM 648, 670 bp.start_all 72, 177 bpadm description 670 bparchive description 670 log 116, 119 bpbackup description 671 log 116, 119 bpbackup log 119 BPBACKUP_POLICY 636 BPBACKUP_SCHED 636 bpbackupdb and catalog backup 210 bpbkar description 671 log 116, 119 bpbkar and VxFS_Checkpoint 244 bpbkar log 119 BPBKAR32 647, 671 bpbrm 643 description 671 bpbrm log 104 BPCD 662
714
Index
bpcd description 672 server log 104 UNIX client log 116, 119 BPCD.NLM 672 BPCDW32.EXE 672 bpdbjobs description 672 bpdbjobs log 104 bpdbm description 673 bpdbm log 104 bpdm description 673 bpdm log 104 bpdown command 7172, 584, 587 bperror command 155 bpfis 643, 673 bphdb description 673 log 116 BPINETD 647, 661 bpinetd log 119 bpinetd.log 119 bpjava-msvc 674 bpjava-msvc log 105, 127 bpjava-usvc log 127 bplist description 674 log 116, 120 bplist log 120 bpmount log 116 bpmount log 120 bporaexp log 116 bporaexp64 log 116 bporaimp log 116 bporaimp64 log 116 bppfi log on client 248 bpps 24 bprd description 675 bprd log 105 bprecover 229 bprestore description 675 log 116, 120 bprestore log 120
bpsched see also nbpem 678 bpsrv log 120 bpsrv log 120 BPSVR.NLM 675 bpsynth 650 BPSYS.EXE 675 bptm description 676 bptm log 105 bptpcinfo 78 bpup command 72, 177, 584, 587 bundling NBCC output 149 nbsu output 140
C
cache partition 173 catalog backup 652 checkpoint removing 244 class database file 682 client NetBackup configured name 704 debug logs. See UNIX clients. See Windows and NetWare clients installation problems 28 multiple hostnames 703 peername 704 software location. See UNIX clients testing configuration 34, 38 Client Backups report 84 Client Properties dialog 62 client, NetBackup Windows disk recovery 588 clone (mirror removing) 245246 CommandCentral Storage 75 communications problems PC clients 46 UNIX clients 40 compression for NBCC 149 for nbsu 140 compression device 33 config file 682
Index
715
configuration database 682 configuration device file does not exist 33 configuration problems 28
D
daemons robotic 685 robotic control 685 database backup (see catalog backup) 652 database extension 633 DAYS_TO_KEEP_LOGS 107 DAYS_TO_KEEP_LOGS vm.conf setting 110 db directory NetBackup 667668 debug level 118, 122, 128 debug logs 85, 127 analysis utilities 131 NetBackup 694 vmd 107, 694 debug.properties file 127 debugging NBCC 149 nbsu 142 define the problem 13 device configuration problems 30 Device Configuration Wizard 583 directory structure Media and Device Management 693 disk full 62 disk recovery Windows client 588 disk space for logs files 100 drive drive limit exceeded 32 type not determined 33 drive_mount_notify script 688 drive_unmount_notify script 688 driver directory 694 drives too many 31 duplex mode and performance 72
Enable robust logging 111 Enterprise Media Manager 580 Enterprise Media Manager (EMM) 635 error codes (see status codes) 156 error database 683 evaluation license key 179 Event viewer logging option 122 eventlog 123 file entries 123 exception errors in Java admin interface 125 expired license key 179
F
failure_history file 683 fibre channel 644 file database 683 file list symbolic link 241 file pathname (max length) 157 files archive process 634 backup process 634 restore process 654 format directory 694 frozen image source for symbolic link 241 fsckptadm command 244 FSM 644 FT Service Manager 644 full disk 62 full duplex mode 72 functional overview introduction 633 Media and Device Management device management 687 directories and files 693 volume management 687 NetBackup backup and archive 634 restores 654 startup 634
E
E-mail 574 EMM server 635 enable debug logging 107 Enable Robust Logging 112
G
Global Logging Level 112 Global logging level 91, 93 goodies directory 668
716
Index
L
legacy logging 103 client logs 115 configuring rotation 111 controlling size of 109 directories 104 file name format 111 locations 103 PC clients 118 rotation of 109 levels for logging 91 libsfr.so file 161 license key 179 license problems 3132 links (in file list) 241 Linux 27 local system account 240 log (bppfi) 248 log analysis utilities debug logs 131 how to run 133 installation 132 limitations 133 output format 132 Log level Windows and NetWare clients 118, 122 logging changing location of 85 levels 91 see legacy logging 103 setting level for clients 118 setting level on PC clients 122 synthetic backup 113 VxUL (see unified logging) 85 logs activity see debug 85 debug enabling detailed 127 introduction 85 event viewer logging option 122 file retention 94 overview[Logs aaa] 81 PC client activity bp 119 bparchive 119 bpbackup 119 bpbkar 119
H
Half duplex and poor performance 72 help files Media and Device Management 694 UNIX client 668 host name entries checking 49 Host Properties 61 host.xlate file 706 hostID unified logging 87
I
IDR 581, 588 ifconfig for checking NIC duplex mode 73 images database 683 images on Media report 84 inetd 27 Information E-mail 574 installation Linux 27 installation problems 27 Intelligent Disaster Recovery (IDR) 581, 588 ipconfig for checking NIC duplex mode 73
J
Java interface debug logging 127 troubleshooting background 125 jbpSA overview 676 job ID search in unified logs 96 jobs queued for long periods 62 jobs database 683
K
Keep logs For setting 94 Keep Logs setting 109 key value 255
Index
717
logs (continued) PC client activity (continued) bpcd 119 bpinetd 119 bplist 120 bpmount 120 bprestore 120 bpsrv 120 tar 120 user_ops 121 reports NetBackup 83 server activity acssi 107 admin 104 bpbrm 104 bpcd 104 bpdbjobs 104 bpdbm 104 bpdm 104 bpjava-susvc 105 bprd 105 bpsynth 105 bptm 105106 daemon 107 ltid 107 nbjm 88 nbpem 88 nbrb 88 reqlib 108 robots 108 tpcommand 108 setting retention period 107, 109 system 85 UNIX client activity bp 116 bparchive 116 bpbackup 116 bpbkar 116 bpcd 116 bphdb 116 bpjava-msvc 105 bplist 116 bpmount 116 bprestore 116 obackup_tape 117 tar 117 user_ops 117 user progress 84
logs directory UNIX client/server 669 ltid 113 ltid, description 696
M
master server test procedure 34, 38 maximum pathname length 157 MaxLogFileSizeKB 98100, 111 media database 683 Media Log Entries report 84 media server test procedure 38 message finding message text from status code 155 messages NetBackup 397 messages (see status codes) 156 mirror (VxVM snapshot) 242 mirror snapshot (removing) 245246 misc file 694 mklogdir.bat 104 moving log locations 85 multiple data streams option 173 multiplexed backups 640 multiplexing 172
N
name format legacy logging 111 NB_dbsrv daemon 63 NBCC archiving and compression 149 does the following 143 introduction 143 location of 144 nbcc-info.txt file 150 Notes on running 149 output 148 progress display 144 Server aliases file 147 troubleshooting 149 when to use 144 nbcc-info.txt file 150 nbdb_move 583 nbemm 24, 635, 677 nbfdrv64 677
718
Index
nbftclnt 115, 118, 644, 646, 660, 677 and bp.conf 74 nbftsrvr 644, 646, 660, 678 nbjm 25, 88, 635, 643, 650, 678 NBNWNT 648, 662 NBNWNT.EXE 678 nbpem 25, 88, 634635, 643, 650, 678 nbproxy 679 nbrb 24, 63, 88, 635, 678679 nbrmms 680 nbstserv 679 nbsu and status codes 141 archiving and compression 140 bundling 140 creating xml output file 141 introduction 137 location of 137 nbsu_info.txt file 143 output files 139 progress display 138 troubleshooting 142 when to use 137 nbsu_info.txt file 143 NBWIN 647, 661 NBWIN.EXE 680 ndmpagent overview 679 NearStore 651 NetBackup if unresponsive 62 product ID 87 NetBackup Administration Console debug logging 127 errors 125 NetBackup Client Service 240 start and stop 26 NetBackup consistency check see NBCC 143 NetBackup Database Manager service start and stop 26 NetBackup Device Manager service start and stop 26 NetBackup Enterprise Media Manager service start and stop 26 NetBackup Job Manager service start and stop 26 NetBackup Policy Execution Manager service start and stop 26
NetBackup Request Manager service start and stop 26 NetBackup Resource Broker service start and stop 26 NetBackup Status Collection daemon.. See vmscd NetBackup Support Utility see nbsu 137 NetBackup Volume Manager service start and stop 26 network connections multiple 703 network daemon (vnetd) 106 network interface cards 72 network problems PC clients 46 UNIX clients 40 NIC cards and full duplex 72 Novell values affecting restore 175176 NumberOfFiles 100 NumberOfLogFiles 99, 111
O
obackup_tape log 117 odld, description 696 odltest 710 off-host backup 78 OpenStorage 258, 575, 582 operating system errors 126 originator IDs list of 88 originatorID unified logging 87
P
pass-thru not supported 32 patch troubleshooting 161 patches (installing during recovery) 590 pathname length 157 PBX Auth User 69 logging 69 Secure Mode 6970 starting 68 starting/stopping 71 troubleshooting 67 pbx_exchange 68, 680
Index
719
pbxcfg 68 preliminary troubleshooting procedure 23 Private Branch Exchange (PBX) 67 problems report 84 procedures recovery Windows client disk 588 troubleshooting communications problems 40 host names and services 49 installation and configuration 27 introduction 21 master server and clients 34 media server and clients 38 preliminary 23 processes (see functional overview) 633 product ID for NetBackup 87 productID unified logging 87 progress logs user 84
Q
ql2300_stub 681 query string 128 queued jobs 62
retention of logs 94 robot drive number unknown 32 robot limit exceeded 32 type not determined 31 unknown 32 robot drive selection 688 robotic control daemons 686 robotic daemons 686 robotic test utility 709 acstest 711 odltest 710 tl4test 710711 tl8test 710711 tldtest 710711 tlhtest 711 tshtest 710 robtest 709710 robust file logging 97 RolloverMode 99 rotation legacy logging 109 of logs 93 unified logging 87
S
SAN Client 644 SAN client and bp.conf 74 SANPoint Control 75 Secure Mode for PBX 69 serialization problems 30 server installation problems 27 NetBackup debug logs 103 test procedure for master 34, 38 test procedure for media server 38 SERVER entries bp.conf 73 services entries checking 49 SharedDisk 75, 575, 582 slots too many 31 slow performance and NIC cards 72 snaplist command 243 snapoff command 244
R
raw partitions backup process 634 restore process 654 recording information 15 recovery procedures Windows client disk 588 RedHat 27 relational database 63 removing a snapshot checkpoint 244 removing a snapshot clone 245246 reports NetBackup 83 reqlib directory 109 restore problem with sparse file 186 restore process 654 NetWare client 662 Windows 2000 client 661 restoring NAS_Snapshot 240
720
Index
snapshot backup process overview 642 cache partition 173 mirror (VxVM) 242 removing 245246 stale 243 Snapshot Client pathname length 157 software version determining Macintosh clients 174 NetWare nontarget clients 174 NetWare target clients 174 UNIX client/server 669 UNIX servers 173 Windows clients 174 Windows servers 173 sparse files (restore problem) 186 stale snapshots 243 starting NetBackup processes 72, 177 startup NetBackup 634 status code finding message from 155 status codes and nbsu 141 NetBackup sorted by code 156 sorted by message 397 Status Collection Daemon 109 stderr 125 stdout 125 stopping NetBackup processes 70, 72, 177 storage units 74 SuSE 27 symbolic links 241 synthetic backup 649 logs 113 syslogd 85, 108 system logs 85
third-party copy 255 and multiplexing 172 tl4d, description 697 tl4test 710711 tl8cd, description 698 tl8d, description 697 tl8test 710711 tldcd, description 699 tldd, description 698 tldtest 710711 tlhcd, description 700 tlhd, description 699 tlhtest 711 tlmd, description 700 tpautoconf 108, 494, 579 tpconfig 108 tpconfig, overview 701 traceroute 706 troubleshooting procedure communication problems PC clients 46 UNIX clients 40 general introduction 33 master server and clients 34, 38 media server and clients 38 host name and services entries 49 installation 27 preliminary 23 try file 114 tshd, overview 701 tshtest 710
U
unavailable 74 unified logging 85 changing location of 85 client logs 115 command examples 94 configuring settings 97 controlling disk space usage 100 controlling number of log files 100 controlling size of 98 deleting logs 97 file name format 86 file rotation 93 format of files 95 listing settings 99 location 85
T
tar log 120 log files 103 NetBackup 681 TAR32 661 test utility robotic 709
Index
721
unified logging (continued) message types 86 NetBackup product ID 87 PC clients 118 processes using 88 retention 94 setting level on PC clients 122 settings levels 91 submitting to support 102 tar log files 103 viewing 95 upload directory 103 user-directed backups 636 user_ops log 106, 117, 121 utility robotic test 709
W
Windows open file backup 642 windrvr6 682
X
xinetd 27 XML 116 xml for nbsu 141
V
VERBOSE 113 verbose flag 113 VERBOSE level 93 Veritas Volume Manager 242 vm.conf 107, 113 vm.conf file 695 vmadm, overview 702 vmd 107 debug logging 107 overview 701 vmscd 109 logging 108 vmscd, overview 702 vnetd log 106 Volume Configuration Wizard 584 vxassist command 242, 247 VxFS_Checkpoint snapshot 244 vxlogcfg 86, 111 vxlogcfg command 93, 97, 99, 111 vxlogmgr command 96, 100 vxlogview command 95 query string overview 128 with job ID option 96 vxpbx_exchanged 71 vxprint command 247 VxUL logging see unified logging 85 vxvm frozen image method 242 vxvol command 242