Top
ServerView Resource Orchestrator Cloud Edition V3.1.0 DR Option Instruction
ServerView

4.2 Switchover Operation During Disasters

This section explains operations of disaster recovery switchover when a disaster strikes the primary site.

The same operations can be used for failback from the backup site to the recovered primary site.

For switchover operations during a disaster, there are the two main operations described below.

Detailed Procedure for Switchover Operation during Disasters

The details of the procedure are explained below.

  1. Stop operation of the backup site

    Stop the operations being performed on the backup site, and prepare to switch over from the primary site.

  2. Disconnect the storage

    Stop the storage replication function, and disconnect the backup site from the primary site.

  3. Mount volumes which contain the information for failover

    Mount the volume which was replicated from the primary site when performing "2. Configure the Disaster Recovery function" in "Chapter 3 Installation", to the manager of the backup site.

  4. Copy physical L-Server images

    Physical L-Server images for the primary site are replicated to the backup site. However, the image file storage folder of the backup site contains the physical L-Server images for the backup site. Perform the following operation to switch the physical L-Server images on the backup site.

    1. Save the physical L-Server images in the image file storage folder on the backup site to a different folder.

      The image file storage folder can be shown by executing the rcxadm imagemgr command with the info option.

      For details on rcxadm imagemgr command, refer to "5.9 rcxadm imagemgr" in the "Reference Guide (Command/XML) CE".

    2. Clear the content of the image file storage folder.

    3. Copy the physical L-Server images for the primary site to the image file storage folder.

      The physical L-Server images for the primary site can be found in the Image folder of the volume mounted in "3. Mount volumes which contain the information for failover".

    4. Restart the manager.

  5. Copy firewall configurations

    When using the firewall auto-configuration function for network devices, copy the newest configuration from the primary site to the firewall of the backup site.

  6. Recover customized information of the primary site

    If any files in the following folder on the primary site have been modified, restore the files from the backup saved in "2. Backup customize information of the primary site" in "4.1 Normal Operation", and restart manager if necessary.

    [Windows Manager]

    • The following files under Installation_folder\SVROR\Manager\etc\customize_data

      • l_server.rcxprop

      • logctl_env.rcxprop

      • storage_rcxprop

      • vnetwork_vmware.rcxprop

      • vnetwork_hyperv.rcxprop

      • vnetwork_excluded_vmware.rcxprop

      • rcx_base.rcxprof

      • server_spec.rcxprop

      • library_share_deny.conf

      • image_admin_hyperv.rcxprop

      • library_share_User_Group_Name_deny.conf

      • kb978336_scvmm.rcxprop

      • image_admin_hyperv_User_Group_Name.rcxprop

      • server_control.rcxprop

      • vm.rcxprop

      • vm_console.rcxprop

      • scvmm_mac_pool.rcxprop

      • scvmm_mac_pool_Tenant_Name.rcxprop

      • vm_VMware.rcxprop

      • vm_Hyper-V.rcxprop

      • create_image_previous_folder_name_scvmm.rcxprop

      • auto_replace.rcxprop

      • Physical.rcxprop

      • spare_server_config.rcxprop

    • Installation_folder\SVROR\Manager\etc\trapop.bat

    • All files under Installation_folder\SVROR\Manager\etc\event_handler

    • All files under Installation_folder\SVROR\Manager\etc\scripts

    • The following file under Installation_folder\SVROR\Manager\etc\vm

      • vds_vc

  7. Reflecting changes of settings of the primary site

    Reflect the following customization settings of the primary site, which are performed according to "Chapter 19 To Customize Environment" in the "Setup Guide CE", on the backup site.

    • "19.1 Settings for Sending Email"

    • "19.2.3 Setting Application process settings"

    • "19.2.4 Setting Application process to be used"

    • "19.3 Customizing the Dashboard"

    • "19.4 Setting the Host Names for Virtual Servers"

    • "19.5 Setting the Method for Setting Resource Names"

    • "19.6 Settings for the Overcommit Function"

    • "19.7 Editing the Environment Setup File for the L-Platform API"

    • "19.9 System Disk Settings for Physical L-Servers"

    • "19.12 Edit the License Agreement"

    • "19.13 Editing the User Agreement when Registering a User"

    After a Disaster Recovery environment is configured, if the settings of the primary site are changed according to "Chapter 8 Modifying Settings" in the "Operation Guide CE", reflect the changes of the following settings on the backup site.

    • "8.2 Settings for Sending Email"

    • "8.4 Editing Information in the Home Window"

    • "8.5.1 Settings for Permissions to Change L-Platform Templates"

    • "8.5.2 Subnet Settings at Segment Editing"

    • "8.5.3 Settings for the Simplified Reconfiguration Function"

    • "8.5.4 Distribution Ratio Settings"

    • "8.5.5 Application Process Settings"

    • "8.5.7 Edit the License Agreement"

    • "8.5.9 Default Password Setting for Sent Emails"

    • "8.5.10 Settings for the Maximum Number of Connections for the L-Platform Template"

    • "8.6.1 Settings for Tenant Management and Account Management"

    • "8.6.2 Editing the User Agreement when Registering a User"

    • "8.7 Accounting Settings"

    • "8.8 System Condition Server List Settings"

    • "8.9 Settings for Event Log Output for CMDB Agent"

  8. Batch import

    Perform batch switchover operations, using the switchover information for Disaster Recovery.

    • Operations to Perform Batch Switchover of All Resources

      Execute the following command when settings for disaster recovery are complete.

      [Windows Manager]

      >Installation_folder\SVROR\Manager\bin\rcxrecovery <RETURN>

    • Operation Involving Resource Switchover by Tenant

      1. Restore global resources

        Execute the following command when settings for disaster recovery are complete.

        [Windows Manager]

        >Installation_folder\SVROR\Manager\bin\rcxrecovery -global <RETURN>

      2. Restore each tenant resource

        Execute the following command for each tenant.

        Execute the following command when settings for disaster recovery are complete.

        [Windows Manager]

        >Installation_folder\SVROR\Manager\bin\rcxrecovery -tenant tenant1 <RETURN>

      For details on the rcxrecovery command, refer to "A.2 rcxrecovery".

      The rcxrecovery command displays messages which represent the status of the switchover process. In addition, the following operations require manual operations or decisions by an administrator. Respond to inquiries from the system to continue execution of the command.

      • When the -nocleanup option is not specified and the backup site is to be cleaned up, an inquiry will be displayed to ask if you really want it to be cleaned up. Press "y" to approve the cleanup and continue the rcxrecovery command. If cleanup of the backup site cannot be performed, press "n" to quit the command.

      • When VM management software is not configured on the physical L-Server, manual restoration of VM management software is necessary. The rcxrecovery command will show a prompt while waiting for the restoration of VM management software. In that case, continue the command after restoration of VM management software is completed.

      • If VMware ESXi is installed on the server that is the target of switchover, the VM host must be registered in SVOM. After registration is complete, continue the command execution. The command will be postponed until registration is complete.

      Information

      The rcxrecovery command performs the following:

      • Mapping of storage

        Storage resource mapping is performed, based on the files defining the relations between the storage on the primary and backup sites.

      • Division of import resources

        Imported resources are divided using the following units:

        - Global resources and resources of individual tenants

        - Physical L-Server resources and virtual L-Server resources

      • Delete the backup site environment

        Delete the following environment used for backup site operations.

        - L-Platform templates

        - L-Platform configuration information

        - Resource information

        - Accounting information

        - Metering logs

      • Import the information

        Import the following information:

        - L-Platform templates

        - L-Platform configuration information

        - Resource information

        - Accounting information

        - Metering logs

      Note

      • Configured physical servers and linked L-Servers for which only configuration definition has been created are not covered by disaster recovery failover.
        Execute the rcxadm config command as follows to back up the information which is used to relate the configuration definition to L-Server again later.

        [Windows Manager]

        >Installation_folder\SVROR\Manager\bin\rcxadm config filter -convert -indir indir -outdir outdir <RETURN>

        The following information will be output to outdir, using the file name convert.txt.
        After completing the restoration procedure, perform creation of a link with an L-Server and setting the scope again as shown below:

        [command]
        rcxadm lserver convert -with BX900_1 -name physical_lserver_1
        rcxadm lserver convert -with BX900_3 -name physical_lserver_2 -to /tenant_1
        rcxadm lserver convert -with BX900_6 -name physical_lserver_3 -label test_label3 -comment test_comment3 -to /folder_1
        [user]
        user_1, physical_lserver_1
        user_1, /tenant_1/physical_lserver_2
        user_2, /folder_1/physical_lserver_2
        [usergroup]
        usergroup_1, physical_lserver_1
        usergroup_2, /folder_1/physical_lserver_3

      • Storage using the ETERNUS dynamic LUN mirroring is switched to the LUN that was prepared on the backup site when the switchover operation is performed after a disaster. Operation may be different if the storage is not released even after deleting an L-Platform.

      • Perform operation so that the import of the resource configuration information and user definition file (XML file) using the rcxrecovery command is not executed at the same time as modification of the resource configuration information or user definition file (XML file) due to L-Server creation, or export operations using the rcxmgrexport command.

      • The RC console may not display the correct power status of L-Servers immediately after a disaster recovery failover. Wait for about 3 minutes after the failover to see the correct power status of L-Servers.

      • As the performance information (the information displayed in the dashboard and in operational status displays) depends on the configuration of the primary site, this is not carried over to the backup site when switched over. The information on the backup site is displayed.

      • Importing may fail when switchover is performed to the backup site, due to a VM guest switchover failure. In this case, perform the import again using the following procedure, but excluding the managed servers for which import failed. Disaster recovery failover fails when configuration import fails. In this case, execute the failover command again after taking the corrective action for the cause of the import failure. When you execute the command again, the -nocleanup option needs to be specified if the previous command was executed without the -nocleanup option and configuration cleanup was performed and the "cleanup of resources" process ended in "completed".
        The following shows an example of where the VM guest could not be restored during L-Server import.

        1) An error occurs during import.

        >Installation_folder\SVROR\Manager\bin\rcxrecovery <RETURN>
        FJSVrcx:ERROR:62569:/tenant1/l-platform1/l-server1:lserver:import was interrupted. Message=:67154: VM Guest not found

        2) The status of the relevant resources is displayed to help determine the cause of the error.

        >Installation_folder\SVROR\Manager\bin\rcxadm config show -type lserver -name /tenant1/l-platform1/l-server1 -dir dir1 -format xml<RETURN>

        For -dir, specify the folder that was most recently exported in the folder below.
        rcxrecovery -dir option, or the import folder specified in fa_dr.rcxprop\ManagerExport\RORexport_date

        3) Delete the resource where the problem occurred from the export file.

        >Installation_folder\SVROR\Manager\bin\rcxadm config filter -exclude lserver -name /tenant1/l-platform1/l-server1 -indir dir1 -outdir dir2 <RETURN>

        4) Use the export file output in 3) to overwrite the file in the folder specified for -dir in 2).

        5) Import again.

        >Installation_folder\SVROR\Manager\bin\rcxrecovery -nocleanup<RETURN>
  9. Restore directory service information

    Restore the directory service information. For the restoration procedure, refer to the manuals of the directory service software.

  10. Check the imported L-Platforms

    When an L-Platform is exported during its deployment or a reconfiguration, and then the L-Platform configuration information is imported using the rcxrecovery command, L-Platform information without resources may be recovered.
    In this case, information of the L-Platform whose resources may not exist is output to the log.

    Confirm the details output in logs, and take corrective action.

    For information on the content output to the log and the corrective actions, refer to "Appendix B Output Logs when Switchover Occurs because of Disaster Recovery".

  11. Recover usage charges

    • Send usage charge files

      Usage charge files for each tenant of the exported month will be recovered to the following directory on the primary site. Refer to "15.4.6 Sending Usage Charges" in the "Operation Guide CE" for the format of the usage charge files.

      • Usage Charge Files

        Stored folder

        Installation_folder\RCXCTMG\Charging\importdata

      • File Name

        - YYYYMM_Tenant_name[_Deleted_date].zip

        - YYYYMM is the cutoff year and month of the tenant.

        - Deleted_date will be appended if the tenant is deleted.

      These usage charge files will be needed for the accounts manager to determine the usage charges for each tenant on the month of the recovery.

      Send these files to the accounts manager of each tenant based on the tenant information file. The tenant information file is a file describing the tenant and it holds information such as tenant name, cutoff date, accounting E-mail address and so on. The tenant information file is stored in the same directory as the usage charge files.

      • Tenant Information File

        Stored folder

        Installation_folder\RCXCTMG\Charging\importdata

      • File Name

        tenant_info.csv

      • Format

        CSV file with "Tenant name", "Tenant display name", "Tenant deleted date", "Tenant last cutoff date", "Tenant next cutoff date", and "Accounting E-mail address" columns.

        #TenantName,TenantDisplayName,TenantDeletedDate,TenantLastCutoffDate,TenantNextCutoffDate,AccountingMailAddress
        tenant01,tenant001,,2012-05-31,2012-06-30,tenant1@example.com
        tenant02,tenant002,,2012-05-31,2012-06-30,tenant2@example.com
        tenant03,tenant003,2012-05-27 16:48,2012-05-31,2012-06-30,tenant3@example.com
        tenant04,tenant004,,2012-05-31,2012-06-30,tenant4@example.com

      Send usage charge files to the accounts manager as follows.

      • Operations to Perform Batch Switchover of All Resources

        Send the recovered usage charge files of each tenant to the accounts manager. For each usage charge file, which tenant it corresponds to can be determined from the file name. Obtain the accounting E-mail addresses from the tenant information file and send the usage charge files.

      • Operation Involving Resource Switchover by Tenant

        Obtain the accounting E-mail addresses of the switchover tenant from the tenant information file and send the target usage charge files.

      When sending the usage charge files, also notify the recipient of the following information.

      • The usage files of the backup site will be sent separately on the day after the cutoff date.

      • Usage charges for the month of the recovery have to be calculated with the usage charge files of the affected site and the backup site.

      In cases such as when the accounting calculation is stopped due to the disaster on the day of the cutoff of the tenant or the tenant has already been deleted, there is no accumulated data on the backup site. Therefore, the usage charge files will not be sent from the backup site. Notify the accounts manager that those tenants can be charged only from the usage charge files from the primary site. The cutoff date of the tenant can be obtained from tenant information file.

    • Notify the tenant administrator

      In the Usage Charge window of the backup site, the usage charges are those of the backup up site only. Therefore, the infrastructure administrator has to inform tenant administrators of this fact, and if necessary, disclose the usage charge information of the primary site.

    Note

    Note that the calculation of the usage charge for the backup site will start the day following the recovery.

  12. Change the FQDN setting

    When admin servers have different IP addresses between the primary and backup sites, and the FQDN needs to be taken over by the backup site, change the IP address for the FQDN registered in DNS server to the IP address of the admin server on the backup site after disaster recovery failover is performed.

    When the FQDN is taken over from the primary site to the backup site, clear the caches or temporary internet files of Web browsers which are used to connect to the ROR console on clients.

    When test certificates are used, import the certificate of the backup site to the browser if the certificate was not imported during installation.

    For details, refer to "Chapter 6 Importing a Certificate to a Browser" in the "Setup Guide CE".

The access rights that were modified on the primary site are not carried over to the backup site. Update them manually when the switchover is complete.