Top
ServerView Resource Orchestrator Cloud Edition V3.2.0 Operation Guide
FUJITSU Software

18.1.3 High Availability for Storage Chassis

This section explains high availability of storage chassis connected to physical L-Servers.

If performing replication between two storage units of LUNs used by a physical L-Server, quick restoration of physical L-Servers is possible, even when storage units have failed.

This section explains the switchover of disks used by physical L-Servers, between two storage units on which replication of LUNs is managed by a single manager.

Prerequisites

For details on prerequisites for high availability of storage chassis, refer to "7.2 Storage Chassis High Availability Design" in the "Design Guide CE".


Replication of Storage Unit LUNs

Replication Definition Files

The replication definition file must be created in advance. In the replication definition file, describe the relationship between the operating storage and the standby storage for replication.

The format of the replication definition file is as follows:

IP_address_of_operating_storage_unit,Operating_volume_identifier,IP_address_of_standby_storage_unit,Standby_volume_identifier

Configure the definition file using a unique combination of an IP address for an operating or standby storage unit, and an identifier for an operating or standby volume. When the information overlaps in the replication definition file, an error will occur when creating a failover or a failback script.

When Performing Switchover from Operating to Standby (Failover)

This section explains the procedure to perform switchover from operating storage units to standby storage units.

  1. Create the replication definition file.

    For details on replication definition files, refer to "Replication Definition Files".

  2. Create the following scripts by executing the rcxstorage -failover command.

    • failover script

    • Physical L-Server stopping script

    • Physical L-Server startup script

    For details on the rcxstorage command, refer to "5.23 rcxstorage" in the "Reference Guide (Command/XML) CE".

    • Create these scripts in units of operating storage.

    • These scripts are created based on the configuration information at the time which the command is executed. When changing configurations, create these scripts again.

    • Execute the rcxstorage command with the -failover option, when an operating and a standby storage unit are displayed in the storage tree. The script can be created as long as the operating storage unit is displayed in the storage tree, even if it has failed.

    Note

    These scripts cannot be created when the operation target storage unit is not displayed in the storage tree.

  3. Execute the physical L-Server stopping script on the server where the manager is being operated. This operation stops the physical L-Server targeted by the failover script. To forcibly stop the server without shutting down the operating system running on the L-Server, specify the -force option when executing the physical L-Server stopping script.

    If an error occurs during execution of the script, contact Fujitsu technical staff.

    Note

    Physical L-Servers are stopped in the order of the entries in the physical L-Server stopping script. When specification of the order of stopping physical L-Servers is necessary, edit the physical L-Server stopping script.

  4. Delete the zoning combining the WWPN of the HBA of the physical L-Server and the WWPN of the port of the operating storage from the Fibre Channel switch. For ETERNUS storage, this step is not necessary, as the zoning for Fibre Channel switch will be deleted by Resource Orchestrator when the failover script is executed.

  5. If the replication function for storage is in operation, stop it.

  6. Execute the failover script on the server where the manager is being operated.

    If an error occurs during execution of the script, contact Fujitsu technical staff.

  7. To access the standby storage, add the zoning combining the WWPN of the HBA of the physical L-Server and the WWPN of the port of the standby storage to the Fibre Channel switch. For ETERNUS storage, this step is not necessary, as the zoning for Fibre Channel switch will be added by Resource Orchestrator when the failover script is executed.

  8. When perform reading or writing for the LUN of the standby storage, modify the settings of replication if necessary.

  9. Execute the physical L-Server startup script on the server where the manager is being operated. This operation starts the physical L-Server.

    If an error occurs during execution of the script, contact Fujitsu technical staff.

    Note

    Physical L-Servers are started in the order of the entries in the physical L-Server startup script. When the specification of the order of starting physical L-Servers is necessary, edit the physical L-Server startup script.

  10. When operating an L-Platform, use the cfmg_syncdiskinfo command to reflect the information for switched disks on the L-Platform configuration information.

    [Windows Manager]

    >Installation_folder\RCXCFMG\bin\cfmg_syncdiskinfo <RETURN>

    [Linux Manager]

    # /opt/FJSVcfmg/bin/cfmg_syncdiskinfo <RETURN>

    For details on the cfmg_syncdiskinfo command, refer to "10.8 cfmg_syncdiskinfo (Synchronize Disk Information)" in the "Reference Guide (Command/XML) CE".

When Performing Switchover from Standby to Operating (Failback)

This section explains the procedure for performing switchover from standby storage units to operating storage units.

  1. Request repair of the operating storage units.

  2. Using storage management software, restore the logical configuration (RAID, LUN) of an operating storage unit.

  3. Using storage management software, check the LUN masking definition and LUN mapping definition of the operating storage unit. If the definitions relating to the WWPN of the HBA of the physical L-Server remain, delete them.

    Information

    When the storage is ETERNUS, delete the affinity group for the LUN of the operating storage unit

  4. By modifying the settings of the replication function, perform replication of the storage unit from the operating to the standby, then wait until the status of the LUNs of the operating and standby storage become equivalent.

  5. Prepare the replication definition file.

    Use the same replication definition file as that for failover. When changing the configuration after executing the failover script, correct the replication definition file.

  6. Create the following scripts by executing the rcxstorage -failback command.

    • failback script

    • Physical L-Server stopping script

    • Physical L-Server startup script

    For details on the rcxstorage command, refer to "5.23 rcxstorage" in the "Reference Guide (Command/XML) CE".

    • Create these scripts in units of operating storage.

    • These scripts can be created after executing the failover script and performing switchover to the standby storage unit.

    • These scripts are created based on the configuration information at the time which the command is executed. When changing configurations, create these scripts again.

    • Execute the rcxstorage command with the -failback option, when an operating and a standby storage unit are displayed in the storage tree. These scripts cannot be created when the operation target storage unit is not displayed in the storage tree.

  7. Execute the physical L-Server stopping script on the server where the manager is being operated. This operation stops the physical L-Server targeted by the failback script. To forcibly stop the server without shutting down the operating system running on the L-Server, specify the -force option when executing the physical L-Server stopping script.

    If an error occurs during execution of the script, contact Fujitsu technical staff.

    Note

    Physical L-Servers are stopped in the order of the entries in the physical L-Server stopping script. When specification of the order of stopping physical L-Servers is necessary, edit the physical L-Server stopping script.

  8. Delete the zoning combining the WWPN of the HBA of the physical L-Server and the WWPN of the port of the standby storage from the Fibre Channel switch. For ETERNUS storage, this step is not necessary, as the zoning for Fibre Channel switch will be deleted by Resource Orchestrator when the failback script is executed.

  9. Stop the storage replication function.

  10. Execute the failback script on the server where the manager is being operated.

    If an error occurs during execution of the script, contact Fujitsu technical staff.

  11. To access the operating storage, add the zoning combining the WWPN of the HBA of the physical L-Server and the WWPN of the port of the operating storage to the Fibre Channel switch. For ETERNUS storage, this step is not necessary, as the zoning for Fibre Channel switch will be added by Resource Orchestrator when the failback script is executed.

  12. By modifying the settings of the replication function, perform replication of the storage unit from the standby to the operating, then wait until the status of the LUNs of the standby and operating storage become equivalent.

  13. Execute the physical L-Server startup script on the server where the manager is being operated. This operation starts the physical L-Server.

    If an error occurs during execution of the script, contact Fujitsu technical staff.

    Note

    Physical L-Servers are started in the order of the entries in the physical L-Server startup script. When the specification of the order of starting physical L-Servers is necessary, edit the physical L-Server startup script.

  14. When operating an L-Platform, use the cfmg_syncdiskinfo command to reflect the information for switched disks on the L-Platform configuration information.

    [Windows Manager]

    >Installation_folder\RCXCFMG\bin\cfmg_syncdiskinfo <RETURN>

    [Linux Manager]

    # /opt/FJSVcfmg/bin/cfmg_syncdiskinfo <RETURN>

    For details on the cfmg_syncdiskinfo command, refer to "10.8 cfmg_syncdiskinfo (Synchronize Disk Information)" in the "Reference Guide (Command/XML) CE".

When Using Dynamic LUN Mirroring

This document explains some changes and points to note when using dynamic LUN mirroring.

Prerequisites

Refer to "7.2 Storage Chassis High Availability Design" in the "Design Guide CE".

Replication of Storage Unit LUNs

If disk resources are automatically generated from virtual storage, replication may be automatically set depending on the settings in the mirroring definition file. Replication settings use REC for CCM (machine-to-machine copying, transfer mode: Stack mode), so copy groups or pairs are automatically generated. The name of the generated copy group will be the value entered in the mirroring definition file.

Replication Definition Files

When replication using CCM is set, a replication definition file can be generated by executing the rcxrepdef command.

For details on the rcxrepdef command, refer to "5.22 rcxrepdef" in the "Reference Guide (Command/XML) CE".

When Performing Switchover from Operating to Standby (Failover)

The procedure for switchover is the same as that for when a LUN is created in the storage unit beforehand.

Note

The disk resources created using dynamic LUN mirroring are not deleted when failover to the standby node occurs.

Information

Start to copy using REC (transfer mode: Stack mode) for CCM for the dynamic LUN mirroring. Stop the replication function using the procedure described in step 5 in the same way as follows:

  1. Use the acec suspend command by specifying the -force option to forcibly suspend the REC session.

  2. Use the acec change command to change the REC transfer mode to Through mode.

  3. Suspend the REC session using the acec cancel command. In this case, do not specify the -force option.

When Performing Switchover from Standby to Operating (Failback)

The procedure for switchover is the same as that for when a LUN is created in the storage unit beforehand.