Top
PRIMECLUSTER  Installation and Administration Guide 4.5
FUJITSU Software

11.3.2 Changing Time to Detect RMS Heartbeat Timeout

If RMS cannot perform a heartbeat for 600 seconds, it is determined that a heartbeat fails. In the environment where the system is overloaded and a heartbeat failure occurs often, tune the heartbeat time.

Note

  • If you set the heartbeat time long, it takes long to detect an error. Therefore, tune the heartbeat time carefully.

  • If you set the heartbeat time shorter than CF heartbeat time, a warning message is output during RMS startup. For details, see the notes on "7.6 CF and RMS Heartbeats."

To tune the heartbeat time, perform the following procedure:

Default value: 600 seconds

Setting value: set it from 45 to 3600 seconds

  1. Stop a cluster application and RMS on all the nodes.
    # hvshut -a

  2. Change /usr/opt/reliant/etc/CONFIG.rms on all the nodes as follows.

    hvcm -c config -h waiting time (seconds)

    Example

    To change the default value from 600 to 800 seconds

    -h monitoring timeout (Maximum: 3600)
    hvcm -c config -h 800

  3. Start RMS on all the nodes.
    # hvcm -a

  4. Check if RMS has started with the option specified in Step 2.
    # hvdisp -h
    Check if hvcm -c config -h waiting time (seconds) (Check that the waiting time is the value set above).