If RMS cannot perform a heartbeat for 600 seconds, it is determined that a heartbeat fails. In the environment where the system is overloaded and a heartbeat failure occurs often, tune the heartbeat time.
Note
If you set the heartbeat time long, it takes long to detect an error. Therefore, tune the heartbeat time carefully.
If you set the heartbeat time shorter than CF heartbeat time, a warning message is output during RMS startup. For details, see the notes on "7.6 CF and RMS Heartbeats."
To tune the heartbeat time, perform the following procedure:
Default value: 600 seconds
Setting value: set it from 45 to 3600 seconds
Stop a cluster application and RMS on all the nodes.
# hvshut -a
Change /usr/opt/reliant/etc/CONFIG.rms on all the nodes as follows.
hvcm -c config -h waiting time (seconds)
Example
To change the default value from 600 to 800 seconds
-h monitoring timeout (Maximum: 3600)
hvcm -c config -h 800
Start RMS on all the nodes.
# hvcm -a
Check if RMS has started with the option specified in Step 2.
# hvdisp -h
Check if hvcm -c config -h waiting time (seconds) (Check that the waiting time is the value set above).