PRIMECLUSTER sends heartbeats to CF and RMS. Each type of heartbeat failure that is detected from CF and RMS respectively and its detection time (default) are as follows.
Failure type detected with a heartbeat | Detection time of heartbeat timeout | |
---|---|---|
CF |
| 10 seconds |
RMS |
|
|
(*1): When using the monitoring agent of PRIMECLUSTER, the monitoring agent detects it immediately
(*2): In the environment where the ELM heartbeat (RMS heartbeat) is available, the ELM heartbeat detects it immediately (the ELM heartbeat is available in 4.2A00 or later as default).
(*3): As an example, there is a double fault.
Note
The error detected by a CF heartbeat effects well on the operation. Therefore, the detection time of heartbeat timeout (detection time) is set shorter than RMS detection time.
If you set the detection time of CF shorter than that of RMS, the following warning message is output during RMS startup.
(BM, 4) The CF cluster timeout <cftimeout> exceeds the RMS timeout <rmstimeout>. This may result in RMS node elimination request before CF timeout is exceeded. Please check the CF timeout specified in /etc/default/cluster.config and the RMS heartbeat miss time specified by hvcm '-h' option.