PRIMECLUSTER sends heartbeats to CF and RMS. Each type of heartbeat failure that is detected from CF and RMS respectively and its detection time (default) are as follows.
Failure type detected with a heartbeat | Detection time of heartbeat timeout | |
---|---|---|
CF |
| 10 seconds |
RMS |
| 600 seconds |
(*1): When using the monitoring agent of PRIMECLUSTER, the monitoring agent detects it immediately
(*2): The ELM heartbeat (RMS heartbeat) detects it immediately.
(*3): As an example, there is a double fault.
Note
The error detected by a CF heartbeat effects well on the operation. Therefore, the detection time of heartbeat timeout (detection time) is set shorter than RMS detection time.
If you set the detection time of CF shorter than that of RMS, the following warning message is output during RMS startup.
(BM, 4) The CF cluster timeout <cftimeout> exceeds the RMS timeout <rmstimeout>. This may result in RMS node elimination request before CF timeout is exceeded. Please check the CF timeout specified in /etc/default/cluster.config and the RMS heartbeat miss time specified by hvcm '-h' option.