Top
PRIMECLUSTER Concepts Guide 4.6
FUJITSU Software

2.3.5 PRIMECLUSTER SF

The PRIMECLUSTER Shutdown Facility (SF) provides a function to guarantee that other nodes are shut down during error processing such as when contention for user resources occurs in a cluster system.

Note

When CF confirms that a cluster node has restarted and can guarantee that the node was shut down before, PRIMECLUSTER SF does not shut down the node.

PRIMECLUSTER SF is made up of the following major components:

Shutdown Agents (SA)

The SA guarantees a reliable suspension of the remote cluster node. The SA varies depending on the architecture of each cluster node.

The SA provides the following functions:

The PRIMECLUSTER Shutdown Facility provides the following Shutdown Agents:

Note

This SA is not available in PRIMERGY iRMC.

MA (Monitoring Agent)

The Monitoring Agent (MA) has the capability to monitor the state of a system and promptly detect a failure such as system panic and shutdown. This function is provided by taking advantage of the hardware features that detect the state transition and inform the upper-level modules.

Without the MA, the cluster heartbeat time-out detects only a communication failure during periodic intervals. The MA allows the PRIMECLUSTER system to quickly detect a node failure.

The MA provides the following functions:

PRIMECLUSTER SF provides the following Monitoring Agents:

RCI Monitoring Agents (SPARC Enterprise M Series)

The MA monitors the node state and detects a node failure by using the SCF/RCI mounted on SPARC Enterprise M-series. The System Control Facility (SCF), which is implemented on a hardware platform, monitors the hardware state and notifies the upper-level modules. The MA assures node elimination and prevents access to the shared disk.

Console Monitoring Agents (Available server models are limited to SPARC Enterprise M-series and most of SPARC Enterprise T-series.)

The console monitoring agent monitors message output to the console of each node using XSCF/ILOM. If an error message of a node failure is output to one node, the other node detects the message and notifies SF of a node failure. Normally, the console monitoring agent creates a loop, monitoring another node, for example, A controls B, B controls C, and C controls A. If one node goes down because of a failure, another node takes over the monitoring role instead of this failed node.

The console monitoring agent also ensures node elimination by sending a break signal to the failed node.

The figure below shows how the monitoring feature is taken over in a cluster system with three nodes if one node goes down. The arrow indicates that a node monitors another node.

Figure 2.5 MA normal operation

When a failure occurs, and Node 2 is DOWN, the following actions occur:

  • Node 1 begins to monitor Node 3.

  • The following message is output to the /var/adm/messages file of Node 1:

    FJSVcluster: INFO: DEV: 3044: The console monitoring agent took over monitoring (node: targetnode)

The figure below shows how Node 1 added Node 3 as the monitored node when Node 2 went down.

Figure 2.6 MA operation in the event of node failure

Note

If monitoring function is taken over while the console monitoring agent is stopped, the stopped console monitoring agent is resumed.

When Node 2 recovers from the failure and starts, the following actions occur:

  • The original monitoring mode is restored.

  • The following message is output to the /var/adm/messages file of Node 1:

    FJSVcluster: INFO: DEV: 3045: The console monitoring agent cancelled to monitor (node: targetnode)

The figure below shows how Node 2 returns to monitoring Node 3 once it has been restored to the cluster.

Figure 2.7 Node recovery

The following are possible messages that might be found in the /var/adm/messages file:

  • FJSVcluster: INFO: DEV: 3042: The RCI monitoring agent has been started
    Indicates that the RCI monitoring agent is enabled.

  • FJSVcluster: INFO: DEV: 3043: The RCI monitoring agent has been stopped.
    Indicates that the monitoring feature is disabled.

  • FJSVcluster: INFO: DEV: 3040: The console monitoring agent has been started (node:monitored node name)
    Indicates that the monitoring feature of the console monitoring agent is enabled.

  • FJSVcluster: INFO: DEV: 3041: The console monitoring agent has been stopped (node:monitored node name)
    Indicates that the monitoring feature of the console monitoring agent is disabled. When the monitoring feature is not enabled, the other feature that forcibly brings the node DOWN might not work.

Note

The console monitoring agent monitors the console message of the remote node. So it cannot recognize the node state in the event of an unexpected shutdown. In such a case, the node goes into the LEFTCLUSTER state, and you need to mark the remote node DOWN. For how to mark a node with DOWN, see "PRIMECLUSTER Cluster Foundation (CF) Configuration and Administration Guide."

SNMP asynchronous monitoring (SPARC M10, M12)

This function monitors the node state by using the eXtended System Control Facility (XSCF) installed in the SPARC M10, M12.

The function can ascertain node failures by having the XSCF report the node state to the software using SNMP (Simple Network Management Protocol).

This function can intentionally trigger a panic or a reset in other nodes to forcibly stop those nodes with certainty and prevent contention over user resources.

MMB asynchronous monitoring (PRIMEQUEST 2000)

This function uses the MMB, which is one of the hardware units installed in PRIMEQUEST 2000, to monitor nodes. The function can ascertain node failures by having the MMB, which is one of the standard units installed in the hardware, report the node state to the software.

This function can intentionally trigger a panic or a reset in other nodes to forcibly stop those nodes with certainty and prevent contention over user resources.

iRMC asynchronous monitoring (PRIMEQUEST 3000)

This function uses the iRMC and MMB, which are the hardware units installed in PRIMEQUEST 3000, to monitor nodes. The function can ascertain node failures by having the iRMC and MMB, the standard units installed in the hardware, report the node state to the software. This function can intentionally trigger a panic, reset or shutting off the power in other nodes to forcibly stop those nodes with certainty and prevent contention over user resources.

Note

This SA is not available in PRIMERGY iRMC.

Note

Node state monitoring of the RCI asynchronous monitoring function operates from when message (a) shown below is output until message (b) is output.
The messages for the console asynchronous monitoring function are messages (c) and (d).
The messages for the SNMP asynchronous monitoring function are messages (e) and (f).
The messages for the MMB asynchronous monitoring function are messages (g) and (h).
The messages for the iRMC asynchronous monitoring function are messages (i) and (j).
When node state monitoring is disabled, the function that forcibly stops nodes may not operate normally.

(a) FJSVcluster: INFO: DEV: 3042: The RCI monitoring agent has been started.
(b) FJSVcluster: INFO: DEV: 3043: The RCI monitoring agent has been stopped.
(c) FJSVcluster: INFO: DEV: 3040: The console monitoring agent has been started (node:monitored node name).
(d) FJSVcluster: INFO: DEV: 3041: The console monitoring agent has been stopped (node:monitored node name).
(e) FJSVcluster: INFO: DEV: 3110: The SNMP monitoring agent has been started.
(f) FJSVcluster: INFO: DEV: 3111: The SNMP monitoring agent has been stopped.
(g) FJSVcluster: INFO: DEV: 3080: The MMB monitoring agent has been started.
(h) FJSVcluster: INFO: DEV: 3081: The MMB monitoring agent has been stopped.
(i) FJSVcluster: INFO: DEV: 3120: The iRMC asynchronous monitoring agent has been started.
(j) FJSVcluster: INFO: DEV: 3121: The iRMC asynchronous monitoring agent has been stopped.