PRIMECLUSTER Global File Services Configuration and Administration Guide 4.1 (Solaris(TM) 10 Operating System)
Contents Index PreviousNext

Appendix C Trouble shooting> C.3 Common corrective actions against failures

C.3.3 Corrective action when the sfcfrmd daemon is not started

This section describes corrective actions when the sfcfrmd daemon is not started during node is started up, the node enters multi-user mode, or CF is started from GUI.

For integrated file system access being maintained, startup of the sfcfrmd daemon will be suspended until a quorum exists.

If activation of the daemon is suspended, the following message will be output;

WARNING: sfcfsrm:5001: Starting the sfcfrmd daemon was suspended because quorum dose not exist

Normally, no corrective action should be taken to activate the sfcfrmd daemon because it will be started as soon as a quorum exists.

In the following cases, a quorum does not exist, so corrective action should be taken to start operation of the GFS Shared File System.

If GFS cannot be operated because the sfcfrmd daemon is not activated, using the following steps:

Procedure 1. Check the state of all the cluster nodes.

Connect all the operating nodes and check if the same state is displayed using the cftool(1M) command or Cluster Admin GUI.

# cftool -n <Enter>
Node  Number State       Os      Cpu
sunny 1      UP          Solaris Sparc
monny 2      UP          Solaris Sparc

If the node state is different on all the operating nodes, a cluster partition error exists.

For details about cftool(1M), see "Node details" or "Manual pages" of the "PRIMECLUSTER Cluster Foundation (CF) (Solaris(TM)) Configuration and Administration Guide".

Procedure 2. If a cluster partition error occurs, take the following corrective action:

- When a LEFTCLUSTER node exists

If SF is running properly on all the cluster nodes, it will solve the cluster partition error, so no corrective action is necessary. If SF is not running properly, or forced shutdown of the node through SF fails, you need to recover the node manually. Take corrective action according to "Caused by a cluster partition" of the "PRIMECLUSTER Cluster Foundation (CF) (Solaris(TM)) Configuration and Administration Guide".

- When a LEFTCLUSTER node does not exist

Procedure 3. If a cluster partition error does not exist, forcibly restart the sfcfrmd daemon that has been suspended by executing the sfcfrmstart(1M) command on all the nodes where operation will be resumed.

# sfcfrmstart -f <Enter>

Contents Index PreviousNext

All Rights Reserved, Copyright(C) FUJITSU LIMITED 2006