Top
Systemwalker Runbook Automation Reference Guide
Systemwalker

3.10.1 Check for hardware errors

Description

This operation component uses IPMI to check for hardware errors based on the sensor information for the specified hardware.

Options

Basic Options

(1) ipmiipaddress

IP address defined for the server IPMI.

An argument error will occur if this is omitted.

The maximum length of the IP address is 1,024 characters. An argument error will occur if this limit is exceeded.

(2) ipmiusername

Name of the user defined for the server IPMI.

An argument error will occur if this is omitted.

The maximum length of the user name is 1,024 characters. An argument error will occur if this limit is exceeded.

(3) ipmipassword

Password of the user defined for the server IPMI.

An argument error will occur if this is omitted.

The maximum length of the password is 1,024 characters. An argument error will occur if this limit is exceeded.

Advanced Options

(1) timeout

This is the completion timeout (in seconds) for executing the operation component.

Values between 300 and 86400 (1 day) can be specified.

Example: If the completion timeout is 10 minutes: 600

If the operation component has not finished executing when the specified time has passed, the processing for the operation component will be interrupted with return value 201.

(2) retry

This is the retry count for executing the operation component.

Specify the number of retry attempts to be used when the operation component terminates with return value 161. Values between 0 and 5 can be specified.

If the operation component terminates with a return value other than "161" as a result of being re-executed from a retry, the execution of the operation component will terminate even if the specified number of retries has not been reached. The return value for the operation component will be the value from the last time the operation component was executed.

Example: To retry twice: 2

(3) retry_interval

This is the retry interval (in seconds) for the operation component.

Specify the time to wait before a retry is attempted if the operation component terminates with return value 161. Values between 1 and 14400 can be specified.

Example: To retry at 300 second intervals: 300


If the specification of the timeout, retry and retry_interval is omitted, and a value beyond the limits of the above-mentioned is input, the value specified with the operation components definition file becomes effective. Refer to "2.5 Definition File for Operation Components" for details.

Return Values

Icon

Name

Return value

Description

Normal

0

Successfully checked for hardware errors. No errors have been found.

2

Successfully checked for hardware errors. No errors were detected, but information could not be acquired from some sensors.

Abnormality discovered

1

Successfully checked for hardware errors. Either an error was detected or information could not be acquired from any sensors.

Failure

161

Failed to check for hardware errors. If a retry count has been specified, retries will be performed.

197

There is an error with the content of an option.

-

-

201

The execution of the operation component has timed out.

202

The operation component has not been executed. There is a problem with the settings for executing the operation component.

203

The operation component has not been executed normally. There is a problem with the environment for the Management Server.

205

The operation component has not been executed. There is a problem with the input information specification of the operation component.

206

The operation component has not been executed normally. There is a problem with the output information specification of the operation component.

207

The operation component has not been executed. The operation component may not have been registered on the Management Server.

208

The Automated Operation Process has been canceled because the Automated Operation Process was recovered while the operation component was executing.

Output information

Variable

Description

message

This variable will be set to the following message if hardware problems are checked successfully.

The operation component was successful.

If an attempt to check hardware problems fails, the content of the error will be set as a string.

sensor_status_result

This variable is set to a list of the sensor information that has been obtained.

returnCode

This variable is set to the return value.

A list of the sensor information acquired is output to the sensor_status_result using the following format.

<sensor name>|<current state>|<threshold evaluation result>

[Output example]

CPU Fan 1        | 5709.72 RPM       | ok
CPU Fan 2        | 5709.72 RPM       | ok
System 5V        | 4.91 Volts        | ok
System 3.3V      | 3.29 Volts        | ok
CPU0 Dmn 0 Temp  | 48 degrees C      | ok

Notes