International Business Machines Corporation
Failure detection for central electronics complex group management

Last updated:

Abstract:

Examples of techniques for failure detection for central electronics complex (CEC) group management are described herein. An aspect includes issuing a first virtual input/output server (VIOS) probe to a hardware management console (HMC) of a central electronics complex (CEC) group. Another aspect includes receiving a first response packet that includes health data corresponding to a plurality of VIOSes. Another aspect includes determining, based on the first response packet, that cluster down is indicated on a first VIOS. Another aspect includes, based on determining that cluster down is indicated on the first VIOS, getting a VIOS state for the first VIOS from the HMC. Another aspect includes determining based on the VIOS state that the first VIOS is in a down state and determining that the first VIOS is unhealthy. Another aspect includes updating a health data entry corresponding to the first VIOS to indicate that the first VIOS is unhealthy.

Status:
Grant
Type:

Utility

Filling date:

27 Apr 2020

Issue date:

21 Dec 2021