Workday, Inc.
Distributed monitoring agents for cluster execution of jobs

Last updated:

Abstract:

A system with distributed monitoring agents include a state storage, a plurality of worker agents, a first processor, and a second processor. A job is executed using a worker agent of the plurality of worker agents. The first processor is configured to execute a first monitor to monitor the job and to restart the job using job state data stored in the state storage in the event that the job fails to successfully complete. The second processor is configured to execute a second monitor to monitor the first monitor and to restart the first monitor using first monitor state data stored in the state storage in the event that the first monitor crashes.

Status:
Grant
Type:

Utility

Filling date:

18 Jul 2017

Issue date:

12 Nov 2019