Intel Corporation
Service level agreement-based multi-hardware accelerated inference

Last updated: 8 Jun 2022

Abstract:

Various systems and methods for implementing a service-level agreement (SLA) apparatus receive a request from a requester via a network interface of the gateway, the request comprising an inference model identifier that identifies a handler of the request, and a response time indicator. The response time indicator relates to a time within which the request is to be handled indicates an undefined time within which the request is to be handled. The apparatus determines a network location of a handler that is a platform or an inference model to handle the request consistent with the response time indicator, and routes the request to the handler at the network location.

Status:

Grant

Type:

Utility

Filling date:

8 Oct 2020

Issue date:

7 Jun 2022

Full patent description

Patent application document

Intel Corporation Service level agreement-based multi-hardware accelerated inference

Abstract:

Intel Corporation
Service level agreement-based multi-hardware accelerated inference