Intel Corporation
Service level agreement-based multi-hardware accelerated inference

Last updated:

Abstract:

Various systems and methods for implementing a service-level agreement (SLA) apparatus receive a request from a requester via a network interface of the gateway, the request comprising an inference model identifier that identifies a handler of the request, and a response time indicator. The response time indicator relates to a time within which the request is to be handled indicates an undefined time within which the request is to be handled. The apparatus determines a network location of a handler that is a platform or an inference model to handle the request consistent with the response time indicator, and routes the request to the handler at the network location.

Status:
Grant
Type:

Utility

Filling date:

8 Oct 2020

Issue date:

7 Jun 2022