Intel Corporation
Service level agreement-based multi-hardware accelerated inference
Last updated:
Abstract:
Various systems and methods for implementing a service-level agreement (SLA) apparatus receive a request from a requester via a network interface of the gateway, the request comprising an inference model identifier that identifies a handler of the request, and a response time indicator. The response time indicator relates to a time within which the request is to be handled indicates an undefined time within which the request is to be handled. The apparatus determines a network location of a handler that is a platform or an inference model to handle the request consistent with the response time indicator, and routes the request to the handler at the network location.
Status:
Grant
Type:
Utility
Filling date:
8 Oct 2020
Issue date:
7 Jun 2022