Adobe Inc.
Capacity-based scaling of queue-based services
Last updated:
Abstract:
Capacity-based scaling of queue-based resources is described. Initially, a scaling system measures capacity of service processors that are instantiated at a cloud computing service system to provide a service on behalf of a service provider, and also measures a load on these processors. In contrast to conventional scaling systems--which base scalings on a number of queued messages which the instantiated service processors process to provide the service--the scaling system measures the load in terms of not only the number of messages held in a queue but also an input rate of the messages to the queue. The described scaling system then determines whether and by how much to scale the instantiated processors based on this number of messages and input rate. Given this, the scaling system instructs the cloud computing service system how to scale the instantiated service processors to provide the service.
Utility
19 Oct 2018
8 Dec 2020