Abstract:The adoption of cloud computing has been accelerating over the last decade, while enterprise cloud users (“firms”) are struggling to manage their growing cloud expenditures in the face of intermittent digital demand surges caused by planned or random events. To deal with such challenges, a firm can employ reserved instances with the standard contract length (e.g., one year) to meet the stationary base demand, which we refer to as the base contracts, complemented by additional reserved instances with either standard or shorter contract lengths, which we refer to as the supplementary contracts, to cope with the demand surges. We first analyze a model whereby the surge and inter-surge durations are deterministic, demand magnitude is random, and cancellation of the reservations is allowed. We develop a capacity management plan for the firm including not only the optimal capacity levels, which follow a newsvendor-type solution, but also the optimal policy for managing the purchase, renewal, cancellation, or expiration of the supplementary contracts, which can be characterized as a two-threshold policy. Due to the complexity of the structure of the optimal policy, we also construct an effective heuristic policy by excluding the renewal option from the action space, which can be applied to a more general setting where the surge and inter-surge durations are random. We examine two model extensions: (1) when trades of reserved instances are allowed in a secondary marketplace; (2) when the firm does not have exact information about the distributions of the surge magnitude and duration while it can adjust the capacity levels as data unveils. Our analysis shows that the optimal policy for managing the supplementary contracts depends on the relative magnitude of the surge and inter-surge durations in relationship to the cancellation fee. Moreover, our numerical results show that cloud platforms that offer a secondary marketplace are more attractive to firms from a cost standpoint than those that allow cancellation only. The latter, without the secondary marketplace, however, can achieve parity with the former by offering a deeper discount rate for the reserved instances, thereby bypassing the cost of administering the secondary marketplace.

Delay and Price Differentiation in Cloud Computing: A Service Model, Supporting Architectures, and Performance

Performance Analysis of QoS-Differentiated Pricing in Cloud Computing: An Analytical Approach

A Novel Job Scheduling Model to Enhance Efficiency and Overall User Fairness of Cloud Computing Environment.

A Game-Theoretic Method of Fair Resource Allocation for Cloud Computing Services

A Framework for Allocating Server Time to Spot and On-demand Services in Cloud Computing

EXPRESS: Cost Optimization in Cloud Computing: Capacity Reservation for Intermittent Random Demand Surges

Revenue Maximization Using Adaptive Resource Provisioning in Cloud Computing Environments

On Optimal Service Differentiation in Congested Network Markets

Pricing the cloud: a QoS-based auction approach

Pricing Policy and Computational Resource Provisioning for Delay-Aware Mobile Edge Computing

On the Economics of Cloud Markets

SRD Service: A Stochastic Network Calculus Approach for Service Differentiation Analysis

A Competitive Penalty Model for Availability Based Cloud SLA

Less is More: Service Profit Maximization in Geo-Distributed Clouds

SLA-Driven Load Scheduling in Multi-Tier Cloud Computing: Financial Impact Considerations

A Utility Game Driven QoS Optimization for Cloud Services

Dynamic Pricing for Resource Consumption in Cloud Service.

Pricing As a Service: Personalized Pricing Strategy in Cloud Computing

Tiered Cloud Storage via Two-Stage, Latency-Aware Bidding

Reputation-Based QoS Provisioning in Cloud Computing via Dirichlet Multinomial Model

Flexible Instance: Meeting Deadlines of Delay Tolerant Jobs in the Cloud with Dynamic Pricing