Deprecated: Function create_function() is deprecated in
/home/cerezona/public_html/blog/wp-content/plugins/codecolorer/lib/geshi.php on line
4698
Elasticity and scalability via dynamic provisioning of servers and other resources is one of the tenets of cloud computing: in practice, being able to automatically grow the resources available to an application based on the level of demand is actually eased with patterns like WASABi, the Autoscaling Application Block for Azure.
But even if an application can automatically grow, a question is still unanswered: what is the minimum number of servers that should be reserved? And that is a complex question that requires trading off the losses of being unable to properly service requests because booting up instances takes some minutes, and the actual costs of those instances that may rest idle with no load.
Let’s consider a service with low margin (10%) and a really intensive computation (8 requests per second per Azure XL server) were the probability that a customer does not enter the service due to congestion is 1% and a much higher probability of 2.5% for those leaving the service after a request: to model the whole system an M/M/s queue is considered, that is, s servers experiencing a Poisson-like input process and an exponential service time, with an unlimited waiting queue. To solve the proposed queue model, R code and its resulting plot as follows:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47
| # Minimum number of roles to calculate
minRoles <- 4
# Maximum number of roles to calculate
maxRoles <- 12
# Probability of not entering service due to high-load
probNotJoin <- 0.01
lossNotJoin <- 0
# Probability of leaving the service due to high-load
probLeaving <- 0.025
lossLeaving <- 0
# Revenue per request
customerRevenue <- 1
# Profit per request
customerProfit <- 0.1
# Azure XL web/worker role ($0.96/hour)
costServer <- 0.96
# New requests/hour
arrivalRate <- 90000
# Requests/hour serviced per role
serviceRate <- 30000
serviceCost <- 0
totalCost <- 0
library(pdq, help)
for(r in minRoles :maxRoles ) {
Init (“”)
CreateMultiNode (r, “server”, CEN, FCFS)
CreateOpen (“requests”, arrivalRate )
SetDemand (“server”, “requests”, 1/serviceRate )
Solve (CANON)
queueLength <- GetQueueLength (“server”, “requests”, TRANS) - arrivalRate /serviceRate
waitingTime <- 60 * queueLength /arrivalRate
numNotJoin <- queueLength * probNotJoin * arrivalRate
numLeaving <- waitingTime * probLeaving * arrivalRate
serviceCost [r ] <- r * costServer
lossNotJoin [r ] <- numNotJoin * customerRevenue * customerProfit
lossLeaving [r ] <- numLeaving * customerRevenue * customerProfit
totalCost [r ] <- serviceCost [r ] + lossNotJoin [r ] + lossLeaving [r ]
}
plot(totalCost, xlim =c(minRoles,maxRoles ), type =“b”, col=“black”, lwd =2,
main =“Optimal Elasticity of Demand”, xlab =“Servers (m)”, ylab =“Cost ($)”)
lines(serviceCost, xlim =c(minRoles,maxRoles ), type =“b”, col=“brown”, lty =“dashed”)
lines(lossNotJoin, xlim =c(minRoles,maxRoles ), type =“b”, col=“green”, lty =“dashed”)
lines(lossLeaving, xlim =c(minRoles,maxRoles ), type =“b”, col=“orange”, lty =“dashed”) |

The black curve represents the total costs: the sum of the server costs (brown curve) and losses due to not entering the service (green curve) and leaving the service (orange curve). An optimal reservation of servers is found at s=8, the minimum of the total curve costs.