October 2018
Intermediate to advanced
556 pages
15h 18m
English
To answer this question, Little's Law comes to the rescue. This law explains how to calculate the number of requests that are processed simultaneously (or simply how many parallel workers there should be) to handle a predefined throughput at a particular latency level. In other words, using this formula, we can calculate the system capacity, or how many computers, nodes, or web application instances running in parallel we need in order to handle the required number of users per second with a stable response time:

The preceding formula may be explained as: the mean number of requests resident (or the number of requests processed ...