Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place.
Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place.
Path: blob/master/notebooks/queue.ipynb
Views: 531
Modeling and Simulation in Python
Case Study: Queueing theory
Copyright 2017 Allen Downey
One queue or two?
This notebook presents a solution to an exercise from Modeling and Simulation in Python. It uses features from the first four chapters to answer a question related to queueing theory, which is the study of systems that involve waiting in lines, also known as "queues".
Suppose you are designing the checkout area for a new store. There is room for two checkout counters and a waiting area for customers. You can make two lines, one for each counter, or one line that serves both counters.
In theory, you might expect a single line to be better, but it has some practical drawbacks: in order to maintain a single line, you would have to install rope barriers, and customers might be put off by what seems to be a longer line, even if it moves faster.
So you'd like to check whether the single line is really better and by how much. Simulation can help answer this question.
As we did in the bikeshare model, we'll assume that a customer is equally likely to arrive during any timestep. I'll denote this probability using the Greek letter lambda, , or the variable name lam
. The value of probably varies from day to day, so we'll have to consider a range of possibilities.
Based on data from other stores, you know that it takes 5 minutes for a customer to check out, on average. But checkout times are highly variable: most customers take less than 5 minutes, but some take substantially more. A simple way to model this variability is to assume that when a customer is checking out, they have the same probability of finishing up during each time step. I'll denote this probability using the Greek letter mu, , or the variable name mu
.
If we choose , the average number of time steps for each checkout will be 5 minutes, which is consistent with the data.
One server, one queue
Write a function called make_system
that takes lam
and mu
as parameters and returns a System
object with variables lam
, mu
, and duration
. Set duration
, which is the number of time steps to simulate, to 10 hours, expressed in minutes.
Test this function by creating a System
object with lam=1/8
and mu=1/5
.
Write an update function that takes as parameters x
, which is the total number of customer in the store, including the one checking out; t
, which is the number of minutes that have elapsed in the simulation, and system
, which is a System
object.
If there's a customer checking out, it should use flip
to decide whether they are done. And it should use flip
to decide if a new customer has arrived.
It should return the total number of customers at the end of the time step.
Test your function by calling it with x=1
, t=0
, and the System
object you created. If you run it a few times, you should see different results.
Now we can run the simulation. Here's a version of run_simulation
that creates a TimeSeries
with the total number of customers in the store, including the one checking out.
Call run_simulation
with your update function and plot the results.
After the simulation, we can compute L
, which is the average number of customers in the system, and W
, which is the average time customers spend in the store. L
and W
are related by Little's Law:
Where is the arrival rate. Here's a function that computes them.
Call compute_metrics
with the results from your simulation.
Parameter sweep
Since we don't know the actual value of , we can sweep through a range of possibilities, from 10% to 80% of the completion rate, . (If customers arrive faster than the completion rate, the queue grows without bound. In that case the metrics L
and W
just depend on how long the store is open.)
Create an array of values for lam
.
Write a function that takes an array of values for lam
, a single value for mu
, and an update function.
For each value of lam
, it should run a simulation, compute L
and W
, and store the value of W
in a SweepSeries
.
It should return the SweepSeries
.
Call your function to generate a SweepSeries
, and plot it.
If we imagine that this range of values represents arrival rates on different days, we can use the average value of W
, for a range of values of lam
, to compare different queueing strategies.
Analysis
The model I chose for this system is a common model in queueing theory, in part because many of its properties can be derived analytically.
In particular, we can derive the average time in the store as a function of and :
The following function plots the theoretical value of as a function of .
Use this function to plot the theoretical results, then plot your simulation results again on the same graph. How do they compare?
Multiple servers
Now let's try the other two queueing strategies:
One queue with two checkout counters.
Two queues, one for each counter.
The following figure shows the three scenarios:
Write an update function for one queue with two servers.
Use this update function to simulate the system, plot the results, and print the metrics.
Since we have two checkout counters now, we can consider values for that exceed .
Create a new array of values for lam
from 10% to 160% of mu
.
Use your sweep function to simulate the two server, one queue scenario with a range of values for lam
.
Plot the results and print the average value of W
across all values of lam
.
Multiple queues
To simulate the scenario with two separate queues, we need two state variables to keep track of customers in each queue.
Write an update function that takes x1
, x2
, t
, and system
as parameters and returns x1
and x2
as return values. f you are not sure how to return more than one return value, see compute_metrics
.
When a customer arrives, which queue do they join?
Write a version of run_simulation
that works with this update function.
Test your functions by running a simulation with a single value of lam
.
Sweep a range of values for lam
, plot the results, and print the average wait time across all values of lam
.
How do the results compare to the scenario with two servers and one queue.