Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place.

GitHub Repository: AllenDowney/ModSimPy
Path: blob/master/notebooks/queue.ipynb
Views: ⁵³¹

Kernel: Python 3

Modeling and Simulation in Python

Case Study: Queueing theory

License: Creative Commons Attribution 4.0 International

In [1]:

# Configure Jupyter so figures appear in the notebook
%matplotlib inline

# Configure Jupyter to display the assigned value after an assignment
%config InteractiveShell.ast_node_interactivity='last_expr_or_assign'

# import functions from the modsim.py module
from modsim import *

# set the random number generator
np.random.seed(7)

One queue or two?

This notebook presents a solution to an exercise from Modeling and Simulation in Python. It uses features from the first four chapters to answer a question related to queueing theory, which is the study of systems that involve waiting in lines, also known as "queues".

Suppose you are designing the checkout area for a new store. There is room for two checkout counters and a waiting area for customers. You can make two lines, one for each counter, or one line that serves both counters.

In theory, you might expect a single line to be better, but it has some practical drawbacks: in order to maintain a single line, you would have to install rope barriers, and customers might be put off by what seems to be a longer line, even if it moves faster.

So you'd like to check whether the single line is really better and by how much. Simulation can help answer this question.

As we did in the bikeshare model, we'll assume that a customer is equally likely to arrive during any timestep. I'll denote this probability using the Greek letter lambda, $\lambda$ , or the variable name lam. The value of $\lambda$ probably varies from day to day, so we'll have to consider a range of possibilities.

Based on data from other stores, you know that it takes 5 minutes for a customer to check out, on average. But checkout times are highly variable: most customers take less than 5 minutes, but some take substantially more. A simple way to model this variability is to assume that when a customer is checking out, they have the same probability of finishing up during each time step. I'll denote this probability using the Greek letter mu, $\mu$ , or the variable name mu.

If we choose $\mu=1/5$ , the average number of time steps for each checkout will be 5 minutes, which is consistent with the data.

One server, one queue

Write a function called make_system that takes lam and mu as parameters and returns a System object with variables lam, mu, and duration. Set duration, which is the number of time steps to simulate, to 10 hours, expressed in minutes.

In [2]:

# Solution goes here

Test this function by creating a System object with lam=1/8 and mu=1/5.

In [3]:

# Solution goes here

Write an update function that takes as parameters x, which is the total number of customer in the store, including the one checking out; t, which is the number of minutes that have elapsed in the simulation, and system, which is a System object.

If there's a customer checking out, it should use flip to decide whether they are done. And it should use flip to decide if a new customer has arrived.

It should return the total number of customers at the end of the time step.

In [4]:

# Solution goes here

Test your function by calling it with x=1, t=0, and the System object you created. If you run it a few times, you should see different results.

In [5]:

# Solution goes here

Now we can run the simulation. Here's a version of run_simulation that creates a TimeSeries with the total number of customers in the store, including the one checking out.

In [6]:

def run_simulation(system, update_func):
    """Simulate a queueing system.
    
    system: System object
    update_func: function object
    """
    x = 0
    results = TimeSeries()
    results[0] = x
    
    for t in linrange(0, system.duration):
        x = update_func(x, t, system)
        results[t+1] = x

    return results

Call run_simulation with your update function and plot the results.

In [7]:

# Solution goes here

After the simulation, we can compute L, which is the average number of customers in the system, and W, which is the average time customers spend in the store. L and W are related by Little's Law:

$L = \lambda W$

Where $\lambda$ is the arrival rate. Here's a function that computes them.

In [8]:

def compute_metrics(results, system):
    """Compute average number of customers and wait time.
    
    results: TimeSeries of queue lengths
    system: System object
    
    returns: L, W
    """
    L = results.mean()
    W = L / system.lam
    return L, W

Call compute_metrics with the results from your simulation.

In [9]:

# Solution goes here

Parameter sweep

Since we don't know the actual value of $\lambda$ , we can sweep through a range of possibilities, from 10% to 80% of the completion rate, $\mu$ . (If customers arrive faster than the completion rate, the queue grows without bound. In that case the metrics L and W just depend on how long the store is open.)

Create an array of values for lam.

In [10]:

# Solution goes here

Write a function that takes an array of values for lam, a single value for mu, and an update function.

For each value of lam, it should run a simulation, compute L and W, and store the value of W in a SweepSeries.

It should return the SweepSeries.

In [11]:

# Solution goes here

Call your function to generate a SweepSeries, and plot it.

In [12]:

# Solution goes here

In [13]:

# Solution goes here

If we imagine that this range of values represents arrival rates on different days, we can use the average value of W, for a range of values of lam, to compare different queueing strategies.

In [14]:

# W_avg = sweep.mean()

Analysis

The model I chose for this system is a common model in queueing theory, in part because many of its properties can be derived analytically.

In particular, we can derive the average time in the store as a function of $\mu$ and $\lambda$ :

$W = 1 / (\mu - \lambda)$

The following function plots the theoretical value of $W$ as a function of $\lambda$ .

In [15]:

def plot_W(lam_array, mu):
    """Plot the theoretical mean wait time.
    
    lam_array: array of values for `lam`
    mu: probability of finishing a checkout
    """
    W = 1 / (mu - lam_array)
    plot(lam_array, W, 'g-')

Use this function to plot the theoretical results, then plot your simulation results again on the same graph. How do they compare?

In [16]:

# Solution goes here

Multiple servers

Now let's try the other two queueing strategies:

One queue with two checkout counters.
Two queues, one for each counter.

The following figure shows the three scenarios:

Write an update function for one queue with two servers.

In [17]:

# Solution goes here

Use this update function to simulate the system, plot the results, and print the metrics.

In [18]:

# Solution goes here

Since we have two checkout counters now, we can consider values for $\lambda$ that exceed $\mu$ .

Create a new array of values for lam from 10% to 160% of mu.

In [19]:

# Solution goes here

Use your sweep function to simulate the two server, one queue scenario with a range of values for lam.

Plot the results and print the average value of W across all values of lam.

In [20]:

# Solution goes here

In [21]:

# Solution goes here

Multiple queues

To simulate the scenario with two separate queues, we need two state variables to keep track of customers in each queue.

Write an update function that takes x1, x2, t, and system as parameters and returns x1 and x2 as return values. f you are not sure how to return more than one return value, see compute_metrics.

When a customer arrives, which queue do they join?

In [22]:

# Solution goes here

Write a version of run_simulation that works with this update function.

In [23]:

# Solution goes here

Test your functions by running a simulation with a single value of lam.

In [24]:

# Solution goes here

Sweep a range of values for lam, plot the results, and print the average wait time across all values of lam.

How do the results compare to the scenario with two servers and one queue.

In [25]:

# Solution goes here

In [26]:

# Solution goes here

In [27]:

# Solution goes here

In [ ]:

Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place.

Modeling and Simulation in Python

One queue or two?

One server, one queue

Parameter sweep

Analysis

Multiple servers

Multiple queues

Product

Resources

Company

Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more, all in one place.

Modeling and Simulation in Python

One queue or two?

One server, one queue

Parameter sweep

Analysis

Multiple servers

Multiple queues

Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place.