CoCalc -- prefetch_caching.ipynb

GitHub Repository: codebasics/deep-learning-keras-tf-tutorial
Path: blob/master/45_prefatch/prefetch_caching.ipynb
¹¹⁴¹ views

Kernel: Python 3

Optimize tensorflow pipeline performance with prefetch and caching

In [14]:

import tensorflow as tf
import time

In [15]:

tf.__version__

Out[15]:

'2.5.0'

Prefetch

In [16]:

class FileDataset(tf.data.Dataset):
    def read_file_in_batches(num_samples):
        # Opening the file
        time.sleep(0.03)

        for sample_idx in range(num_samples):
            # Reading data (line, record) from the file
            time.sleep(0.015)

            yield (sample_idx,)

    def __new__(cls, num_samples=3):
        return tf.data.Dataset.from_generator(
            cls.read_file_in_batches,
            output_signature = tf.TensorSpec(shape = (1,), dtype = tf.int64),
            args=(num_samples,)
        )

In [17]:

def benchmark(dataset, num_epochs=2):
    for epoch_num in range(num_epochs):
        for sample in dataset:
            # Performing a training step
            time.sleep(0.01)

In [18]:

%%timeit
benchmark(FileDataset())

Out[18]:

304 ms ± 10.8 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [23]:

%%timeit
benchmark(FileDataset().prefetch(1))

Out[23]:

238 ms ± 6.64 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [19]:

%%timeit
benchmark(FileDataset().prefetch(tf.data.AUTOTUNE))

Out[19]:

240 ms ± 7.28 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

As you can notice above, using prefetch improves the performance from 304 ms to 238 and 240 ms

Cache

In [30]:

dataset = tf.data.Dataset.range(5)
dataset = dataset.map(lambda x: x**2)
dataset = dataset.cache("mycache.txt")
# The first time reading through the data will generate the data using
# `range` and `map`.
list(dataset.as_numpy_iterator())

Out[30]:

[0, 1, 4, 9, 16]

In [29]:

# Subsequent iterations read from the cache.
list(dataset.as_numpy_iterator())

Out[29]:

[0, 1, 4, 9, 16]

In [24]:

def mapped_function(s):
    # Do some hard pre-processing
    tf.py_function(lambda: time.sleep(0.03), [], ())
    return s

In [26]:

%%timeit -r1 -n1
benchmark(FileDataset().map(mapped_function), 5)

Out[26]:

1.25 s ± 0 ns per loop (mean ± std. dev. of 1 run, 1 loop each)

In [27]:

%%timeit -r1 -n1
benchmark(FileDataset().map(mapped_function).cache(), 5)

Out[27]:

528 ms ± 0 ns per loop (mean ± std. dev. of 1 run, 1 loop each)

Further reading https://www.tensorflow.org/guide/data_performance#caching

Optimize tensorflow pipeline performance with prefetch and caching

Prefetch

Cache

Product

Resources

Company