GitHub Repository: tensorflow/docs-l10n
Path: blob/master/site/zh-cn/lite/r1/convert/python_api.md
²⁵¹¹⁸ views

Python API 指南

本页提供了一些示例来说明如何通过 Python API 调用 TensorFlow Lite 转换器，以及解释器。

注意 : 本文介绍的是 Tensorflow nightly 版本的转换器，运行 pip install tf-nightly 安装此版本。旧版文档请参考“转换 TensorFlow 1.12 及之前版本的模型”。

[TOC]

概述

虽然也可以在命令行中调用 TensorFlow Lite 转换器，但用 Python 脚本调用 API 的方式可以作为模型开发流水线 (model development pipeline) 的一环，通常会更加便捷；可以让你更早的了解正在设计的模型是否针对移动设备

API

tf.lite.TFLiteConverter：用于将 TensorFlow 模型转换为 TensorFlow Lite 的 API。 tf.lite.Interpreter：用于调用 Python 解释器的 API。

针对不同的模型原始格式，TFLiteConverter 提供了多种用于转换的类方法。 TFLiteConverter.from_session() 用于 GraphDefs。 TFLiteConverter.from_saved_model() 用于 SavedModels。 TFLiteConverter.from_keras_model_file() 用于 tf.Keras 文件。基本示例展示简单浮点模型的用法。复杂示例展示更复杂的模型用法。

基本示例

以下部分显示了如何把基本浮点模型从各种原始数据格式转换成 TensorFlow Lite FlatBuffers。

使用 tf.Session 导出 GraphDef

以下示例展示了如何从 tf.Session 对象转换一个 TensorFlow GraphDef 成 TensorFlow Lite FlatBuffer。

import tensorflow as tf

img = tf.placeholder(name="img", dtype=tf.float32, shape=(1, 64, 64, 3))
var = tf.get_variable("weights", dtype=tf.float32, shape=(1, 64, 64, 3))
val = img + var
out = tf.identity(val, name="out")

with tf.Session() as sess:
  sess.run(tf.global_variables_initializer())
  converter = tf.lite.TFLiteConverter.from_session(sess, [img], [out])
  tflite_model = converter.convert()
  open("converted_model.tflite", "wb").write(tflite_model)

使用文件导出 GraphDef

以下示例展示了当 GraphDef 被存成文件时，是怎样转换一个 TensorFlow GraphDef 到 TensorFlow Lite FlatBuffer。支持文件后缀为 .pb 和 .pbtxt。

示例中用到的文件下载包：Mobilenet_1.0_224。该函数只支持用 freeze_graph.py 冻结的 GraphDef。

import tensorflow as tf

graph_def_file = "/path/to/Downloads/mobilenet_v1_1.0_224/frozen_graph.pb"
input_arrays = ["input"]
output_arrays = ["MobilenetV1/Predictions/Softmax"]

converter = tf.lite.TFLiteConverter.from_frozen_graph(
  graph_def_file, input_arrays, output_arrays)
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

导出 SavedModel

以下示例展示了如何将 SavedModel 转换成 TensorFlow Lite FlatBuffer。

import tensorflow as tf

converter = tf.lite.TFLiteConverter.from_saved_model(saved_model_dir)
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

对于更复杂的 SavedModel, 可以给 TFLiteConverter.from_saved_model() 函数传递可选参数： input_arrays，input_shapes，output_arrays，tag_set，signature_key。运行 help(tf.lite.TFLiteConverter) 查看参数详情。

导出 tf.keras 文件

以下示例展示如何将 tf.keras 模型转换成 TensorFlow Lite FlatBuffer。示例需要先安装h5py

import tensorflow as tf

converter = tf.lite.TFLiteConverter.from_keras_model_file("keras_model.h5")
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

tf.keras 文件必须包含模型和权重。一个全面的包括模型构造在内的示例如下所示：

import numpy as np
import tensorflow as tf

# Generate tf.keras model.
model = tf.keras.models.Sequential()
model.add(tf.keras.layers.Dense(2, input_shape=(3,)))
model.add(tf.keras.layers.RepeatVector(3))
model.add(tf.keras.layers.TimeDistributed(tf.keras.layers.Dense(3)))
model.compile(loss=tf.keras.losses.MSE,
              optimizer=tf.keras.optimizers.RMSprop(lr=0.0001),
              metrics=[tf.keras.metrics.categorical_accuracy],
              sample_weight_mode='temporal')

x = np.random.random((1, 3))
y = np.random.random((1, 3, 3))
model.train_on_batch(x, y)
model.predict(x)

# Save tf.keras model in HDF5 format.
keras_file = "keras_model.h5"
tf.keras.models.save_model(model, keras_file)

# Convert to TensorFlow Lite model.
converter = tf.lite.TFLiteConverter.from_keras_model_file(keras_file)
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

复杂示例

对于属性默认值不足的模型，在调用 convert() 之前应该设置属性值。例如设置任何常量都需要使用 tf.lite.constants.<CONSTANT_NAME>，以下示例中使用了常量 QUANTIZED_UINT8。您可以在 Python 终端中运行 help(tf.lite.TFLiteConverter) 获取有关属性的详细文档。

尽管示例中只演示了包含常量的 GraphDefs，同样的逻辑可以应用于每一种输入数据格式。

导出量化 GraphDef

以下示例展示了如何把量化模型转换成 TensorFlow Lite FlatBuffer。

import tensorflow as tf

img = tf.placeholder(name="img", dtype=tf.float32, shape=(1, 64, 64, 3))
const = tf.constant([1., 2., 3.]) + tf.constant([1., 4., 4.])
val = img + const
out = tf.fake_quant_with_min_max_args(val, min=0., max=1., name="output")

with tf.Session() as sess:
  converter = tf.lite.TFLiteConverter.from_session(sess, [img], [out])
  converter.inference_type = tf.lite.constants.QUANTIZED_UINT8
  input_arrays = converter.get_input_arrays()
  converter.quantized_input_stats = {input_arrays[0] : (0., 1.)}  # mean, std_dev
  tflite_model = converter.convert()
  open("converted_model.tflite", "wb").write(tflite_model)

TensorFlow Lite Python 解释器

从模型文件调用解释器

以下示例展示了获得 TensorFlow Lite FlatBuffer 文件后，如何使用 TensorFlow Lite Python 解释器。此代码还演示了如何对随机输入数据进行推理。您可以在 Python 终端中运行 help(tf.lite.Interpreter) 获取解释器的详细文档。

import numpy as np
import tensorflow as tf

# Load TFLite model and allocate tensors.
interpreter = tf.lite.Interpreter(model_path="converted_model.tflite")
interpreter.allocate_tensors()

# Get input and output tensors.
input_details = interpreter.get_input_details()
output_details = interpreter.get_output_details()

# Test model on random input data.
input_shape = input_details[0]['shape']
input_data = np.array(np.random.random_sample(input_shape), dtype=np.float32)
interpreter.set_tensor(input_details[0]['index'], input_data)

interpreter.invoke()

# The function `get_tensor()` returns a copy of the tensor data.
# Use `tensor()` in order to get a pointer to the tensor.
output_data = interpreter.get_tensor(output_details[0]['index'])
print(output_data)

从模型数据调用解释器

以下示例展示了如何从之前加载好的 TensorFlow Lite Flatbuffer 模型，调用 TensorFlow Lite Python 解释器。此代码显示了一个从构建 TensorFlow 模型开始的端到端案例。

import numpy as np
import tensorflow as tf

img = tf.placeholder(name="img", dtype=tf.float32, shape=(1, 64, 64, 3))
const = tf.constant([1., 2., 3.]) + tf.constant([1., 4., 4.])
val = img + const
out = tf.identity(val, name="out")

with tf.Session() as sess:
  converter = tf.lite.TFLiteConverter.from_session(sess, [img], [out])
  tflite_model = converter.convert()

# Load TFLite model and allocate tensors.
interpreter = tf.lite.Interpreter(model_content=tflite_model)
interpreter.allocate_tensors()

附加说明

源码构建

为了运行最新版本的 TensorFlow Lite Converter Python API，您可以选择一种方式安装 nightly 版本： pip（推荐）， Docker，从源代码构建 pip 包。

转换 TensorFlow 1.12 及之前版本的模型

参考下表在 TensorFlow 1.12 之前的版本中转换 TensorFlow 模型到 TensorFlow Lite 运行 help() 获取每种 API 的详情。

TensorFlow 版本	Python API
1.12	`tf.contrib.lite.TFLiteConverter`
1.9-1.11	`tf.contrib.lite.TocoConverter`
1.7-1.8	`tf.contrib.lite.toco_convert`