.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples_tensorflow/plot_tensorflow_to_onnx.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_tensorflow_plot_tensorflow_to_onnx.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_tensorflow_plot_tensorflow_to_onnx.py:


.. _l-plot-tensorflow-to-onnx:

Converting a TensorFlow/Keras model to ONNX
============================================

:func:`yobx.tensorflow.to_onnx` converts a :epkg:`TensorFlow`/:epkg:`Keras`
model into an :class:`onnx.ModelProto` that can be executed with any
ONNX-compatible runtime.

The converter works by tracing the Keras model with
:func:`tf.function <tensorflow.function>` / ``get_concrete_function`` to
obtain the underlying TensorFlow computation graph.  Each TF operation in that
graph is then mapped to an equivalent ONNX node by a registered converter.

Supported layers and activations (see :mod:`yobx.tensorflow` for the full
registry):

* :class:`tf.keras.layers.Dense` — converted to ``MatMul`` (+ ``Add`` when
  the bias is non-zero after optimisation)
* Activations: ``relu``, ``sigmoid``, ``tanh``, ``softmax``, ``relu6``
* :class:`tf.keras.Sequential` — chains the above step-by-step

The workflow is:

1. **Build** (and optionally train) a Keras model.
2. Call :func:`yobx.tensorflow.to_onnx` with a representative *dummy input*
   (a NumPy array) so it can infer input dtype and shape.
3. **Run** the ONNX model with any ONNX runtime — this example uses
   :epkg:`onnxruntime`.
4. **Verify** that the ONNX outputs match Keras' predictions.

.. GENERATED FROM PYTHON SOURCE LINES 35-41

.. code-block:: Python

    import tensorflow as tf
    import numpy as np
    import onnxruntime
    from yobx.doc import plot_dot
    from yobx.tensorflow import to_onnx


.. GENERATED FROM PYTHON SOURCE LINES 42-43

Remove this line to run on GPU.

.. GENERATED FROM PYTHON SOURCE LINES 43-49

.. code-block:: Python

    try:
        tf.config.set_visible_devices([], "GPU")
    except RuntimeError:
        # may fail if tensorflow has already been running before this script
        pass


.. GENERATED FROM PYTHON SOURCE LINES 50-56

1. Build a simple Dense layer
------------------------------

We start with the simplest possible model: a single
:class:`~tf.keras.layers.Dense` layer with no activation function
(linear/identity), so the conversion reduces to ``MatMul`` + ``Add``.

.. GENERATED FROM PYTHON SOURCE LINES 56-72

.. code-block:: Python


    rng = np.random.default_rng(0)
    X = rng.standard_normal((5, 4)).astype(np.float32)

    model_linear = tf.keras.Sequential([tf.keras.layers.Dense(3, input_shape=(4,))])
    onx_linear = to_onnx(model_linear, (X,))

    print("Opset            :", onx_linear.opset_import[0].version)
    print("Number of nodes  :", len(onx_linear.graph.node))
    print("Node op-types    :", [n.op_type for n in onx_linear.graph.node])
    print(
        "Graph inputs     :",
        [(inp.name, inp.type.tensor_type.elem_type) for inp in onx_linear.graph.input],
    )
    print("Graph outputs    :", [out.name for out in onx_linear.graph.output])


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    /home/xadupre/vv/this312/lib/python3.12/site-packages/keras/src/layers/core/dense.py:106: UserWarning: Do not pass an `input_shape`/`input_dim` argument to a layer. When using Sequential models, prefer using an `Input(shape)` object as the first layer in the model instead.
      super().__init__(activity_regularizer=activity_regularizer, **kwargs)
    Opset            : 21
    Number of nodes  : 1
    Node op-types    : ['MatMul']
    Graph inputs     : [('X:0', 1)]
    Graph outputs    : ['Identity:0']


.. GENERATED FROM PYTHON SOURCE LINES 73-79

Run and compare
~~~~~~~~~~~~~~~~

The ONNX input name follows TensorFlow's tensor-naming convention:
``"{name}:0"`` where *name* is the name passed to :func:`to_onnx`
(defaults to ``"X"``).

.. GENERATED FROM PYTHON SOURCE LINES 79-91

.. code-block:: Python


    ref = onnxruntime.InferenceSession(
        onx_linear.SerializeToString(), providers=["CPUExecutionProvider"]
    )
    (result_linear,) = ref.run(None, {"X:0": X})

    expected_linear = model_linear(X).numpy()
    print("\nKeras  output (first row):", expected_linear[0])
    print("ONNX   output (first row):", result_linear[0])
    assert np.allclose(expected_linear, result_linear, atol=1e-5), "Mismatch!"
    print("Outputs match ✓")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Keras  output (first row): [-0.52251536  0.46021557 -0.23878135]
    ONNX   output (first row): [-0.52251536  0.4602156  -0.23878136]
    Outputs match ✓


.. GENERATED FROM PYTHON SOURCE LINES 92-97

2. Multi-layer MLP with activations
-------------------------------------

Next we build a deeper :class:`~tf.keras.Sequential` model that exercises
the ``Relu``, ``MatMul`` and ``Add`` ONNX converters together.

.. GENERATED FROM PYTHON SOURCE LINES 97-116

.. code-block:: Python


    X_train = rng.standard_normal((80, 8)).astype(np.float32)

    mlp = tf.keras.Sequential(
        [
            tf.keras.layers.Dense(16, activation="relu", input_shape=(8,)),
            tf.keras.layers.Dense(8, activation="relu"),
            tf.keras.layers.Dense(4),
        ]
    )
    _ = mlp(X_train)

    onx_mlp = to_onnx(mlp, (X_train[:1],))

    op_types = [n.op_type for n in onx_mlp.graph.node]
    print("\nOp-types in the MLP graph:", op_types)
    assert "MatMul" in op_types
    assert "Relu" in op_types


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Op-types in the MLP graph: ['MatMul', 'Relu', 'MatMul', 'Relu', 'MatMul']


.. GENERATED FROM PYTHON SOURCE LINES 117-118

Verify predictions on a held-out batch.

.. GENERATED FROM PYTHON SOURCE LINES 118-130

.. code-block:: Python


    X_test = rng.standard_normal((20, 8)).astype(np.float32)

    ref_mlp = onnxruntime.InferenceSession(
        onx_mlp.SerializeToString(), providers=["CPUExecutionProvider"]
    )
    (result_mlp,) = ref_mlp.run(None, {"X:0": X_test})

    expected_mlp = mlp(X_test).numpy()
    np.testing.assert_allclose(expected_mlp, result_mlp, atol=1e-2)
    print("MLP predictions match ✓")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    MLP predictions match ✓


.. GENERATED FROM PYTHON SOURCE LINES 131-138

3. Dynamic batch dimension
---------------------------

By default :func:`to_onnx` marks the first axis of every input as a
dynamic (symbolic) dimension.  You can also pass ``dynamic_shapes``
explicitly to name the dynamic axis, which is useful when deploying the
model with variable-length batches.

.. GENERATED FROM PYTHON SOURCE LINES 138-158

.. code-block:: Python


    onx_dyn = to_onnx(mlp, (X_train[:1],), dynamic_shapes=({0: "batch"},))

    input_shape = onx_dyn.graph.input[0].type.tensor_type.shape
    batch_dim = input_shape.dim[0]
    print("\nBatch dimension param  :", batch_dim.dim_param)
    print("Batch dimension value  :", batch_dim.dim_value)
    assert batch_dim.dim_param, "Expected a named dynamic dimension"

    # The converted model still produces correct results for any batch size.
    ref_dyn = onnxruntime.InferenceSession(
        onx_dyn.SerializeToString(), providers=["CPUExecutionProvider"]
    )
    for n in (1, 7, 20):
        X_batch = rng.standard_normal((n, 8)).astype(np.float32)
        (out,) = ref_dyn.run(None, {"X:0": X_batch})
        expected = mlp(X_batch).numpy()
        np.testing.assert_allclose(expected, out, atol=1e-2)
        print("Dynamic-batch model verified for batch sizes 1, 7, 20 ✓")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Batch dimension param  : dim
    Batch dimension value  : 0
    Dynamic-batch model verified for batch sizes 1, 7, 20 ✓
    Dynamic-batch model verified for batch sizes 1, 7, 20 ✓
    Dynamic-batch model verified for batch sizes 1, 7, 20 ✓


.. GENERATED FROM PYTHON SOURCE LINES 159-166

4. Custom op converter
-----------------------

The ``extra_converters`` argument allows you to override or extend the
built-in op converters.  Here we replace the ``Relu`` converter with a
custom one that clips the output at 6 (i.e. ``Relu6``) instead of the
standard unbounded rectifier.

.. GENERATED FROM PYTHON SOURCE LINES 166-187

.. code-block:: Python


    def custom_relu_converter(g, sts, outputs, op):
        """Custom converter: replace Relu with Clip(0, 6)."""
        return g.op.Clip(
            op.inputs[0].name,
            np.array(0.0, dtype=np.float32),
            np.array(6.0, dtype=np.float32),
            outputs=outputs,
            name="custom_relu6",
        )


    onx_custom = to_onnx(mlp, (X_train[:1],), extra_converters={"Relu": custom_relu_converter})

    custom_op_types = [n.op_type for n in onx_custom.graph.node]
    print("\nOp-types with custom converter:", custom_op_types)
    assert "Clip" in custom_op_types, "Expected Clip node from custom converter"
    assert "Relu" not in custom_op_types, "Relu should have been replaced"
    print("Custom converter verified ✓")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Op-types with custom converter: ['MatMul', 'Clip', 'MatMul', 'Clip', 'MatMul']
    Custom converter verified ✓


.. GENERATED FROM PYTHON SOURCE LINES 188-191

5. Visualize the ONNX graph
----------------------------


.. GENERATED FROM PYTHON SOURCE LINES 191-192

.. code-block:: Python

    plot_dot(onx_mlp)


.. image-sg:: /auto_examples_tensorflow/images/sphx_glr_plot_tensorflow_to_onnx_001.png
   :alt: plot tensorflow to onnx
   :srcset: /auto_examples_tensorflow/images/sphx_glr_plot_tensorflow_to_onnx_001.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 4.878 seconds)


.. _sphx_glr_download_auto_examples_tensorflow_plot_tensorflow_to_onnx.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_tensorflow_to_onnx.ipynb <plot_tensorflow_to_onnx.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_tensorflow_to_onnx.py <plot_tensorflow_to_onnx.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_tensorflow_to_onnx.zip <plot_tensorflow_to_onnx.zip>`


.. include:: plot_tensorflow_to_onnx.recommendations


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_