.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples_sklearn/plot_sklearn_pls_float32.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_sklearn_plot_sklearn_pls_float32.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_sklearn_plot_sklearn_pls_float32.py:


.. _l-plot-sklearn-pls-float32:

Float32 vs Float64: precision loss with PLSRegression
======================================================

A common source of surprise when deploying scikit-learn models with ONNX
is **numerical discrepancy between scikit-learn and the ONNX runtime output**.
The culprit is almost always a **dtype mismatch**: the model was trained with
``float64`` (the NumPy default), but the ONNX graph was exported with
``float32`` inputs and weights.

With ``scikit-learn>=1.8`` and the initiative to follow
`array API <https://data-apis.org/array-api/latest/>`_,
the computation type is more consistent and less discrepancies
are observed between the original `scikit-learn` model
and its converted ONNX version.

Nevertheless, this shows an example based on
:class:`sklearn.cross_decomposition.PLSRegression` to
illustrate the problem step by step and shows how to fix it.

Why does float32 cause discrepancies?
--------------------------------------

Floating-point arithmetic is not exact.  ``float32`` has roughly **7 decimal
digits** of precision, while ``float64`` has **15–16**.  When a model trained
in ``float64`` is exported to ``float32``:

1. The **weight matrices** (``coef_``, ``_x_mean``, ``intercept_``) are cast
   from ``float64`` → ``float32``, losing precision.
2. The **intermediate computations** inside the ONNX graph also run in
   ``float32``.
3. scikit-learn itself always uses ``float64`` internally, regardless of the
   input dtype.

For numerically sensitive operations such as the matrix multiplication at the
core of PLS, even small rounding errors in the weights can accumulate and
produce predictions that differ by an order of magnitude larger than the
tolerance you would expect.

The fix: export with float64
-----------------------------

If your deployment environment supports ``float64`` tensors (all major ONNX
runtimes do), simply pass ``float64`` inputs to
:func:`yobx.sklearn.to_onnx`.  The converter will then keep all weights in
``float64`` and the exported predictions will match scikit-learn up to
floating-point round-off.

.. GENERATED FROM PYTHON SOURCE LINES 51-59

.. code-block:: Python


    import numpy as np
    import onnxruntime
    from sklearn.cross_decomposition import PLSRegression
    from sklearn.datasets import make_regression

    from yobx.sklearn import to_onnx


.. GENERATED FROM PYTHON SOURCE LINES 60-66

1. Train a PLSRegression in float64 (the default)
--------------------------------------------------

We generate a regression dataset and fit a ``PLSRegression`` model.
``make_regression`` returns ``float64`` arrays by default, which is also
what scikit-learn uses internally for all computations.

.. GENERATED FROM PYTHON SOURCE LINES 66-77

.. code-block:: Python


    X, y = make_regression(n_samples=200, n_features=10, n_informative=3, random_state=0)
    # X and y are float64 here
    print("X dtype :", X.dtype)  # float64
    print("y dtype :", y.dtype)  # float64

    pls = PLSRegression(n_components=3)
    pls.fit(X, y)
    print("coef_ dtype   :", pls.coef_.dtype)  # float64
    print("_x_mean dtype :", pls._x_mean.dtype)  # float64


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    X dtype : float64
    y dtype : float64
    coef_ dtype   : float64
    _x_mean dtype : float64


.. GENERATED FROM PYTHON SOURCE LINES 78-84

2. Export to ONNX with float32 — the wrong way
-----------------------------------------------

Passing a ``float32`` dummy input tells the converter to build an ONNX graph
whose weights are all cast to ``float32``.  scikit-learn, however, will
still use ``float64`` internally when ``predict`` is called.

.. GENERATED FROM PYTHON SOURCE LINES 84-105

.. code-block:: Python


    X_f32 = X.astype(np.float32)
    onx_f32 = to_onnx(pls, (X_f32[:1],))

    sess_f32 = onnxruntime.InferenceSession(
        onx_f32.SerializeToString(), providers=["CPUExecutionProvider"]
    )
    pred_onnx_f32 = sess_f32.run(None, {"X": X_f32})[0]

    # scikit-learn always predicts in float64
    pred_sk = pls.predict(X).ravel()
    pred_sk_f32_input = pls.predict(X_f32).ravel()  # sklearn upcasts to float64

    print("\n--- float32 ONNX export ---")
    print("sklearn  (float64 input) :", pred_sk[:5])
    print("sklearn  (float32 input) :", pred_sk_f32_input[:5])
    print("ONNX     (float32)       :", pred_onnx_f32[:5])

    max_diff_f32 = float(np.abs(pred_onnx_f32 - pred_sk_f32_input).max())
    print(f"Max absolute difference (float32 ONNX vs sklearn): {max_diff_f32:.6e}")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    --- float32 ONNX export ---
    sklearn  (float64 input) : [-35.45234819  64.07263046 -95.9353714   30.26326687 -62.15702034]
    sklearn  (float32 input) : [-35.45234417  64.07263174 -95.93536812  30.26326818 -62.15701773]
    ONNX     (float32)       : [-35.452343  64.07264  -95.93536   30.26326  -62.157017]
    Max absolute difference (float32 ONNX vs sklearn): 1.574150e-05


.. GENERATED FROM PYTHON SOURCE LINES 106-112

3. Export to ONNX with float64 — the correct way
------------------------------------------------

Passing a ``float64`` dummy input keeps all weights in double precision.
The ONNX graph will now produce predictions that agree with scikit-learn up
to floating-point round-off (typically < 1e-10 for PLS).

.. GENERATED FROM PYTHON SOURCE LINES 112-127

.. code-block:: Python


    onx_f64 = to_onnx(pls, (X[:1],))  # X is already float64

    sess_f64 = onnxruntime.InferenceSession(
        onx_f64.SerializeToString(), providers=["CPUExecutionProvider"]
    )
    pred_onnx_f64 = sess_f64.run(None, {"X": X})[0]

    print("\n--- float64 ONNX export ---")
    print("sklearn (float64) :", pred_sk[:5])
    print("ONNX    (float64) :", pred_onnx_f64[:5])

    max_diff_f64 = float(np.abs(pred_onnx_f64 - pred_sk).max())
    print(f"Max absolute difference (float64 ONNX vs sklearn): {max_diff_f64:.6e}")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    --- float64 ONNX export ---
    sklearn (float64) : [-35.45234819  64.07263046 -95.9353714   30.26326687 -62.15702034]
    ONNX    (float64) : [-35.45234819  64.07263046 -95.9353714   30.26326687 -62.15702034]
    Max absolute difference (float64 ONNX vs sklearn): 2.842171e-14


.. GENERATED FROM PYTHON SOURCE LINES 128-134

4. Side-by-side comparison
---------------------------

The table below summarises the maximum absolute difference across all 200
test samples.  The ``float32`` export can introduce errors that are many
orders of magnitude larger than the ``float64`` export.

.. GENERATED FROM PYTHON SOURCE LINES 134-146

.. code-block:: Python


    print("\nSummary")
    print("=" * 50)
    print(f"  float32 ONNX max |error|: {max_diff_f32:.4e}")
    print(f"  float64 ONNX max |error|: {max_diff_f64:.4e}")
    _eps = 1e-300  # guard against division by zero when float64 error is exactly 0
    print(f"  Ratio (f32/f64)         : {max_diff_f32 / (max_diff_f64 + _eps):.1f}x")

    assert max_diff_f64 < 1e-7, f"float64 export should be near-exact, got {max_diff_f64}"
    assert max_diff_f32 > max_diff_f64, "float32 export should introduce more error than float64"
    print("\nConclusion: use float64 inputs when the model was trained with float64 ✓")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Summary
    ==================================================
      float32 ONNX max |error|: 1.5742e-05
      float64 ONNX max |error|: 2.8422e-14
      Ratio (f32/f64)         : 553854943.0x

    Conclusion: use float64 inputs when the model was trained with float64 ✓


.. GENERATED FROM PYTHON SOURCE LINES 147-151

5. Multi-target PLSRegression
------------------------------

The same precision loss applies to multi-target regression.

.. GENERATED FROM PYTHON SOURCE LINES 151-183

.. code-block:: Python


    X2, Y2 = make_regression(
        n_samples=200, n_features=10, n_informative=5, n_targets=3, random_state=1
    )

    pls2 = PLSRegression(n_components=3)
    pls2.fit(X2, Y2)

    # float32 export
    X2_f32 = X2.astype(np.float32)
    onx2_f32 = to_onnx(pls2, (X2_f32[:1],))
    sess2_f32 = onnxruntime.InferenceSession(
        onx2_f32.SerializeToString(), providers=["CPUExecutionProvider"]
    )
    pred2_onnx_f32 = sess2_f32.run(None, {"X": X2_f32})[0]
    pred2_sk = pls2.predict(X2)
    diff2_f32 = float(np.abs(pred2_onnx_f32 - pred2_sk).max())

    # float64 export
    onx2_f64 = to_onnx(pls2, (X2[:1],))
    sess2_f64 = onnxruntime.InferenceSession(
        onx2_f64.SerializeToString(), providers=["CPUExecutionProvider"]
    )
    pred2_onnx_f64 = sess2_f64.run(None, {"X": X2})[0]
    diff2_f64 = float(np.abs(pred2_onnx_f64 - pred2_sk).max())

    print(f"\nMulti-target PLS — float32 max |error|: {diff2_f32:.4e}")
    print(f"Multi-target PLS — float64 max |error|: {diff2_f64:.4e}")

    assert diff2_f64 < 1e-7, f"float64 multi-target should be near-exact, got {diff2_f64}"
    assert diff2_f32 > diff2_f64
    print("Multi-target: same conclusion holds ✓")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Multi-target PLS — float32 max |error|: 4.3680e-05
    Multi-target PLS — float64 max |error|: 0.0000e+00
    Multi-target: same conclusion holds ✓


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 0.098 seconds)


.. _sphx_glr_download_auto_examples_sklearn_plot_sklearn_pls_float32.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_sklearn_pls_float32.ipynb <plot_sklearn_pls_float32.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_sklearn_pls_float32.py <plot_sklearn_pls_float32.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_sklearn_pls_float32.zip <plot_sklearn_pls_float32.zip>`


.. include:: plot_sklearn_pls_float32.recommendations


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_