Note

Go to the end to download the full example code.

Piecewise linear regression with scikit-learn predictors¶

The notebook illustrates an implementation of a piecewise linear regression based on scikit-learn. The bucketization can be done with a DecisionTreeRegressor or a KBinsDiscretizer. A linear model is then fitted on each bucket.

Piecewise data¶

Let’s build a toy problem based on two linear models.

import numpy
import numpy.random as npr
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeRegressor
from sklearn.preprocessing import KBinsDiscretizer
from sklearn.dummy import DummyRegressor
from mlinsights.mlmodel import PiecewiseRegressor


X = npr.normal(size=(1000, 4))
alpha = [4, -2]
t = (X[:, 0] + X[:, 3] * 0.5) > 0
switch = numpy.zeros(X.shape[0])
switch[t] = 1
y = alpha[0] * X[:, 0] * t + alpha[1] * X[:, 0] * (1 - t) + X[:, 2]

fig, ax = plt.subplots(1, 1)
ax.plot(X[:, 0], y, ".")
ax.set_title("Piecewise examples")

Text(0.5, 1.0, 'Piecewise examples')

Piecewise Linear Regression with a decision tree¶

The first example is done with a decision tree.

X_train, X_test, y_train, y_test = train_test_split(X[:, :1], y)

model = PiecewiseRegressor(
    verbose=True, binner=DecisionTreeRegressor(min_samples_leaf=300)
)
model.fit(X_train, y_train)

[Parallel(n_jobs=1)]: Done   2 out of   2 | elapsed:    0.0s finished

PiecewiseRegressor(binner=DecisionTreeRegressor(min_samples_leaf=300),
                   estimator=LinearRegression(), verbose=True)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

PiecewiseRegressor

iFitted

Parameters

	binner	DecisionTreeR...ples_leaf=300)
	estimator	LinearRegression()
	n_jobs	None
	verbose	True

binner: DecisionTreeRegressor

DecisionTreeRegressor(min_samples_leaf=300)

DecisionTreeRegressor

?Documentation for DecisionTreeRegressor

Parameters

	criterion	'squared_error'
	splitter	'best'
	max_depth	None
	min_samples_split	2
	min_samples_leaf	300
	min_weight_fraction_leaf	0.0
	max_features	None
	random_state	None
	max_leaf_nodes	None
	min_impurity_decrease	0.0
	ccp_alpha	0.0
	monotonic_cst	None

estimator: LinearRegression

LinearRegression()

LinearRegression

?Documentation for LinearRegression

Parameters

	fit_intercept	True
	copy_X	True
	tol	1e-06
	n_jobs	None
	positive	False

pred = model.predict(X_test)
pred[:5]

array([ 0.66437562,  2.48521352,  0.17839144, -0.7044688 ,  0.77987267])

fig, ax = plt.subplots(1, 1)
ax.plot(X_test[:, 0], y_test, ".", label="data")
ax.plot(X_test[:, 0], pred, ".", label="predictions")
ax.set_title("Piecewise Linear Regression\n2 buckets")
ax.legend()

<matplotlib.legend.Legend object at 0x7dd18eb163f0>

The method transform_bins returns the bucket of each variables, the final leave from the tree.

model.transform_bins(X_test)

array([0., 0., 0., 0., 0., 0., 1., 0., 0., 1., 0., 0., 0., 1., 0., 0., 0.,
       0., 0., 0., 1., 1., 1., 0., 1., 0., 1., 1., 1., 0., 0., 1., 0., 0.,
       1., 0., 0., 0., 1., 1., 0., 0., 1., 1., 0., 0., 1., 0., 1., 1., 1.,
       0., 0., 0., 1., 1., 0., 0., 0., 0., 1., 1., 0., 0., 0., 1., 1., 0.,
       1., 0., 0., 1., 1., 0., 0., 0., 0., 0., 1., 1., 0., 0., 0., 1., 0.,
       0., 1., 1., 0., 1., 0., 0., 0., 1., 1., 1., 0., 0., 0., 0., 1., 1.,
       0., 0., 0., 1., 1., 0., 1., 1., 1., 1., 0., 1., 1., 1., 0., 0., 0.,
       0., 0., 0., 1., 0., 1., 0., 1., 0., 0., 1., 0., 0., 1., 1., 0., 0.,
       0., 1., 0., 1., 1., 0., 0., 0., 0., 0., 1., 0., 1., 0., 1., 0., 1.,
       1., 0., 1., 0., 0., 0., 0., 0., 1., 0., 0., 1., 0., 1., 0., 1., 0.,
       0., 1., 0., 1., 0., 1., 0., 0., 0., 1., 0., 1., 0., 0., 1., 0., 0.,
       0., 0., 0., 0., 1., 0., 0., 1., 1., 0., 1., 1., 0., 0., 0., 0., 0.,
       1., 1., 1., 1., 0., 0., 1., 0., 1., 1., 0., 0., 1., 0., 1., 1., 1.,
       0., 1., 1., 0., 1., 0., 0., 1., 0., 0., 1., 1., 1., 1., 1., 1., 0.,
       0., 0., 1., 0., 1., 0., 1., 0., 1., 1., 0., 1.])

Let’s try with more buckets.

model = PiecewiseRegressor(
    verbose=False, binner=DecisionTreeRegressor(min_samples_leaf=150)
)
model.fit(X_train, y_train)

PiecewiseRegressor(binner=DecisionTreeRegressor(min_samples_leaf=150),
                   estimator=LinearRegression())

PiecewiseRegressor

iFitted

Parameters

	binner	DecisionTreeR...ples_leaf=150)
	estimator	LinearRegression()
	n_jobs	None
	verbose	False

binner: DecisionTreeRegressor

DecisionTreeRegressor(min_samples_leaf=150)

DecisionTreeRegressor

?Documentation for DecisionTreeRegressor

Parameters

	criterion	'squared_error'
	splitter	'best'
	max_depth	None
	min_samples_split	2
	min_samples_leaf	150
	min_weight_fraction_leaf	0.0
	max_features	None
	random_state	None
	max_leaf_nodes	None
	min_impurity_decrease	0.0
	ccp_alpha	0.0
	monotonic_cst	None

estimator: LinearRegression

LinearRegression()

LinearRegression

?Documentation for LinearRegression

Parameters

	fit_intercept	True
	copy_X	True
	tol	1e-06
	n_jobs	None
	positive	False

fig, ax = plt.subplots(1, 1)
ax.plot(X_test[:, 0], y_test, ".", label="data")
ax.plot(X_test[:, 0], model.predict(X_test), ".", label="predictions")
ax.set_title("Piecewise Linear Regression\n4 buckets")
ax.legend()

<matplotlib.legend.Legend object at 0x7dd18ec2c9e0>

Piecewise Linear Regression with a KBinsDiscretizer¶

model = PiecewiseRegressor(verbose=True, binner=KBinsDiscretizer(n_bins=2))
model.fit(X_train, y_train)

~/vv/this312/lib/python3.12/site-packages/sklearn/preprocessing/_discretization.py:296: FutureWarning: The current default behavior, quantile_method='linear', will be changed to quantile_method='averaged_inverted_cdf' in scikit-learn version 1.9 to naturally support sample weight equivalence properties by default. Pass quantile_method='averaged_inverted_cdf' explicitly to silence this warning.
  warnings.warn(
[Parallel(n_jobs=1)]: Done   2 out of   2 | elapsed:    0.0s finished

PiecewiseRegressor(binner=KBinsDiscretizer(n_bins=2),
                   estimator=LinearRegression(), verbose=True)

PiecewiseRegressor

iFitted

Parameters

	binner	KBinsDiscretizer(n_bins=2)
	estimator	LinearRegression()
	n_jobs	None
	verbose	True

binner: KBinsDiscretizer

KBinsDiscretizer(n_bins=2)

KBinsDiscretizer

?Documentation for KBinsDiscretizer

Parameters

	n_bins	2
	encode	'onehot'
	strategy	'quantile'
	quantile_method	'warn'
	dtype	None
	subsample	200000
	random_state	None

estimator: LinearRegression

LinearRegression()

LinearRegression

?Documentation for LinearRegression

Parameters

	fit_intercept	True
	copy_X	True
	tol	1e-06
	n_jobs	None
	positive	False

fig, ax = plt.subplots(1, 1)
ax.plot(X_test[:, 0], y_test, ".", label="data")
ax.plot(X_test[:, 0], model.predict(X_test), ".", label="predictions")
ax.set_title("Piecewise Linear Regression\n2 buckets")
ax.legend()

<matplotlib.legend.Legend object at 0x7dd18eaa1d30>

model = PiecewiseRegressor(verbose=True, binner=KBinsDiscretizer(n_bins=4))
model.fit(X_train, y_train)

~/vv/this312/lib/python3.12/site-packages/sklearn/preprocessing/_discretization.py:296: FutureWarning: The current default behavior, quantile_method='linear', will be changed to quantile_method='averaged_inverted_cdf' in scikit-learn version 1.9 to naturally support sample weight equivalence properties by default. Pass quantile_method='averaged_inverted_cdf' explicitly to silence this warning.
  warnings.warn(
[Parallel(n_jobs=1)]: Done   4 out of   4 | elapsed:    0.0s finished

PiecewiseRegressor(binner=KBinsDiscretizer(n_bins=4),
                   estimator=LinearRegression(), verbose=True)

PiecewiseRegressor

iFitted

Parameters

	binner	KBinsDiscretizer(n_bins=4)
	estimator	LinearRegression()
	n_jobs	None
	verbose	True

binner: KBinsDiscretizer

KBinsDiscretizer(n_bins=4)

KBinsDiscretizer

?Documentation for KBinsDiscretizer

Parameters

	n_bins	4
	encode	'onehot'
	strategy	'quantile'
	quantile_method	'warn'
	dtype	None
	subsample	200000
	random_state	None

estimator: LinearRegression

LinearRegression()

LinearRegression

?Documentation for LinearRegression

Parameters

	fit_intercept	True
	copy_X	True
	tol	1e-06
	n_jobs	None
	positive	False

fig, ax = plt.subplots(1, 1)
ax.plot(X_test[:, 0], y_test, ".", label="data")
ax.plot(X_test[:, 0], model.predict(X_test), ".", label="predictions")
ax.set_title("Piecewise Linear Regression\n4 buckets")
ax.legend()

<matplotlib.legend.Legend object at 0x7dd18e90fe30>

The model does not enforce continuity despite the fast it looks like so. Let’s compare with a constant on each bucket.

model = PiecewiseRegressor(
    verbose="tqdm", binner=KBinsDiscretizer(n_bins=4), estimator=DummyRegressor()
)
model.fit(X_train, y_train)

~/vv/this312/lib/python3.12/site-packages/sklearn/preprocessing/_discretization.py:296: FutureWarning: The current default behavior, quantile_method='linear', will be changed to quantile_method='averaged_inverted_cdf' in scikit-learn version 1.9 to naturally support sample weight equivalence properties by default. Pass quantile_method='averaged_inverted_cdf' explicitly to silence this warning.
  warnings.warn(

  0%|          | 0/4 [00:00<?, ?it/s]
100%|██████████| 4/4 [00:00<00:00, 2593.88it/s]
[Parallel(n_jobs=1)]: Done   4 out of   4 | elapsed:    0.0s finished

PiecewiseRegressor(binner=KBinsDiscretizer(n_bins=4),
                   estimator=DummyRegressor(), verbose='tqdm')

PiecewiseRegressor

iFitted

Parameters

	binner	KBinsDiscretizer(n_bins=4)
	estimator	DummyRegressor()
	n_jobs	None
	verbose	'tqdm'

binner: KBinsDiscretizer

KBinsDiscretizer(n_bins=4)

KBinsDiscretizer

?Documentation for KBinsDiscretizer

Parameters

	n_bins	4
	encode	'onehot'
	strategy	'quantile'
	quantile_method	'warn'
	dtype	None
	subsample	200000
	random_state	None

estimator: DummyRegressor

DummyRegressor()

DummyRegressor

?Documentation for DummyRegressor

Parameters

	strategy	'mean'
	constant	None
	quantile	None

fig, ax = plt.subplots(1, 1)
ax.plot(X_test[:, 0], y_test, ".", label="data")
ax.plot(X_test[:, 0], model.predict(X_test), ".", label="predictions")
ax.set_title("Piecewise Constants\n4 buckets")
ax.legend()

<matplotlib.legend.Legend object at 0x7dd18ec922a0>

Next¶

# PR `Model trees (M5P and
# co) <https://github.com/scikit-learn/scikit-learn/issues/13106>`_ and
# issue `Model trees
# (M5P) <https://github.com/scikit-learn/scikit-learn/pull/13732>`_
# propose an implementation a piecewise regression with any kind of
# regression model. It is based on `Building Model
# Trees <https://github.com/ankonzoid/LearningX/tree/master/advanced_ML/model_tree%3E>`_.
# It fits many models to find the best splits.

Total running time of the script: (0 minutes 0.698 seconds)

Download Jupyter notebook: plot_piecewise_linear_regression.ipynb

Download Python source code: plot_piecewise_linear_regression.py

Download zipped: plot_piecewise_linear_regression.zip

Gallery generated by Sphinx-Gallery