mlinsights.sklapi¶

The following implementation play with scikit-learn API, it overwrites the code handling parameters. It is pretty much useless unless to check the stability of the API.

SkBase¶

class mlinsights.sklapi.sklearn_base.SkBase(**kwargs)[source]¶

Pattern of a learner or a transform which follows the API of scikit-learn.

static compare_params(p1: Dict[str, Any], p2: Dict[str, Any], exc: bool = True) → bool[source]¶

Compares two sets of parameters.

Parameters:

p1 – dictionary
p2 – dictionary
exc – raises an exception if error is met

Returns:

boolean

fit(X, y=None, sample_weight=None)[source]¶

Trains a model.

@param X features @param y target @param sample_weight weight @return self

get_params(deep=True)[source]¶

Returns the parameters which define the objet, all are needed to clone the object.

@param deep unused here @return dict

set_params(**values)[source]¶

Udpates parameters which define the object, all needed to clone the object.

@param values values @return dictionary

test_equality(o, exc=True)[source]¶

Compares two objects and checks parameters have the same values.

@param p1 dictionary @param p2 dictionary @param exc raises an exception if there is a difference @return boolean

SkBaseClassifier¶

class mlinsights.sklapi.sklearn_base_classifier.SkBaseClassifier(**kwargs)[source]¶

Defines a custom classifier.

predict_proba(X)[source]¶

Returns probability estimates for the test data X.

Parameters:: X – Training data, numpy array or sparse matrix of shape [n_samples,n_features]
Returns:: array, shape = (n_samples,.), Returns predicted values.

score(X, y=None, sample_weight=None)[source]¶

Returns the mean accuracy on the given test data and labels.

Parameters:

X – Training data, numpy array or sparse matrix of shape [n_samples,n_features]
y – Target values, numpy array of shape [n_samples, n_targets] (optional)
sample_weight – Weight values, numpy array of shape [n_samples, n_targets] (optional)

Returns:

score : float, Mean accuracy of self.predict(X) wrt. y.

SkException¶

class mlinsights.sklapi.sklearn_parameters.SkException[source]¶: custom exception

SkBaseLearner¶

class mlinsights.sklapi.sklearn_base_learner.SkBaseLearner(**kwargs)[source]¶

Pattern of a learner qui suit la même API que scikit-learn.

decision_function(X)[source]¶

Output of the model in case of a regressor, matrix with a score for each class and each sample for a classifier.

Parameters:: X – Samples, {array-like, sparse matrix}, shape = (n_samples, n_features)
Returns:: array, shape = (n_samples,.), Returns predicted values.

fit(X, y=None, sample_weight=None)[source]¶

Trains a model.

@param X features @param y targets @param sample_weight weight @return self

predict(X)[source]¶

Predicts.

@param X features @return prédictions

score(X, y=None, sample_weight=None)[source]¶

Returns the mean accuracy on the given test data and labels.

Parameters:

X – Training data, numpy array or sparse matrix of shape [n_samples,n_features]
y – Target values, numpy array of shape [n_samples, n_targets] (optional)
sample_weight – Weight values, numpy array of shape [n_samples, n_targets] (optional)

Returns:

score : float, Mean accuracy of self.predict(X) wrt. y.

SkLearnParameters¶

class mlinsights.sklapi.sklearn_parameters.SkLearnParameters(**kwargs)[source]¶

Defines a class to store parameters of a learner or a transform.

property Keys¶: Returns parameter names.

to_dict()[source]¶

Returns parameters as a dictionary.

@return dict

validate(name, value)[source]¶

Verifies a parameter and its value.

Parameters:

name – name
value – value

Raise:

raises SkException if error

SkBaseRegressor¶

class mlinsights.sklapi.sklearn_base_regressor.SkBaseRegressor(**kwargs)[source]¶

Defines a custom regressor.

score(X, y=None, sample_weight=None)[source]¶

Returns the mean accuracy on the given test data and labels.

Parameters:

X – Training data, numpy array or sparse matrix of shape [n_samples,n_features]
y – Target values, numpy array of shape [n_samples, n_targets] (optional)
sample_weight – Weight values, numpy array of shape [n_samples, n_targets] (optional)

Returns:

score : float, Mean accuracy of self.predict(X) wrt. y.

SkBaseTransform¶

class mlinsights.sklapi.sklearn_base_transform.SkBaseTransform(**kwargs)[source]¶

Pattern of a learner which follows the same API que scikit-learn.

fit(X, y=None, **kwargs)[source]¶

Trains a model.

@param X features @param y targets @return self

fit_transform(X, y=None, **kwargs)[source]¶

Trains and transforms the data.

Parameters:

X – features
y – targets
kwargs – additional fitting parameters

Returns:

self

transform(X)[source]¶

Transforms the data.

@param X features @return predictions

SkBaseTransformLearner¶

class mlinsights.sklapi.sklearn_base_transform_learner.SkBaseTransformLearner(model=None, method=None, **kwargs)[source]¶

A transform which hides a learner, it converts method predict into transform. This way, two learners can be inserted into the same pipeline. There is another a,d shorter implementation with class TransferTransformer.

Use two learners into a same pipeline

It is impossible to use two learners into a pipeline unless we use a class such as SkBaseTransformLearner which disguise a learner into a transform.

<<<

from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
from sklearn.tree import DecisionTreeClassifier
from sklearn.metrics import accuracy_score
from sklearn.pipeline import make_pipeline
from mlinsights.sklapi import SkBaseTransformLearner

data = load_iris()
X, y = data.data, data.target
X_train, X_test, y_train, y_test = train_test_split(X, y)

try:
    pipe = make_pipeline(LogisticRegression(), DecisionTreeClassifier())
except Exception as e:
    print("ERROR:")
    print(e)
    print(".")

pipe = make_pipeline(
    SkBaseTransformLearner(LogisticRegression()), DecisionTreeClassifier()
)
pipe.fit(X_train, y_train)
pred = pipe.predict(X_test)
score = accuracy_score(y_test, pred)
print("pipeline avec deux learners :", score)

>>>

    pipeline avec deux learners : 1.0

fit(X, y=None, **kwargs)[source]¶

Trains a model.

@param X features @param y targets @param kwargs additional parameters @return self

get_params(deep=True)[source]¶

Returns the parameters mandatory to clone the class.

@param deep unused here @return dict

set_params(**values)[source]¶

Sets parameters.

@param values parameters

transform(X)[source]¶

Predictions, output of the embedded learner.

@param X features @return prédictions

SkBaseTransformStacking¶

class mlinsights.sklapi.sklearn_base_transform_stacking.SkBaseTransformStacking(models=None, method=None, **kwargs)[source]¶

Un transform qui cache plusieurs learners, arrangés selon la méthode du stacking.

Stacking de plusieurs learners dans un pipeline scikit-learn.

Ce transform assemble les résultats de plusieurs learners. Ces features servent d’entrée à un modèle de stacking.

<<<

from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
from sklearn.tree import DecisionTreeClassifier
from sklearn.metrics import accuracy_score
from sklearn.pipeline import make_pipeline
from mlinsights.sklapi import SkBaseTransformStacking

data = load_iris()
X, y = data.data, data.target
X_train, X_test, y_train, y_test = train_test_split(X, y)

trans = SkBaseTransformStacking([LogisticRegression(), DecisionTreeClassifier()])
trans.fit(X_train, y_train)
pred = trans.transform(X_test)
print(pred[3:])

>>>

fit(X, y=None, **kwargs)[source]¶

Trains a model.

@param X features @param y targets @param kwargs additional parameters @return self

get_params(deep=True)[source]¶

Returns the parameters which define the object. It follows scikit-learn API.

@param deep unused here @return dict

set_params(**values)[source]¶

Sets the parameters.

@param params parameters

transform(X)[source]¶

Calls the learners predictions to convert the features.

@param X features @return prédictions