Deep dive into Scikit-learn pipeline - Hands on Session
AIEngineering AIEngineering
74.7K subscribers
5,980 views
0

 Published On Dec 5, 2020

#datascience #machinelearning #ml

Link to video containing introduction to scikit learn pipeline -    • Introduction to Scikit-Learn pipeline...  

In this video we will do end to end scikit learn pipeline hands on

Scikit-learn Pipeline can be used to chain multiple estimators into one. This is useful as there is often a fixed sequence of steps in processing the data, for example feature selection, normalization and classification. Pipeline serves multiple purposes here:

Convenience and encapsulation
You only have to call fit and predict once on your data to fit a whole sequence of estimators.

Joint parameter selection
You can grid search over parameters of all estimators in the pipeline at once.

Safety
Pipelines help avoid leaking statistics from your test data into the trained model in cross-validation, by ensuring that the same samples are used to train the transformers and predictors.

All estimators in a pipeline, except the last one, must be transformers (i.e. must have a transform method). The last estimator may be any type (transformer, classifier, etc.).

Ref: https://scikit-learn.org/stable/modul...

show more

Share/Embed