Scikit-learn Concepts
The scikit-learn (sklearn) library provides tools for preprocessing and modeling. Its main object types are estimator, transformer, predictor, and model.
Estimator
Any class with .fit() is an estimator β it learns from data.
estimator.fit(X, y) # supervised
estimator.fit(X) # unsupervised
Transformer
A transformer has .fit() and .transform(), plus .fit_transform() to do both at once.
Transformers are usually used to transform the X array. However, as we will see in the example of LabelEncoder, some transformers are made for the y array.
nan values shown in the training set in the picture indicate missing data in Python.
Predictor
A predictor is an estimator with .predict() for generating outputs.
predictor.fit(X, y)
predictor.predict(X_new)
Model
A model is a predictor with .score(), which evaluates performance.
model.fit(X, y)
model.score(X, y)
As mentioned in the previous chapter, accuracy is a metric representing the percentage of correct predictions.
The preprocessing stage involves working with transformers, and we work with predictors (more specifically with models) at the modeling stage.
Thanks for your feedback!
Ask AI
Ask AI
Ask anything or try one of the suggested questions to begin our chat
Awesome!
Completion rate improved to 3.13
Scikit-learn Concepts
Swipe to show menu
The scikit-learn (sklearn) library provides tools for preprocessing and modeling. Its main object types are estimator, transformer, predictor, and model.
Estimator
Any class with .fit() is an estimator β it learns from data.
estimator.fit(X, y) # supervised
estimator.fit(X) # unsupervised
Transformer
A transformer has .fit() and .transform(), plus .fit_transform() to do both at once.
Transformers are usually used to transform the X array. However, as we will see in the example of LabelEncoder, some transformers are made for the y array.
nan values shown in the training set in the picture indicate missing data in Python.
Predictor
A predictor is an estimator with .predict() for generating outputs.
predictor.fit(X, y)
predictor.predict(X_new)
Model
A model is a predictor with .score(), which evaluates performance.
model.fit(X, y)
model.score(X, y)
As mentioned in the previous chapter, accuracy is a metric representing the percentage of correct predictions.
The preprocessing stage involves working with transformers, and we work with predictors (more specifically with models) at the modeling stage.
Thanks for your feedback!