Sektion 1. Kapitel 32
single
Challenge: Putting It All Together
Stryg for at vise menuen
In this challenge, apply the full workflow learned in the course — from data preprocessing through training to model evaluation.
Opgave
Swipe to start coding
You are working with a penguin dataset. Build an ML pipeline to classify species with KNN, handling encoding, missing values, scaling, and tuning.
- Encode
ywithLabelEncoder. - Split with
train_test_split(test_size=0.33). - Make
ct:OneHotEncoderon'island','sex',remainder='passthrough'. - Set
param_gridforn_neighbors,weights,p. Forn_neighborsbetter to use odd values of integers. - Create
GridSearchCV(KNeighborsClassifier(), param_grid). - Pipeline:
ct→SimpleImputer('most_frequent')→StandardScaler→GridSearchCV. - Fit on train.
- Print test
.score. - Predict, print first 5 decoded labels.
- Print
.best_estimator_.
Løsning
Var alt klart?
Tak for dine kommentarer!
Sektion 1. Kapitel 32
single
Spørg AI
Spørg AI
Spørg om hvad som helst eller prøv et af de foreslåede spørgsmål for at starte vores chat