Sección 1. Capítulo 32
single
Challenge: Putting It All Together
Desliza para mostrar el menú
In this challenge, apply the full workflow learned in the course — from data preprocessing through training to model evaluation.
Tarea
Desliza para comenzar a programar
You are working with a penguin dataset. Build an ML pipeline to classify species with KNN, handling encoding, missing values, scaling, and tuning.
- Encode
ywithLabelEncoder. - Split with
train_test_split(test_size=0.33). - Make
ct:OneHotEncoderon'island','sex',remainder='passthrough'. - Set
param_gridforn_neighbors,weights,p. Forn_neighborsbetter to use odd values of integers. - Create
GridSearchCV(KNeighborsClassifier(), param_grid). - Pipeline:
ct→SimpleImputer('most_frequent')→StandardScaler→GridSearchCV. - Fit on train.
- Print test
.score. - Predict, print first 5 decoded labels.
- Print
.best_estimator_.
Solución
¿Todo estuvo claro?
¡Gracias por tus comentarios!
Sección 1. Capítulo 32
single
Pregunte a AI
Pregunte a AI
Pregunte lo que quiera o pruebe una de las preguntas sugeridas para comenzar nuestra charla