Seção 1. Capítulo 32
single
Challenge: Putting It All Together
Deslize para mostrar o menu
In this challenge, apply the full workflow learned in the course — from data preprocessing through training to model evaluation.
Tarefa
Deslize para começar a programar
You are working with a penguin dataset. Build an ML pipeline to classify species with KNN, handling encoding, missing values, scaling, and tuning.
- Encode
ywithLabelEncoder. - Split with
train_test_split(test_size=0.33). - Make
ct:OneHotEncoderon'island','sex',remainder='passthrough'. - Set
param_gridforn_neighbors,weights,p. Forn_neighborsbetter to use odd values of integers. - Create
GridSearchCV(KNeighborsClassifier(), param_grid). - Pipeline:
ct→SimpleImputer('most_frequent')→StandardScaler→GridSearchCV. - Fit on train.
- Print test
.score. - Predict, print first 5 decoded labels.
- Print
.best_estimator_.
Solução
Tudo estava claro?
Obrigado pelo seu feedback!
Seção 1. Capítulo 32
single
Pergunte à IA
Pergunte à IA
Pergunte o que quiser ou experimente uma das perguntas sugeridas para iniciar nosso bate-papo