Sección 1. Capítulo 13
single
Challenge: Encoding Categorical Variables
Desliza para mostrar el menú
To summarize the previous three chapters, here is a table showing what encoder you should use:
In this challenge, you work with the penguins dataset (no missing values). All categorical features — including the target 'species' — must be encoded for ML use.
12345import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/a65bbc96-309e-4df9-a790-a1eb8c815a1c/penguins_imputed.csv') print(df.head())
Keep in mind that 'island' and 'sex' are categorical features and 'species' is a categorical target.
Tarea
Desliza para comenzar a programar
You are given a DataFrame df. Encode all categorical columns:
- Import
OneHotEncoderandLabelEncoderfromsklearn.preprocessing. - Split the data into
X(features) andy(target). - Create a
OneHotEncoderand apply it to the'island'and'sex'columns inX. - Replace those original columns with their encoded versions.
- Use
LabelEncoderon the'species'column to encodey.
Solución
¿Todo estuvo claro?
¡Gracias por tus comentarios!
Sección 1. Capítulo 13
single
Pregunte a AI
Pregunte a AI
Pregunte lo que quiera o pruebe una de las preguntas sugeridas para comenzar nuestra charla