Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Aprende Challenge: Imputing Missing Values | Section
Machine Learning Foundations with Scikit-Learn
Sección 1. Capítulo 9
single

single

bookChallenge: Imputing Missing Values

Desliza para mostrar el menú

The SimpleImputer class replaces missing values automatically.

from sklearn.impute import SimpleImputer
imputer = SimpleImputer()

Its key parameters:

  • missing_value: placeholder treated as missing (default np.nan);
  • strategy: method for filling gaps ('mean' by default);
  • fill_value: used when strategy='constant'.

As a transformer, it provides methods such as .fit(), .transform(), and .fit_transform().

Choosing how to fill missing data is essential. A common approach:

  • numerical features → mean;
  • categorical features → most frequent value.

strategy options:

  • 'mean' — fill with mean;
  • 'median' — fill with median;
  • 'most_frequent' — fill with mode;
  • 'constant' — fill with a specified value via fill_value.

missing_values defines which values are treated as missing (default NaN, but may be '' or another marker).

Note
Note

SimpleImputer expects a DataFrame, not a Series. A single-column DataFrame must be selected using double brackets:

imputer.fit_transform(df[['column']])

fit_transform() returns a 2D array, but assigning back to a DataFrame column requires a 1D array. Flatten the result using .ravel():

df['column'] = imputer.fit_transform(df[['column']]).ravel()
Tarea

Desliza para comenzar a programar

You are given a DataFrame df containing penguin data. The 'sex' column has missing values. Fill them using the most frequent category.

  1. Import SimpleImputer;
  2. Create an imputer with strategy='most_frequent';
  3. Apply it to df[['sex']];
  4. Assign the imputed values back to df['sex'].

Solución

Switch to desktopCambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones
¿Todo estuvo claro?

¿Cómo podemos mejorarlo?

¡Gracias por tus comentarios!

Sección 1. Capítulo 9
single

single

Pregunte a AI

expand

Pregunte a AI

ChatGPT

Pregunte lo que quiera o pruebe una de las preguntas sugeridas para comenzar nuestra charla

some-alt