Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Apprendre Challenge: Imputing Missing Values | Section
Machine Learning Foundations with Scikit-Learn
Section 1. Chapitre 9
single

single

bookChallenge: Imputing Missing Values

Glissez pour afficher le menu

The SimpleImputer class replaces missing values automatically.

from sklearn.impute import SimpleImputer
imputer = SimpleImputer()

Its key parameters:

  • missing_value: placeholder treated as missing (default np.nan);
  • strategy: method for filling gaps ('mean' by default);
  • fill_value: used when strategy='constant'.

As a transformer, it provides methods such as .fit(), .transform(), and .fit_transform().

Choosing how to fill missing data is essential. A common approach:

  • numerical features → mean;
  • categorical features → most frequent value.

strategy options:

  • 'mean' — fill with mean;
  • 'median' — fill with median;
  • 'most_frequent' — fill with mode;
  • 'constant' — fill with a specified value via fill_value.

missing_values defines which values are treated as missing (default NaN, but may be '' or another marker).

Note
Note

SimpleImputer expects a DataFrame, not a Series. A single-column DataFrame must be selected using double brackets:

imputer.fit_transform(df[['column']])

fit_transform() returns a 2D array, but assigning back to a DataFrame column requires a 1D array. Flatten the result using .ravel():

df['column'] = imputer.fit_transform(df[['column']]).ravel()
Tâche

Glissez pour commencer à coder

You are given a DataFrame df containing penguin data. The 'sex' column has missing values. Fill them using the most frequent category.

  1. Import SimpleImputer;
  2. Create an imputer with strategy='most_frequent';
  3. Apply it to df[['sex']];
  4. Assign the imputed values back to df['sex'].

Solution

Switch to desktopPassez à un bureau pour une pratique réelleContinuez d'où vous êtes en utilisant l'une des options ci-dessous
Tout était clair ?

Comment pouvons-nous l'améliorer ?

Merci pour vos commentaires !

Section 1. Chapitre 9
single

single

Demandez à l'IA

expand

Demandez à l'IA

ChatGPT

Posez n'importe quelle question ou essayez l'une des questions suggérées pour commencer notre discussion

some-alt