Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Challenge: Scaling the Features | Preprocessing Data with Scikit-learn
ML Introduction with scikit-learn

bookChallenge: Scaling the Features

In this challenge, scale the features of the penguins dataset (already encoded and without missing values) using StandardScaler.

12345
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/a65bbc96-309e-4df9-a790-a1eb8c815a1c/penguins_imputed_encoded.csv') print(df)
copy

Here is a little reminder of the StandardScaler class.

Task

Swipe to start coding

You are given a DataFrame named df that contains encoded and imputed penguin data. Your goal is to standardize all feature values so that each column has a mean of 0 and a variance of 1. This ensures that features are on the same scale before training a machine learning model.

  1. Import the StandardScaler class from sklearn.preprocessing.
  2. Separate the feature matrix X and the target variable y from the DataFrame.
  3. Create a StandardScaler object.
  4. Apply the scaler to the feature matrix X and store the scaled values back into X.

Solution

Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 11
single

single

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

Suggested prompts:

How do I use StandardScaler to scale the penguins dataset?

Can you show me how to fit and transform the data with StandardScaler?

What does scaling the features with StandardScaler achieve?

close

Awesome!

Completion rate improved to 3.13

bookChallenge: Scaling the Features

Swipe to show menu

In this challenge, scale the features of the penguins dataset (already encoded and without missing values) using StandardScaler.

12345
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/a65bbc96-309e-4df9-a790-a1eb8c815a1c/penguins_imputed_encoded.csv') print(df)
copy

Here is a little reminder of the StandardScaler class.

Task

Swipe to start coding

You are given a DataFrame named df that contains encoded and imputed penguin data. Your goal is to standardize all feature values so that each column has a mean of 0 and a variance of 1. This ensures that features are on the same scale before training a machine learning model.

  1. Import the StandardScaler class from sklearn.preprocessing.
  2. Separate the feature matrix X and the target variable y from the DataFrame.
  3. Create a StandardScaler object.
  4. Apply the scaler to the feature matrix X and store the scaled values back into X.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 11
single

single

some-alt