Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Apprendre Challenge: Build a Preprocessing Pipeline | Choosing and Evaluating Techniques
Feature Scaling and Normalization Deep Dive

bookChallenge: Build a Preprocessing Pipeline

Tâche

Swipe to start coding

You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:

  1. Split data into X (features) and y (target), then do a train/test split (test_size=0.3, random_state=42).
  2. Create a ColumnTransformer named preprocess:
    • numeric columns → StandardScaler()
    • categorical columns → OneHotEncoder(handle_unknown="ignore")
  3. Build a Pipeline named pipe with steps:
    • ("preprocess", preprocess)
    • ("clf", LogisticRegression(max_iter=1000, random_state=0))
  4. Fit on train only, then predict on test:
    • compute y_pred and test_accuracy = accuracy_score(y_test, y_pred)
  5. Add a few prints at the end to show shapes and the accuracy.

Solution

Tout était clair ?

Comment pouvons-nous l'améliorer ?

Merci pour vos commentaires !

Section 5. Chapitre 3
single

single

Demandez à l'IA

expand

Demandez à l'IA

ChatGPT

Posez n'importe quelle question ou essayez l'une des questions suggérées pour commencer notre discussion

close

Awesome!

Completion rate improved to 5.26

bookChallenge: Build a Preprocessing Pipeline

Glissez pour afficher le menu

Tâche

Swipe to start coding

You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:

  1. Split data into X (features) and y (target), then do a train/test split (test_size=0.3, random_state=42).
  2. Create a ColumnTransformer named preprocess:
    • numeric columns → StandardScaler()
    • categorical columns → OneHotEncoder(handle_unknown="ignore")
  3. Build a Pipeline named pipe with steps:
    • ("preprocess", preprocess)
    • ("clf", LogisticRegression(max_iter=1000, random_state=0))
  4. Fit on train only, then predict on test:
    • compute y_pred and test_accuracy = accuracy_score(y_test, y_pred)
  5. Add a few prints at the end to show shapes and the accuracy.

Solution

Switch to desktopPassez à un bureau pour une pratique réelleContinuez d'où vous êtes en utilisant l'une des options ci-dessous
Tout était clair ?

Comment pouvons-nous l'améliorer ?

Merci pour vos commentaires !

Section 5. Chapitre 3
single

single

some-alt