Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Oppiskele Challenge: Build a Preprocessing Pipeline | Choosing and Evaluating Techniques
Feature Scaling and Normalization Deep Dive

bookChallenge: Build a Preprocessing Pipeline

Tehtävä

Swipe to start coding

You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:

  1. Split data into X (features) and y (target), then do a train/test split (test_size=0.3, random_state=42).
  2. Create a ColumnTransformer named preprocess:
    • numeric columns → StandardScaler()
    • categorical columns → OneHotEncoder(handle_unknown="ignore")
  3. Build a Pipeline named pipe with steps:
    • ("preprocess", preprocess)
    • ("clf", LogisticRegression(max_iter=1000, random_state=0))
  4. Fit on train only, then predict on test:
    • compute y_pred and test_accuracy = accuracy_score(y_test, y_pred)
  5. Add a few prints at the end to show shapes and the accuracy.

Ratkaisu

Oliko kaikki selvää?

Miten voimme parantaa sitä?

Kiitos palautteestasi!

Osio 5. Luku 3
single

single

Kysy tekoälyä

expand

Kysy tekoälyä

ChatGPT

Kysy mitä tahansa tai kokeile jotakin ehdotetuista kysymyksistä aloittaaksesi keskustelumme

close

Awesome!

Completion rate improved to 5.26

bookChallenge: Build a Preprocessing Pipeline

Pyyhkäise näyttääksesi valikon

Tehtävä

Swipe to start coding

You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:

  1. Split data into X (features) and y (target), then do a train/test split (test_size=0.3, random_state=42).
  2. Create a ColumnTransformer named preprocess:
    • numeric columns → StandardScaler()
    • categorical columns → OneHotEncoder(handle_unknown="ignore")
  3. Build a Pipeline named pipe with steps:
    • ("preprocess", preprocess)
    • ("clf", LogisticRegression(max_iter=1000, random_state=0))
  4. Fit on train only, then predict on test:
    • compute y_pred and test_accuracy = accuracy_score(y_test, y_pred)
  5. Add a few prints at the end to show shapes and the accuracy.

Ratkaisu

Switch to desktopVaihda työpöytään todellista harjoitusta vartenJatka siitä, missä olet käyttämällä jotakin alla olevista vaihtoehdoista
Oliko kaikki selvää?

Miten voimme parantaa sitä?

Kiitos palautteestasi!

Osio 5. Luku 3
single

single

some-alt