Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lære Train and Test Split | Logistic Regression Mastering
Logistic Regression Mastering
course content

Kursusindhold

Logistic Regression Mastering

book
Train and Test Split

The train-test split procedure is used to estimate the performance of machine learning algorithms when they are used to make predictions on data not used to train the model.

It is a fast and easy procedure to perform, the results of which allow you to compare the performance of machine learning algorithms for your predictive modeling problem.

Methods description

  • sklearn: This module provides simple and efficient tools for data mining and data analysis. It includes various algorithms and utilities for machine learning tasks;

  • model_selection: This submodule within sklearn provides tools for model selection and evaluation, including methods for splitting data into training and testing sets;

  • .train_test_split(): This function splits arrays or matrices into random train and test subsets. It takes in arrays X and y representing features and target variables, respectively. The test_size parameter determines the proportion of the dataset to include in the test split. The random_state parameter sets the seed used for random sampling to ensure reproducibility. It returns four arrays: X_train, X_test, y_train, and y_test, representing the training and testing sets for features and target variables, respectively.

Opgave

Swipe to start coding

  1. Import train_test_split from sklearn.

  2. Define X as all the features (exclude "target").

  3. Define y as the "target" variable.

  4. Split the training and the test set with a size of 67% (train) and 33% (test).

Løsning

Mark tasks as Completed
Switch to desktopSkift til skrivebord for at øve i den virkelige verdenFortsæt der, hvor du er, med en af nedenstående muligheder
Var alt klart?

Hvordan kan vi forbedre det?

Tak for dine kommentarer!

Sektion 1. Kapitel 5

Spørg AI

expand
ChatGPT

Spørg om hvad som helst eller prøv et af de foreslåede spørgsmål for at starte vores chat

course content

Kursusindhold

Logistic Regression Mastering

book
Train and Test Split

The train-test split procedure is used to estimate the performance of machine learning algorithms when they are used to make predictions on data not used to train the model.

It is a fast and easy procedure to perform, the results of which allow you to compare the performance of machine learning algorithms for your predictive modeling problem.

Methods description

  • sklearn: This module provides simple and efficient tools for data mining and data analysis. It includes various algorithms and utilities for machine learning tasks;

  • model_selection: This submodule within sklearn provides tools for model selection and evaluation, including methods for splitting data into training and testing sets;

  • .train_test_split(): This function splits arrays or matrices into random train and test subsets. It takes in arrays X and y representing features and target variables, respectively. The test_size parameter determines the proportion of the dataset to include in the test split. The random_state parameter sets the seed used for random sampling to ensure reproducibility. It returns four arrays: X_train, X_test, y_train, and y_test, representing the training and testing sets for features and target variables, respectively.

Opgave

Swipe to start coding

  1. Import train_test_split from sklearn.

  2. Define X as all the features (exclude "target").

  3. Define y as the "target" variable.

  4. Split the training and the test set with a size of 67% (train) and 33% (test).

Løsning

Mark tasks as Completed
Switch to desktopSkift til skrivebord for at øve i den virkelige verdenFortsæt der, hvor du er, med en af nedenstående muligheder
Var alt klart?

Hvordan kan vi forbedre det?

Tak for dine kommentarer!

Sektion 1. Kapitel 5
Vi beklager, at noget gik galt. Hvad skete der?
some-alt