Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lære Challenge: Predict Market Direction with Logistic Regression | Building and Evaluating Trading Strategies
Python for Traders

bookChallenge: Predict Market Direction with Logistic Regression

To put your understanding of machine learning in trading to the test, you'll tackle a hands-on challenge: predicting the next day's market direction using logistic regression. You'll begin with a hardcoded DataFrame of closing prices, engineer lagged return features, and use scikit-learn's logistic regression to classify whether the next day's return is positive or negative. You'll then evaluate the model's accuracy and display a confusion matrix to summarize its predictive performance.

Start by creating a DataFrame with daily closing prices. From these prices, calculate daily returns and then generate lagged versions of these returns to use as input features for your model. The target variable will be whether the following day's return is positive (market up) or not (market down or unchanged).

Note
Note

You can experiment by adjusting the number of lagged features or changing the train-test split to see how it affects the model's performance. For more on logistic regression and confusion matrices, consult the scikit-learn documentation.

Oppgave

Swipe to start coding

You are given a DataFrame of daily closing prices for a hypothetical asset. Your task is to build a logistic regression model to predict whether the next day's return will be positive (market up) or not (market down or unchanged) using lagged return features.

  • Calculate daily returns from closing prices and add them as a new column named Return.
  • Create two new columns Lag1 and Lag2 that contain the previous day's and the day-before-previous's return, respectively.
  • Define a new column Direction as your target variable: set to 1 if the next day's return is greater than 0, otherwise 0.
  • Remove any rows containing missing values caused by shifting.
  • Use Lag1 and Lag2 as features (X) and Direction as the target (y).
  • Use all but the last 5 observations for training, and the last 5 for testing.
  • Fit a LogisticRegression model using scikit-learn on the training data.
  • Use the model to make predictions on the test set.
  • Calculate and print the test accuracy.
  • Calculate and print the confusion matrix for the test predictions.

Løsning

Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 3. Kapittel 7
single

single

Spør AI

expand

Spør AI

ChatGPT

Spør om hva du vil, eller prøv ett av de foreslåtte spørsmålene for å starte chatten vår

Suggested prompts:

Can you show me how to create the DataFrame with closing prices?

How do I calculate lagged return features from the closing prices?

Can you explain how to set up the logistic regression model for this task?

close

bookChallenge: Predict Market Direction with Logistic Regression

Sveip for å vise menyen

To put your understanding of machine learning in trading to the test, you'll tackle a hands-on challenge: predicting the next day's market direction using logistic regression. You'll begin with a hardcoded DataFrame of closing prices, engineer lagged return features, and use scikit-learn's logistic regression to classify whether the next day's return is positive or negative. You'll then evaluate the model's accuracy and display a confusion matrix to summarize its predictive performance.

Start by creating a DataFrame with daily closing prices. From these prices, calculate daily returns and then generate lagged versions of these returns to use as input features for your model. The target variable will be whether the following day's return is positive (market up) or not (market down or unchanged).

Note
Note

You can experiment by adjusting the number of lagged features or changing the train-test split to see how it affects the model's performance. For more on logistic regression and confusion matrices, consult the scikit-learn documentation.

Oppgave

Swipe to start coding

You are given a DataFrame of daily closing prices for a hypothetical asset. Your task is to build a logistic regression model to predict whether the next day's return will be positive (market up) or not (market down or unchanged) using lagged return features.

  • Calculate daily returns from closing prices and add them as a new column named Return.
  • Create two new columns Lag1 and Lag2 that contain the previous day's and the day-before-previous's return, respectively.
  • Define a new column Direction as your target variable: set to 1 if the next day's return is greater than 0, otherwise 0.
  • Remove any rows containing missing values caused by shifting.
  • Use Lag1 and Lag2 as features (X) and Direction as the target (y).
  • Use all but the last 5 observations for training, and the last 5 for testing.
  • Fit a LogisticRegression model using scikit-learn on the training data.
  • Use the model to make predictions on the test set.
  • Calculate and print the test accuracy.
  • Calculate and print the confusion matrix for the test predictions.

Løsning

Switch to desktopBytt til skrivebordet for virkelighetspraksisFortsett der du er med et av alternativene nedenfor
Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 3. Kapittel 7
single

single

some-alt