Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
学ぶ Challenge: Predict Market Direction with Logistic Regression | Building and Evaluating Trading Strategies
Python for Traders
セクション 3.  7
single

single

bookChallenge: Predict Market Direction with Logistic Regression

メニューを表示するにはスワイプしてください

To put your understanding of machine learning in trading to the test, you'll tackle a hands-on challenge: predicting the next day's market direction using logistic regression. You'll begin with a hardcoded DataFrame of closing prices, engineer lagged return features, and use scikit-learn's logistic regression to classify whether the next day's return is positive or negative. You'll then evaluate the model's accuracy and display a confusion matrix to summarize its predictive performance.

Start by creating a DataFrame with daily closing prices. From these prices, calculate daily returns and then generate lagged versions of these returns to use as input features for your model. The target variable will be whether the following day's return is positive (market up) or not (market down or unchanged).

Note
Note

You can experiment by adjusting the number of lagged features or changing the train-test split to see how it affects the model's performance. For more on logistic regression and confusion matrices, consult the scikit-learn documentation.

タスク

スワイプしてコーディングを開始

You are given a DataFrame of daily closing prices for a hypothetical asset. Your task is to build a logistic regression model to predict whether the next day's return will be positive (market up) or not (market down or unchanged) using lagged return features.

  • Calculate daily returns from closing prices and add them as a new column named Return.
  • Create two new columns Lag1 and Lag2 that contain the previous day's and the day-before-previous's return, respectively.
  • Define a new column Direction as your target variable: set to 1 if the next day's return is greater than 0, otherwise 0.
  • Remove any rows containing missing values caused by shifting.
  • Use Lag1 and Lag2 as features (X) and Direction as the target (y).
  • Use all but the last 5 observations for training, and the last 5 for testing.
  • Fit a LogisticRegression model using scikit-learn on the training data.
  • Use the model to make predictions on the test set.
  • Calculate and print the test accuracy.
  • Calculate and print the confusion matrix for the test predictions.

解答

Switch to desktop実践的な練習のためにデスクトップに切り替える下記のオプションのいずれかを利用して、現在の場所から続行する
すべて明確でしたか?

どのように改善できますか?

フィードバックありがとうございます!

セクション 3.  7
single

single

AIに質問する

expand

AIに質問する

ChatGPT

何でも質問するか、提案された質問の1つを試してチャットを始めてください

some-alt