Learn Introduction to Machine Learning for Trading | Building and Evaluating Trading Strategies

Swipe to show menu

Machine learning has become an essential tool for traders seeking to uncover patterns and predict future market movements. By leveraging historical data, you can train models to classify whether the next trading day is likely to be an "up" day (price increase) or a "down" day (price decrease). This approach is particularly useful when the goal is to anticipate short-term market direction using features derived from past price action, such as lagged returns. Classification models, such as LogisticRegression, allow you to estimate the probability that the market will move in a certain direction, guiding potential trading decisions.


              123456789101112131415161718192021222324252627
            
import pandas as pd
import numpy as np
from sklearn.linear_model import LogisticRegression

# Create a simple DataFrame with daily returns
np.random.seed(42)
returns = np.random.normal(0, 0.01, 100)
df = pd.DataFrame({"Return": returns})

# Create lagged features (previous day's return)
df["Lag1"] = df["Return"].shift(1)
df = df.dropna()

# Define the target: 1 for up day, 0 for down day
df["Direction"] = (df["Return"] > 0).astype(int)

# Features and target
X = df[["Lag1"]]
y = df["Direction"]

# Fit logistic regression
model = LogisticRegression()
model.fit(X, y)

# Predict the direction for the same data (for demonstration)
df["Predicted"] = model.predict(X)
print(df[["Return", "Lag1", "Direction", "Predicted"]].head())

The effectiveness of any machine learning model for trading depends heavily on feature engineering and careful evaluation. Feature engineering is the process of creating input variables that help the model capture relevant patterns in the data. In trading, this might include lagged returns, technical indicators, or volatility measures. Once features are selected and the model is trained, you need to assess its performance using metrics such as accuracy and confusion matrices. This evaluation helps you understand not only how often the model is correct, but also the types of errors it makes—critical for real-world trading decisions where costs and risks are involved.


              123456789
            
from sklearn.metrics import accuracy_score, confusion_matrix

# Calculate accuracy
accuracy = accuracy_score(df["Direction"], df["Predicted"])
print("Model accuracy:", accuracy)

# Confusion matrix
cm = confusion_matrix(df["Direction"], df["Predicted"])
print("Confusion matrix:\n", cm)

1. What is the goal of using classification models in trading?

2. Why is feature engineering important for trading models?

Everything was clear?

Thanks for your feedback!

Section 3. Chapter 6

Ask AI

Ask anything or try one of the suggested questions to begin our chat

Section 3. Chapter 6