Summary  
This chapter demonstrates how to compute margin sampling and entropy sampling measures from a model’s predicted class probabilities to quantify uncertainty and identify the most informative samples.

General domain of usage  
Active learning

Margin sampling and entropy sampling are two widely used query strategies in active learning, both designed to identify the most informative unlabeled samples for labeling. **Margin sampling** focuses on the difference between the highest and the second-highest predicted class probabilities for each sample. The smaller this margin, the less confident the model is about its prediction, signaling a more uncertain and potentially informative example. In contrast, **entropy sampling** quantifies uncertainty using the entropy of the predicted class probability distribution for each sample. Entropy measures the amount of uncertainty or randomness; higher entropy values indicate that the model is less certain about its prediction across all possible classes, rather than just the top two.

import numpy as np
from sklearn.datasets import make_classification
from sklearn.linear_model import LogisticRegression

# Create a more complex, noisy dataset
X, y = make_classification(
    n_samples=600,
    n_features=6,
    n_informative=3,
    n_redundant=1,
    n_clusters_per_class=2,
    flip_y=0.15,              # adds label noise → much more uncertainty
    class_sep=0.6,            # higher overlap between classes
    random_state=42
)

# Train a weaker classifier to increase uncertainty
clf = LogisticRegression(max_iter=2000)
clf.fit(X, y)

# Take a batch from the dataset
probs = clf.predict_proba(X[:5])

# Margin sampling
margins = []
for prob in probs:
    sorted_probs = np.sort(prob)[::-1]
    margin = sorted_probs[0] - sorted_probs[1]
    margins.append(margin)

# Entropy sampling
entropies = []
for prob in probs:
    entropy = -np.sum(prob * np.log(prob + 1e-12))
    entropies.append(entropy)

print("Class probabilities for each sample:")
print(probs.round(4))

print("\nMargin values (smaller = more uncertain):")
print([round(m, 4) for m in margins])

print("\nEntropy values (higher = more uncertain):")
print([round(e, 4) for e in entropies])

Which query strategy — margin sampling or entropy sampling — is generally more sensitive to the overall distribution of class probabilities?

Explore the principles and practical techniques of Active Learning to maximize label efficiency in machine learning workflows. Learn the core concepts, sampling strategies, and hands-on iterative simulations using Python and scikit-learn.

Build a conceptual and practical foundation for Active Learning, focusing on its motivation, core loop, and sampling paradigms.

Explore and implement core query strategies for selecting informative samples in Active Learning.

Apply Active Learning in small-scale simulations, analyze efficiency, and discuss practical considerations.

Margin And Entropy Sampling