Recognizing Handwritten Digits

Train Test SplitTrain Test Split

In Python, the train_test_split function, part of the sklearn.model_selection module, is frequently utilized for dividing a dataset into two parts: a training subset and a testing subset.

This train_test_split() function performs a random partitioning of the dataset into these subsets, determined by a predefined test size or train size.


  1. Split the dataset into training and test sets. Use only the first 1000 samples for splitting.
  2. Print the shapes and sizes of the resulting training and test sets for both the feature matrix and the target vector.

Section 1. Chapter 6