course content

Course Content

Principal Component Analysis

Feature Vector and Principal ComponentsFeature Vector and Principal Components

After we have our main components, we need to create a feature vector. Why do we need this new variable? At this stage, we decide whether to keep all components or discard those that have the least value. The feature vector is just a matrix of vectors from the remaining most significant components.

Thus, the creation of the feature vector is exactly the stage at which dataset dimensionality reduction occurs, because if we decide to keep only p principal components out of n, the final dataset will have only p dimensions.

pict

We can reduce a matrix with 2 components to 1 component:

pict

Finally, we have the main components and we can transform our data, i.e. reorient the data from the original axes to those represented by the principal components. This is implemented very simply by multiplying the feature vector by standardized data (the matrices must be transposed):

pict

Quiz

From which dimension to which was the dataset in the image transferred?

pict

question-icon

Choose the correct option.

Select the correct answer

Section 2.

Chapter 4