**Data scaling** and **normalization** are two terms that are often used interchangeably, but they actually refer to slightly different concepts.

Data scaling refers to transforming a dataset's values so that they fall within a specific range. This can involve rescaling the data to a specific minimum and maximum value, or standardizing the data so that it has a mean of zero and a standard deviation of one. *The goal of data scaling is to ensure that all the dataset's features are on the same scale so that no feature dominates the others*.

Normalization, on the other hand, refers to the process of transforming the values of a dataset so that they conform to a specific distribution. This can involve transforming the data so that it has a normal (Gaussian) distribution or some other distribution. *Normalization aims to make the data more interpretable or to meet the assumptions of a particular statistical test or machine learning algorithm*.

Data scaling is a more common preprocessing step in machine learning, as it is often necessary to ensure that all features are on the same scale to avoid bias and improve accuracy. Normalization is less commonly used but can be important in certain situations, such as when working with data with a skewed distribution or when using certain statistical tests.


Creating a machine learning model seems to be your most challenging and essential task. But first, we have to work with data! Learn how to process datasets and fully prepare them for use. Numerical, categorical, and temporal data await you in our course.

Different types of data? How to work with them? If your eyes are wide open, don't worry, let's start with a brief overview of the pandas library and learn how to work with it in the future.

This chapter discusses in detail how to work with quantitative data, what methods it is processed with, how data scaling and normalization differ, and much more.

Is categorical data as simple as you think it is? Find out what is the complexity of processing and working with it.


Time series data processing is the process of handling, analyzing, and preparing data that is presented as a sequence of temporally ordered values. Find out what steps it includes in this section.

Did you know that you can extract even more values from your data and create more informative features? In this section, you will learn how to work with feature engineering.

You have reached the end of this course. Let's test your knowledge! There are 3 tasks for you to solve.

Data Scaling vs Data Normalization

Data Scaling vs Data Normalization