course content

Course Content

Data Preprocessing

Challenge 1Challenge 1


In this challenge, you will need to work with the 'adult-census.csv' dataset. It contains both categorical and numerical data. Your task will be to prepare the data for processing.

  1. Read the dataset 'adult-census.csv'
  2. Explore the dataset. Carefully check which character indicates the missed data in the dataset and replace it with the np.nan object
  3. Remove rows with missing values
  4. Let's start with processing categorical data - columns 'workclass', 'sex' Use a one-hot encoding method to encode them
  5. For numeric data ('age', 'hours-per-week'), you will need to scale the data
  6. Print processed data

Everything was clear?

Section 6. Chapter 1
toggle bottom row