## Introduction to pandas

INTERMEDIATE

#python

Author: Alina Myronets

Course description

Pandas is a very easy-to-use library for analyzing any data. It is also used to work with large amounts of data, reading data in data frame and data series structures. This is a very useful tool for Data Science. We will learn about many statistical functions: how to search for correlation, mode, median, maximum and minimum values in a dataset. Learn what to do with missing values in a dataset. Learn to work with specific values in the dataset, as well as how to delete them.

Complete all chapters to get certificate

0%

The Very First Steps

In this section, we will introduce the basics of the series and dataframe data structures. We will also learn the difference between these structures.

What Is pandas?

Series in pandas

Challenge

DataFrame

Challenge 1

Challenge 2

Add a New Column 1/2

Add a New Column 2/2

Delete a Column

Challenge

Work With Columns

Challenge

Function iloc[] 1/2

Function iloc[] 2/2

Reading Files in pandas

Raw data comes in many forms: CSV, JSON, SQL, HTML, and so on. With pandas, we can work with data in a variety of formats. In this chapter, we will take a closer look at the CSV and TXT formats.

CSV Files

TXT Files

Challenge 1

Challenge 2

Analyzing the Data

In this section, we will learn how to work with raw data, how to remove unwanted data, and how to work with null values in a dataset.

Viewing the Data

Challenge 1

Challenge 2

Information on the Data 1/2

Information on the Data 2/2

Finding Null Values

Challenge

Dropping Null Values

Filling Null Values

Challenge

Describing the Data

Challenge

Max() and Min()

Challenge

Sum() and Count()

Unique Values