Introduction to pandas
INTERMEDIATE
#python
Author: Alina Myronets
Course description
Pandas is a very easy-to-use library for analyzing any data. It is also used to work with large amounts of data, reading data in data frame and data series structures. This is a very useful tool for Data Science. We will learn about many statistical functions: how to search for correlation, mode, median, maximum and minimum values in a dataset. Learn what to do with missing values in a dataset. Learn to work with specific values in the dataset, as well as how to delete them.
Complete all chapters to get certificate
0%
The Very First Steps
In this section, we will introduce the basics of the series and dataframe data structures. We will also learn the difference between these structures.
What Is pandas?
Series in pandas
Challenge
DataFrame
Challenge 1
Challenge 2
Add a New Column 1/2
Add a New Column 2/2
Delete a Column
Challenge
Work With Columns
Challenge
Function iloc[] 1/2
Function iloc[] 2/2
Reading Files in pandas
Raw data comes in many forms: CSV, JSON, SQL, HTML, and so on. With pandas, we can work with data in a variety of formats. In this chapter, we will take a closer look at the CSV and TXT formats.
CSV Files
TXT Files
Challenge 1
Challenge 2
Analyzing the Data
In this section, we will learn how to work with raw data, how to remove unwanted data, and how to work with null values in a dataset.
Viewing the Data
Challenge 1
Challenge 2
Information on the Data 1/2
Information on the Data 2/2
Finding Null Values
Challenge
Dropping Null Values
Filling Null Values
Challenge
Describing the Data
Challenge
Max() and Min()
Challenge
Sum() and Count()
Unique Values