Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Statistics with pandas | Mean, Median and Mode with Python
Learning Statistics with Python

bookStatistics with pandas

The Pandas library already has three built-in functions for calculating the mean and median. To import pandas using the pd alias, use the following syntax:

import pandas as pd

Here's an example of calculating the .mean() and .median() for the 'work_year' column in the dataset named df.

Feel free to change the columns and observe the results:

1234567891011
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/a849660e-ddfa-4033-80a6-94a1b7772e23/update/ds_salaries_statistics', index_col = 0) # Calculating the mean value mean = df['work_year'].mean() # Calculating the median value median = df['work_year'].median() print('The mean value is', mean) print('The median value is', median)
copy

To calculate key statistical values, use methods designed for handling measurements, such as:

.mean()
.median()
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 3

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

Awesome!

Completion rate improved to 2.63

bookStatistics with pandas

Swipe to show menu

The Pandas library already has three built-in functions for calculating the mean and median. To import pandas using the pd alias, use the following syntax:

import pandas as pd

Here's an example of calculating the .mean() and .median() for the 'work_year' column in the dataset named df.

Feel free to change the columns and observe the results:

1234567891011
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/a849660e-ddfa-4033-80a6-94a1b7772e23/update/ds_salaries_statistics', index_col = 0) # Calculating the mean value mean = df['work_year'].mean() # Calculating the median value median = df['work_year'].median() print('The mean value is', mean) print('The median value is', median)
copy

To calculate key statistical values, use methods designed for handling measurements, such as:

.mean()
.median()
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 3
some-alt