Course Content
Learning Statistics with Python
Learning Statistics with Python
2. Mean, Median and Mode with Python
4. Covariance vs Correlation
Statistics with pandas
The Pandas library already has three built-in functions for calculating the mean and median. To import pandas
using the pd
alias, use the following syntax:
python
Here's an example of calculating the .mean()
and .median()
for the 'work_year'
column in the dataset named df
.
Feel free to change the columns and observe the results:
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/a849660e-ddfa-4033-80a6-94a1b7772e23/update/ds_salaries_statistics', index_col = 0) # Calculating the mean value mean = df['work_year'].mean() # Calculating the median value median = df['work_year'].median() print('The mean value is', mean) print('The median value is', median)
To calculate key statistical values, use methods designed for handling measurements, such as:
python
Everything was clear?
Thanks for your feedback!
SectionΒ 2. ChapterΒ 3