Statistics with pandas
The Pandas library already has three built-in functions for calculating the mean and median. To import pandas
using the pd
alias, use the following syntax:
pythonimport pandas as pd
Here's an example of calculating the .mean()
and .median()
for the 'work_year'
column in the dataset named df
.
Feel free to change the columns and observe the results:
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/a849660e-ddfa-4033-80a6-94a1b7772e23/update/ds_salaries_statistics', index_col = 0) # Calculating the mean value mean = df['work_year'].mean() # Calculating the median value median = df['work_year'].median() print('The mean value is', mean) print('The median value is', median)
To calculate key statistical values, use methods designed for handling measurements, such as:
python.mean().median()
Grazie per i tuoi commenti!