Advanced Aggregation [2/2]
Great! What is you need to apply certain functions to specific columns separately, like  find out median of one column, and mean of another one. Surely, you can  perform grouping twice. But this case is also covered by pandas and .agg() method. If you want to apply specific functions to certain columns, pass them as  a dictionary {'column': 'function'}. For instance, we can calculate  mean total income ('totinch') and median number of people in a dwelling ('hhsize') for  each number of bedrooms.
12345678# Importing the library import pandas as pd # Reading the file df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/f2947b09-5f0d-4ad9-992f-ec0b87cd4b3f/data4.csv') # Minimal and maximal prices for each dwelling type print(df.groupby('broomh').agg({'totinch': 'mean', 'hhsize': 'median'}))
Note that if you pass dictionary as the
.agg()method parameter, then there is no need to select columns after grouping.
Bedankt voor je feedback!
Vraag AI
Vraag AI
Vraag wat u wilt of probeer een van de voorgestelde vragen om onze chat te starten.
Stel mij vragen over dit onderwerp
Vat dit hoofdstuk samen
Toon voorbeelden uit de praktijk
Awesome!
Completion rate improved to 2.56
Advanced Aggregation [2/2]
Veeg om het menu te tonen
Great! What is you need to apply certain functions to specific columns separately, like  find out median of one column, and mean of another one. Surely, you can  perform grouping twice. But this case is also covered by pandas and .agg() method. If you want to apply specific functions to certain columns, pass them as  a dictionary {'column': 'function'}. For instance, we can calculate  mean total income ('totinch') and median number of people in a dwelling ('hhsize') for  each number of bedrooms.
12345678# Importing the library import pandas as pd # Reading the file df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/f2947b09-5f0d-4ad9-992f-ec0b87cd4b3f/data4.csv') # Minimal and maximal prices for each dwelling type print(df.groupby('broomh').agg({'totinch': 'mean', 'hhsize': 'median'}))
Note that if you pass dictionary as the
.agg()method parameter, then there is no need to select columns after grouping.
Bedankt voor je feedback!