Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Apprendre Filtering the DataFrame | Unveiling the Power of Data Manipulation with Pandas
Unveiling the Power of Data Manipulation with Pandas

book
Filtering the DataFrame

Filtering a pandas DataFrame refers to selecting rows based on a specific condition. You can filter a DataFrame using the [] operator with conditions inside, such as df[df['column'] > value], or the .query() method, like df.query('column > value').

For example, suppose you have a DataFrame df with columns "Name", "Age", and "Gender", and you want to select all rows where the values of the "Age" column are greater than 30. You can use the following code to filter the DataFrame:

python
# Filter the DataFrame using
# the [] operator
df_filtered = df[df["Age"] > 30]

# Filter the DataFrame using
# the .query() method
df_filtered = df.query("Age > 30")

You can also use the "&" (logical and) and "|" (logical or) operators to combine multiple conditions.

Tâche

Swipe to start coding

  1. Filter the data DataFrame using multiple conditions (use the logical and operator to combine them):
    • The value of the 'MANAGER_ID' column is greater than 100.
    • The value of the 'LOCATION_ID' column is equal to 1700.

Solution

# Filter the data based on multiple conditions
data_filtered = data[(data['MANAGER_ID'] > 100) & (data['LOCATION_ID'] == 1700)].reset_index(drop=True)

data_filtered

Mark tasks as Completed
Tout était clair ?

Comment pouvons-nous l'améliorer ?

Merci pour vos commentaires !

Section 1. Chapitre 4
AVAILABLE TO ULTIMATE ONLY
some-alt