Course Content
Advanced Techniques in pandas
Advanced Techniques in pandas
Is Data in ...?
In this section, we will continue extracting data using specific conditions. Here, you will become familiar with the helpful method called .isin()
. But firstly, you need to examine the dataset. Look at the first five rows:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) print(data.head())
Now, take a look at the example and the explanation below:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) models = ['HONDA', 'FORD', 'MERCEDES-BENZ', 'HYUNDAI'] data_extracted = data.loc[data['Manufacturer'].isin(models)] print(data_extracted.head())
Explanation:
If you remember, we always put the conditions inside the .loc[]
attribute. Here, we do the same. The .isin(list)
method checks if the values from the column are in the array. In our case, we check if values from the column 'Manufacturer'
are in the list models
.
Swipe to show code editor
Your task here is to extract data about cars where values from the column 'Color'
are equal to 'Grey'
, 'White'
, 'Black'
. Follow the algorithm to easily manage with the task:
- Create the
colors
list with the elements'Grey'
,'White'
,'Black'
(in this order). - Extract values from the column
'Color'
that the listcolor
consists of. Use the.loc[]
attribute. - Output the last five rows of the dataset
data_extracted
.
Thanks for your feedback!
Is Data in ...?
In this section, we will continue extracting data using specific conditions. Here, you will become familiar with the helpful method called .isin()
. But firstly, you need to examine the dataset. Look at the first five rows:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) print(data.head())
Now, take a look at the example and the explanation below:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) models = ['HONDA', 'FORD', 'MERCEDES-BENZ', 'HYUNDAI'] data_extracted = data.loc[data['Manufacturer'].isin(models)] print(data_extracted.head())
Explanation:
If you remember, we always put the conditions inside the .loc[]
attribute. Here, we do the same. The .isin(list)
method checks if the values from the column are in the array. In our case, we check if values from the column 'Manufacturer'
are in the list models
.
Swipe to show code editor
Your task here is to extract data about cars where values from the column 'Color'
are equal to 'Grey'
, 'White'
, 'Black'
. Follow the algorithm to easily manage with the task:
- Create the
colors
list with the elements'Grey'
,'White'
,'Black'
(in this order). - Extract values from the column
'Color'
that the listcolor
consists of. Use the.loc[]
attribute. - Output the last five rows of the dataset
data_extracted
.
Thanks for your feedback!
Is Data in ...?
In this section, we will continue extracting data using specific conditions. Here, you will become familiar with the helpful method called .isin()
. But firstly, you need to examine the dataset. Look at the first five rows:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) print(data.head())
Now, take a look at the example and the explanation below:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) models = ['HONDA', 'FORD', 'MERCEDES-BENZ', 'HYUNDAI'] data_extracted = data.loc[data['Manufacturer'].isin(models)] print(data_extracted.head())
Explanation:
If you remember, we always put the conditions inside the .loc[]
attribute. Here, we do the same. The .isin(list)
method checks if the values from the column are in the array. In our case, we check if values from the column 'Manufacturer'
are in the list models
.
Swipe to show code editor
Your task here is to extract data about cars where values from the column 'Color'
are equal to 'Grey'
, 'White'
, 'Black'
. Follow the algorithm to easily manage with the task:
- Create the
colors
list with the elements'Grey'
,'White'
,'Black'
(in this order). - Extract values from the column
'Color'
that the listcolor
consists of. Use the.loc[]
attribute. - Output the last five rows of the dataset
data_extracted
.
Thanks for your feedback!
In this section, we will continue extracting data using specific conditions. Here, you will become familiar with the helpful method called .isin()
. But firstly, you need to examine the dataset. Look at the first five rows:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) print(data.head())
Now, take a look at the example and the explanation below:
import pandas as pd data = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/4bf24830-59ba-4418-969b-aaf8117d522e/cars.csv', index_col = 0) models = ['HONDA', 'FORD', 'MERCEDES-BENZ', 'HYUNDAI'] data_extracted = data.loc[data['Manufacturer'].isin(models)] print(data_extracted.head())
Explanation:
If you remember, we always put the conditions inside the .loc[]
attribute. Here, we do the same. The .isin(list)
method checks if the values from the column are in the array. In our case, we check if values from the column 'Manufacturer'
are in the list models
.
Swipe to show code editor
Your task here is to extract data about cars where values from the column 'Color'
are equal to 'Grey'
, 'White'
, 'Black'
. Follow the algorithm to easily manage with the task:
- Create the
colors
list with the elements'Grey'
,'White'
,'Black'
(in this order). - Extract values from the column
'Color'
that the listcolor
consists of. Use the.loc[]
attribute. - Output the last five rows of the dataset
data_extracted
.