Removing Duplicates
To remove the duplicate rows, simply use function drop_duplicates()
. To change the current dataframe, add inplace=True
.
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
Taak
Swipe to start coding
The planets
dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.
Note that dataframe may have only distinct records, in this case, the shape will remain the same.
Oplossing
Was alles duidelijk?
Bedankt voor je feedback!
Sectie 2. Hoofdstuk 7