Вивчайте What Will We Do With the NaN Values?

In the previous chapter, you received the result:


PassengerId	0
Survived	0
Pclass	0
Name	0
Sex	0
Age	86
SibSp	0
Parch	0
Ticket	0
Fare	1
Cabin	327
Embarked	0

The dataset has 418 rows. Look at the column Cabin, where we have 327 missing values. There is no sense filling them in because we have minimal information here. So, in this case, the best solution is to delete the column that is senseless to us. One of the reasons is that we can delete only the rows that contain missing values, but we can't delete 327 rows out of 418. So, let's figure out how to do this.

To delete a column, you must apply the method .drop() to the data set. The syntax is the following:

# If you want to delete one column
data.drop(columns = 'column_name', inplace = True)

# If you want to delete several columns
data.drop(columns = ['column_1', 'column_2'], inplace = True)

Explanation:

.drop() - a method that deletes columns;
columns = 'column_name' or columns = ['column_1', 'column_2'] - argument of the function, where you specify the name or names of columns that you want to delete;
inplace = True - useful argument of pandas that allows us to save all changes. You can use it in other functions too; we will learn some of them later on.

Завдання

Swipe to start coding

Your task is to delete the column with the greatest number of NaN values. Follow the algorithm:

Drop the column 'Cabin' using the inplace = True argument.
Output the random 5 rows of the data set.

Рішення

Все було зрозуміло?

Дякуємо за ваш відгук!

Секція 5. Розділ 3

single

Запитати АІ

Запитайте про що завгодно або спробуйте одне із запропонованих запитань, щоб почати наш чат

Свайпніть щоб показати меню

In the previous chapter, you received the result:


PassengerId	0
Survived	0
Pclass	0
Name	0
Sex	0
Age	86
SibSp	0
Parch	0
Ticket	0
Fare	1
Cabin	327
Embarked	0

To delete a column, you must apply the method .drop() to the data set. The syntax is the following:

# If you want to delete one column
data.drop(columns = 'column_name', inplace = True)

# If you want to delete several columns
data.drop(columns = ['column_1', 'column_2'], inplace = True)

Explanation:

.drop() - a method that deletes columns;
columns = 'column_name' or columns = ['column_1', 'column_2'] - argument of the function, where you specify the name or names of columns that you want to delete;
inplace = True - useful argument of pandas that allows us to save all changes. You can use it in other functions too; we will learn some of them later on.

Завдання

Swipe to start coding

Your task is to delete the column with the greatest number of NaN values. Follow the algorithm:

Drop the column 'Cabin' using the inplace = True argument.
Output the random 5 rows of the data set.

Рішення

Перейдіть на комп'ютер для реальної практикиПродовжуйте з того місця, де ви зупинились, використовуючи один з наведених нижче варіантів

Все було зрозуміло?

Дякуємо за ваш відгук!

Секція 5. Розділ 3

single

What Will We Do With the NaN Values?

Рішення

Awesome!

What Will We Do With the NaN Values?

Рішення

Awesome!