Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lære Challenge | Preprocessing Data: Part I
Data Manipulation using pandas

Sveip for å vise menyen

book
Challenge

You've solved the first problem with wrong column type. Let's solve the remaining one (with dots). Recall that there are 4 columns with wrong types left ('morgh', 'valueh', 'grosrth', 'omphtotinch'). These columns considered to have dots as indicators for 'Not applicable'. For instance, columns valueh and grosrth are mutually exclusive, since the first one indicates the price of dwelling (i.e., house is owned) and the second one indicates the monthly rent.

The most appropriate way to solve this problem is to replace dots by NA values. In that case, we would be able to manipulate column like a numerical one.

Oppgave

Swipe to start coding

Perform a replacement of dot symbols . by NAs for 'morgh', 'valueh', 'grosrth', 'omphtotinch' columns. Follow the next steps:

  1. Import the NumPy library with np alias.
  2. Apply the .where() method to the df dataframe.
  3. Set the condition what values must remain unchanged. These must be non-dots values.
  4. Set the other parameter to nan value from NumPy.

Løsning

Switch to desktopBytt til skrivebordet for virkelighetspraksisFortsett der du er med et av alternativene nedenfor
Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 1. Kapittel 8
Vi beklager at noe gikk galt. Hva skjedde?

Spør AI

expand
ChatGPT

Spør om hva du vil, eller prøv ett av de foreslåtte spørsmålene for å starte chatten vår

book
Challenge

You've solved the first problem with wrong column type. Let's solve the remaining one (with dots). Recall that there are 4 columns with wrong types left ('morgh', 'valueh', 'grosrth', 'omphtotinch'). These columns considered to have dots as indicators for 'Not applicable'. For instance, columns valueh and grosrth are mutually exclusive, since the first one indicates the price of dwelling (i.e., house is owned) and the second one indicates the monthly rent.

The most appropriate way to solve this problem is to replace dots by NA values. In that case, we would be able to manipulate column like a numerical one.

Oppgave

Swipe to start coding

Perform a replacement of dot symbols . by NAs for 'morgh', 'valueh', 'grosrth', 'omphtotinch' columns. Follow the next steps:

  1. Import the NumPy library with np alias.
  2. Apply the .where() method to the df dataframe.
  3. Set the condition what values must remain unchanged. These must be non-dots values.
  4. Set the other parameter to nan value from NumPy.

Løsning

Switch to desktopBytt til skrivebordet for virkelighetspraksisFortsett der du er med et av alternativene nedenfor
Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 1. Kapittel 8
Switch to desktopBytt til skrivebordet for virkelighetspraksisFortsett der du er med et av alternativene nedenfor
Vi beklager at noe gikk galt. Hva skjedde?
some-alt