Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lære What Criterion to Choose | The First A/B Test
A/B Testing in Python

bookWhat Criterion to Choose

Imagine we have received MORE clicks with the SECOND test variant of the site! So, we can set an updated version on our main page website, and that's all!

But wait a minute!

We have to perform more manipulations to prove whether there is a NON-RANDOM difference between groups.

To do that, we will:

  • Look at the plots that represent group tests that show if there are any visual differences between groups;
  • Look at the intervals of the distributions to see if they cover each other or not;
  • Perform the hypothesis check with the statistical criterion.

A statistical criterion is a mathematical rule that allows us to reject the null hypothesis or not, that is, to conclude whether there is a non-random difference between groups. A statistical criterion creates a p-value.

To decide what criterion to choose while performing an A/B test, we need to use this scheme:

A statistical significance is a measure of confidence that a result is not random. By default, a statistical significance of 5%(or 1%) is used.

So look at the table:

ConditionAction
p-value > statistical significanceWe can not decline the H0 hypothesis
p-value < statistical significanceWe can accept the H1 hypothesis

To choose the right statistical criterion, we have to understand the distribution of our data. That is what we are going to do now!

If the normaltest shows the result < 0.05, it means that we receive the normal distribution. We need to decide what criterion to choose. To do that, let's define if our distribution is normal. Use scipy.stats.normaltest(data) to perform this test.

Don't worry if the information sounds hard! We will cope with that!

Oppgave

Swipe to start coding

  1. Import the pandas with the pd alias.
  2. Import the seaborn with the sns alias.
  3. Import the scipy.
  4. Import the statsmodels.api with the sm alias.
  5. Build the distplot using the clicks column from the df.
  6. Build the qqplot using the clicks column from the df.
  7. Perfrom the normaltest with the clicks column from the df.

Løsning

Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 2. Kapittel 2
single

single

Spør AI

expand

Spør AI

ChatGPT

Spør om hva du vil, eller prøv ett av de foreslåtte spørsmålene for å starte chatten vår

Suggested prompts:

Oppsummer dette kapittelet

Explain code

Explain why doesn't solve task

close

Awesome!

Completion rate improved to 4.55

bookWhat Criterion to Choose

Sveip for å vise menyen

Imagine we have received MORE clicks with the SECOND test variant of the site! So, we can set an updated version on our main page website, and that's all!

But wait a minute!

We have to perform more manipulations to prove whether there is a NON-RANDOM difference between groups.

To do that, we will:

  • Look at the plots that represent group tests that show if there are any visual differences between groups;
  • Look at the intervals of the distributions to see if they cover each other or not;
  • Perform the hypothesis check with the statistical criterion.

A statistical criterion is a mathematical rule that allows us to reject the null hypothesis or not, that is, to conclude whether there is a non-random difference between groups. A statistical criterion creates a p-value.

To decide what criterion to choose while performing an A/B test, we need to use this scheme:

A statistical significance is a measure of confidence that a result is not random. By default, a statistical significance of 5%(or 1%) is used.

So look at the table:

ConditionAction
p-value > statistical significanceWe can not decline the H0 hypothesis
p-value < statistical significanceWe can accept the H1 hypothesis

To choose the right statistical criterion, we have to understand the distribution of our data. That is what we are going to do now!

If the normaltest shows the result < 0.05, it means that we receive the normal distribution. We need to decide what criterion to choose. To do that, let's define if our distribution is normal. Use scipy.stats.normaltest(data) to perform this test.

Don't worry if the information sounds hard! We will cope with that!

Oppgave

Swipe to start coding

  1. Import the pandas with the pd alias.
  2. Import the seaborn with the sns alias.
  3. Import the scipy.
  4. Import the statsmodels.api with the sm alias.
  5. Build the distplot using the clicks column from the df.
  6. Build the qqplot using the clicks column from the df.
  7. Perfrom the normaltest with the clicks column from the df.

Løsning

Switch to desktopBytt til skrivebordet for virkelighetspraksisFortsett der du er med et av alternativene nedenfor
Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 2. Kapittel 2
single

single

some-alt