Course Content

Learning Statistics with Python

1. Basic Concepts

Sample vs Population Types of Statistics Types of Data Mean Value Median Value Median Value of the Even Number of Values Mean or Median Mode Value Descriptive Statistics Quiz

2. Mean, Median and Mode with Python

Examine the Dataset Calculating Mean and Median Values with Python Statistics with pandas Calculate the Mean and Median Salary

3. Variance and Standard Deviation

Population Variance Sample Variance Calculate Variance with Python Standard Deviation Standard Deviation with Python Calculating Variance and Standard Deviation

4. Covariance vs Correlation

Covariance Correlation Covariance and Correlation Quiz Calculate Covariance and Correlation

5. Confidence Interval

Explore the Data Set Confidence Interval Calculating Confidence Interval with Python Confidence Interval Width Quiz Calculate 95% Confidence Interval Advanced Confidence Interval Calculation with Python Match the Functions

6. Statistical Testing

What is t-test Hypotheses t-test Mathematically One-Tailed And Two-Tailed Test t-test Assumptions Performing a t-test in Python Conduct a t-test Paired t-test

Standard Deviation

One of the most important measurements is standard deviation. This value is similar to variance because standard deviation is the square root of variance. Therefore, the formulas will differ for the population and sample.

Definition

Standard deviation is a measure of how data is spread out in relation to the mean.

Empirical Rule

The Empirical Rule, also known as the 68–95–99.7 rule, applies when the population follows a Normal Distribution. According to this rule:

About 68% of the data falls within one standard deviation (σ) of the mean;
About 95% falls within two standard deviations (2σ);
About 99.7% falls within three standard deviations (3σ).

When dealing with samples, the percentages might not be precisely accurate, but you can expect them to be quite close to the values in the rule, especially with larger sample sizes.

Example

To illustrate this, let's examine a sample of kitten weights measured in grams:

In this scenario, the following data is being used:

Mean value is 100 grams;
Standard deviation (represented by the σ symbol in the picture) is 20 grams.

As mentioned earlier, one standard deviation above and below the mean encompasses 68% of the values. In this instance, those values range:

\textbf{from:}\ \text{mean} - \text{standard deviation} = 100 - 20 = 80;\\ \textbf{to:}\ \text{mean} + \text{standard deviation} = 100 + 20 = 120.

Everything was clear?

Thanks for your feedback!

Section 3. Chapter 4

Ask AI

Ask anything or try one of the suggested questions to begin our chat