In time series analysis, **stationarity** refers to a feature of a stochastic process where the statistical properties of the process do not change over time. The process's mean, variance, and autocorrelation structure remain constant. 

Stationary data example:


**Why is this feature important to us**? 

Its presence is a criterion in selecting models that you can use to predict the data. If the data is stationary, you can use diverse models ( Autoregression, Moving Average, Linear Regression, etc.). On the other hand, there are no universal models to predict non-stationary processes: some non-stationary time series can be converted to stationary via some mathematical manipulations (ARIMA model); other non-stationary time series can be forecasted using special types of neural networks.


Let's move on to checking the data for stationarity. This can be done using statistical tests called "Unit Root Tests". There are tests such as:

- **Augmented Dickey-Fuller test**; 
- **Kwiatkowski-Phillips-Schmidt-Shin**;
- **Philips Perron test**.

Most often, we will use the Augmented Dickey-Fuller test (ADF), but you can take into account the results of other tests.

Using the `statsmodel` library, we will determine whether the data is stationary. This is done in a few lines of code:

```python
from statsmodels.tsa.stattools import adfuller
import pandas as pd

dataset = pd.read_csv("time_series.csv", parse_dates=["date"])

# ADF Test
result = adfuller(dataset["Price"], autolag="AIC")
print("ADF Statistic: %f" % (result[0]))
print("p-value: %f" % (result[1]))
```

How can we interpret the results obtained?

The null hypothesis of this test is that the time series is non-stationary. If the p-value is less than some level (e.g., 0.05), we can reject the null hypothesis and conclude that the time series is stationary.

Let's look at the p-value above. It is equal to 1.0, which means that we cannot reject the null hypothesis, and our data is non-stationary. The null hypothesis would be false if the p-value were less than 0.05.


What can be done with thousands of online store purchase records? How can we analyze this data and predict its growth? In this course, you will learn what parameters we can analyze in time series and how to create predictive models. Let's get started!

Start from the introduction to time series to move on to more in-depth topics. Find out what it is, how often time series occur in real business, how to analyze them and make forecasts.

What is the difference between trend and seasonality? Is the data I'm working with stationary or not? In this section, you will learn how to analyze characteristics of time series using Python!


Learn more about time series through… their visualization! Explore your data with 2D and 3D graphs.


What will be our first step? Let's start with stationary models! Create your first predictive model that will forecast time series.

Most of the data you work with is non-stationary. Learn how to process such data and what models to use to predict it.

Let's dive into a variety of interesting tasks that you may encounter while working with time series! Find out how big these challenges are and what models we can use to solve them.

Stationarity

Рішення