Leer Autoregressive-Integrated-Moving-Average Model

Sectie 4. Hoofdstuk 4

single

Veeg om het menu te tonen

The last model we will look at is the autoregressive integrated moving average. This model combines autoregression, moving average and differencing technique(this is not the model we discussed above). It is presented as follows:

It may look quite complicated, but in reality, look, AR - is an autoregressive model with which you are already familiar, MA - is a moving average, which implies a linear combination of errors from the past forecast.

ARIMA uses 3 parameters that we must choose ourselves (p, d, q):

P - is called the order of autoregression. It is the number of immediately preceding values in the series that are used to predict the value at the present time;

D - order of differencing;

Q - the order of the moving average. Allows you to set the model error as a linear combination of previously observed error values.

Note

We will consider differencing techniques in the next chapters in more detail

What is the advantage of this model? It can forecast simple non-stationary processes ( more precisely, processes in which mean and covariance changes over time) and is more efficient when working with short-term predictions.

Let's create an ARIMA model using statsmodels, for this, and we use the class ARIMA():

from statsmodels.tsa.arima.model import ARIMA
from sklearn.metrics import mean_squared_error
from math import sqrt

X = df.values
size = int(len(X) * 0.6)
train, test = X[0:size], X[size:]
history = train.tolist()
predictions = list()

# Making foreasts with ARIMA model
for t in range(len(test)):
  model = ARIMA(history, order=(5, 1, 0))
  model_fit = model.fit()
  output = model_fit.forecast()
  predictions.append(output[0])
  history.append(test[t])

# Calculate MSE error
rmse = sqrt(mean_squared_error(test, predictions))
print("Test RMSE: %.3f" % rmse)

# Plot results
plt.plot(test)
plt.plot(predictions, color="red")
plt.show()

The results:

Basically, you will use 2 functions: .forecast() and .predict() The first function is for predictions outside of the dataset (which is why we used a loop in the example above), while the .predict() function is used for predictions inside the dataset.

Further, you can experiment with the p, q, and d parameters to get the best results. But even with these parameters, you can see that the model tracks the main trends well.

The code may process for up to 1-minute

Taak

Swipe to start coding

Create an ARIMA model and train it on the pr_air_quality.csv dataset.

Within the for loop, create an ARIMA model using the history data and assign it to the model variable. Then, fit model to the data and save it as model_fit. Then, make forecasts using fitted model_fit.
Calculate RMSE: take the square root (sqrt) of the mean squared error (calculated using the test and predictions).
Visualize the results: display the test values within the first call of the .plot() function and predictions values within the second call.

Please note that the code may take a long time to complete

Oplossing

Schakel over naar desktop voor praktijkervaringGa verder vanaf waar je bent met een van de onderstaande opties

Was alles duidelijk?

Bedankt voor je feedback!

Sectie 4. Hoofdstuk 4

single

Vraag AI

Vraag wat u wilt of probeer een van de voorgestelde vragen om onze chat te starten.