Time series stationarity and models for heteroskedasticity

Franziska_W · December 6, 2021, 8:49am

Hi guys,

I am new at the community and I use KNIME for time series analysis about 2 weeks and I have few questions:

Is there a possibility to check if a timeseries itself is stationary except of “Analyze ARIMA Residuals”. I have following time series after removing the saisonality and I am not 100% sure if it is stationary:

The node Analyze ARIMA Residuals says for the first 10 lags it is stationary but the higher the lag the higher the autocorrelation.

And another question: Does KNIME have models for heteroskedasticity (like ARCH and GARCH)? I know this question came up already last year in July but maybe in the meantime there was a change.

Thanks in advance and if there are any unclear points, feel free to reach out.

Best regards,
Franziska

Maarit · December 9, 2021, 2:50pm

Hi Franziska,

Thank you for the example! We don’t have another component for checking the stationarity. But your question whether the time series is stationary or not, I’ll check that!

And ARCH and GARCH models are not yet available in KNIME.

Best,
Maarit

Maarit · December 9, 2021, 4:20pm

Could you share

the granularity of the data and
the number of data points used for the model?

Franziska_W · December 10, 2021, 10:07am

Hi Maarit,

thank your for your answer and efforts.

I used following dataset (only store 1):
Walmart Dataset (Retail) | Kaggle

It includes weekly sales and after removing the saisonality (with lag 52) 91 rows.

Maarit · December 15, 2021, 11:29am

Hi Franziska,

below the answer from Prof. Daniele Tonini who is teaching the time series course at KNIME:

With seasonal series, typically you check the ACF function up to the lag corresponding to twice the seasonal period. So, for instance, with monthly data you check until 24 lags. Having weekly data you should check until 104/106 lags, but with only 91 obs you have a lot of instability of the auto correlation function at high lag values, just because there are few observations available to compute the covariance between Yt and Yt-k… so it’s quite normal to see that high variability of the AFC over there. If it’s not possible to collect more datapoints, I would consider to do the Ljung-Box test only up to 52 lags.

Thank you for the question, I also learned something! I hope this helps you!

Best,
Maarit

Daniel_Weikert · December 15, 2021, 5:17pm

Would differencing be an option? Then ust apply it e.g by using a lag column note and you would be sure (Please note I am not a data scientist so please correct me if I am wrong. Just trying to help here)
br

Maarit · December 16, 2021, 3:06pm

Yes, differencing should handle non-stationarity in data. However, if I read the answer correctly, @Franziska_W had already differenced the data at lag 52 to remove yearly seasonality.

Franziska_W · December 20, 2021, 8:26am

Hi Maarit,

thank you so much for your help!

Best,
Franziska

Franziska_W · December 20, 2021, 8:28am

Yes, I already differenced the data. But anyway thank you for your help.

system · June 20, 2022, 8:28pm

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.