Hi KNIME Members,
I am struggling with my model. I have following datasets: Data.csv, temperature.csv and holiday.csv. I will upload these datasets within the zip file.Three datasets.zip (64.1 KB)
Dataset 1(“data”): Contains the electricity load each 30 minutes in years 2014 and 2015. Each row corresponds to the data of one day. Columns 1, 2 and 3 indicate the year, month and day of the measurements, respectively. The remaining columns show the electricity loads in 30 minutes blocks on that day. More specifically, T1 denotes the load of period 00:00-00:30, T2 00:30-01:00, and so on, and T48 is the load of the last 30 minutes of the day, i.e. period 23:30-00:00.
Dataset 2 (“Temperature”): Contains the average daily temperatures from 2012 to In the data file, the first three columns contain the information of year, month, and day, respectively. The last column indicates the average daily temperature in °C.
Dataset 3 (“Holidays”): Contains dates (in year-month-day format) of public holidays of years 2014, 2015 and 2016.
We used the following KNIME model. KNIME_Model.knwf (99.9 KB)
The dataset out of the column aggregator(Data 2014-2015 w/ max elec. value), also has a column with the value 0 or 1 depending on whether the date is a holiday or not. We use a polynomial regression learner and predictor. In this learner we want to include the variables holiday and temperature to predict the maximum electricity load per day. The problem here is that the we want a second degree polynomial regression learner, that includes these two variables. But we get the following error: ERROR Polynomial Regression Learner 0:56 Execute failed: Index: 2, Size: 2.
How can we overcome this error but also include both variables.
Is it best to only include the variable temperature, based on the outcome, multiply the predicted value of a holiday with a certain (known) factor. Or is it better to split the data in a holiday and non-holiday dataset and use two separate learners to arrive at the predicted values? Or do you have another suggestion on how to solve this problem?
Thank you in advance!