Hi everyone!
I’m learning regression and trying regression datasets.
I built following simple model
Now I need to estimate the future performance of algorithm (Question 1). How do I do that in Knime.?
Also what would be performance expectation (Question 2)? How can I be sure that the estimation is close to true performance(Question 3).?
Hi, Question 1:you can use the scorer node on the data you reserved on the second port to test. Question 2: The statistics you find as output on the scorer node answers this question. Question 3: If the data you used to train/test is a random sample of the data you’ll be using your model on in general it should perform similarly. Analyze the results and try to be sure you are not overfitting
Hi. The different statistics should help you.
R² is a measure of the degree of fitness of your model to your data. It measures the percentage of the variation of y explained by your model.
The mean absolute error is the average of the absolute value of the differences between real and forecasted values.
Chapter 3 of https://www.statlearning.com/ explains the theory behind and should help you to interpret the output of the node
Sorry for asking more. I don’t have access to this book. Can you please explain in simple terms how can I answer all these by using my simple model?
Thanks!