I have recently started learning about KNIME functionalities and I need help finding a way of doing the automatic data preparation and automatic variable selection in KNIME.
For the same purpose, I have used SPSS before with the Automatic Linear Modeling feature includes (Data preparation feature - on and best model selections method, Automatic variable selection).
How this can be done in KNIME?
Concerning automatic model building I would like to point you to two approaches. One I built a wrapper for H2O.ai AutoML module - and one version with the added benefit of using R’s vtreat to prepare the data.
KNIME Workflows and Verified Components offer automated machine learning only for classification task at the moment. That is:
as well as the most recent and simplified one:
The AutoML component is made to work with other components to cover the many use cases:
Regarding AutoML for Regression Models:
We are still working on it. For now we have an example you can look at to take inspiration that we use for education purposes in our KNIME Pros Learnathon. This is the solution of the exercise we give there to solve AutoML for regression. This is not a Verified Components and it is not recommended to work in a real use case: it does not have parameter optimization for example.
Hopefully we will supply a more refined versions this year.