Before you can do fancy machine-learning it does make sense to prepare your data - like deal with missing values, remove highly correlated variables and so on. You can do a lot of this by hand or you could employ a ready made tool like vtreat which has been implemented in R and Python.
You can read more in my Medium stories. Also check out the examples mentioned there on the KNIME Hub: