Hi,
There appears to be a bug in the RProp MLP Predictor node, specifically related to the order of input colums for the prediction. It appears the order of columns must be the same as the input data used to train the RProp MLP learner.
Here is my setup:
- Workflow A to read in a training/test dataset and save the MLP Learner model via PMML writer
- Workflow B to read in a prediction dataset, read the model via PMML reader, and output a prediction
I have noticed the following:
- Prediction on training/test dataset is good
- Prediction on the prediction dataset seemed way off. Several hours of debugging later, I realized the input column order (of Double values) was different. There were also a few additional columns (unused)
- Added a "Column resorter" node and fed in prediction dataset into the MLP redictor. Problem solved.
This appears to be a major flaw. Please let me know if you are able to recreate this issue, otherwise I am happy to sanitize and share my dataset and workflows.
Regards,
Jawahar