Feature Reduction with KNIME and Weka

roberto_cadili · January 15, 2025, 9:58am

Too many features slowing your #ML model? @nilotpalc shows how to use #KNIME & its #Weka integration to perform #FeatureReduction in a loan approval dataset and streamline modeling, improve performance, and reduce storage needs. Enjoy the data story!

PS: #HELPLINE . Want to discuss your article? Need help structuring your story? Make a date with the editors of Low Code for Data Science via Calendly → Calendly - Blog Writer

rfeigel · January 17, 2025, 4:08pm

I reset the workflow without making any changes and got the following error:

ActionAndi · January 17, 2025, 4:29pm

Great article! I’m wondering if it is valid to compare different feature reduction techniques with different dataset partitions.

roberto_cadili · January 17, 2025, 4:57pm

Hi @rfeigel, that’s expected since the input ports of the failing node are passing data columns that are likely to change a bit at each execution (e.g., there’s no seed set in the Partitioning node).

Additionally, the author decided to select manually the features to filter out for higher control on the process (check the configuration of the failing Feature Selection Filter node). This is just a workflow design choice.

How can you fix the problem? Either set seeds for reproducibility and then manually select the features to filter out, or simply select some other filtering condition in the failing node (e.g., “Select best score” ).

Hope it helps ,
Roberto

rfeigel · January 17, 2025, 6:12pm

Got it. Thank you very much.

system · April 17, 2025, 6:13pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.