Mix and match Spark nodes with other KNIME nodes

Hub · September 5, 2020, 7:31pm

This workflow mixes standard KNIME nodes with the Spark nodes to find the optimal parameters for a k-means clustering using the hillclimbing approach. Other optimization strategies are available - check the Parameter Optimization Loop Start Node description for more. The workflow makes use of the Create Local Big Data Environment node to create a Spark context. You can swap this node out for a Create Spark Context (Livy) node to connect to a remote cluster.

This is a companion discussion topic for the original entry at https://kni.me/w/vYGaMaKYddsXUx9G