I am using the KNIME extension for Apache Spark to process large amounts of data. Unfortunately there are currently no text-mining nodes. That’s why I have to use the Spark DataFrame Java Snippet. The node produce a NoClassDefFoundError while stemming. I use these libraries: spark-stemming-0.2.0.jar and spark-mllib_2.11-2.3.1.jar. When I try to execute the Spark DataFrame Java Snippet Node it crash. I got a NoClassDefFoundError. In Intellij IDEA the code is running.
Can anybody help?
spark_stemming.knwf (14.0 KB)
PS: You have to download these two libraries and copy to folder “spark_stemming/libs”. These libraries are too big to upload.