LOAD LIBRARY IN NODE LIVY SPARK

And · October 6, 2023, 1:31pm

Hi, I’m using knime, in the hadoop world.
After connecting to HDFS I should create a SPARK context, via livy.

The node uses the libraries that are installed on the livy server, specifically the Jetty library.
How do I load this library from the client side? can I load it from these settings from here?

sascha.wolke · October 6, 2023, 2:19pm

HI @And,

You can add custom jars, using settings like the one in your screenshot. Please node that the Spark jobs are running on your cluster. This means the jars must be stored on your servers or e.g. HDFS.

Cheers,
Sascha

And · October 9, 2023, 6:50am

Can you give me an example screenshot? in which I see which parameter to use and how to value the key and value column, and in the example how to set the value column in which the jetty library is located in a specific path.

thanks
And

And · October 9, 2023, 11:19am

Can you help me with an example?

thanks
And

sascha.wolke · October 10, 2023, 3:35pm

Hi @And,

Can you try to explain what like to archive? Not sure if I understand your question.

If you like to connect to Livy, you have to use the Create Spark Context (Livy) node. You don’t need to import any libraries.

Cheers,
Sascha

And · October 11, 2023, 8:30am

Hi Sasha,
No, I connected without problems with the livy node, but there was a version change on the Hadoop side, and the jetty-io-9.4.39.v20210325.jar library present in the Cloudera package gives me problems, I would like to use only on livy the updated library jetty-io-9.4.48.v20220622.jar.

I saw that in the livy custom settings, I can indicate how to overwrite a system library with a custom one.
My question is how do you compile the image?

should I use spark.jars.packages?
How do I fill in the Values field?
Where do I keep the library?

thanks for the support !!!

sascha.wolke · October 11, 2023, 10:29am

Hi @And,

Not sure if this is possible. You can provide additional jars to your Spark Job, but not to the Livy service itself. Do you have trouble with Livy accessing HDFS or Spark accessing HDFS?

Cloudera provides Livy together with the Spark CDS, maybe you can update it to the latest version, and then you don’t have to use special jetty versions?

Cheers,
Sascha

And · October 11, 2023, 2:38pm

HI,
I’m also fine with adding it to my spark job running the jars.

I would like to make a set like this

image1248×407 35.4 KB

this is the source:
Apache Zeppelin 0.7.0 Documentation: Livy Interpreter for Apache Zeppelin.

thanks

sascha.wolke · October 17, 2023, 5:01pm

Hi @And,

Not sure if this is possible, Netty is an essential lib used already in Livy and you might not able to replace it this way. You can try the spark.jars.packages option in the Livy configuration dialog. You might also be able to do this using the livy.spark.jars.packages on a global level via the Cloudera Manager, and the Livy configuration.

Cheers,
Sascha

system · January 15, 2024, 5:02pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.