Solutions to “Just KNIME It!” Challenge 24 - Season 4

:sun_with_face: Happy Wednesday, everybody! Today’s Just KNIME It! challenge focuses on AI Governance with Giskard. :brain:

:mag_right: You want to build a workflow that evaluates the output of LLMs, including the detection of their potential vulnerabilities. The goal is to create a workflow that facilitates decision making when picking an LLM for a new task. As an initial test, you want to check how different LLMs tackle the following task: “given a prompt with product descriptions, the LLM should create emails to customers detailing such products.” :dart: Which insights can you gather from this LLM evaluation?

Here is the challenge. Let’s use this thread to post our solutions to it, which should be uploaded to your public KNIME Hub spaces with tag JKISeason4-24 .

:sos: Need help with tags? To add tag JKISeason4-24 to your workflow, go to the description panel in KNIME Analytics Platform, click the pencil to edit it, and you will see the option for adding tags right there. :blush: Let us know if you have any problems!

Find herewith my Submission : i have used openai gpt 3.5 with low temp for factual inclination and 2nd LLM is deepseek .. Giskard throuw some alianic output.. which really i couldnt interpret… other then that .. i need to improvise.. My flow JKISeason4-24 – KNIME Community Hub

1 Like