Happy Wednesday, everybody! Today’s Just KNIME It! challenge focuses on AI Governance with Giskard.
You want to build a workflow that evaluates the output of LLMs, including the detection of their potential vulnerabilities. The goal is to create a workflow that facilitates decision making when picking an LLM for a new task. As an initial test, you want to check how different LLMs tackle the following task: “given a prompt with product descriptions, the LLM should create emails to customers detailing such products.”
Which insights can you gather from this LLM evaluation?
Here is the challenge. Let’s use this thread to post our solutions to it, which should be uploaded to your public KNIME Hub spaces with tag JKISeason4-24 .
Need help with tags? To add tag JKISeason4-24 to your workflow, go to the description panel in KNIME Analytics Platform, click the pencil to edit it, and you will see the option for adding tags right there.
Let us know if you have any problems!