Ollama / OpenAI - local embeddings not working

mlauber71 · October 21, 2024, 6:42pm

I am trying to create a workflow that would use local LLMs like Llama32 with the help of Ollama and the KNIME AI nodes as described in the article by @roberto_cadili How to leverage open source LLMs locally via Ollama.

I tested several local URLs and models but I always receive the same error message ‘invalid input’. I wonder if there is a problem with the current implementation - Ollama version 0.3.14.

Maybe someone can also try and give feedback.

What does work is creating such Vector stores with GPT4All and the KNIME nodes (Chroma and FAISS and then using it with Ollama).

roberto_cadili · October 21, 2024, 7:10pm

Hi @mlauber71, thanks for checking out the blog post .

What you’re trying to do is unfortunately not possible using the OpenAI Authenticator node.

In Step 2 of the blog post, I also explained why:

… Next, we drag and drop the OpenAI Chat Model Connector node, which we can use to connect to Ollama’s chat, instruct and code models. We need this specific connector node (and not the OpenAI LLM Connector) because Ollama has built-in compatibility only with the OpenAI Chat Completions API, which is what the OpenAI Chat Model Connector node uses behind the hood.

If we wish to leverage Ollama’s vision and embeddings models, we can do so using the nodes of the KNIME REST Client Extension.

To connect to embedding models available via Ollama, you can use the POST Request node (Mind: make sure that the model you want to use is compatible with the task of creating embeddings).

To show how this would work, a couple of months ago I built a workflow to connect to embedding models and vision models from Ollama. Find it below:

Hope this helps .

Happy KNIMEing,
Roberto

roberto_cadili · October 21, 2024, 7:16pm

…the good news is that the future is bright . So at some point it’s going to be possible.

Sourced from Ollama blog about OpenAI compatibility: OpenAI compatibility · Ollama Blog.

Best,
Roberto

tescnovonesis · October 22, 2024, 3:14pm

I made this, but I am stuck what to do with the vectors

roberto_cadili · October 23, 2024, 7:44am

Hi @tescnovonesis, nice to see that with a POST Request you were able to obtain the embeddings.

Are you asking what you can do with embeddings? There are many uses: the most common one in the context of GenAI is for populating vector stores, and subsequently retrieving documents in RAG systems.

However, that’s not the only option. In a traditional data science setting, you could use those vectors to feed a traditional ML model and obtain predictions, perform topic modeling or even use embeddings to plot semantic overlaps in your text documents - just to mention a few applications.

Hope it helps ,
Roberto

tescnovonesis · October 23, 2024, 8:04am

Hi Roberto,

Yes, is there somehow a way to stuff those vectors into the Chroma/FAISS Vector Store Creator in KNIME?

mlauber71 · October 23, 2024, 8:07am

@tescnovonesis I have modified my original workflow to now use GPT4All to create Vector stores and store them with Chroma and FAISS.

If you want a more Pythonesque version that also can work on KNIME 4 you can take a look at my article and the examples mentioned and in my LLM collection on the hub.

Also make sure to check out KNIME’s collection about the use of GenAI

tescnovonesis · October 27, 2024, 8:56pm

Hi @roberto_cadili

Is there a public code repository for KNIME AI Extension – KNIME Community Hub in case I got the crazy idea to develop nodes myself?

MartinDDDD · October 28, 2024, 6:47am

If you have the extension installed you can go to your knime installation folder.
In Plugins Folder there should be a folder org.knime.python.llm_5.3.2.v202409031801 (version number may differ…)

You can find the Python code behind that extension in this folder (inside the folder above)

\src\main\python\src

tescnovonesis · October 28, 2024, 8:44am

Found it !

How relieving

Would you know if it’s possible to make pull request toward repository somewhere?

ScottF · October 29, 2024, 2:38pm

Hi @tescnovonesis -

From a different thread:

system · November 5, 2024, 2:39pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.