Tess4j - How to configure another language?

Hi friends,

In getting an error when trying to use another language.

Path

Variable connection

Error

The text you provided can be translated as follows:

"I’m having a problem with Tess4j in Node, trying to configure another language. I haven’t found any manual and I’m guessing the path forward is the one below.

  1. I went to GitHub and downloaded a ‘por.traineddata’ file.
  2. I’m connecting it via a variable, and I’m getting an ERROR."

“When I remove the variable, it works normally. But I need it to be in another language. Does anyone know this error?”

I attached the ‘por.traineddata’ file into data folder

Reading Image Based PDFs with Tika Parser.knwf (822.3 KB)

I installed the GitHub language file directly into Knime plugins and now its available as an internal choice in the Tess4J node.

I performed the same procedure you explained, copying the file to the plugin folder, but the error persists. I tried with two “po.” files (downloaded from GITHUB), but it didn’t work.

Perhaps the files aren’t compatible with the KNIME models. I couldn’t find any others online.

STEPs

[Reading Image Based PDFs with Tika Parser_v2.knwf|attachment]

Reading Image Based PDFs with Tika Parser_v2.knwf (1.0 MB)

image-based-pdf-sample.pdf (224.0 KB)

1 Like

I can’t test your workflow because you stored the pdf locally so I don’t have access to it. Create a “data” folder in your workflow and store it there. Also the latest Tessaract version is 5.5.1. I don’t know about compatibility with the current Knime Tess4J node. @ScottF could you have someone weigh in?

Hi @Felipereis50.

I attached a workflow for Portugal.

Tess4J Portugal.knwf (110.2 KB)

You must have your por.traineddata file in this folder

…\knime\plugins\org.knime.knip.tess4j.base_1.3.3.v202307241154\tessdata or similar

There is a data folder in the workflow folder that has three pdf for you to test.

It works for me. Please try it and if you have problems return here.

Br

2 Likes

I’m having the same problem as @Felipereis50 . I’m pretty sure the por language file is installed correctly. I’m on Windows 11 KAP 5.7.0.

Hi @rfeigel

I’m on a windows 10, KNIME 5.7.0 and this is my log file

knime.log (48.6 KB)

Hope can help.

Br

@hmfa

Sorry late.

I will try this weekend. (I’m on vacation this week) :laughing:

1 Like

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.