PDF parsing issue

Hi,
I am relatively new to KNIME and I am having an issue reading in PDF files. The problem occurs while using the pdf parsing node - the editable text in the pdf is not collected. My end goal is to extract the data from the pdf which is contained in the editable sections of the pdf.

Image of pdf:
image

Image of what is read from pdf parser as displayed by document viewer:
image

Sorry I can’t give more from the pdf but this is proprietary. I think it still captures the jist of the issue.

Thanks

You can try Tika Parser i stead. It may work better.

4 Likes

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.