PDF parsing issue

I am relatively new to KNIME and I am having an issue reading in PDF files. The problem occurs while using the pdf parsing node - the editable text in the pdf is not collected. My end goal is to extract the data from the pdf which is contained in the editable sections of the pdf.

Image of pdf:

Image of what is read from pdf parser as displayed by document viewer:

Sorry I can’t give more from the pdf but this is proprietary. I think it still captures the jist of the issue.


You can try Tika Parser i stead. It may work better.


This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.