Convert a PDF file to HTML

Hi @ScottF

I’ve already used the Tika Parser but for somes files I have a recognition issues.

For example there is some words which are interspersed with spaces like : F R E E D O M or words with unrecognized characters (M�dical �mergencies).

So, I 've tried to convert the PDF in different format to resolve this issue and finnaly to find that the Html format don’t have this issue.

Instead of using the Tikka parser node to read a PDF File, I’ve in mind to convert the PDF in a Html document and extract the text from the Html.

This is why I need to use a method to automatically convert PDF files to html.
But I don’t know if we can do it in Knime.

Thank you for you’re help.