I have a PDF that is annotated with comments. I would like to extract the comments section information ONLY from the PDF along with the page number they come from using KNIME.
The Tika parser can capture all the contents of the PDF, but there is too much text to extract the relevant comments as they are not tagged.
Would be nice to hear from anyone in the KNIME community which has solved this problem.
Thanks