Hi everyone, I need some help with KNIME regarding PDF processing.
I’m trying to load a folder that contains several PDFs (a company’s Annual Reports, one for each year) along with an Excel file containing a list of keywords.
My goal is to have KNIME read each individual PDF, correctly associate it with its corresponding year, and search for the keywords within it.
The issues I’m facing are:
-
I can’t manage to assign each PDF the correct title/year, so the output isn’t clean.
-
The number of resulting rows is much larger than expected: I would expect 58 keywords × 14 reports = 812 rows, but I’m getting more than 11,000.
What could be causing this?
Thanks in advance to anyone who can help!
If you can, you can also contact me via email: roberto.cirillo@unicampania.it