I have several PDFs to be parsed in KNIME. In this case, I give you two: one with 2022 data and one with 2023 data.
The two files have the same structure, but while 2022 is read by the Tika parser node with no issue, 2023 seems to be unreadible. Why is that happening and how can I make that work again?
you are right, 2023 data is not in text format and Tika can’t read it. Download zipped PDF folder here
Don’t know what the website has changed to make this happen.