Hello, we have 1,000+ PDFs that are structured roughly the same way and want to extract the line items to put into a CSV (about 4-5 columns, though they don’t have clear lines separating the cells, the formatting is consistent). The data is somewhat messy (lots of various descriptions), but the line items appear in the same place across the pages.
Is there an easy way to do this with KNIME?
Alternatively, we are able to drop the PDF into ChatGPT and it has no issue extracting the data we’re looking for into a table with the right prompt. But, we don’t want to go through and use ChatGPT for each of these. Would there be a KNIME-based solution that could run all of these through ChatGPT to consolidate into a CSV?