How to extract from and divide a pdf with text and tables into rows and then search for a row by a word it contains.

@Ami1 the solutions to the KNIME challenge 015 might Indeed be a starting point. Then there is an approach using R package:

You can search for a word in a PDF file and extract the location:

To extract tables there is an option to use tabulizer

Or you can try and use the python package camelot

3 Likes