I have never before worked before text processing nodes, hope that you will help me to learn something.
Currently I have pdf file, and I have to get different tables from this pdf file.
For now, what I ve found is Tika Parser which parses all data from file.
I have got successfully the data, but in one column and row. In addition, a bit messy…(((
It turns out that depending on how the PDF is formatted, this can be quite a tricky problem. This is something that’s come up on the forum before - you might start here to see how other folks have approached it: