DATA EXTRACTION BETWEEN THE ROWS

SABAREESHS37 · December 30, 2021, 5:58am

Good day Everyone,

Am having an query that I need to extract the particular data from the set of row of an extracted pdf document have extracted the data using tikka parser hear I would like to filter the data between 2 headers

andrejz · January 3, 2022, 5:57am

Hi,

I supose that you have your pdf document in rows and in some row (n) is the tittle of the header and in the next n+x rows you have the title of the second header and so on …
Use Rule engine to mark the n row as header 1, n+x as header 2 ,…
then use Missing value node to fill the rows between the two headers with the previous value … and now you can filter the header you want.

Regards

system · July 4, 2022, 5:57pm

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.