Thanks for your support over a period. I have been able to learn of new things which has increased my overall productivity. I have been trying to extract some data from multiple PDF’s from 2 weeks, but I hadn’t got any success. So I did it manually, but now since I have the time I wanted to understand who can I extract specific data from PDF.
I have seen the example pdf_extract.knwf (57.8 KB) which matches with my case.
I understand that I need to follow this sequence to get my data.
The problem i have is I am not able to generate the regex to extract my specific data.
I need to extract
- Name of the Trust - Page 1
- Income of Trust estate - Page 2
- Total tax losses carried forward to later income years - Page 3
Link to the PDF file: https://drive.google.com/file/d/1n5-MBs5R4Fhv_IuakRgiewzhCUTCVwxS/view?usp=sharing
Can some please help me creating the regex code for this. Hopefully, which this I will be able to understand extraction technique and then I can add few more items that I need to extract?