Simple PDF Text Extraction

This is a companion discussion topic for the original entry at


What are the nodes, in a sequence, to be used for distilling data from pdf files, both text and image. I have tried using both the tika parser as well as pdf reader. it does not seem to detect multiple values in different lines on the same columnar cell.

Any help would be of assistance.

Thanks and regards