I tried all known methods from the tutorial “ The KNIME Text Processing Feature” and “Text Mining Webinar The Text processing Extension” and the text mining webinar video. I tried different node including Reader node (my file is stored on my desktop as pdf). However, when I try to configure the file if comes out gibberish unreadable format.
I am probably asking Knime to do more than its capability.
I can not download the pdf file. Could you please attach it to your post in the forum, put it somewhere else, or send it to me via email? I will have a look into it.
Hi Cheers,
The web site (http://www.dhcs.ca.gov) always has links problems. The file is big, so I am posting portion of it.
Thanks,
What a great community!
Thank you for the PDF file. With the PDF Parser, provided by the Textprocessing extension you can parse PDF files into KNIME as DocumentCells. This works fine for your PDF file.
Attached is a workflow which reads the file and creates a bow.