it would be very useful to have a node that is able to read email text and download attachment to be used in Knime ad data source for text mining and other analysis.
Do you have something like this in your road map?
you can use the Tika Parser node to read emails stored as .eml files. The nodes can also extract attachments from the .eml files and parse them as well. Is that what you are looking for?
thank you for your answer but I was looking for a node that is able to read email text and download attachment directly from email Host in real time or at least from Outlook. Do you see what I mean?
There is no node for that yet. Something for the future development though.
Thank you for your reply. I hope this feature will be introduced soon.
Hi all, I’ve used the Tika Parser and it worked well; I’m able to read and analyze a folder with exported eml files. I’ve separated the footers by using the Cell Splitter node but now I’m facing the challenge to eliminate all the multiple existing textblocks in forwarded or replied emails. Is there anybody who solved this?
May be the approach is to eliminate all redundant eml files by using java regular expressions.
I am having the same problem. Does anyone know how to solve it?
Meanwhile I found a solution to filter the last valid email that means to filter out all the emails before by using regex and grouping of the email title.