Feature Request: Read Text and attachement from an email

Hi,
it would be very useful to have a node that is able to read email text and download attachment to be used in Knime ad data source for text mining and other analysis.
Do you have something like this in your road map?
Thanks
Regards

Hi iiiaaa,

you can use the Tika Parser node to read emails stored as .eml files. The nodes can also extract attachments from the .eml files and parse them as well. Is that what you are looking for?

Cheers, Kilian

Dear Kilian,
thank you for your answer but I was looking for a node that is able to read email text and download attachment directly from email Host in real time or at least from Outlook. Do you see what I mean?
Thanks
Regards

Hi iiiaaa,
There is no node for that yet. Something for the future development though.
Best, Daria

Thank you for your reply. I hope this feature will be introduced soon.
Thanks
Regards

Hi all, I’ve used the Tika Parser and it worked well; I’m able to read and analyze a folder with exported eml files. I’ve separated the footers by using the Cell Splitter node but now I’m facing the challenge to eliminate all the multiple existing textblocks in forwarded or replied emails. Is there anybody who solved this?
Thanks
Dieter

May be the approach is to eliminate all redundant eml files by using java regular expressions.

1 Like

I am having the same problem. Does anyone know how to solve it?

Meanwhile I found a solution to filter the last valid email that means to filter out all the emails before by using regex and grouping of the email title.

Hi Kilian, I am looking for that solution, Could you please share the solution if you have any for reference.

Hi,
I am looking for that solution, If anyone has any reference workflow ready then please share it.

Hi @k_user -

There are several examples using the Tika Parser on the KNIME Hub. Here’s a simple one to start:

Many others available here:

Does that help?

Thanks @ScottF for sharing this.
I am more than looking for some reference workflow to read the data/attachments from the mailbox/outlook.
It would be helpful if someone had already created this and could share it here.
Thanks

Updating this thread to let everyone know the new Email Reader (Labs) and associated nodes were added in the 5.2 release. :slight_smile:

1 Like