so far there is no dedicated node to read/parse emails from email clients. To get the email data into KNIME you need to export it as e.g. csv formatted plain text. For thunderbird there is an addon (ImportExportTool https://addons.mozilla.org/de/thunderbird/addon/importexporttools/) which can do this for you. Then read the data using the File Reader node and convert it into documents using the Strings to documents node.
There are white papers on social media analysis with KNIME available (http://tech.knime.org/examples). In the papers a slashdot data set is used. However, there are no papers/tutorials available for twitter or email data analytics.
As portfolio manager in a generic company I would like to bring this aspect of BI to the discussion table.
I was just wondering that business intelligence "a la mode" just refers to social media, when we (and company customers too) are sitting on zillion of texts which are costantly neglected.
I would like to import email directly from mail servers and database the content, headers, and attachements. I tried the Thunderbird method suggested but the .csv export file is limited. There is an email package in Python for working with email:
https://docs.python.org/3.5/library/email.html
I have not worked with Python in KNIME. Would implementing the email package and functions be involved?
@gcarmich Possibly. However, here's a Stackoverflow question that had some code that looked legit to start testing. From a flow perspective, if you can get the IMAP connection authenticated and open, getting the other data you're looking for should be pretty simple.
Thanks. I'll see what I can do with the informaiton on the link provided. I recently posted a question on the forum but have not recieved any responses
https://tech.knime.org/node/55539/view
The method I have been trying recently uses the Context.io service and scripting for authentication. The script outputs the email messages in JSON format and the service has the ability to transfer attachments. I've been having problems getting Context.io to run since my Python skills are limited.