Question on Text and Network Mining Workflow Example

Giovanni · January 16, 2013, 3:07pm

Hi all,

I have a simple question on the example workflow related to "Text and Network Mining".

The initial node, a File reader one, reads Slashdot data. The question is: as the .table data, as far as I know, is an internal KNIME format, what was the original format of Slashdot data used in this examples. Did these data pass through a File Writer node before?

Generally speaking I do not fully understand the whole potential of the File Reader node. Whan should be it used and for what data? Could you advice?

Thanks a lot in advance!

Giovanni

InsilicoConsulting · January 17, 2013, 7:46am

file reader is a generic, tab, csv or txt file parsing node. It seems to be the daay of the csv node. I have found it to work better than the csv node in situations where there are custome delimiters or missing lines , columns etc.

Giovanni · January 17, 2013, 11:30am

My apologies,

I meant Table Reader, not File Reader. The Table Reader node is the one used in the example mentioned in the thread subject.

Cheers

G

tobias.koetter · January 18, 2013, 3:28pm

Hi Giovanni,

we extracted the information from an XML dump of Slashdot which you can download here.

Bye,

Tobias

Giovanni · January 22, 2013, 10:29am

Hi Tobias, I had a look at that page but the data set is no longer available. And I've got also problems in creating an account.

No worries however, thanks for the answer!

G

madlee · January 22, 2013, 10:41am

Maybe we can call a vbs by "External Tool" to list all sheets and save it to a text file to process. I will try this approach.

But I do not know how to deploy the scripts with the workflow. :(

tobias.koetter · January 23, 2013, 8:41am

Hi Giovanni,

sorry I thought that the file is still available. I can send you the original file if you contact me via the contact form.

Bye,

Tobias

caceter · January 22, 2017, 8:45pm

Is workflow from whitepaper still available?

just found it
https://www.knime.org/white-papers#networktext1