Hi everyone,
I recently discovered Knime which seemed me very useful & powerful for data analytics, so I decided to play with and to try to replicate the workflow and the results to the example combining text and network mining (the following whitepaper is available: http://www.knime.org/files/knime_social_media_white_paper.pdf).
I downloaded the Slashdot's xml data files and started to build a workflow. However I'm a newbie and I have a problem to extract information of a xml files folder.
More precisely my problem is the following: I'm able to extract categories from one xml file in using xml reader, Xpath and ungroup nodes but when I tried to extract these same categories from the whole of xml files present in a folder and that I use the list files and the iterate list of files nodes, my collected results are not correct. I obtain the right number of rows (equal to iteration number / files number in my folder) but it seems that information of only one xml file has been iterated. So after the iterate list of files node, if I try to parse each row with a Xpath node, the resulting output table has identical rows.
Here is attached my workflow file and 2 xml files.
Would you have an idea to resolve my problem ? What are the right options or basic settings to use in the "Variable Based File Reader" ?
Thank you in advance and congrats to Knime's developpers !!! :)