Hello there,
I need help in creating a workflow for my bachelor thesis.
In the following I am going to descripe my intended procedure:
Importing text data
I want to import text data of websites and social media accounts into KNIME
The only solution I´ve got is to copy and paste the text into an .txt document and import this into KNIME
Wanted words
In the thesis there will be important words which matters to my topic. I have to analyse the text data on this words
The only solution I´ve got and found in a youtube turtorial is to use a k-means clustering element but this was not working
Count the frequency of the wanted words
I have to specify how often the wanted words appear in the text data
Visualize the result
I need something like a table which shows me how often the wanted words appear in the data
Last question:
Do I have to run this workflow for each document or is there an option to run this workflow for all documents at once and get a big overview for the results?
@Nick_B Welcome to the forum. Are the data sources a fixed list? What specific types of sites are they, e.g. websites, Facebook, etc.? Are “wanted words” also a fixed list? If the answer to these questions is yes, can you provide the lists?
Hello rfeigel.
Thank you for welcoming me and for your reply.
The data souce will be the websites of companies or the social media profiles.
I would do this data import via txt documents because I see no other option.
After determin the relevant webistes and profiles the data source will be fixed but at this time there is nothing fixed
The wanted words will be a fixed list which I will determin as a part of my thesis.
So I can incluence all the variables.
My intention in the forum is to get help to create a workflow which can be used for this purpose later.
Try this. It automates pulling the data so you don’t have to cut and paste. Read the workflow description. The example data is a list of KNIME webpages. If you have questions let me know.
Hi rfeigel,
thank you very much for your help until this.
Do I have to download any add-ins or something like that?
I can´t drag and drop this workflow.
I´ve tried to create my own based on your example and I can´t find the tool “create word count”
Thank you very much it is running now.
Now the following questions:
Can I get run this for a list of words or do I have to run this for every wanted word alone?
Can I export this with a table writer ? There is no option to get a connection out of the create word count.
Your second question is easy. Open the component and add a file writer at the appropriate point(s). The second question is doable by adding a second loop with the word list. I’m working on it. Its a little messy.
2nd problem is that from the moment I´m changing the database links.xls and save this to get the websites I need into the workflow I can´t change run the workflow with the new input search word.
(the apply button does not work)
This workflow permits multiple search words. You can take the input from an Excel file if you want. It also has your output format. I’m not crazy about the blanks in the URL column, but its your call.
Good evening,
wow this is very impressive.
It works very well. Can you say me what the problem was that I can´t change the input tabel / after that I can´t change the wanted words?
I will now create the basis to use the workflow.
Can I contact you in the future if I have problems with this workflow?
Maybe you can tell me your real name so I can mention you in the acknowledgments if you want? I would like to do this because of the help.
I don’t understand this question “Can you say me what the problem was that I can´t change the input tabel / after that I can´t change the wanted words?” Could you explain further? Also, if this solved your problem could you mark it solved? Finally, no need for a citation. Feel free to contact me.